ORACLE大表关联更新提速方法--688IT编程网

ORACLE⼤表关联更新提速⽅法

前提知识

在oracle数据库中，每⼀⾏记录都有⼀个该记录的唯⼀标识rowid，rowid⼀旦确定不会随意变动。rowid由10个字节存储，在数据库查询中显⽰为18位的字符串，在其内部存储以下信息：1.对象编号。2.记录所在数据⽂件。3.记录所在⽂件上块的编号。4.记录所在块的⾏编号。

在两表的关联更新时，⼀般都会在表上建⽴索引。在表上建⽴索引时，oracle会根据索引字段的内容（key）和该⾏的rowid（value）建⽴⼀个B-tree，⼀般为三层，达到4层时会影响索引性能。当执⾏查询时，先根据关键字段到对应的rowid，再根据rowid到磁盘中到记录的位置将记录查询出来。

⼤表关联更新的瓶颈

执⾏两表关联更新时，oracle⼀般采⽤类似nested_loop的更新⽅式。当依据关联关系将表B的值更新到A表的字段中时，先遍历A表的，对A表的每⾏记录查询B表的索引，得到b表的结果后再更新到A表。由于存在查询索引的操作，更新每条记录都会⾄少执⾏两次io操作。第⼀次查询索引、第⼆次根据索引的rowid查询数据。当执⾏⼤规模数据更新时，速度会很慢。

性能瓶颈突破思路

所谓“成也萧何败萧何”，既然在⼤规模的数据更新中索引存在性能瓶颈，那就想办法在⼤规模数据更新时避免索引的使⽤。由于rowid是记录的唯⼀标识且根据rowid去更新时会⾃动定位记录所在位置（⽐通过索引更快），可以考虑将rowid和要更新的结果批量查询出来再批量去更新。批量查询时，oracle⼀般会采⽤hash关联的⽅式。在两表数据量⽐较⼤时，hash关联⽐通过索引nested_loop关联快很多倍。

测试结果及源码

有了以上思路，可以通过存储过程代替update语句实现更新。在存储过程中可以暂时保存批量查询的结果，依据批量查询的结果执⾏更新。后⾯的测试结果是我在虚拟机上跑出来的，性能差异不明显。如果是物理机性能差距会更⼤。

⾸先建两张表并造⼀些测试数据

drop table test_user purge;

create table test_user

( user_id number(11),

user_name varchar2(64),

user_acct number(11)

) tablespace tbs_all_in_one;

drop table test_relations purge;

create table test_relations

(

user_id number(11),

user_acct number(11),

start_date date,

end_date date

)tablespace tbs_all_in_one;

insert into test_user values (NULL,NULL,NULL);

commit;

--造200w条A表数据

set serveroutput on;

begin

for i in 1..21 loop

insert into test_user select * from test_user;

commit;

dbms_output.put_line('insert loop: '||i);

end loop;

end;

select count(*) from test_user;

--根据rownum更新A表的user_id

update test_user a

set a.user_id=rownum+100000000,

a.user_name='user_name'||to_char(rownum+100000000),

a.user_acct = 0;

commit;

--造400w条B表数据

insert into test_relations

select user_id,user_id+200000000,sysdate-365,sysdate-30-1/86400

from test_user;

commit;

insert into test_relations

select user_id,user_id+200000000,sysdate-30, sysdate+365

from test_user;

commit;

然后建⽴索引，并执⾏oracle的统计命令，统计两个表的信息

create index indx_test_user on test_user(user_id) tablespace tbs_all_in_one;

create index indx_test_relations on test_relations(user_id) tablespace tbs_all_in_one;

exec dbms_stats.gather_table_stats(OWNNAME=>'luhao',tabname=>'test_user',cascade=>true); exec dbms_stats.gather_table_stats(OWNNAME=>'luhao',tabname=>'test_relations',cascade=>true);

普通更新语句的执⾏计划和时间

存储过程更新⽅法

分析上图的源码截图，先通过⼤表关联得到A表中的rowid和要更新的结果。将数据放到数组中，再通过forall语法根据rowid批量更新。存储过程更新时间

存储过程源码

批量更新sql语句create or replace procedure p_test_update

iv_sql VARCHAR2(4000) ;

TYPE tab_acct IS TABLE OF NUMBER(11) INDEX BY binary_integer;

TYPE ref_cur IS REF CURSOR;

rowid_arry dbms_sql.urowid_table;

acct_array tab_acct;

iv_cur ref_cur;

start_time date;

finish_time date;

begin

select sysdate into start_time from dual;

iv_sql := 'wid, b.user_acct from test_user a, test_relations b

where a.user_id = b.user_id and sysdate between b.start_date d_date'; open iv_cur for iv_sql;

loop

fetch iv_cur bulk collect into rowid_arry,acct_array limit 10000;

exit when unt = 0;

--更新acct

forall i unt

update test_user a

set a.user_acct = acct_array(i)

where rowid = rowid_arry(i);

end loop;

close iv_cur;

commit;

select sysdate into finish_time from dual;

dbms_output.put_line('cost seconds:'||(finish_time-start_time)*86400);

end;

限制

只能更新静态数据，不能⽀持事务。

688IT编程网

ORACLE大表关联更新提速方法

发表评论

推荐文章

应用程序的安全检测方法、装置、电子设备和存储介质

nginx map用法正则

VBA之正则表达式(1)--基础篇

Prometheus监控学习笔记之初识PromQL

关于PHP中的webshell

热门文章

m函数数字提取

jest断言方法大全

中兴ZXSEC US 管理员手册

keras系列(一):参数设置

Qt从QString中提取出数字

element input 金额千分位格式化

freemaker 参数解析正则

C#正则验证数字

form表单验证正则

scanf正则表达式用法

grafana value的正则表达式

Android平台浮点数运算应用

js-(JS正则表达式验证数字)

判断Python输入是否是整数,字符,或浮点数

c语言 sscanf 正则规则

从文本中提取数值技巧

js将整数转换成两位浮点数的方法

vue正则限制浮点数

8到20的结尾的正则

shell 正则表达式最后一行

最新文章

应用程序的安全检测方法、装置、电子设备和存储介质

VBA之正则表达式(1)--基础篇

代码编辑的辅助方法、装置及电子设备

SHELL查字符串中包含字符的命令

String方法中replace和replaceAll的区别详解(源码分析)

双字节符号正则

标签列表

688IT编程网

ORACLE大表关联更新提速方法

发表评论

推荐文章

应用程序的安全检测方法、装置、电子设备和存储介质

nginx map用法 正则

VBA之正则表达式(1)--基础篇

Prometheus监控学习笔记之初识PromQL

关于PHP中的webshell

热门文章

m函数数字提取

jest断言方法大全

中兴ZXSEC US 管理员手册

keras系列(一):参数设置

Qt从QString中提取出数字

element input 金额千分位格式化

freemaker 参数解析正则

C#正则验证数字

form表单验证正则

scanf正则表达式用法

grafana value的正则表达式

Android平台浮点数运算应用

js-(JS正则表达式验证数字)

判断Python输入是否是整数,字符,或浮点数

c语言 sscanf 正则规则

从文本中提取数值技巧

js将整数转换成两位浮点数的方法

vue正则限制浮点数

8到20的结尾的正则

shell 正则表达式 最后一行

最新文章

应用程序的安全检测方法、装置、电子设备和存储介质

VBA之正则表达式(1)--基础篇

代码编辑的辅助方法、装置及电子设备

SHELL查字符串中包含字符的命令

String方法中replace和replaceAll的区别详解(源码分析)

双字节符号正则

标签列表

nginx map用法正则

shell 正则表达式最后一行