OLAP 大表和小表並行hash join

乙個表50mb 乙個表10gb 50m表做驅動表,放在pga裡這時候慢在對對 10g 的全表掃瞄對10個g掃瞄塊需要開並行我有這樣乙個演算法乙個程序讀 50mb 8程序來掃瞄 10gb 乙個程序掃瞄 1.25gb 50mb 都分發到 8個程序超大表和小表之間做hash join，一般會啟用用並行，oracle在並行hash join的時候會用到很多技術，比如 hash hash, 或者broadcast 對於超大表和小表做hash join,一定要讓小表進行廣播(broadcast)，通常情況下cbo會選擇正確，但是如果統計資訊不准，或者基數計算錯誤cbo選擇了 hash hash join，這個時候就很慢，觀察現象就是它在做direct path write temp,這個時候就可以用hint pq_distribute 進行調整 pq_distribute(驅動表 none, broadcast) 如果外層表很小(hash_aj)，這個時候可以用 pq_distribute(驅動表 broadcast,none) 下面就是乙個具體的例子, f 是乙個超大表 t 是乙個小表 sql&get; explain plan for select /*+ parallel(f 8) parallel(t 8) use_hash(t,f) full(f) full(t) pq_distribute(f hash, hash) */ * 2 from crs_data_fct f 3 join crs_time_perd_fdim t on t.time_perd_id = f.time_perd_id; explained. elapsed: 00:00:00.83

sql&get; select * from table(dbms_xplan.display);

OLAP 大表和小表並行hash join

Hash表和Hash衝突

小表驅動大表

MySQL 小表驅動大表

OLAP 大表和小表並行hash join

Hash表和Hash衝突

小表驅動大表

MySQL 小表驅動大表

相關推薦