程式師世界 >> 數據庫知識 >> Oracle數據庫 >> Oracle數據庫基礎 >> 循序漸進講解Oracle數據庫的Hash join

循序漸進講解Oracle數據庫的Hash join

編輯：Oracle數據庫基礎

在開發過程中，很多人經常會使用到Hash Map或者Hash Set這種數據結構，這種數據結構的特點就是插入和訪問速度快。當向集合中加入一個對象時，會調用hash算法來獲得hash code，然後根據hash code分配存放位置。訪問的時，根據hashcode直接找到存放位置。

　　Oracle Hash join 是一種非常高效的join 算法，主要以CPU(hash計算)和內存空間(創建hash table)為代價獲得最大的效率。Hash join一般用於大表和小表之間的連接，我們將小表構建到內存中，稱為Hash cluster，大表稱為probe表。

　　效率

　　Hash join具有較高效率的兩個原因：

　　1.Hash 查詢，根據映射關系來查詢值，不需要遍歷整個數據結構。

　　2.Mem 訪問速度是Disk的萬倍以上。

　　理想化的Hash join的效率是接近對大表的單表選擇掃描的。

　　首先我們來比較一下，幾種join之間的效率，首先 optimizer會自動選擇使用hash join。

　　注意到Cost= 221

　　SQL> select * from vendition t,customer b WHERE t.customerid = b.customerid;

　　100000 rows selected.

　　Execution Plan

　　----------------------------------------------------------

　　Plan hash value: 3402771356

　　--------------------------------------------------------------------------------

　　--------------------------------------------------------------------------------

　　| 0 | SELECT STATEMENT | | 106K| 22M| 221 (3)| 00:00:03 |

　　|* 1 | HASH JOIN | | 106K| 22M| 221 (3)| 00:00:03 |

　　| 2 | TABLE Access FULL| CUSTOMER | 5000 | 424K| 9 (0)| 00:00:01 |

　　| 3 | TABLE Access FULL| VENDITION | 106K| 14M| 210 (2)| 00:00:03 |

　　--------------------------------------------------------------------------------

　　不使用hash，這時optimizer自動選擇了merge join。。

　　注意到Cost=3507大大的增加了。

　　SQL> select /*+ USE_MERGE (t b) */* from vendition t,customer b WHERE t.customerid = b.customerid;

　　100000 rows selected.

　　Execution Plan

　　----------------------------------------------------------

　　Plan hash value: 1076153206

　　-----------------------------------------------------------------------------------------

　　-----------------------------------------------------------------------------------------

　　| 0 | SELECT STATEMENT | | 106K| 22M| | 3507 (1)| 00:00:43 |

　　| 1 | MERGE JOIN | | 106K| 22M| | 3507 (1)| 00:00:43 |

　　| 2 | SORT JOIN | | 5000 | 424K| | 10 (10)| 00:00:01 |

　　| 3 | TABLE Access FULL| CUSTOMER | 5000 | 424K| | 9 (0)| 00:00:01 |

　　|* 4 | SORT JOIN | | 106K| 14M| 31M| 3496 (1)| 00:00:42 |

　　| 5 | TABLE Access FULL| VENDITION | 106K| 14M| | 210 (2)| 00:00:03 |

　　-----------------------------------------------------------------------------------------

　　那麼Nest loop呢，經過漫長的等待後，發現Cost達到了驚人的828K，同時伴隨3814337 consistent gets(由於沒有建索引)，可見在這個測試中，Nest loop是最低效的。在給customerid建立唯一索引後，減低到106K，但仍然是內存join的上千倍。

　　SQL> select /*+ USE_NL(t b) */* from vendition t,customer b WHERE t.customerid = b.customerid;

　　100000 rows selected.

　　Execution Plan

　　----------------------------------------------------------

　　Plan hash value: 2015764663

　　--------------------------------------------------------------------------------

　　--------------------------------------------------------------------------------

　　| 0 | SELECT STATEMENT | | 106K| 22M| 828K (2)| 02:45:41 |

　　| 1 | NESTED LOOPS | | 106K| 22M| 828K (2)| 02:45:41 |

　　| 2 | TABLE Access FULL| VENDITION | 106K| 14M| 210 (2)| 00:00:03 |

　　|* 3 | TABLE Access FULL| CUSTOMER | 1 | 87 | 8 (0)| 00:00:01 |

　　HASH的內部

　　HASH_AREA_SIZE在Oracle 9i 和以前，都是影響hash join性能的一個重要的參數。但是在10g發生了一些變化。Oracle不建議使用這個參數，除非你是在MTS模式下。Oracle建議采用自動PGA管理(設置PGA_AGGREGATE_TARGET和WORKAREA_SIZE_POLICY)來，替代使用這個參數。由於我的測試環境是mts環境，自動內存管理，所以我在這裡只討論mts下的hash join。

　　Mts的PGA中，只包含了一些棧空間信息，UGA則包含在large pool中，那麼實際類似hash，sort，merge等操作都是有large pool來分配空間，large pool同時也是auto管理的，它和SGA_TARGET有關。所以在這種條件下，內存的分配是很靈活。

　　Hash連接根據內存分配的大小，可以有三種不同的效果：

　　1.optimal 內存完全足夠

　　2.onepass 內存不能裝載完小表

　　3.multipass workarea executions 內存嚴重不足

　　下面，分別測試小表為50行，500行和5000行，內存的分配情況(內存都能完全轉載)。

　　Vendition表 10W條記錄

　　Customer表 5000

　　Customer_small 500，去Customer表前500行建立

　　Customer_pity 50，取Customer表前50行建立

　　表的統計信息如下：

　　SQL> SELECT s.table_name,S.BLOCKS,S.AVG_SPACE,S.NUM_ROWS,S.AVG_ROW_LEN,S.EMPTY_BLOCKS FROM user_tables S WHERE table_name IN ('CUSTOMER','VENDITION','CUSTOMER_SMALL','CUSTOMER_PITY') ;

　　TABLE_NAME BLOCKS AVG_SPACE NUM_ROWS AVG_ROW_LEN EMPTY_BLOCKS

　　CUSTOMER 35 1167 5000 38 5

　　CUSTOMER_PITY 4 6096 50 37 4

　　CUSTOMER_SMALL 6 1719 500 36 2

　　VENDITION 936 1021 100000 64 88打開10104事件追蹤：(hash 連接追蹤)

　　ALTER SYSTEM SET EVENTS ‘ 10104 TRACE NAME CONTEXT,LEVEL 2’;

　　測試SQL

　　SELECT * FROM vendition a,customer b WHERE a.customerid = b.customerid;

　　SELECT * FROM vendition a,customer_small b WHERE a.customerid = b.customerid;

　　SELECT * FROM vendition a,customer_pity b WHERE a.customerid = b.customerid;

　　小表50行時候的trace分析：

　　*** 2008-03-23 18:17:49.467

　　*** SESSION ID:(773.23969) 2008-03-23 18:17:49.467

　　kxhfInit(): enter

　　kxhfInit(): exit

　　*** RowSrcId: 1 HASH JOIN STATISTICS (INITIALIZATION) ***

　　Join Type: INNER join

　　Original hash-area size: 3883510

　　PS:hash area的大小，大約380k，本例中最大的表也不過250塊左右，所以內存完全可以完全裝載

　　Memory for slot table: 2826240

　　Calculated overhead for partitions and row/slot managers: 1057270

　　Hash-join fanout: 8

　　Number of partitions: 8

　　PS:hash 表數據連一個塊都沒裝滿，Oracle仍然對數據進行了分區，這裡和以前在一些文檔上看到的，當內存不足時才會對數據分區的說法，發生了變化。

　　Number of slots: 23

　　Multiblock IO: 15

　　Block size(KB): 8

　　Cluster (slot) size(KB): 120

　　PS:分區中全部行占有的cluster的size

　　Minimum number of bytes per block: 8160

　　Bit vector memory allocation(KB): 128

　　Per partition bit vector length(KB): 16

　　Maximum possible row length: 270

　　Estimated build size (KB): 0

　　Estimated Build Row Length (includes overhead): 45

　　# Immutable Flags:

　　Not BUFFER(execution) output of the join for PQ

　　Evaluate Left Input Row Vector

　　Evaluate Right Input Row Vector

　　# Mutable Flags:

　　IO sync

　　kxhfSetPhase: phase=BUILD

　　kxhfAddChunk: add chunk 0 (sz=32) to slot table

　　kxhfAddChunk: chunk 0 (lbs=0x2a97825c38, slotTab=0x2a97825e00) successfuly added

　　kxhfSetPhase: phase=PROBE_1

　　qerhjFetch: max build row length (mbl=44)

　　*** RowSrcId: 1 END OF HASH JOIN BUILD (PHASE 1) ***

　　Revised row length: 45

　　Revised build size: 2KB

　　kxhfResize(enter): resize to 12 slots (numAlloc=8, max=23)

　　kxhfResize(exit): resized to 12 slots (numAlloc=8, max=12)

　　Slot table resized: old=23 wanted=12 got=12 unload=0

　　*** RowSrcId: 1 HASH JOIN BUILD HASH TABLE (PHASE 1) ***

　　Total number of partitions: 8

　　Number of partitions which could fit in memory: 8

　　Number of partitions left in memory: 8

　　Total number of slots in in-memory partitions: 8

　　Total number of rows in in-memory partitions: 50

　　(used as preliminary number of buckets in hash table)

　　Estimated max # of build rows that can fit in avail memory: 66960

　　### Partition Distribution ###

　　Partition:0 rows:5 clusters:1 slots:1 kept=1

　　Partition:1 rows:6 clusters:1 slots:1 kept=1

　　Partition:2 rows:4 clusters:1 slots:1 kept=1

　　Partition:3 rows:9 clusters:1 slots:1 kept=1

　　Partition:4 rows:5 clusters:1 slots:1 kept=1

　　Partition:5 rows:9 clusters:1 slots:1 kept=1

　　Partition:6 rows:4 clusters:1 slots:1 kept=1

　　Partition:7 rows:8 clusters:1 slots:1 kept=1

　　PS:每個分區只有不到10行，這裡有一個重要的參數Kept，1在內存中，0在磁盤

　　*** (continued) HASH JOIN BUILD HASH TABLE (PHASE 1) ***

　　PS:hash join的第一階段，但是要觀察更多的階段，需提高trace的level，這裡略過

　　Revised number of hash buckets (after flushing): 50

　　Allocating new hash table.

　　*** (continued) HASH JOIN BUILD HASH TABLE (PHASE 1) ***

　　Requested size of hash table: 16

　　Actual size of hash table: 16

　　Number of buckets: 128

　　Match bit vector allocated: FALSE

　　kxhfResize(enter): resize to 14 slots (numAlloc=8, max=12)

　　kxhfResize(exit): resized to 14 slots (numAlloc=8, max=14)

　　freeze work area size to: 2359K (14 slots)

　　*** (continued) HASH JOIN BUILD HASH TABLE (PHASE 1) ***

　　Total number of rows (may have changed): 50

　　Number of in-memory partitions (may have changed): 8

　　Final number of hash buckets: 128

　　Size (in bytes) of hash table: 1024

　　kxhfIterate(end_iterate): numAlloc=8, maxSlots=14

　　*** (continued) HASH JOIN BUILD HASH TABLE (PHASE 1) ***

　　### Hash table ###

　　# NOTE: The calculated number of rows in non-empty buckets may be smaller

　　# than the true number.

　　Number of buckets with 0 rows: 86

　　Number of buckets with 1 rows: 37

　　Number of buckets with 2 rows: 5

　　Number of buckets with 3 rows: 0

　　PS:桶裡面的行數，最大的桶也只有2行，理論上，桶裡面的行數越少，性能越佳。

　　Number of buckets with 4 rows: 0

　　Number of buckets with 5 rows: 0

　　Number of buckets with 6 rows: 0

　　Number of buckets with 7 rows: 0

　　Number of buckets with 8 rows: 0

　　Number of buckets with 9 rows: 0

　　Number of buckets with between 10 and 19 rows: 0

　　Number of buckets with between 20 and 29 rows: 0

　　Number of buckets with between 30 and 39 rows: 0

　　Number of buckets with between 40 and 49 rows: 0

　　Number of buckets with between 50 and 59 rows: 0

　　Number of buckets with between 60 and 69 rows: 0

　　Number of buckets with between 70 and 79 rows: 0

　　Nmber of buckets with between 80 and 89 rows: 0

　　Number of buckets with between 90 and 99 rows: 0

　　Number of buckets with 100 or more rows: 0

　　### Hash table overall statistics ###

　　Total buckets: 128 Empty buckets: 86 Non-empty buckets: 42

　　PS:創建了128個桶，Oracle 7開始的計算公式

　　Bucket數=0.8*hash_area_size/(hash_multiblock_io_count*db_block_size)

　　但是不准確，估計10g發生了變化。

　　Total number of rows: 50

　　Maximum number of rows in a bucket: 2

　　Average number of rows in non-empty buckets: 1.190476

　　小表500行時候的trace分析

　　Original hash-area size: 3925453

　　Memory for slot table: 2826240

　　。。。

　　Hash-join fanout: 8

　　Number of partitions: 8

　　。。。

　　### Partition Distribution ###

　　Partition:0 rows:52 clusters:1 slots:1 kept=1

　　Partition:1 rows:63 clusters:1 slots:1 kept=1

　　Partition:2 rows:55 clusters:1 slots:1 kept=1

　　Partition:3 rows:74 clusters:1 slots:1 kept=1

　　Partition:4 rows:66 clusters:1 slots:1 kept=1

　　Partition:5 rows:66 clusters:1 slots:1 kept=1

　　Partition:6 rows:54 clusters:1 slots:1 kept=1

　　Partition:7 rows:70 clusters:1 slots:1 kept=1

　　PS：每個partition的行數增加

　　。。。

　　Number of buckets with 0 rows: 622

　　Number of buckets with 1 rows: 319

　　Number of buckets with 2 rows: 71

　　Number of buckets with 3 rows: 10

　　Number of buckets with 4 rows: 2

　　Number of buckets with 5 rows: 0

　　。。。

　　### Hash table overall statistics ###

　　Total buckets: 1024 Empty buckets: 622 Non-empty buckets: 402

　　Total number of rows: 500

　　Maximum number of rows in a bucket: 4

　　Average number of rows in non-empty buckets: 1.243781

　　小表5000行時候的trace分析

　　Original hash-area size: 3809692

　　Memory for slot table: 2826240

　　。。。

　　Hash-join fanout: 8

　　Number of partitions: 8

　　Nuber of slots: 23

　　Multiblock IO: 15

　　Block size(KB): 8

　　Cluster (slot) size(KB): 120

　　Minimum number of bytes per block: 8160

　　Bit vector memory allocation(KB): 128

　　Per partition bit vector length(KB): 16

　　Maximum possible row length: 270

　　Estimated build size (KB): 0

　　。。。

　　### Partition Distribution ###

　　Partition:0 rows:588 clusters:1 slots:1 kept=1

　　Partition:1 rows:638 clusters:1 slots:1 kept=1

　　Partition:2 rows:621 clusters:1 slots:1 kept=1

　　Partiton:3 rows:651 clusters:1 slots:1 kept=1

　　Partition:4 rows:645 clusters:1 slots:1 kept=1

　　Partition:5 rows:611 clusters:1 slots:1 kept=1

　　Partitio:6 rows:590 clusters:1 slots:1 kept=1

　　Partition:7 rows:656 clusters:1 slots:1 kept=1

　　。。。

　　# than the true number.

　　Number of buckets with 0 rows: 4429

　　Number of buckets with 1 rows: 2762

　　Number of buckets with 2 rows: 794

　　Number of buckets with 3 rows: 182

　　Number of buckets with 4 rows: 23

　　Number of buckets with 5 rows: 2

　　Number of buckets with 6 rows: 0

　　。。。

　　### Hash table overall statistics ###

　　Total buckets: 8192 Empty buckets: 4429 Non-empty buckets: 3763

　　Total number of rows: 5000

　　Maximum number of rows in a bucket: 5

　　PS:當小表上升到5000行的時候，bucket的rows最大也不過5行。注意，如果bucket行數過多，遍歷帶來的開銷會帶來性能的嚴重下降。

　　Average number of rows in non-empty buckets: 1.328727

　　結論：

　　Oracle數據庫10g中，內存問題並不是干擾Hash join的首要問題，現今硬件價格越來越便宜，內存2G，8G，64G的環境也很常見。大家在針對hash join調優的過程，更要偏重於partition和bucket的數據分配診斷。

上一頁:詳細講解Oracle數據庫後台進程的具體功能
下一頁:循序漸進講解Oracle數據庫管理員的職責

Oracle數據庫基礎

oracle游標(顯示和隱示)之使用

要求:定義一個游標，使用FOR循環將班級編號為200105

oracle7.3.4 for openserver 507的安裝！

Oracle7.3.4 for openserver 50

用Instant client批量安裝Oracle客戶端

做了一個C#和Oracle 的系統，想到每個客戶端還要裝O

基於Oracle ADF的應用程序開發過程

ADF簡介ADF(Application Developm

Oracle Pro*C 動態SQL技術

Pro*C的常用動態SQL技術一共有3種:用於處理不包含宿

Oracle中時間、數字、字符格式詳解

字符格式Oracle中字符串的比較有兩類1.填補空格的語義

閱讀排行榜

Oracle跟蹤事件的包括那些？全面透析Oracle Archive Log模式 Oracle初學者必知的100個問題 Oracle死鎖的處理 Oracle數據庫操作匯總大全 Oracle數據庫的備份及恢復策略研究 Oracle RMAN快速入門指南--(3) Oracle 的數據庫的數據備份與恢復 Oracle 10R2在Solaris10的安裝過程 Oracle多條件查詢的分頁存儲如何操作？開放源代碼將挑戰甲骨文王國

熱門圖文

Mysql全局ID生成方法， MySQL在Windows下解決中文亂碼怎樣縮小SQL Server數據庫日志文件 BBED的安裝 oracle中從指定日期中獲取月份或者部分數據如何為Access數據庫表添加日期或時間戳怎麼樣恢復MYSQL地ROOT口令調整MYSQL校對規則

欄目導航

關於Oracle數據庫 Oracle數據庫基礎 Oracle教程