今天是2014-03-19,對oracle TAF技術整理一下學習筆記,記錄如下:
####################################################################################
failover_mode 參數 描述
####################################################################################
backup 指定用於創建備份連接的本地服務名,當使用preconnect預創建連接
的時候應該指明這個參數值
method TAF的配置包含如下兩種failover切換方式
preconnect:創建到切換實例的預連接,提供快速failover的能力
basic:在發生failover的時候創建連接
retries failover發生後嘗試連接的次數,如果指明了delay參數,那麼retries默認為5
type Taf的配置包含如下三種failover的類型:
session:如果用戶連接丟失,新的會話將自動被創建。這種類型的failover不能
嘗試恢復select操作
select:如果用戶連接丟失,新創建的會話將繼續之前失敗之後的select操作
none 這是默認值,不具備failover能力。這個能被明確的指明用於防止failover
的發生。
注意:這些參數只能手動設置,不能在listener.ora文件中SID_LIST_<LISTENER_NAME>條目中設置global_dbname參數,靜態配置的全局數據
庫名稱不能使用TAF功能。另外jdbc thin驅動方式無法使用TAF技術。
實現TAF有兩種方式一種為client-side TAF 另一種為server-side TAF ,下面先介紹第一種client-side TAF:
RAC = (DESCRIPTION = (FAILOVER=ON) (LOAD_BALANCE=ON) (ADDRESS_LIST = (ADDRESS = (PROTOCOL = TCP)(HOST = rac-one)(PORT = 1521)) (ADDRESS = (PROTOCOL = TCP)(HOST = rac-two)(PORT = 1521)) ) (CONNECT_DATA = (SERVICE_NAME = Rac2) (FAILOVER_MODE= (TYPE=SELECT) (METHOD=BASIC)) ) )
該方式使用了連接時failover、client-side TAF 和客戶端負載均衡,當該客戶端嘗試連接數據庫的時候會在address中隨即挑選一個用於連接數據庫,假如選擇rac-one如果連接失敗,那麼就會使用rac-two進行連接,如果都失敗那麼將提出連接錯誤。當客戶端已經連接到數據庫的時候,突然rac-one實例關閉,那麼該客戶端隨即創建與rac-two的會話連接這個過程報錯select的操作。例外我們可以使用retries和delay參數來指定重新連接次數和延遲重新連接的秒數。如下是重試連接rac-one5次每次120秒。
eg:
RAC = (DESCRIPTION = (FAILOVER=ON) (LOAD_BALANCE=OFF) (ADDRESS_LIST = (ADDRESS = (PROTOCOL = TCP)(HOST = rac-one-vip)(PORT = 1521)) (ADDRESS = (PROTOCOL = TCP)(HOST = rac-two-vip)(PORT = 1521)) ) (CONNECT_DATA = (SERVICE_NAME = Rac2) (FAILOVER_MODE= (TYPE=SELECT) (METHOD=BASIC) (RETRIES=5) (DELAY=120) ) ) )
另外在failover_mode中的method中有preconnect(預連接),該說明在client-side TAF中分配一個主連接的同時預先分配備用連接。
eg:
RAC1 = (DESCRIPTION = (FAILOVER=ON) (LOAD_BALANCE=OFF) (ADDRESS_LIST = (ADDRESS = (PROTOCOL = TCP)(HOST = rac-one)(PORT = 1521)) ) (CONNECT_DATA = (SERVICE_NAME = Rac2) (FAILOVER_MODE= (METHOD=PRECONNECT) (BACKUP=RAC2) (TYPE=SELECT) (RETRIES=5) (DELAY=30) ) ) ) RAC2 = (DESCRIPTION = (FAILOVER=ON) (LOAD_BALANCE=OFF) (ADDRESS_LIST = (ADDRESS = (PROTOCOL = TCP)(HOST = rac-two)(PORT = 1521)) ) (CONNECT_DATA = (SERVICE_NAME = Rac2) (FAILOVER_MODE= (METHOD=PRECONNECT) (BACKUP=RAC1) (TYPE=SELECT) (RETRIES=5) (DELAY=30) ) ) )
驗證client-side TAF:
首先確認/etc/hosts文件如下:
[root@rac-one ~]# more /etc/hosts 127.0.0.1 localhost localhost.localdomain 192.168.2.11 openfiler1 192.168.1.112 rac-two-priv 192.168.1.111 rac-one-priv 192.168.4.111 rac-one rac-one.localdomain 192.168.4.112 rac-two rac-two.localdomain 192.168.4.113 rac-one-vip 192.168.4.114 rac-two-vip [root@rac-one ~]#
查看客戶端tnsname.ora配置:
RAC = (DESCRIPTION = (FAILOVER=ON) (LOAD_BALANCE=ON) (ADDRESS_LIST = (ADDRESS = (PROTOCOL = TCP)(HOST = rac-two-vip)(PORT = 1521)) (ADDRESS = (PROTOCOL = TCP)(HOST = rac-one-vip)(PORT = 1521)) ) (CONNECT_DATA = (SERVICE_NAME = Rac) (FAILEOVER_MODE= (METHOD=BASIC) (TYPE=SELECT) (RETRIES=5) (DELAY=60)) ) )
用戶連接數據庫,然後查看數據庫會話信息,(注意還需要修改windows的hosts名稱否則無法識別rac-two-vip或是rac-one-vip):
rac-one節點:
SQL> r 1 select inst_id,username,failover_type,failover_method,failed_over from gv$session where username in ('SYSTEM','SYS') 2* INST_ID USERNAME FAILOVER_TYPE FAILOVER_M FAI ---------- ------------------------------ ------------- ---------- --- 2 SYS NONE NONE NO 2 SYS NONE NONE NO 2 SYS NONE NONE NO 2 SYS NONE NONE NO 2 SYS NONE NONE NO 2 SYS NONE NONE NO 1 SYS NONE NONE NO 1 SYS NONE NONE NO 1 SYS NONE NONE NO 1 SYSTEM SELECT BASIC NO 10 rows selected. SQL>
可知目前system用戶已經具備failover功能。
注意:在配置client-side TAF的時候尤其注意參數的設置位置,否則無法實現failover。
其實在11G中scan功能也實現了負載均衡的作用,它是從dns解析中的三個地址輪詢負載的分配給scan listener進而采去和本地listener進行通信。
另外實現TAF的方式為server-side TAF。說白了,就是通過服務端設置service來實現,先比client-side TAF有很多簡便的方式。
eg:
增加服務名rac1和rac2:
oracle@rac-two ~]$ srvctl add service -d Rac -s rac1 -r Rac1 -a Rac2 -P basic -y automatic -e select -m basic -z 5 -w 120 [oracle@rac-two ~]$ srvctl add service -d Rac -s rac2 -r Rac2 -a Rac1 -P basic -y automatic -e select -m basic -z 5 -w 120
查看服務名狀態
[oracle@rac-two ~]$ srvctl status service -d RAc Service rac1 is not running. Service rac2 is not running.
啟動服務資源:
[oracle@rac-two ~]$ srvctl start service -d Rac [oracle@rac-two ~]$ srvctl status service -d Rac Service rac1 is running on instance(s) Rac1 Service rac2 is running on instance(s) Rac2
查看配置信息:
[oracle@rac-two ~]$ srvctl config service -d Rac Service name: rac1 Service is enabled Server pool: Rac_rac1 Cardinality: 1 Disconnect: false Service role: PRIMARY Management policy: AUTOMATIC DTP transaction: false AQ HA notifications: false Failover type: SELECT Failover method: BASIC TAF failover retries: 5 TAF failover delay: 120 Connection Load Balancing Goal: LONG Runtime Load Balancing Goal: NONE TAF policy specification: BASIC Edition: Preferred instances: Rac1 Available instances: Rac2 Service name: rac2 Service is enabled Server pool: Rac_rac2 Cardinality: 1 Disconnect: false Service role: PRIMARY Management policy: AUTOMATIC DTP transaction: false AQ HA notifications: false Failover type: SELECT Failover method: BASIC TAF failover retries: 5 TAF failover delay: 120 Connection Load Balancing Goal: LONG Runtime Load Balancing Goal: NONE TAF policy specification: BASIC Edition: Preferred instances: Rac2 Available instances: Rac1 [oracle@rac-two ~]$
注意這個時候,實例Rac1已經注冊了rac1服務,且主要實例為Rac1備用實例為Rac2,實例Rac2注冊了rac2服務,且主要實例為Rac2備用實例為Rac1;
本地監聽只會注冊本地服務名,scan監聽將注冊所有的監聽服務名。
驗證:
首先明確客戶端配置:
RAC = (DESCRIPTION = (ADDRESS_LIST = (ADDRESS = (PROTOCOL = TCP)(HOST = rac-two-cluster-scan.grid.example.com)(PORT = 1521)) ) (CONNECT_DATA = (SERVICE_NAME = Rac2) ) )
登錄數據庫查看會話信息如下:
會話一使用system用戶登錄數據庫:
用戶沒有登錄之前狀態:
[oracle@rac-one ~]$ sqlplus / as sysdba SQL*Plus: Release 11.2.0.4.0 Production on Wed Mar 19 20:59:56 2014 Copyright (c) 1982, 2013, Oracle. All rights reserved. Connected to: Oracle Database 11g Enterprise Edition Release 11.2.0.4.0 - 64bit Production With the Partitioning, Real Application Clusters, Automatic Storage Management, OLAP, Data Mining and Real Application Testing options SQL> col username for a20 SQL> set linesize 200 SQL> select inst_id,username,failover_type,failover_method,failed_over from gv$session where username='SYSTEM'; no rows selected
登錄之後狀態:
SQL> r 1* select inst_id,username,failover_type,failover_method,failed_over from gv$session where username='SYSTEM' INST_ID USERNAME FAILOVER_TYPE FAILOVER_M FAI ---------- -------------------- ------------- ---------- --- 2 SYSTEM SELECT BASIC NO 2 SYSTEM SELECT BASIC NO SQL>
這個時候關閉該節點,且在客戶端執行select * from dba_objects;語句,
查看節點二 用戶會話狀態。
SQL> r 1* select inst_id,username,failover_type,failover_method,failed_over from gv$session where username='SYSTEM' INST_ID USERNAME FAILOVER_TYPE FAILOVER_M FAI ---------- -------------------- ------------- ---------- --- 1 SYSTEM SELECT BASIC YES SQL>
可以看到用戶只在幾秒中停頓後繼續完成了select操作,且failed_over狀態為yes,證明failover已經生效。
查看狀態信息如下:
c[grid@rac-two ~]$ crsctl status res -t -------------------------------------------------------------------------------- NAME TARGET STATE SERVER STATE_DETAILS -------------------------------------------------------------------------------- Local Resources -------------------------------------------------------------------------------- ora.DATADG.dg ONLINE ONLINE rac-two ora.GIDG.dg ONLINE ONLINE rac-two ora.LISTENER.lsnr ONLINE ONLINE rac-two ora.asm ONLINE ONLINE rac-two Started ora.gsd OFFLINE OFFLINE rac-two ora.net1.network ONLINE ONLINE rac-two ora.ons ONLINE ONLINE rac-two ora.registry.acfs ONLINE ONLINE rac-two -------------------------------------------------------------------------------- Cluster Resources -------------------------------------------------------------------------------- ora.LISTENER_SCAN1.lsnr 1 ONLINE ONLINE rac-two ora.LISTENER_SCAN2.lsnr 1 ONLINE ONLINE rac-two ora.LISTENER_SCAN3.lsnr 1 ONLINE ONLINE rac-two ora.cvu 1 ONLINE ONLINE rac-two ora.oc4j 1 ONLINE ONLINE rac-two ora.rac-one.vip 1 ONLINE INTERMEDIATE rac-two FAILED OVER ora.rac-two.vip 1 ONLINE ONLINE rac-two ora.rac.db 1 ONLINE ONLINE rac-two Open 2 ONLINE OFFLINE Instance Shutdown ora.rac.rac1.svc 1 ONLINE ONLINE rac-two ora.rac.rac2.svc 1 ONLINE ONLINE rac-two ora.scan1.vip 1 ONLINE ONLINE rac-two ora.scan2.vip 1 ONLINE ONLINE rac-two ora.scan3.vip 1 ONLINE ONLINE rac-two [grid@rac-two ~]$
可知目前ora.ora-one.vip已經failed over,且ora.rac.rac1.svc運行到了rac-two中。
重啟節點後查看資源如下:
[grid@rac-one ~]$ crsctl status res -t -------------------------------------------------------------------------------- NAME TARGET STATE SERVER STATE_DETAILS -------------------------------------------------------------------------------- Local Resources -------------------------------------------------------------------------------- ora.DATADG.dg ONLINE ONLINE rac-one ONLINE ONLINE rac-two ora.GIDG.dg ONLINE ONLINE rac-one ONLINE ONLINE rac-two ora.LISTENER.lsnr ONLINE OFFLINE rac-one ONLINE ONLINE rac-two ora.asm ONLINE ONLINE rac-one Started ONLINE ONLINE rac-two Started ora.gsd OFFLINE OFFLINE rac-one OFFLINE OFFLINE rac-two ora.net1.network ONLINE ONLINE rac-one ONLINE ONLINE rac-two ora.ons ONLINE OFFLINE rac-one STARTING ONLINE ONLINE rac-two ora.registry.acfs ONLINE ONLINE rac-one ONLINE ONLINE rac-two -------------------------------------------------------------------------------- Cluster Resources -------------------------------------------------------------------------------- ora.LISTENER_SCAN1.lsnr 1 ONLINE ONLINE rac-two STOPPING ora.LISTENER_SCAN2.lsnr 1 ONLINE ONLINE rac-two ora.LISTENER_SCAN3.lsnr 1 ONLINE ONLINE rac-two ora.cvu 1 ONLINE ONLINE rac-two ora.oc4j 1 ONLINE ONLINE rac-two ora.rac-one.vip 1 ONLINE INTERMEDIATE rac-two FAILED OVER,STOPPING ora.rac-two.vip 1 ONLINE ONLINE rac-two ora.rac.db 1 ONLINE ONLINE rac-two Open 2 ONLINE OFFLINE Instance Shutdown ora.rac.rac1.svc 1 ONLINE ONLINE rac-two ora.rac.rac2.svc 1 ONLINE ONLINE rac-two ora.scan1.vip 1 ONLINE ONLINE rac-two ora.scan2.vip 1 ONLINE ONLINE rac-two ora.scan3.vip 1 ONLINE ONLINE rac-two [grid@rac-one ~]$ [grid@rac-one ~]$ crsctl status res -t -------------------------------------------------------------------------------- NAME TARGET STATE SERVER STATE_DETAILS -------------------------------------------------------------------------------- Local Resources -------------------------------------------------------------------------------- ora.DATADG.dg ONLINE ONLINE rac-one ONLINE ONLINE rac-two ora.GIDG.dg ONLINE ONLINE rac-one ONLINE ONLINE rac-two ora.LISTENER.lsnr ONLINE ONLINE rac-one ONLINE ONLINE rac-two ora.asm ONLINE ONLINE rac-one Started ONLINE ONLINE rac-two Started ora.gsd OFFLINE OFFLINE rac-one OFFLINE OFFLINE rac-two ora.net1.network ONLINE ONLINE rac-one ONLINE ONLINE rac-two ora.ons ONLINE ONLINE rac-one ONLINE ONLINE rac-two ora.registry.acfs ONLINE ONLINE rac-one ONLINE ONLINE rac-two -------------------------------------------------------------------------------- Cluster Resources -------------------------------------------------------------------------------- ora.LISTENER_SCAN1.lsnr 1 ONLINE ONLINE rac-one ora.LISTENER_SCAN2.lsnr 1 ONLINE ONLINE rac-two ora.LISTENER_SCAN3.lsnr 1 ONLINE ONLINE rac-two ora.cvu 1 ONLINE ONLINE rac-two ora.oc4j 1 ONLINE ONLINE rac-two ora.rac-one.vip 1 ONLINE ONLINE rac-one ora.rac-two.vip 1 ONLINE ONLINE rac-two ora.rac.db 1 ONLINE ONLINE rac-two Open 2 ONLINE ONLINE rac-one Open ora.rac.rac1.svc 1 ONLINE ONLINE rac-two ora.rac.rac2.svc 1 ONLINE ONLINE rac-two ora.scan1.vip 1 ONLINE ONLINE rac-one ora.scan2.vip 1 ONLINE ONLINE rac-two ora.scan3.vip 1 ONLINE ONLINE rac-two [grid@rac-one ~]$
如果當Rac1失效(節點關閉),那麼select將再次移動到Rac2上來
SQL> r 1* select inst_id,username,failover_type,failover_method,failed_over from gv$session where username='SYSTEM' INST_ID USERNAME FAILOVER_TYPE FAILOVER_M FAI ---------- ------------------------------ ------------- ---------- --- 2 SYSTEM SELECT BASIC YES
到了11G R2 使用scan和server-side TAF是最佳選擇。
That’s all!!!!!
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++Rhys↖(^ω^)↗Amy+++++++++++++++++++++++++++