Oracle FAQ Your Portal to the Oracle Knowledge Grid
HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US
 

Home -> Community -> Mailing Lists -> Oracle-L -> RE: How to prevent single point-of_failure with ONAMES

RE: How to prevent single point-of_failure with ONAMES

From: <Rick_Cale_at_teamhealth.com>
Date: Tue, 01 Apr 2003 08:28:51 -0800
Message-ID: <F001.00577249.20030401082851@fatcity.com>

What had happened was the data in the table onrs_region got corrupted but the region database was still up. Somehow this caused all 3 name services to shutdown on all 3 servers. I do not have a clue how that is possible. There were checkpoint file on all servers but still no one connected. Perhaps the
corrupted data got replicated to the other servers. The only error was the nno-50.

Thanks
Rick

                                                                                                                    
                    "Jesse, Rich"                                                                                   
                    <Rich.Jesse_at_qti       To:     Multiple recipients of list ORACLE-L <ORACLE-L_at_fatcity.com>       
                    world.com>            cc:                                                                       
                    Sent by:              Subject:     RE: How to prevent single point-of_failure with ONAMES       
                    root_at_fatcity.co                                                                                 
                    m                                                                                               
                                                                                                                    
                                                                                                                    
                    04/01/2003                                                                                      
                    10:18 AM                                                                                        
                    Please respond                                                                                  
                    to ORACLE-L                                                                                     
                                                                                                                    
                                                                                                                    




Wierd. Our primary ONAMES server is 8.0.5 on OpenVMS, with the Region DB being on the same box. Our secondary ONAMES server is 9.2.0.2 on WinTuke using the same Region DB as the primary (amazing that 9iR2 can still use an 8.0.5 Region DB). Yes, we are working on "upgrading" this config, despite it being extremely stable due to non-support by Oracle Corp.

One upon a time, a controller on the primary ONAMES / region DB server puked, causing the server to crash. While it was down, all Oracle Net connections eventually failed over to the secondary ONAMES server, which was
indeed using the local checkpoint files for names resolution. It worked well with the only downside being that the clients had to wait for a TCP/IP timeout on the primary ONAMES server until it failed over to the secondary, but at least connections were able to be made.

So when your primary failed the secondary/tertiary failed, too??? You stated that they wouldn't start, as though they hadn't been running. If they hadn't been running, there's probably no local checkpoint files, which I would think would prevent them from starting successfully. Were there any
error messages?

Rich

Rich Jesse                        System/Database Administrator
rich.jesse_at_qtiworld.com           Quad/Tech International, Sussex, WI USA


-----Original Message-----
Sent: Tuesday, April 01, 2003 5:09 AM
To: Multiple recipients of list ORACLE-L

Hi All,

Oracle 8.1.7.4 Win 2000 ONAMES with region database

I have ONAMES repository on server1
I have 2 additional onames services on server2 and server3(no database/repository).

I received a NNO-50 serious errors in configuration data. I do not know how it happened but I had to truncate and reload nmo_master and the 3 onrs_* tables. However this error caused a single point-of-failure. None of the oname services on all 3 servers would start. I was under the impression under these circumstances that Oracle would use CHKP* files.

How can I prevent a single point-of-failure other than some type of hot standby?
Can I have 2 repositories? If so where can I get doc to set this up?

Thanks
Rick

--
Please see the official ORACLE-L FAQ: http://www.orafaq.net
--
Author: Jesse, Rich
  INET: Rich.Jesse_at_qtiworld.com

Fat City Network Services    -- 858-538-5051 http://www.fatcity.com
San Diego, California        -- Mailing list and web hosting services
---------------------------------------------------------------------
To REMOVE yourself from this mailing list, send an E-Mail message
to: ListGuru_at_fatcity.com (note EXACT spelling of 'ListGuru') and in
the message BODY, include a line containing: UNSUB ORACLE-L

(or the name of mailing list you want to be removed from). You may
also send the HELP command for other information (like subscribing). -- Please see the official ORACLE-L FAQ: http://www.orafaq.net -- Author: INET: Rick_Cale_at_teamhealth.com Fat City Network Services -- 858-538-5051 http://www.fatcity.com San Diego, California -- Mailing list and web hosting services --------------------------------------------------------------------- To REMOVE yourself from this mailing list, send an E-Mail message to: ListGuru_at_fatcity.com (note EXACT spelling of 'ListGuru') and in the message BODY, include a line containing: UNSUB ORACLE-L
(or the name of mailing list you want to be removed from). You may
also send the HELP command for other information (like subscribing).
Received on Tue Apr 01 2003 - 10:28:51 CST

Original text of this message

HOME | ASK QUESTION | ADD INFO | SEARCH | E-MAIL US