Mirrored you say? Can't you just replace the primary and let the mirror =
re-image it?
j.
On May 27, 2012, at 12:38 PM, ANDY LENNARD wrote:
> Dear All,=20
>=20
> It's a long time since I've posted here as I haven't worked with =
Informix for=20
> a while. However, I'm 'last man standing' and there's a problem at a =
customer=20
> site.=20
>=20
> The system has been installed and working for many years, but now the =
primary=20
> drive of their mirrored pair is beginning to show soft errors and =
could do=20
> with replacing. Regrettably, I've forgotten how to and I don't have =
the luxury=20
> of a development system to practice on. Nightmare scenario...=20
>=20
> This is how the server is configured on Solaris 2.6:=20
>=20
> server# onstat -d=20
>=20
> Informix Dynamic Server Version 7.31.UC6 -- On-Line -- Up 20 days =
19:43:44 --=20
> 189696 Kbytes=20
>=20
> Dbspaces=20
> address number flags fchunk nchunks flags owner name=20
> 1104a150 1 2 1 2 M informix rootdbs=20
> 1 active, 2047 maximum=20
>=20
> Chunks=20
> address chk/dbs offset size free bpages flags pathname=20
> 1104a210 1 1 0 1026000 457502 PO- /INFORMIX_DBSPACE_CHUNK0=20
> 1104a2f0 1 1 0 1026000 0 MO- /.INFORMIX_INSTANCE_0.m0=20
> 11073798 2 1 0 852120 105896 PO- /INFORMIX_DBSPACE_CHUNK1=20
> 11073878 2 1 0 852120 0 MO- /.INFORMIX_INSTANCE_0.m1=20
> 2 active, 2047 maximum=20
>=20
> server# ls -ld /INFORMIX_DBSPACE_CHUNK*=20
> lrwxrwxrwx 1 root other 18 May 29 2010 /INFORMIX_DBSPACE_CHUNK0 ->=20
> /dev/rdsk/c0t2d0s3=20
> lrwxrwxrwx 1 root other 18 May 29 2010 /INFORMIX_DBSPACE_CHUNK1 ->=20
> /dev/rdsk/c0t2d0s4=20
> server# ls -ld /.INFORMIX_INSTANCE_0.m*=20
> lrwxrwxrwx 1 root other 18 May 29 2010 /.INFORMIX_INSTANCE_0.m0 ->=20
> /dev/rdsk/c0t1d0s3=20
> lrwxrwxrwx 1 root other 18 May 29 2010 /.INFORMIX_INSTANCE_0.m1 ->=20
> /dev/rdsk/c0t1d0s4=20
>=20
> server# prtvtoc /dev/rdsk/c0t1d0s2=20
> * /dev/rdsk/c0t1d0s2 partition map=20
> *=20
> * Dimensions:=20
> * 512 bytes/sector=20
> * 135 sectors/track=20
> * 16 tracks/cylinder=20
> * 2160 sectors/cylinder=20
> * 3882 cylinders=20
> * 3880 accessible cylinders=20
> *=20
> * Flags:=20
> * 1: unmountable=20
> * 10: read-only=20
> *=20
> * First Sector Last=20
> * Partition Tag Flags Sector Count Sector Mount Directory=20
>=20
> 0 0 00 0 4320 4319=20
>=20
> 1 3 01 4320 864000 868319=20
>=20
> 2 5 01 0 8380800 8380799=20
>=20
> 3 0 00 868320 4104000 4972319=20
>=20
> 4 0 00 4972320 3408480 8380799=20
>=20
> server# prtvtoc /dev/rdsk/c0t2d0s2=20
> * /dev/rdsk/c0t2d0s2 partition map=20
> *=20
> * Dimensions:=20
> * 512 bytes/sector=20
> * 135 sectors/track=20
> * 16 tracks/cylinder=20
> * 2160 sectors/cylinder=20
> * 3882 cylinders=20
> * 3880 accessible cylinders=20
> *=20
> * Flags:=20
> * 1: unmountable=20
> * 10: read-only=20
> *=20
> * First Sector Last=20
> * Partition Tag Flags Sector Count Sector Mount Directory=20
>=20
> 0 0 00 0 4320 4319=20
>=20
> 1 3 01 4320 864000 868319=20
>=20
> 2 5 01 0 8380800 8380799=20
>=20
> 3 0 00 868320 4104000 4972319=20
>=20
> 4 0 00 4972320 3408480 8380799=20
>=20
> server# iostat -En=20
>=20
> <..snip..>=20
>=20
> c0t1d0 Soft Errors: 230 Hard Errors: 0 Transport Errors: 0=20
> Vendor: IBM Product: DDRS34560SUN4.2G Revision: S98E Serial No: =
9848310333=20
> RPM: 5400 Heads: 16 Size: 4.29GB <4292075520 bytes>=20
> Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 230=20
> Illegal Request: 0 Predictive Failure Analysis: 0=20
>=20
> c0t2d0 Soft Errors: 0 Hard Errors: 0 Transport Errors: 0=20
> Vendor: IBM Product: DDRS34560SUN4.2G Revision: S98E Serial No: =
99285G2252=20
> RPM: 5400 Heads: 16 Size: 4.29GB <4292075520 bytes>=20
> Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0=20
> Illegal Request: 0 Predictive Failure Analysis: 0=20
>=20
> server# cat onconfig.INSTANCE_0=20
> =
#*************************************************************************=
*=20
> #=20
> # INFORMIX SOFTWARE, INC.=20
> #=20
> # Title: onconfig.ows=20
> # Description: INFORMIX-OnLine Configuration Parameters=20
> #=20
> =
#*************************************************************************=
*=20
> # Root Dbspace Configuration=20
> ROOTNAME rootdbs # Root dbspace name=20
> ROOTPATH /INFORMIX_DBSPACE_CHUNK0=20
> ROOTOFFSET 0 # Offset of root dbspace into device (Kbytes)=20
> ROOTSIZE 2052000 # Size of root dbspace (Kbytes)=20
> # Disk Mirroring Configuration Parameters=20
> MIRROR 1 # Mirroring flag (Yes =3D 1, No =3D 0)=20
> MIRRORPATH /.INFORMIX_INSTANCE_0.m0=20
> MIRROROFFSET 0 # Offset into mirrored device (Kbytes)=20
> # Physical Log Configuration=20
> PHYSDBS rootdbs # Location (dbspace) of physical log=20
> PHYSFILE 10000 # Physical log file size (Kbytes)=20
> # Logical Log Configuration=20
> LOGFILES 20 # Number of logical log files=20
> LOGSIZE 2000 # Logical log size (Kbytes)=20
> # Diagnostics=20
> MSGPATH /usr/informix/etc/online.log # System message log file path=20
> CONSOLE /dev/console # System console message path=20
> ALARMPROGRAM /tanit/tcl/log_warn/log_full.sh # Alarm program path=20
> # System Archive Tape Device=20
> TAPEDEV /dev/rmt/0=20
> TAPEBLK 128 # Tape block size (Kbytes)=20
> TAPESIZE 4000000 # Maximum amount of data to put on tape (Kbytes)=20
> # Log Archive Tape Device=20
> LTAPEDEV /dev/rmt/1=20
> LTAPEBLK 128 # Log tape block size (Kbytes)=20
> LTAPESIZE 4000000 # Max amount of data to put on log tape (Kbytes)=20
> # Optical=20
> STAGEBLOB # INFORMIX-OnLine/Optical staging area=20
> # System Configuration=20
> SERVERNUM 0 # Unique id corresponding to a OnLine instance=20
> DBSERVERNAME INSTANCE_0 # Name of default database server=20
> DBSERVERALIASES INSTANCE_0_SHM,INSTANCE_0_TCP # List of alternate=20
> dbservernames=20
> NETTYPE ipcstr,1,200,NET=20
> NETTYPE ipcshm,1,200,CPU=20
> NETTYPE tlitcp,1,100,NET=20
> # increase tcp connections from 5 to 50. AL 1-mar-00=20
> # read tasks failed to start correctly, error -25553 in database =
selection=20
> # AL 23-nov-01=20
> # increase again: str 100 -> 200=20
> # shm 100 -> 200=20
> # tcp 50 -> 100=20
> # as a result of error -25560 from read tasks=20
> DEADLOCK_TIMEOUT 60 # Max time to wait of lock in distributed env.=20
> # AL 15-oct-00 change RESIDENT from 0 to 1=20
> RESIDENT 1 # Forced residency flag (Yes =3D 1, No =3D 0)=20
> MULTIPROCESSOR 0 # 0 for single-processor, 1 for multi-processor=20
> NUMCPUVPS 1 # Number of user (cpu) vps=20
> SINGLE_CPU_VP 1 # If non-zero, limit number of cpu vps to one=20
>=20
> # AL 30-aug-2000=20
> # disable process ageing=20
> NOAGE 1 # Process aging=20
> AFF_SPROC 0 # Affinity start processor=20
> AFF_NPROCS 0 # Affinity number of processors=20
> # Shared Memory Parameters=20
> LOCKS 2000 # Maximum number of locks=20
> #BUFFERS 1000 # Maximum number of shared buffers=20
> # buffers increased from 1000 to 10000. al 22/12/98=20
> # buffers increased to 20000. al 28/07/00=20
> # buffers increased to 30000. al 17/4/02=20
> # buffers increased to 40000. al 23/4/02=20
> # buffers increased to 50000. al 16/10/02=20
> BUFFERS 50000 # Maximum number of shared buffers=20
> NUMAIOVPS # Number of IO vps=20
> PHYSBUFF 32 # Physical log buffer size (Kbytes)=20
> LOGBUFF 32 # Logical log buffer size (Kbytes)=20
> LOGSMAX 20 # Maximum number of logical log files=20
> #CLEANERS 1 # Number of buffer cleaner processes=20
> # cleaners incread in line with LRUS. AL 16/10/02=20
> CLEANERS 63 # Number of buffer cleaner processes=20
> SHMBASE 0xa000000 # Shared memory base address=20
> # SHMVIRTSIZE increased from 8000 to 37000, 5 jun 2000 by AL=20
> # from 37000 to 75000, 16 Jun 2000 by AL=20
> SHMVIRTSIZE 75000 # initial virtual shared memory segment size=20
> SHMADD 8192 # Size of new shared memory segments (Kbytes)=20
> SHMTOTAL 0 # Total shared memory (Kbytes). 0=3D>unlimited=20
> CKPTINTVL 300 # Check point interval (in sec)=20
> #LRUS 8 # Number of LRU queues=20
> # increase LRUs from 8 to 32. al 22/12/98=20
> # increase LRUs to 64. al 28/07/00=20
> # decrease LRUs to 63. suspicion that even numbers are bad... al 19 =
apr 02=20
> LRUS 63 # Number of LRU queues=20
> #LRU_MAX_DIRTY 60 # LRU percent dirty begin cleaning limit=20
> #LRU_MIN_DIRTY 50 # LRU percent dirty end cleaning limit=20
> # reduce max/min to 20/10. al 16/10/02=20
> LRU_MAX_DIRTY 20 # LRU percent dirty begin cleaning limit=20
> LRU_MIN_DIRTY 10 # LRU percent dirty end cleaning limit=20
> LTXHWM 50 # Long transaction high water mark percentage=20
> LTXEHWM 60 # Long transaction high water mark (exclusive)=20
> TXTIMEOUT 0x12c # Transaction timeout (in sec)=20
> STACKSIZE 32 # Stack size (Kbytes)=20
> # System Page Size=20
> # BUFFSIZE - OnLine no longer supports this configuration parameter.=20=
> # To determine the page size used by OnLine on your platform=20
> # see the last line of output from the command, 'onstat -b'.=20
> # Recovery Variables=20
> # OFF_RECVRY_THREADS:=20
> # Number of parallel worker threads during fast recovery or an offline=20=
> restore.=20
> # ON_RECVRY_THREADS:=20
> # Number of parallel worker threads during an online restore.=20
> OFF_RECVRY_THREADS 10 # Default number of offline worker threads=20
> ON_RECVRY_THREADS 1 # Default number of online worker threads=20
> # Data Replication Variables=20
> # DRAUTO: 0 manual, 1 retain type, 2 reverse type=20
> DRAUTO 0 # DR automatic switchover=20
> DRINTERVAL 30 # DR max time between DR buffer flushes (in sec)=20
> DRTIMEOUT 30 # DR network timeout (in sec)=20
> DRLOSTFOUND /usr/informix/etc/dr.lostfound # DR lost+found file path=20=
> # Read Ahead Variables=20
> RA_PAGES 10 # Number of pages to attempt to read ahead=20
> #RA_THRESHOLD 4 # Number of pages left before next group=20
> # AL 22 may 02 increase threshold from 4 to 8 to try to get bufwaits =
down=20
> # read ahead util is around 100%, so just try to get the=20
> # pages a bit sooner=20
> RA_THRESHOLD 8 # Number of pages left before next group=20
> # DBSPACETEMP:=20
> # OnLine equivalent of DBTEMP for SE. This is the list of dbspaces=20
> # that the OnLine SQL Engine will use to create temp tables etc.=20
> # If specified it must be a colon separated list of dbspaces that =
exist=20
> # when the OnLine system is brought online. If not specified, or if=20
> # all dbspaces specified are invalid, various ad hoc queries will =
create=20
> # temporary files in /tmp instead.=20
> DBSPACETEMP # Default temp dbspaces=20
> # DUMP*:=20
> # The following parameters control the type of diagnostics information =
which=20
> # is preserved when an unanticipated error condition (assertion =
failure)=20
> occurs=20
> # during OnLine operations.=20
> # For DUMPSHMEM, DUMPGCORE and DUMPCORE 1 means Yes, 0 means No.=20
> DUMPDIR /tmp # Preserve diagnostics in this directory=20
> DUMPSHMEM 1 # Dump a copy of shared memory=20
> DUMPGCORE 0 # Dump a core image using 'gcore'=20
> DUMPCORE 0 # Dump a core image (Warning:this aborts OnLine)=20
> DUMPCNT 1 # Number of shared memory or gcore dumps for=20
>=20
> # a single user's session=20
> FILLFACTOR 90 # Fill factor for building indexes=20
> # method for OnLine to use when determining current time=20
> USEOSTIME 0 # 0: use internal time(fast), 1: get time from OS(slow)=20
> DS_MAX_QUERIES # Maximum number of decision support queries=20
> DS_TOTAL_MEMORY # Decision support memory (Kbytes)=20
> DS_MAX_SCANS 1048576 # Maximum number of decision support scans=20
> DATASKIP off # List of dbspaces to skip=20
> # OPTCOMPIND=20
> # 0 =3D> Nested loop joins will be preferred (where=20
> # possible) over sortmerge joins and hash joins.=20
> # 1 =3D> If the transaction isolation mode is not=20
> # "repeatable read", optimizer behaves as in (2)=20
> # below. Otherwise it behaves as in (0) above.=20
> # 2 =3D> Use costs regardless of the transaction isolation=20
> # mode. Nested loop joins are not necessarily=20
> # preferred. Optimizer bases its decision purely=20
> # on costs.=20
> OPTCOMPIND 2 # To hint the optimizer=20
> ONDBSPACEDOWN 2 # Dbspace down option: 0 =3D CONTINUE, 1 =3D ABORT, 2 =
=3D WAIT=20
> # AL 15-oct-00 change LBU_PRESERVE from 0 to 1=20
> LBU_PRESERVE 1 # Preserve last log for log backup=20
> OPCACHEMAX 0 # Maximum optical cache size (Kbytes)=20
>=20
> # HETERO_COMMIT (Gateway participation in distributed transactions)=20
> # 1 =3D> Heterogeneous Commit is enabled=20
> # 0 (or any other value) =3D> Heterogeneous Commit is disabled=20
> HETERO_COMMIT 0=20
> #=20
> # al 22/12/98=20
> DD_HASHMAX 50=20
> DD_HASHSIZE 53=20
> #=20
> # al 23-feb-2000=20
> # added from onconfig.std on upgrade to v7.31.uc4=20
> # Optimization goal: -1 =3D ALL_ROWS(Default), 0 =3D FIRST_ROWS=20
> OPT_GOAL -1=20
>=20
> # Optimizer DIRECTIVES ON (1/Default) or OFF (0)=20
> DIRECTIVES 1=20
> # Status of restartable restore=20
> RESTARTABLE_RESTORE off=20
> SYSALARMPROGRAM /usr/informix/etc/evidence.sh # System Alarm program =
path=20
> TBLSPACE_STATS 1=20
> CDR_LOGBUFFERS 2048 # size of log reading buffer pool (Kbytes)=20
> CDR_EVALTHREADS 1,2 # evaluator threads (per-cpu-vp,additional)=20
> CDR_DSLOCKWAIT 5 # DS lockwait timeout (seconds)=20
> CDR_QUEUEMEM 4096 # Maximum amount of memory for any CDR queue =
(Kbytes)=20
> CDR_LOGDELTA 30 # % of log space allowed in queue memory=20
> CDR_NUMCONNECT 16 # Expected connections per server=20
> CDR_NIFRETRY 300 # Connection retry (seconds)=20
> CDR_NIFCOMPRESS 0 # Link level compression (-1 never, 0 none, 9 max)=20=
> BAR_ACT_LOG /tmp/bar_act.log=20
> BAR_MAX_BACKUP 4=20
> BAR_RETRY 1=20
> BAR_NB_XPORT_COUNT 10=20
> BAR_XFER_BUF_SIZE 31=20
> ISM_DATA_POOL ISMData # If the data pool name is changed, be sure to=20=
> ISM_LOG_POOL ISMLogs=20
> MAX_PDQPRIORITY 100 # Maximum allowed pdqpriority=20
>=20
> There is absolutely no prospect of upgrading or anything like that and =
in any=20
> case the system is hopefully going to get pensioned off soon.=20
>=20
> What I'd greatly appreciate is the most reliable way to replace that =
failing=20
> drive, given that it is the primary of the mirrored pair. The server =
can be=20
> taken down to replace the drive. If anyone can give me the process & =
tell me=20
> the commands I'll be very grateful.=20
>=20
> Many, many thanks,=20
> Andy.=20
>=20
>=20
> =
**************************************************************************=
*****=20
> Forum Note: Use "Reply" to post a response in the discussion forum.=20=
>=20