ipcs、ipcrm、sysresv、kernel.shmmax
1.1 BLOG文档结构图
1.2 前言部分
1.2.1 导读和注意事项
各位技术爱好者,看完本文后,你可以掌握如下的技能,也可以学到一些其它你所不知道的知识,~O(∩_∩)O~:
① ipcs的使用
② ipcrm释放oracle内存段
③ sysresv的使用
④ 内核参数kernel.shmmax
⑤ 如何快速的清理Oracle的进程
⑥ 其它维护操作
Tips:
① 本文在itpub(http://blog.itpub.net/26736162)、博客园(http://www.cnblogs.com/lhrbest)和微信公众号(xiaomaimiaolhr)上有同步更新。
② 文章中用到的所有代码、相关软件、相关资料及本文的pdf版本都请前往小麦苗的云盘下载,小麦苗的云盘地址见:http://blog.itpub.net/26736162/viewspace-1624453/。
③ 若网页文章代码格式有错乱,请下载pdf格式的文档来阅读。
④ 在本篇BLOG中,代码输出部分一般放在一行一列的表格中。
⑤ 本文适合于初中级人员阅读,数据库大师请略过本文。
⑥ 不喜勿喷。
本文若有错误或不完善的地方请大家多多指正,您的批评指正是我写作的最大动力。
1.3 本文简介
最近有朋友因为kernel.shmmax内核参数的问题导致数据库不能启动。小麦苗之前碰到过一次,只是没有记录下来,而且以前安装数据库的时候也没有详细介绍这几个参数的含义,趁这次机会就把这个参数在详细介绍一下吧。
1.4 相关文章链接
@H_403_447@① @H_403_447@【故障解决】IPCS和IPCRM使用@H_403_447@:http://blog.itpub.net/26736162/viewspace-2112518
@H_403_447@② @H_403_447@ORACLE内核参数@H_403_447@:http://blog.itpub.net/26736162/viewspace-2112447/
@H_403_447@③ @H_403_447@sysresv@H_403_447@:http://blog.itpub.net/26736162/viewspace-2112443/
@H_403_447@④ @H_403_447@视频讲解@H_403_447@IPCS和IPCRM使用@H_403_447@:http://www.iqiyi.com/w_19rs33qqsp.html
@H_403_447@⑤ @H_403_447@有关“TNS-12518: TNS:listener could not hand off client connection”的更多内容请参考:【故障|监听】TNS-12518、TNS-00517和 Linux Error:32:Broken pipe@H_403_447@:http://blog.itpub.net/26736162/viewspace-2135468/
第二章ipcs/ipcrm命令
更多内容请参考:http://blog.itpub.net/26736162/viewspace-2112518
unix/linux下的共享内存、信号量、队列信息管理
在Unix或Linux下,经常有因为共享内存、信号量,队列等共享信息没有干净地清除而引起一些问题。
查看共享内存的命令是:ipcs [-m|-s|-q]。若ipcs命令不带参数,则默认会列出共享内存、信号量,队列信息,而-m列出共享内存,-s列出共享信号量,-q列出共享队列。
清除命令是:ipcrm [-m|-s|-q] id,其中,-m删除共享内存,-s删除共享信号量,-q删除共享队列。
@H_403_447@[oracle@rhel6lhr ~]$ ipcs -h @H_403_447@ipcs provides information on ipc facilities for which you have read access. @H_403_447@Resource Specification: @H_403_447@ -m : shared_mem @H_403_447@ -q : messages @H_403_447@ -s : semaphores @H_403_447@ -a : all (default) @H_403_447@Output Format: @H_403_447@ -t : time @H_403_447@ -p : pid @H_403_447@ -c : creator @H_403_447@ -l : limits @H_403_447@ -u : summary @H_403_447@-i id [-s -q -m] : details on resource identified by id @H_403_447@usage : ipcs -asmq -tclup @H_403_447@ ipcs [-s -m -q] -i id @H_403_447@ ipcs -h for help. |
2.1 ipcs
1. 命令格式
ipcs [resource-option] [output-format]
ipcs [resource-option] -i id
2. 命令功能
提供IPC设备的信息
3. 使用方法
resource选项:
ipcs -m 查看系统共享内存信息
ipcs -q 查看系统消息队列信息
ipcs -s 查看系统信号量信息
ipcs [-a] 系统默认输出信息,显示系统内所有的IPC信息
@H_403_447@[martin@localhost data]$ ipcs -a @H_403_447@ @H_403_447@------ Message Queues -------- @H_403_447@key msqid owner perms used-bytes messages @H_403_447@ @H_403_447@------ Shared Memory Segments -------- @H_403_447@key shmid owner perms bytes nattch status @H_403_447@0x00000000 229376 martin 600 4194304 2 dest @H_403_447@0x00000000 196609 martin 600 524288 2 dest @H_403_447@0x00000000 327682 martin 600 393216 2 dest @H_403_447@0x00000000 491525 martin 600 2097152 2 dest @H_403_447@ @H_403_447@------ Semaphore Arrays -------- @H_403_447@key semid owner perms nsems @H_403_447@ |
输出格式控制:
ipcs -c 查看IPC的创建者和所有者
ipcs -l 查看IPC资源的限制信息
ipcs -p 查看IPC资源的创建者和使用的进程ID
ipcs -t 查看最新调用IPC资源的详细时间
ipcs -u 查看IPC资源状态汇总信息
@H_403_447@[martin@localhost data]$ ipcs -u --human @H_403_447@ @H_403_447@------ Messages Status -------- @H_403_447@allocated queues = 0 @H_403_447@used headers = 0 @H_403_447@used space = 0B @H_403_447@ @H_403_447@------ Shared Memory Status -------- @H_403_447@segments allocated 4 @H_403_447@pages allocated 1760 @H_403_447@pages resident 339 @H_403_447@pages swapped 0 @H_403_447@Swap performance: 0 attempts 0 successes @H_403_447@ @H_403_447@------ Semaphore Status -------- @H_403_447@used arrays = 0 @H_403_447@allocated semaphores = 0 |
额外格式控制:
ipcs -l --human
以人类可以阅读的方式显示size
@H_403_447@[martin@localhost data]$ ipcs -l --human @H_403_447@ @H_403_447@------ Messages Limits -------- @H_403_447@max queues system wide = 3644 @H_403_447@max size of message = 8K @H_403_447@default max size of queue = 16K @H_403_447@ @H_403_447@------ Shared Memory Limits -------- @H_403_447@max number of segments = 4096 @H_403_447@max seg size = 16E @H_403_447@max total shared memory = 16E @H_403_447@min seg size = 1B @H_403_447@ @H_403_447@------ Semaphore Limits -------- @H_403_447@max number of arrays = 128 @H_403_447@max semaphores per array = 250 @H_403_447@max semaphores system wide = 32000 @H_403_447@max ops per semop call = 32 @H_403_447@semaphore max value = 3276 @H_403_447@ |
@H_403_447@[oracle@rhel6lhr ~]$ ipcs -l @H_403_447@ @H_403_447@------ Shared Memory Limits -------- @H_403_447@max number of segments = 4096 @H_403_447@max seg size (kbytes) = 98442 @H_403_447@max total shared memory (kbytes) = 3221512 @H_403_447@min seg size (bytes) = 1 @H_403_447@ @H_403_447@------ Semaphore Limits -------- @H_403_447@max number of arrays = 2048 @H_403_447@max semaphores per array = 250 @H_403_447@max semaphores system wide = 256000 @H_403_447@max ops per semop call = 100 @H_403_447@semaphore max value = 32767 @H_403_447@ @H_403_447@------ Messages: Limits -------- @H_403_447@max queues system wide = 7643 @H_403_447@max size of message (bytes) = 65536 @H_403_447@default max size of queue (bytes) = 65536 @H_403_447@ |
2.2 ipcrm
1. 命令功能
通过指定ID删除删除IPC资源,同时将与IPC对象关联的数据一并删除,只有超级用户或IPC资源创建者能够删除
2. 使用方法
ipcrm -M shmkey
移除用shmkey创建的共享内存段
ipcrm -m shmid
移除用shmid标识的共享内存段
ipcrm -S semkey
移除用semkey创建的信号量
ipcrm -s semid
移除用semid标识的信号量
ipcrm -Q msgkey
移除用msgkey创建的消息队列
ipcrm -q msgid
移除用msgid标识的消息队列
2.3 如何快速的清理Oracle的进程?
真题1、如何快速的清理Oracle的进程?
答案:若想要快速清理掉Oracle的进程,则最直接的办法是杀pmon进程。有如下3条命令可供选择,其中加粗的orcl替换成ORACLE_SID的值即可。
@H_403_447@kill -9 `ps -ef|grep orcl| grep -v grep | awk '{print $2}'` @H_403_447@ps -ef |grep orcl|grep -v grep|awk '{print $2}' | xargs kill -9 @H_403_447@ipcs -m | grep oracle | awk '{print $2}' | xargs ipcrm shm |
若想要快速杀掉集群的进程,则可以执行如下命令:
@H_403_447@kill -9 `ps -ef|grep d.bin| grep -v grep | awk '{print $2}'` |
注意,生产库上严禁使用,否则可能导致集群不能正常启动。
第三章sysresv命令
3.1 若是一个主机上有多个oracle实例的话该如何确定哪个共享内存段属于我们该清掉的oracle实例的内存段?
答案:使用sysresv命令。sysresv是Oracle在Linux/Unix平台提供的工具,用来查看Oracle实例使用的共享内存和信号量等信息。sysresv存放的路径:$ORACLE_HOME/bin/sysresv。使用时需要设置LD_LIBRARY_PATH环境变量,用来告诉Oracle共享库文件的位置。sysresv用法如下:
@H_403_447@[oracle@rhel6lhr ~]$ sysresv -h @H_403_447@sysresv: invalid option -- 'h' @H_403_447@usage : sysresv [-if] [-d ] [-l sid1 ...] @H_403_447@ -i : Prompt before removing ipc resources for each sid @H_403_447@ -f : Remove ipc resources silently,oevrrides -i option @H_403_447@ -d : List ipc resources for each sid if on @H_403_447@ -l sid1 .. : apply sysresv to each sid @H_403_447@Default : sysresv -d on -l $ORACLE_SID @H_403_447@Note : ipc resources will be attempted to be deleted for a @H_403_447@ sid only if there is no currently running instance @H_403_447@ with that sid. @H_403_447@[oracle@rhel6lhr ~]$ which sysresv @H_403_447@/u01/app/oracle/product/11.2.0/dbhome_1/bin/sysresv @H_403_447@ |
来看一下简单使用:
@H_403_447@oracle@sunvs-b@/oracle/oracle $ uname -a @H_403_447@SunOS sunvs-b 5.10 Generic_139555-08 sun4u sparc SUNW,Sun-Fire-480R @H_403_447@oracle@sunvs-b@/oracle/oracle $ ps -ef|grep pmon @H_403_447@ oracle 26257 1 0 5@H_403_447@月 @H_403_447@24 ? 140:42 ora_pmon_H2 @H_403_447@ oracle 15479 14078 0 14:01:36 pts/4 0:00 grep pmon @H_403_447@oracle 12449 1 0 8@H_403_447@月 @H_403_447@17 ? 17:44 ora_pmon_U2 @H_403_447@ @H_403_447@oracle@sunvs-b@/oracle/oracle $ sysresv -l H2 @H_403_447@ @H_403_447@IPC Resources for ORACLE_SID "H2" : @H_403_447@Shared Memory: @H_403_447@ID KEY @H_403_447@1979711594 0x00000000 @H_403_447@1979711595 0x00000000 @H_403_447@1979711596 0x00000000 @H_403_447@1979711597 0xce653c24 @H_403_447@Semaphores: @H_403_447@ID KEY @H_403_447@16777316 0x25393874 @H_403_447@Oracle Instance alive for sid "H2" @H_403_447@ @H_403_447@oracle@sunvs-b@/oracle/oracle $ ipcs -ms @H_403_447@IPC status from as of 2011@H_403_447@年08月29日 星期一 14时11分51秒 @H_403_447@ CST @H_403_447@T ID KEY MODE OWNER GROUP @H_403_447@Shared Memory: @H_403_447@m 1577058426 0xf5649758 --rw-r----- oracle oinstall @H_403_447@m 1577058425 0 --rw-r----- oracle oinstall @H_403_447@m 1577058424 0 --rw-r----- oracle oinstall @H_403_447@m 1577058423 0 --rw-r----- oracle oinstall @H_403_447@m 1979711605 0x4e65af --rw-r--r-- oracle oinstall @H_403_447@m 1979711604 0x3e65af --rw-r--r-- oracle oinstall @H_403_447@m 1979711603 0x1e65af --rw-r--r-- oracle oinstall @H_403_447@m 1979711602 0xe65af --rw-r--r-- oracle oinstall @H_403_447@m 1979711597 0xce653c24 --rw-r----- oracle oinstall @H_403_447@m 1979711596 0 --rw-r----- oracle oinstall @H_403_447@m 1979711595 0 --rw-r----- oracle oinstall @H_403_447@m 1979711594 0 --rw-r----- oracle oinstall @H_403_447@m 1979711511 0x31f4002 --rw-rw-rw- cupsz cupucuse @H_403_447@m 754974788 0xc93f --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974787 0xc93e --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974786 0xc93d --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974785 0xc93c --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974784 0xc93b --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974783 0xc93a --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974782 0xc939 --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974781 0xc938 --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974780 0xc937 --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974779 0xc936 --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974778 0xc935 --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974777 0xc934 --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974776 0xc933 --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974775 0xc932 --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974774 0xc930 --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974773 0xc92f --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974772 0xc92e --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974771 0xc92d --rw-rw-rw- hsm1 cupucuse @H_403_447@m 754974770 0xc931 --rw-rw-rw- hsm1 cupucuse @H_403_447@m 45 0x741cc1a6 --rw-rw-rw- root root @H_403_447@m 44 0x741cc1a5 --rw-rw-rw- root root @H_403_447@m 43 0x741cc1a4 --rw-rw-rw- root root @H_403_447@m 42 0x741cc1a3 --rw-rw-rw- root root @H_403_447@m 41 0x741cc1a2 --rw-rw-rw- root root @H_403_447@m 40 0x741cc1a1 --rw-rw-rw- root root @H_403_447@m 39 0x741cc1a0 --rw-rw-rw- root root @H_403_447@m 37 0x435dce60 --rw-rw-rw- root root @H_403_447@m 0 0x22bb --rw-rw---- root dba @H_403_447@Semaphores: @H_403_447@s 16777324 0x25393ad4 --ra-r----- oracle oinstall @H_403_447@s 16777320 0x1e65af --ra-ra-ra- oracle oinstall @H_403_447@s 16777319 0xe65af --ra-ra-ra- oracle oinstall @H_403_447@s 16777316 0x25393874 --ra-r----- oracle oinstall @H_403_447@s 16777296 0 --ra-ra-ra- cupst cupucuse @H_403_447@s 16777294 0 --ra-ra-ra- cupst cupucuse @H_403_447@s 16777289 0 --ra-ra-ra- cuput cupucuse @H_403_447@s 16777287 0 --ra-ra-ra- cuput cupucuse @H_403_447@s 16777282 0 --ra-ra-ra- cupvip cupucuse @H_403_447@s 16777280 0 --ra-ra-ra- cupvip cupucuse @H_403_447@s 16777279 0 --ra-ra-ra- cupfb cupucuse @H_403_447@s 16777277 0 --ra-ra-ra- cupfb cupucuse @H_403_447@s 16777268 0 --ra-ra-ra- cupuc cupucuse @H_403_447@s 16777266 0 --ra-ra-ra- cupuc cupucuse @H_403_447@s 16777261 0 --ra-ra-ra- cuphx cupucuse @H_403_447@s 16777259 0 --ra-ra-ra- cuphx cupucuse @H_403_447@s 16777258 0 --ra-ra-ra- cupsz cupucuse @H_403_447@s 16777256 0 --ra-ra-ra- cupsz cupucuse @H_403_447@s 1 0x55064bec --ra-r--r-- root root @H_403_447@s 0 0x710644ac --ra-ra-ra- root root @H_403_447@ |
说明一下:在安装ORACLE产品前,需要设置系统的共享内存段的最大值和个数限制,实例在启动后,应尽量保证SGA在一个共享内存段上,这里由于我是在RAC的一个节点上进行的测试,所以实例内存被分配到4个共享内存段上。
IPC的清理可以使用sysresv –if,如果实例正在运行,清理操作会被终止:
@H_403_447@oracle@sunvs-b@/oracle/oracle $ sysresv -fi -l H2 @H_403_447@ @H_403_447@IPC Resources for ORACLE_SID "H2" : @H_403_447@Shared Memory: @H_403_447@ID KEY @H_403_447@1979711594 0x00000000 @H_403_447@1979711595 0x00000000 @H_403_447@1979711596 0x00000000 @H_403_447@1979711597 0xce653c24 @H_403_447@Semaphores: @H_403_447@ID KEY @H_403_447@16777316 0x25393874 @H_403_447@Oracle Instance alive for sid "H2" @H_403_447@SYSRESV-005: Warning @H_403_447@ Instance maybe alive - aborting remove for sid "H2" @H_403_447@ |
另外如果需要清理内存段和信号量,而sysresv发现实例是alive的,可以使用ipcrm命令:
@H_403_447@ipcrm -m @H_403_447@ipcrm -s @H_403_447@ |
3.1.1 实验
@H_403_447@[ZFXDESKDB2:oracle]:/oracle>ps -ef|grep ora_pmon_ @H_403_447@@H_403_447@ oracle 12255344 21626964 0 17:43:01 pts/0@H_403_447@ 0:00 grep ora_pmon_ @H_403_447@@H_403_447@ oracle 17629238 1 0 18:57:42 - 0:09ora_pmon_raclhr2 @H_403_447@@H_403_447@ oracle 20250806 1 0 18:57:42 -@H_403_447@ 0:10 ora_pmon_oraESKDB2 @H_403_447@[ZFXDESKDB2:oracle]:/oracle>which sysresv @H_403_447@/oracle/app/oracle/product/11.2.0/db/bin/sysresv @H_403_447@[ZFXDESKDB2:oracle]:/oracle>ORACLE_SID=raclhr2 @H_403_447@[ZFXDESKDB2:oracle]:/oracle>sysresv @H_403_447@ @H_403_447@IPC Resources for ORACLE_SID "raclhr2" : @H_403_447@Shared Memory: @H_403_447@ID@H_403_447@ @H_403_447@KEY @H_403_447@5242886@H_403_447@ @H_403_447@0xffffffff @H_403_447@5242883@H_403_447@ @H_403_447@0xffffffff @H_403_447@1048583@H_403_447@ @H_403_447@0xd92489e0 @H_403_447@Oracle Instance alive for sid "raclhr2" @H_403_447@[ZFXDESKDB2:oracle]:/oracle>ipcs @H_403_447@IPC status from /dev/mem as of Wed Jun@H_403_447@@H_403_447@ 1 17:43:47 BEIST 2016 @H_403_447@T@H_403_447@ ID KEY MODE OWNER @H_403_447@GROUP @H_403_447@Message Queues: @H_403_447@q@H_403_447@ 0 0x9283a0d2 -Rrw------- root @H_403_447@system @H_403_447@q@H_403_447@ 1 0xffffffff ----------- root @H_403_447@system @H_403_447@ @H_403_447@Shared Memory: @H_403_447@m@H_403_447@ 1048576 00000000 --rw-r----- grid @H_403_447@dba @H_403_447@m@H_403_447@ 1048577 00000000 --rw-r----- grid @H_403_447@dba @H_403_447@m@H_403_447@ 1048578 0x210000aa --rw-rw---- root @H_403_447@system @H_403_447@m@H_403_447@ 5242883 00000000 --rw-r----- @H_403_447@oracle asmadmin @H_403_447@m@H_403_447@ 1048580 00000000 --rw-r----- @H_403_447@oracle asmadmin @H_403_447@m@H_403_447@ 1048581 00000000 --rw-r----- @H_403_447@oracle asmadmin @H_403_447@m@H_403_447@ 5242886 00000000 --rw-r----- @H_403_447@oracle asmadmin @H_403_447@m@H_403_447@ 1048583 0xd92489e0 --rw-r----- @H_403_447@oracle asmadmin @H_403_447@m@H_403_447@ 1048584 0xd1a4a5d8 --rw-r----- grid @H_403_447@dba @H_403_447@m@H_403_447@ 8388617 0x3f516768 --rw-r----- @H_403_447@oracle asmadmin @H_403_447@m 759169034 0x21000148 --rw-rw----@H_403_447@ oracle @H_403_447@dba @H_403_447@Semaphores: @H_403_447@s@H_403_447@ 3145728 0x0100324a --ra-ra-r-- root @H_403_447@system @H_403_447@s@H_403_447@ 1 0x620025b4 --ra-r--r-- root @H_403_447@system @H_403_447@s@H_403_447@ 2 0x02001958 --ra-ra-ra- root @H_403_447@system @H_403_447@s@H_403_447@ 3 0x01001958 --ra-ra-ra- root @H_403_447@system @H_403_447@s@H_403_447@ 9 0x010024be --ra------- root @H_403_447@system @H_403_447@s@H_403_447@ 1048590 0x410000a8 --ra-ra---- root @H_403_447@system @H_403_447@s@H_403_447@ 11534361 0x41000147 --ra-ra---- oracle @H_403_447@dba @H_403_447@[ZFXDESKDB2:oracle]:/oracle>ipcs -m @H_403_447@IPC status from /dev/mem as of Wed Jun@H_403_447@@H_403_447@ 1 17:43:56 BEIST 2016 @H_403_447@T@H_403_447@ ID KEY MODE OWNER @H_403_447@GROUP @H_403_447@Shared Memory: @H_403_447@m@H_403_447@ 1048576 00000000 --rw-r----- grid @H_403_447@dba @H_403_447@m@H_403_447@ 1048577 00000000 --rw-r----- grid @H_403_447@dba @H_403_447@m@H_403_447@ 1048578 0x210000aa --rw-rw---- root @H_403_447@system @H_403_447@m@H_403_447@ 5242883 00000000 --rw-r----- @H_403_447@oracle asmadmin @H_403_447@m@H_403_447@ 1048580 00000000 --rw-r----- @H_403_447@oracle asmadmin @H_403_447@m@H_403_447@ 1048581 00000000 --rw-r----- @H_403_447@oracle asmadmin @H_403_447@m@H_403_447@ 5242886 00000000 --rw-r----- @H_403_447@oracle asmadmin @H_403_447@m@H_403_447@ 1048583 0xd92489e0 --rw-r----- @H_403_447@oracle asmadmin @H_403_447@m@H_403_447@ 1048584 0xd1a4a5d8 --rw-r----- grid @H_403_447@dba @H_403_447@m@H_403_447@ 8388617 0x3f516768 --rw-r----- @H_403_447@oracle asmadmin @H_403_447@m 759169034 0x21000148 --rw-rw----@H_403_447@ oracle @H_403_447@dba @H_403_447@[ZFXDESKDB2:oracle]:/oracle>ipcrm -m 5242886 @H_403_447@[ZFXDESKDB2:oracle]:/oracle>ipcrm -m 5242883 @H_403_447@[ZFXDESKDB2:oracle]:/oracle>ipcrm -m 1048583 @H_403_447@[ZFXDESKDB2:oracle]:/oracle>sysresv @H_403_447@ @H_403_447@IPC Resources for ORACLE_SID "raclhr2" : @H_403_447@Shared Memory @H_403_447@ID@H_403_447@ @H_403_447@KEY @H_403_447@No shared memory segments used @H_403_447@Oracle Instance not alive for sid "raclhr2" @H_403_447@Oracle Instance not alive for sid "raclhr2" @H_403_447@[ZFXDESKDB2:oracle]:/oracle>ps -ef|grep ora_pmon_ @H_403_447@@H_403_447@ oracle 17629238 1 0 18:57:42 -@H_403_447@ 0:09 ora_pmon_raclhr2 @H_403_447@@H_403_447@ oracle 20250806 1 0 18:57:42 -@H_403_447@ 0:10 ora_pmon_oraESKDB2 @H_403_447@@H_403_447@ oracle 23330844 21626964 0 17:44:46 pts/0@H_403_447@ 0:00 grep ora_pmon_ @H_403_447@[ZFXDESKDB2:oracle]:/oracle>sqlplus / as sysdba @H_403_447@ @H_403_447@sql*Plus: Release 11.2.0.4.0 Production on Wed Jun 1 17:44:52 2016 @H_403_447@ @H_403_447@Copyright (c) 1982,2013,Oracle.@H_403_447@@H_403_447@ All rights reserved. @H_403_447@ @H_403_447@Connected to an idle instance. @H_403_447@ @H_403_447@SYS@raclhr2> shutdown abort @H_403_447@ORACLE instance shut down. @H_403_447@SYS@raclhr2> exit @H_403_447@Disconnected from Oracle Database 11g Enterprise Edition Release 11.2.0.4.0 - 64bit Production @H_403_447@With the Partitioning,Real Application Clusters,Automatic Storage Management,OLAP, @H_403_447@Data Mining and Real Application Testing options |
第四章Oracle内核参数
查看:more /proc/sys/kernel/shmmax
临时生效:echo 3145728 > /proc/sys/kernel/shmmax
永久生效,修改文件:/etc/sysctl.conf,并使修改参数立即生效:/sbin/sysctl -p
重要的几个参数如下所示:
@H_403_447@kernel.shmall = 2097152 @H_403_447@kernel.shmmax = 1054472192 @H_403_447@kernel.shmmni = 4096 @H_403_447@kernel.sem = 250 32000 100 128 |
其含义分别如下所示:
(一)kernel.shmall = 2097152 # kernel.shmall参数是控制共享内存页数。Linux 共享内存页大小为4KB,共享内存段的大小都是共享内存页大小的整数倍。如果一个共享内存段的最大大小是16G,那么需要共享内存页数是 16GB/4KB = 16777216KB/4KB = 4194304(页),也就是64Bit系统下16GB物理内存,设置kernel.shmall = 4194304才符合要求(几乎是原来设置2097152的两倍)。简言之,该参数的值始终应该至少为: ceil(SHMMAX/PAGE_SIZE)。这个值太小有可能导致数据库启动报错(ORA-27102: out of memory)。
(二)kernel.shmmax = 1054472192 #定义一个内存段最大可以分配的内存空间,单位为字节。如果定义太小,那么会导致启动实例失败,或者SGA就会被分配到多个共享内存段。那么内存中的指针连接会给系统带来一定的开销,从而降低系统性能。这个值的设置应该大于SGA_MAX_TARGET或MEMORY_MAX_TARGET的值,最大值可以设置成大于或等于实际的物理内存。如果kernel.shmmax为100M,sga_max_size为500M,那么启动Oracle实例至少会分配5个共享内存段;如果设置kernel.shmmax为2G,sga_max_size为500M,那么启动Oracle实例只需要分配1个共享内存段。
(三)kernel.shmmni = 4096 #设置系统级最大共享内存段数量,该参数的默认值是4096。这一数值已经足够,通常不需要更改。。
(四)kernel.sem = 250 32000 100 128 #信号灯的相关配置,信号灯semaphores是进程或线程间访问共享内存时提供同步的计数器。可以通过命令“cat /proc/sys/kernel/sem”来查看当前信号灯的参数配置,如下所示:
@H_403_447@[root@edsir4p1 ~]# cat /proc/sys/kernel/sem @H_403_447@250 32000 100 128 |
其4个值的含义分别如下:
① 250表示SEMMSL,设置每个信号灯组中信号灯最大数量,推荐的最小值是250。对于系统中存在大量并发连接的系统,推荐将这个值设置为PROCESSES初始化参数加10。
② 32000表示SEMMNS,设置系统中信号灯的最大数量。操作系统在分配信号灯时不会超过LEAST(SEMMNS,SEMMSL*SEMMNI)。事实上,如果SEMMNS的值超过了SEMMSL*SEMMNI是非法的,因此推荐SEMMNS的值就设置为SEMMSL*SEMMNI。Oracle推荐SEMMNS的设置不小于32000。
③ 100表示SEMOPM,设置每次系统调用可以同时执行的最大信号灯操作的数量。由于一个信号灯组最多拥有SEMMSL个信号灯,因此有推荐将SEMOPM设置为SEMMSL的值。Oracle验证的10.2和11.1的SEMOPM的配置为100。
④ 128表示SEMMNI,设置系统中信号灯组的最大数量。Oracle10g和11g的推荐值为142。
4.1 kernel.shmmax参数
4.1.1 实验1
下面临时设置kernel.shmmax为3M,会导致Oracle不能启动,设置sqlplus不能进入:
@H_403_447@[root@edsir4p1 ~]# echo 3145728 > /proc/sys/kernel/shmmax@H_403_447@ <<<==== @H_403_447@临时设置@H_403_447@3M @H_403_447@[oracle@edsir4p1- ~]$ more /proc/sys/kernel/shmmax@H_403_447@<<<==== 查看是否生效 @H_403_447@3145728 @H_403_447@[root@edsir4p1 ~]# /sbin/sysctl -a | grep shm @H_403_447@vm.hugetlb_shm_group = 0 @H_403_447@kernel.shmmni = 4096 @H_403_447@kernel.shmall = 2097152 kernel.shmmax = 3145728 @H_403_447@[root@edsir4p1 ~]# more /etc/sysctl.conf | grep kernel.shm @H_403_447@kernel.shmall = 2097152 kernel.shmmax = 2147483648 @H_403_447@kernel.shmmni = 4096 @H_403_447@[root@edsir4p1 ~]# su - oracle @H_403_447@[oracle@edsir4p1- ~]$ . PROD1_env @H_403_447@[oracle@edsir4p1-PROD1 ~]$ sqlplus / as sysdba @H_403_447@ @H_403_447@sql*Plus: Release 11.2.0.1.0 Production on Tue Nov 14 10:09:08 2017 @H_403_447@ @H_403_447@Copyright (c) 1982,2009,Oracle. All rights reserved. @H_403_447@ @H_403_447@ERROR: @H_403_447@ORA-12547: TNS:lost contact @H_403_447@ @H_403_447@ @H_403_447@Enter user-name: @H_403_447@ @H_403_447@ @H_403_447@[oracle@edsir4p1-PROD1 ~]$ oerr ora 12547 @H_403_447@12547,00000,"TNS:lost contact" @H_403_447@// *Cause: Partner has unexpectedly gone away,usually during process @H_403_447@// startup. @H_403_447@// *Action: Investigate partner application for abnormal termination. On an @H_403_447@// Interchange,this can happen if the machine is overloaded. @H_403_447@ |
告警日志:
@H_403_447@ Linux Error: 32: Broken pipe @H_403_447@Tue Nov 14 10:00:38 2017 @H_403_447@14-NOV-2017 10:00:38 * (CONNECT_DATA=(SID=PROD1)(CID=(PROGRAM=emagent)(HOST=edsir4p1.us.oracle.com)(USER=oracle))) * (ADDRESS=(PROTOCOL=tcp)(HOST=10.190.104.111)(PORT=26305)) * establish * PROD1 * 12518 @H_403_447@TNS-12518: TNS:listener could not hand off client connection @H_403_447@TNS-12547: TNS:lost contact @H_403_447@ TNS-12560: TNS:protocol adapter error @H_403_447@ TNS-00517: Lost contact @H_403_447@ Linux Error: 32: Broken pipe |
或启动报错:
@H_403_447@SYS@PROD1> startup @H_403_447@ORA-00443: background process "PMON" did not start @H_403_447@SYS@PROD1> startup @H_403_447@ORA-12547: TNS:lost contact @H_403_447@SYS@PROD1> |
有关“TNS-12518: TNS:listener could not hand off client connection”的更多内容请参考:
【故障|监听】TNS-12518、TNS-00517和 Linux Error:32:Broken pipe:http://blog.itpub.net/26736162/viewspace-2135468/
4.1.2 实验2
下面临时设置kernel.shmmax为100M,sga_max_size为500M,则至少需要5个共享内存段,查看临时段的个数:
@H_403_447@[root@edsir4p1 ~]# echo 104857600 > /proc/sys/kernel/shmmax @H_403_447@[root@edsir4p1 ~]# more /proc/sys/kernel/shmmax @H_403_447@104857600 @H_403_447@[root@edsir4p1 ~]# su - oracle @H_403_447@[oracle@edsir4p1- ~]$ . PROD1_env @H_403_447@[oracle@edsir4p1-PROD1 ~]$ sysresv @H_403_447@ @H_403_447@IPC Resources for ORACLE_SID "PROD1" : @H_403_447@Shared Memory @H_403_447@ID KEY @H_403_447@No shared memory segments used@H_403_447@<<<==== @H_403_447@无实例的共享内存段 @H_403_447@Semaphores: @H_403_447@ID KEY @H_403_447@98304 0xa3dda878 @H_403_447@Oracle Instance not alive for sid "PROD1" @H_403_447@[oracle@edsir4p1-PROD1 ~]$ ipcs @H_403_447@ @H_403_447@------ Shared Memory Segments -------- @H_403_447@key shmid owner perms bytes nattch status @H_403_447@0x00000000 32768 vncuser 644 790528 2 dest @H_403_447@0x00000000 65537 vncuser 644 790528 2 dest @H_403_447@0x00000000 98306 vncuser 644 790528 2 dest @H_403_447@ @H_403_447@------ Semaphore Arrays -------- @H_403_447@key semid owner perms nsems @H_403_447@0xa3dda878 98304 oracle 660 154 @H_403_447@ @H_403_447@------ Message Queues -------- @H_403_447@key msqid owner perms used-bytes messages @H_403_447@[oracle@edsir4p1-PROD1 ~]$ sqlplus / as sysdba @H_403_447@ @H_403_447@sql*Plus: Release 11.2.0.1.0 Production on Tue Nov 14 10:29:07 2017 @H_403_447@ @H_403_447@Copyright (c) 1982,Oracle. All rights reserved. @H_403_447@ @H_403_447@Connected to an idle instance. @H_403_447@SYS@PROD1> startup @H_403_447@ORACLE instance started. @H_403_447@ @H_403_447@Total System Global Area 313860096 bytes @H_403_447@Fixed Size 1336232 bytes @H_403_447@Variable Size 251661400 bytes @H_403_447@Database Buffers 54525952 bytes @H_403_447@Redo Buffers 6336512 bytes @H_403_447@Database mounted. @H_403_447@Database opened. @H_403_447@SYS@PROD1> show parameter sga @H_403_447@ @H_403_447@NAME TYPE VALUE @H_403_447@------------------------------------ ----------- ------------------------------ @H_403_447@lock_sga boolean FALSE @H_403_447@pre_page_sga boolean FALSE sga_max_size big integer 500M @H_403_447@sga_target big integer 300M @H_403_447@SYS@PROD1> exit @H_403_447@Disconnected from Oracle Database 11g Enterprise Edition Release 11.2.0.1.0 - Production @H_403_447@With the Partitioning,Data Mining and Real Application Testing options @H_403_447@[oracle@edsir4p1-PROD1 dbs]$ @H_403_447@[oracle@edsir4p1-PROD1 ~]$ sysresv @H_403_447@ @H_403_447@IPC Resources for ORACLE_SID "PROD1" : @H_403_447@Shared Memory: @H_403_447@ID KEY 1245194 0x00000000 1277963 0x00000000 1310732 0x00000000 1343501 0x00000000 1376270 0x00000000 1409039 0x90c3be20 @H_403_447@Semaphores: @H_403_447@ID KEY @H_403_447@917504 0xa3dda878 @H_403_447@Oracle Instance alive for sid "PROD1" @H_403_447@[oracle@edsir4p1-PROD1 ~]$ ipcs @H_403_447@ @H_403_447@------ Shared Memory Segments -------- @H_403_447@key shmid owner perms bytes nattch status @H_403_447@0x00000000 32768 vncuser 644 790528 2 dest @H_403_447@0x00000000 65537 vncuser 644 790528 2 dest @H_403_447@0x00000000 98306 vncuser 644 790528 2 dest @H_403_447@0x00000000 1245194 oracle 660 8388608 30 @H_403_447@<<<==== @H_403_447@该共享内存段为@H_403_447@8M 0x00000000 1277963 oracle 660 104857600 30 0x00000000 1310732 oracle 660 104857600 30 0x00000000 1343501 oracle 660 104857600 30 0x00000000 1376270 oracle 660 104857600 30 @H_403_447@0x90c3be20 1409039 oracle 660 100663296 30 @H_403_447@<<<==== @H_403_447@每个共享内存段为@H_403_447@100M @H_403_447@ @H_403_447@------ Semaphore Arrays -------- @H_403_447@key semid owner perms nsems @H_403_447@0xa3dda878 917504 oracle 660 154 @H_403_447@ @H_403_447@------ Message Queues -------- @H_403_447@key msqid owner perms used-bytes messages @H_403_447@ |
下面临时设置kernel.shmmax为2G,sga_max_size为500M,则只需要1个共享内存段,查看临时段的个数:
@H_403_447@[oracle@edsir4p1-PROD1 ~]$ ss @H_403_447@ @H_403_447@sql*Plus: Release 11.2.0.1.0 Production on Tue Nov 14 10:49:21 2017 @H_403_447@ @H_403_447@Copyright (c) 1982,Oracle. All rights reserved. @H_403_447@ @H_403_447@ @H_403_447@Connected to: @H_403_447@Oracle Database 11g Enterprise Edition Release 11.2.0.1.0 - Production @H_403_447@With the Partitioning,Data Mining and Real Application Testing options @H_403_447@ @H_403_447@SYS@PROD1> select 2*1024*1024*1024 from dual; @H_403_447@ @H_403_447@2*1024*1024*1024 @H_403_447@---------------- @H_403_447@ 2147483648 @H_403_447@ @H_403_447@SYS@PROD1> exit @H_403_447@Disconnected from Oracle Database 11g Enterprise Edition Release 11.2.0.1.0 - Production @H_403_447@With the Partitioning,Data Mining and Real Application Testing options @H_403_447@[oracle@edsir4p1-PROD1 ~]$ sudo echo 2147483648 > /proc/sys/kernel/shmmax @H_403_447@-bash: /proc/sys/kernel/shmmax: Permission denied @H_403_447@[oracle@edsir4p1-PROD1 ~]$ su - root @H_403_447@Password: @H_403_447@[root@edsir4p1 ~]# echo 2147483648 > /proc/sys/kernel/shmmax @H_403_447@[root@edsir4p1 ~]# exit @H_403_447@[oracle@edsir4p1-PROD1 ~]$ ipcs -m @H_403_447@ @H_403_447@------ Shared Memory Segments -------- @H_403_447@key shmid owner perms bytes nattch status @H_403_447@0x00000000 32768 vncuser 644 790528 2 dest @H_403_447@0x00000000 65537 vncuser 644 790528 2 dest @H_403_447@0x00000000 98306 vncuser 644 790528 2 dest 0x00000000 1245194 oracle 660 8388608 30 0x00000000 1277963 oracle 660 104857600 30 0x00000000 1310732 oracle 660 104857600 30 0x00000000 1343501 oracle 660 104857600 30 0x00000000 1376270 oracle 660 104857600 30 @H_403_447@0x90c3be20 1409039 oracle 660 100663296 30 <<<==== 需要重启数据库,重新分配共享内存段 @H_403_447@ @H_403_447@[oracle@edsir4p1-PROD1 ~]$ ss @H_403_447@ @H_403_447@sql*Plus: Release 11.2.0.1.0 Production on Tue Nov 14 10:50:23 2017 @H_403_447@ @H_403_447@Copyright (c) 1982,Data Mining and Real Application Testing options @H_403_447@ @H_403_447@SYS@PROD1> startup force @H_403_447@ORACLE instance started. @H_403_447@ @H_403_447@Total System Global Area 523108352 bytes @H_403_447@Fixed Size 1337632 bytes @H_403_447@Variable Size 343934688 bytes @H_403_447@Database Buffers 171966464 bytes @H_403_447@Redo Buffers 5869568 bytes @H_403_447@Database mounted. @H_403_447@Database opened. @H_403_447@SYS@PROD1> exit @H_403_447@Disconnected from Oracle Database 11g Enterprise Edition Release 11.2.0.1.0 - Production @H_403_447@With the Partitioning,Data Mining and Real Application Testing options @H_403_447@[oracle@edsir4p1-PROD1 ~]$ sysresv @H_403_447@ @H_403_447@IPC Resources for ORACLE_SID "PROD1" : @H_403_447@Shared Memory: @H_403_447@ID KEY 1474570 0x90c3be20 @H_403_447@Semaphores: @H_403_447@ID KEY @H_403_447@1081344 0xa3dda878 @H_403_447@Oracle Instance alive for sid "PROD1" @H_403_447@[oracle@edsir4p1-PROD1 ~]$ ipcs @H_403_447@ @H_403_447@------ Shared Memory Segments -------- @H_403_447@key shmid owner perms bytes nattch status @H_403_447@0x00000000 32768 vncuser 644 790528 2 dest @H_403_447@0x00000000 65537 vncuser 644 790528 2 dest @H_403_447@0x00000000 98306 vncuser 644 790528 2 dest @H_403_447@0x90c3be20 1474570 oracle 660 528482304 31 <<<====@H_403_447@共享内存段为@H_403_447@500M @H_403_447@ @H_403_447@------ Semaphore Arrays -------- @H_403_447@key semid owner perms nsems @H_403_447@0xa3dda878 1081344 oracle 660 154 @H_403_447@ @H_403_447@------ Message Queues -------- @H_403_447@key msqid owner perms used-bytes messages @H_403_447@ @H_403_447@ |
4.2 kernel.shmall
该参数设置过小,有可能导致数据库启动报错。很多人调整系统内核参数的时候只关注SHMMAX参数,而忽略了SHMALL参数的设置。
@H_403_447@[root@edsir4p1 ~]# echo 10 > /proc/sys/kernel/shmall @H_403_447@[root@edsir4p1 ~]# @H_403_447@[root@edsir4p1 ~]# @H_403_447@[root@edsir4p1 ~]# @H_403_447@[root@edsir4p1 ~]# @H_403_447@[root@edsir4p1 ~]# more /proc/sys/kernel/shmall @H_403_447@10 @H_403_447@[oracle@edsir4p1-PROD1 ~]$ ss @H_403_447@ @H_403_447@sql*Plus: Release 11.2.0.1.0 Production on Tue Nov 14 11:13:53 2017 @H_403_447@ @H_403_447@Copyright (c) 1982,Oracle. All rights reserved. @H_403_447@ @H_403_447@Connected to an idle instance. @H_403_447@ @H_403_447@SYS@PROD1> startup @H_403_447@ORA-27102: out of memory @H_403_447@Linux Error: 28: No space left on device @H_403_447@SYS@PROD1> @H_403_447@ |
4.3 其它博客内容
@H_403_447@4.3.1 @H_403_447@原文地址:@H_403_447@ORACLE内核参数作者:it_newbalance
@H_403_447@服务器内存为@H_403_447@4G的情况下@H_403_447@
修改/etc/sysctl.conf@H_403_447@文件@H_403_447@ (ROOT账户)@H_403_447@
kernel.shmmax = 2147483648
@H_403_447@//公式:2G*1024*1024*1024=2147483648(字节)@H_403_447@
@H_403_447@//表示最大共享内存,如果小的话可以按实际情况而定,一般为物理内存的一半(单位:字节)@H_403_447@
kernel.shmmni=4096
@H_403_447@//表示最小共享内存固定4096KB(由于32位操作系统默认一页为4K)@H_403_447@
kernel.shmall=1048576
@H_403_447@//公式:4G*1024*1024/4K = 1048576(页)@H_403_447@
@H_403_447@//表示所有内存大小(单位:页)@H_403_447@
kernel.sem=250 32000 100 128
@H_403_447@//4个参数依次是SEMMSL:每个用户拥有信号量最大数,SEMMNS:系统信号量最大数,SEMOPM:每次semopm系统调用操作数,SEMMNI:系统辛苦量集数最大数。这4个参数为固定内容大小@H_403_447@
fs.file-max=65536
@H_403_447@//file-max固定大小65536@H_403_447@
net.ipv4.ip_local_port_range=1024 65000
@H_403_447@//ip_local_port_range表示端口的范围,为指定的内容@H_403_447@
@H_403_447@以上步骤做完执行@H_403_447@/sbin/sysctl -p使内核生效
@H_403_447@验证参数(@H_403_447@root账户执行):@H_403_447@
#/sbin/sysctl -a | grep shm
#/sbin/sysctl -a | grep sem
#/sbin/sysctl -a | grep file-max
#/sbin/sysctl -a | grep ip_local_port_range
@H_403_447@最近解决了一些这方面的问题,并在网络上查询了一些相关资料终于发现一个比较全面解释这类问题的官方文档。本来打算当一次活雷锋全文翻译的,后来考虑自己英文一般,并且对于其中一些@H_403_447@OS相关的知识也没有深入了解。就保留英文大家自己去领会其中的要领,自己简单总结了一下解决这类问题的关键点并整理一下英文原文。这个文档是oracle官方技术支持网站Metalink@H_403_447@的资料,里面引用了一些其它的文档例如NOTE:115235.1。
@H_403_447@ @H_403_447@对于@H_403_447@unix操作系统中Semaphores问题只是针对和oracle相关问题作一些解释。对于信号量和共享内存段参数在不同的系统中可能有不同的参数对应,具体你去查询对应的OS@H_403_447@文档。
@H_403_447@在解决这类问题的时候我发现大部分问题都是因为在安装@H_403_447@oracle时没有仔细阅读针对指定OS的安装说明造成安装实例失败,一般oracle的官方文档都详细说明在对应操作系统上如何设置这些内核参数。还有就是因为其他原因OS管理人员调整了参数,但是没有通知DBA,一旦oracle崩溃再次重新启动的时候就可能因为新的内核参数不合适而无法启动。如果是oracle意外停机之后重新启动不成功,并出现类似ora-27123@H_403_447@的错误那么一定要询问是否有其他人修改过内核参数,有时候你没有修改并不代表其他人没有修改哟,我遇到过不少这样的情况!
@H_403_447@1@H_403_447@、与oracle@H_403_447@相关的信号量和共享内存段参数
@H_403_447@ @H_403_447@一般@H_403_447@unix系统中和信号量相关的是三个参数SEMMNI SEMMSL SEMMNS。他们相互关联决定系统可以分配的信号量。Oracle@H_403_447@使用信号量完成内部进程之间的通信。
@H_403_447@ @H_403_447@关于共享内存段使用@H_403_447@shmmx@H_403_447@参数进行总体控制。它指定了系统可以分配的共享内存段最大大小,实际并没有分配那么多只是给出一个可以使用的最大限制。
对于类核参数的修改必须要重新启动系统之后才会生效。
2、出现@H_403_447@信号量和共享内存段相关问题的情况
@H_403_447@ @H_403_447@oracle只有在startup nomount的时候才会请求os的这些资源,用于建立SGA@H_403_447@和启动后台进程。
@H_403_447@ @H_403_447@有些情况下因为@H_403_447@oracle崩溃之后os没有清除oracle分配的SGA@H_403_447@,也可能造成共享内存段不足,需要人工清除。
@H_403_447@3@H_403_447@、如何解决相关的问题
@H_403_447@ @H_403_447@你可以简单的修改@H_403_447@init参数减少oracle@H_403_447@对共享内存段和信号量的需求。
@H_403_447@ @H_403_447@对于控制信号量的三个参数@H_403_447@SEMMNI SEMMSL SEMMNS。最终可以使用的信号量由下面公式提取(semmsl * semmni)或者semmns中最小的值。
例如在linux下@H_403_447@. 进入目录/proc/sys/kernel;用cat命令或more命令查看semaphore当前参数的值:@H_403_447@
@H_403_447@命令运行后将会出现如下的结果:@H_403_447@
@H_403_447@250 32000 32 128@H_403_447@
@H_403_447@其中@H_403_447@,250 是参数SEMMSL的值,32000是参数SEMMNS的值,32是参数SEMOPM的值,而128则是参数SEMMNI的值。250*128=32000
对于oracle7需要信号量的设置等于init中processes的设置。对于8i 9i需要等于processes*2。
对于信号量参数的设定一定要小心,因为不正确的设置可能会让系统使用默认值。这个值一般比oracle系统要求的低。在HP unix上遇到过这样的问题,当时在参数配置的时候指定两个不同的sem-mni造成系统使用默认的设置。
@H_403_447@对于共享内存段,系统的设置至少要等于@H_403_447@SGA@H_403_447@的大小。
Semaphores and Shared Memory
BULLETIN Status: PUBLISHED Content Type: TEXT/PLAIN Creation Date: 05-AUG-2001
Last Revision Date: 05-AUG-2002
PURPOSE-------
To provide an overview of shared memory and semaphores,answer common questions related to these OS resources and provide links to more detailed information.
SCOPE & APPLICATION
-------------------
This document is intended for anyone who is responsible for creating or
administering an Oracle Database. It is intended to compliment the semaphore and
shared memory information already provided in the Oracle Installation Guides.
关于信号量和共享内存段的背景知识
----------------------------------------------------------------------------------
Semaphores and shared memory are two very distinct sets of Operating System
resources. Semaphores are a system resource that Oracle utilizes for interprocess
communication and they occupy a relatively small memory space,while shared memory is utilized to contain the SGA and can garner a large portion of physical memory.
How many of these resources are available and how they are allocated is controlled
by the configuration of the operating system kernel('kernel' referring to the
centralized core components of the underlying operating system).
There are three OS kernel parameters that work together to limit semaphore
allocation and one OS kernel paramater that dictates the maximum size of a shared
memory segment.
Operating System kernel parameters generally cannot be tuned on the fly. If they
are modified,the changes will not take place until the system is rebooted.
Remember also that the kernel parameters related to semaphores and shared memory represent 'high-water' marks.Meaning that the OS will not automatically
allocate a given amount,but will allow up to that given amount to be available
upon request.
什么时候信号量和共享内存段问题最有可能发生
----------------------------------------------------------------------------------
Both semaphore or shared memory errors appear primarily at instance startup(The
'startup nomount' stage specifically).This is the only time that Oracle tries to
acquire semaphores and shared memory for the instance.Errors related to
semaphores or shared memory rarely appear during normal database operations.
The most common circumstance in which these errors occur is during the creation of
a new database.
Sometimes when an Oracle instance crashes,however,it's shared memory segments may not be released by the OS. This limits the overall amount of shared memory available for the instance to start up again. In this case,you will need to remove those segments manually.
如何解决信号量和共享内存段问题:
How to resolve semaphore and shared memory errors:
----------------------------------------------------------------------------------
In addressing both semaphore and shared memory errors at instance startup,there
are two separate areas that should be considered for reconfiguration.
The first and most simple fix is to modify the init.ora to reduce the number of semaphores or the amount of shared memory Oracle will try to grab at instance startup.
If your situation requires that you not reduce the appropriate init.ora
parameters,you will have to modify the operating system kernel to allow the OS to
provide more semaphores or allow larger shared memory segments.
SEMAPHORES
================================================== ================================
IMPORTANT NOTE: ORACLE DOES NOT UTILIZE SEMAPHORES ON AIX OR DIGITAL/TRU64.
与信号量相关的的ORA错误
What kind of ORA errors are related to semaphores?
----------------------------------------------------------------------------------
'Out of memory' type errors are seldom related to semaphores. Error messages which reference a 'SEMM*****' function are related to semaphores.
IMPORTANT NOTE: THESE ERRORS ONLY OCCUR AT INSTANCE STARTUP.
ORA-7250 "spcre: semget error,unable to get first semaphore set."
ORA-7279 "spcre: semget error,unable to get first semaphore set."
ORA-7251 "spcre:semget error,could not allocate any semaphores."
ORA-7252 "spcre: semget error,could not allocate any semaphores."
ORA-7339 "spcre: maximum number of semaphore sets exceeded."
[NOTE:115235.1] Resolving ORA-7279 or ORA-27146 errors when starting instance
VERY COMMON On Oracle8i and Oracle9i:
ORA-3113 "end-of-file on communication channel" at instance startup.
ORA-27146 "post/wait initialization Failed"
[NOTE:115235.1] Resolving ORA-7279 or ORA-27146 errors when starting instance
If you want a very specific explanation of causes for the above errors,refer to:
[NOTE:15566.1] TECH Unix Semaphores and Shared Memory Explained
However,while their exact cause varies,all these error messages indicate that
your init.ora is configured to grab more semaphores than the OS has available.
If you configure your OS as indicated in the following sections,you will not get any of the errors indicated above.
成功配置信号量的步骤
The Basic Steps to Semaphore Success:
----------------------------------------------------------------------------------
1. Understand The Basic Concept Behind Semaphores
2. Understand How Many Semaphores Your Oracle Instance(s) Will Attempt to Grab
From The Operating System.
3. Configure Your OS Kernel To Accomodate all Your Oracle Instance(s) And also
Allow For Future Growth.
[STEP 1] How are semaphores released by the OS for use by an application?
----------------------------------------------------------------------------------
There are 3 OS kernel parameters that work together to limit semaphore allocation.
When an application requests semaphores,the OS releases them in 'sets'.
Illustrated here as 2 sets: +---+ +---+
| | | |
| | | |
+---+ +---+
Controlled by SEMMNI -->OS limit on the Number of Identifiers or sets.
Each set contains a tunable number of individual semaphores.
Illustrated here as 2 semaphores per semaphore set: +---+ +---+
| S | | S | S | | S |
+---+ +---+
Controlled by SEMMSL -->The number of semaphores in an identifier or
set.(Semaphore List)
Ultimately however,the OS can limit the total number of semaphores available
from the OS. Controlled by:
SEMMNS --> The total Number of Semaphores allowed system wide.
For instance: Let's say SEMMNI = 100000000 and SEMMSL= 100000000 while SEMMNS=10
Even though SEMMNI is 100000000 and SEMMSL is 100000000,the max # of semaphores available on your system will only be 10,because SEMMNS is set to 10.
Inversely: Let's say SEMMNI = 10 and SEMMSL = 10 while SEMMNS=
100000000000000000000000000 Because SEMMNI is 10 and SEMMSL is 10,the max # of semaphores avail on your system will only be 100 or (10 X 10),despite what SEMMNS is set too.
THIS NOTION CAN BE SUMMARIZED BY THE FOLLOWING STATEMENT:
The max # of semaphores that can be allocated on a system will be the lesser of:
(semmsl * semmni) or semmns.
On HP: semmsl is hardcoded to 500. [NOTE:74367.1] HP-UX SEMMSL Kernel Parameter
SEMMNI,SEMMSL & SEMMNS are the basic names for OS semaphore kernel parameters,the full name may vary depending on your OS. Consult your OS specific Oracle Install guide.
[NOTE:116638.1] Understanding and Obtaining Oracle Documentation)
[STEP 2] How many semaphores will my Oracle instance(s) require?
----------------------------------------------------------------------------------
With Oracle7: The number of semaphores required by an instance is equal to the
setting the 'processes' parameter in the init.ora for the instance.
With Oracle8,Oracle8i and Oracle9i: The number of semaphores required by an
instance is equal to 2 times the setting of the 'processes' parameter in the init.ora for the instance. Keep in mind,that Oracle only momentarilygrabs 2 X 'processes' then releases half at instance startup. This measure was apparently introduced to ensure Oracle could not exhaust a system of semaphores.
Oracle may also grab a couple of additional semaphores per instance for internal
use.
[STEP 3] Configure your OS kernel to accomodate all your Oracle instances.
----------------------------------------------------------------------------------
There seems to be some confusion of how to deal with lack of semaphore errors. The
popular theory being that if Oracle cannot find enough semaphores on a system,
increase semmns. This is not always the case,as illustrated in [STEP 1].
Once you have determined your semaphore requirements for Oracle and compensated for future growth,contact your System Administrator or OS vendor for assistance in modifying the OS kernel.
What should I set 'semmni','semmsl' & 'semmns' to?
----------------------------------------------------------------------------------
Oracle Support typically does not recommend specific values for semaphore kernel
parameters. Instead,use the information provided in this document to set the parameters to values that are appropriate for your operating environment.
For more info please look at the following note : [NOTE:15654.1] TECH: Calculating
Oracle's SEMAPHORE Requirements
Quick fix for resolving lack of semaphore errors:
----------------------------------------------------------------------------------
Reduce the number of semaphores Oracle requires from the OS.
The first and most simple fix is to modify the init.ora to reduce the
number of semaphores or the amount of shared memory Oracle will try to grab at
instance startup.
Keep in mind,with Oracle8,we grab 2 X 'processes' then release half. This measure
was apparently introduced to ensure Oracle could not exhaust a system of semaphores.
如何查找OS配置的信号量
How can I find out how my OS kernel is configured for semaphores?
----------------------------------------------------------------------------------
The files that are used to tune kernel parameters varies depending on your
Operating System. Consult your system administrator or OS vendor,becauseviewing the system file may not show accurate information about the runtime values.
However,an important point to remember is that if a typographical error is made
while editing these files,the OS will defer to a default value which is usually to low to accomodate Oracle. So it's a good idea to check runtime values with utilities like '/etc/sysdef'.
I've tuned my OS kernel parameters,but I am still having semaphore problems....
----------------------------------------------------------------------------------
常见问题!!
This may mean that you made a typographical error or did not rebuild your
Operating System kernel correctly(if a typographical error is made while editing these files,the OS will defer to a default value which is usually to low to accomodate Oracle).
On Solaris,check current OS kernel values with this command:
> /etc/sysdef|grep -i semm
If these values do not reflect what you put in your 'system' file,you likely made a typographically error.
On HP,be sure the OS kernel was rebuilt correctly and that the OS was booted off the correct file. Contact your System Administrator or HP for more information.
在Linux系统上
@H_403_447@进入目录@H_403_447@/proc/sys/kernel;用cat命令或more命令查看semaphore当前参数的值:@H_403_447@
@H_403_447@命令运行后将会出现如下的结果:@H_403_447@
@H_403_447@250 32000 32 128@H_403_447@
@H_403_447@其中@H_403_447@,32是参数SEMOPM的值,而128则是参数SEMMNI的值。250*128=32000
如何获得当前正在使用的信号量
How can I determine how many semaphores are currently being utilized?
----------------------------------------------------------------------------------
On most Unix systems,current semaphore allocation can be displayed with the OS
command 'ipcs -s'.
% ipcs -s
While good to know,this command is seldom used as part of troubleshooting semaphore errors.
SHARED MEMORY
==================================================
OS如何分配共享内存段
How is shared memory allocated by the OS?
----------------------------------------------------------------------------------
This process varies slightly depending on Unix platform,but the basic premise is this:
An application requests a given amount of contiguous shared memory from the OS. The OS dictates how large of a shared memory segment it will allow with the kernel
parameter SHMMAX(Shared Memory Maximum). If the amount of shared memory requested by the application is greater than SHMMAX,the OS may be granted the shared memory in multiple segments. Ideally,you want the amount requested by the application to be less than SHMMAX so that the application's request can be fulfilled with one shared memory segment.
SHMMAX和SGA的关系
How does SHMMAX relate to my SGA?
----------------------------------------------------------------------------------
Since the SGA is comprised of shared memory,SHMMAX can potentially limit how large your SGA can be and/or prevent your instance from starting.
What limits the size of my SGA?
----------------------------------------------------------------------------------
In no particular order.
5. The amount of Physical Memory and Swap space available on your system.
6. The kernel paramater SHMMAX.
7. Other OS specific limitations on shared memory.
Memory SHMMAX OS Limits +----------+ +----------+ +----------+
| | | | | | +------+
| | | | | | | S |
| | | | | | > | G |
| | | | | | | A |
| | | | | | +------+
+----------+ +----------+ +----------+
Some OS specific limitations are discussed in the following documents:
"Oracle Administrator's Reference" available on the Oracle Install CD
Additionallly:
HP-UX: [NOTE:77310.1] HP-UX Large SGA support for HP,Memory Windows
[NOTE:69119.1] HP-UX SGA Sizing Issues on HP-UX
Solaris: [NOTE:61896.1] SOLARIS: SGA size,sgabeg attach address and Sun
与共享内存当相关的错误
What kind of ORA errors are related to shared memory?
----------------------------------------------------------------------------------
Error Messages referencing a 'SHMM****' function are related to shared memory.
ORA-7306,ORA-7336,ORA-7329,ORA-7307,ORA-7337,ORA-7320,ORA-7334
VERY COMMON IN 8i:ORA-27100 "shared memory realm already exists"ORA-27102 "out of memory"
ORA-27125 "unable to create shared memory segment" and/or "linux 43 identifier removed"
ORA-27123 "unable to attach to shared memory segment"
[NOTE:115753.1] UNIX Resolving the ORA-27123 error
[NOTE:1028623.6] SUN SOLARIS: HOW TO RELOCATE THE SGA
如何设置SHMMAX
What should I set 'shmmax' to?
----------------------------------------------------------------------------------
On some Unix platforms,the Install Guide recommends specific values. PrevIoUs
versions of the Install Guide recommended setting SHMMAX to .5 *(physical memory present in machine). Most recently it's been suggested SHMMAX be set to 4294967295 (4GB). This may not seem appropriate,particularly if the system has considerably less physical memory available,but it does prevent you from having to modify your system kernel everytime a new instance is created or additional physical memory is added to the system. Remember that SHMMAX is a high water mark,meaning that the OS will attempt to allow up to that amount for an application.
解决缺少共享内存段的问题
Quick fix for resolving lack of shared memory errors:
-----------------------------------------------------------------------------------
NOTE: If you have never configured your OS kernel for shared memory,you cannot employ this 'Quick Fix'. You will have to first configure the OS kernel. The amount of shared memory Oracle requests is roughly equal to the size of the SGA. The first and most simple fix is to modify the init.ora to reduce the amount of shared memory Oracle will try to grab at instance startup.
This document lists the init.ora parameters that contribute to the size
of the SGA:
[NOTE:1008866.6] HOW TO DETERMINE SGA SIZE (8.0,8i,7.x)
oracle崩溃之后重新启动失败的问题
My instance crashed. When I try to restart it,I receive errors related to shared
memory. What should I do?
-----------------------------------------------------------------------------------
This may indicate that the shared memory segment associated with the SGA of the crashed instance is still in memory. In this case it may be appropriate to manually remove the segment using OS commands.
THIS PROCESS SHOULD NOT BE ATTEMPTED UNLESS YOU FULLY UNDERSTAND THE CONCEPTS BEHIND IT!!!
The basic steps are:
1. Identify the shared memory segment that is 'stuck' in memory.
2. Remove the 'stuck' shared memory segment using the OS command 'ipcrm'.
[NOTE:68281.1] DETERMINING WHICH INSTANCE OWNS WHICH SHARED MEMORY & SEMAPHORE SEGMENTS
[NOTE:69642.1] also describes this process - Step 9.
[NOTE:123322.1] SYSRESV UTILITY: This note describes the new 8i 'sysresv' utility that can be used on Solaris to associate a given ORACLE_SID with it's shared memory segment(s). .
4.3.2 Oracle 性能优化之内核的shmall 和shmmax 参数
1. 内核的 shmall 和 shmmax 参数
SHMMAX= 配置了最大的内存segment的大小 ——>这个设置的比SGA_MAX_SIZE大比较好。
SHMMAX参数:Linux进程可以分配的单独共享内存段的最大值。一般设置为内存总大小的一半。这个值的设置应该大于SGA_MAX_TARGET或MEMORY_MAX_TARGET的值,因此对于安装Oracle数据库的系统,shmmax的值应该比内存的二分之一大一些。
SHMMIN= 最小的内存segment的大小 。
SHMMNI= 整个系统的内存segment的总个数 。设置系统级最大共享内存段数量。Oracle10g推荐最小值为4096,可以适当比4096增加一些。
SHMSEG= 每个进程可以使用的内存segment的最大个数
shmall=是全部允许使用的共享内存大小,shmmax 是单个段允许使用的大小。这两个可以设置为内存的 90%。例如 16G 内存,16*1024*1024*1024*90% = 15461882265,shmall 的大小为 15461882265/4k(getconf PAGESIZE可得到) = 3774873。
shmall设置共享内存总页数。这个值太小有可能导致数据库启动报错。很多人调整系统内核参数的时候只关注SHMMAX参数,而忽略了SHMALL参数的设置。
-
2.配置信号灯( semphore )的参数
信号灯semaphores是进程或线程间访问共享内存时提供同步的计数器。
SEMMSL= 设置每个信号灯组中信号灯最大数量,推荐的最小值是250。对于系统中存在大量并发连接的系统,推荐将这个值设置为PROCESSES初始化参数加10。
SEMMNI= 设置系统中信号灯组的最大数量。Oracle10g和11g的推荐值为142。
SEMMNS=设置系统中信号灯的最大数量。操作系统在分配信号灯时不会超过LEAST(SEMMNS,SEMMSL*SEMMNI)。事实上,如果SEMMNS的值超过了SEMMSL*SEMMNI是非法的,因此推荐SEMMNS的值就设置为SEMMSL*SEMMNI。Oracle推荐SEMMNS的设置不小于32000,假如数据库的PROCESSES参数设置为600,则SEMMNS的设置应为:
sql> select (600+10)*142 from dual;
(600+10)*142
------------
86620
1
2
3
4
5
SEMOPM参数:设置每次系统调用可以同时执行的最大信号灯操作的数量。由于一个信号灯组最多拥有SEMMSL个信号灯,因此有推荐将SEMOPM设置为SEMMSL的值。Oracle验证的10.2和11.1的SEMOPM的配置为100。
通过下面的命令可以检查信号灯相关配置:
# cat /proc/sys/kernel/sem
250 32000 100 128
1
2
对应的4个值从左到右分别为SEMMSL、SEMMNS、SEMOPM和SEMMNI
-
3.修改 /etc/sysctl.conf
kernel.shmmax=15461882265
kernel.shmall=3774873
kernel.msgmax=65535
kernel.msgmnb=65535
执行 sudo sysctl -p
可以使用 ipcs -l 看结果,ipcs -u 可以看到实际使用的情况
-------------------------------------------------------------------------
About Me
.............................................................................................................................................
● 本文作者:小麦苗,部分内容整理自网络,若有侵权请联系小麦苗删除
● 本文在itpub(http://blog.itpub.net/26736162/abstract/1/)、博客园(http://www.cnblogs.com/lhrbest)和个人微信公众号(xiaomaimiaolhr)上有同步更新
● 本文itpub地址:http://blog.itpub.net/26736162/viewspace-2147273/
● 本文博客园地址:http://www.cnblogs.com/lhrbest/p/7838547.html
● 本文pdf版、个人简介及小麦苗云盘地址:http://blog.itpub.net/26736162/viewspace-1624453/
● 数据库笔试面试题库及解答:http://blog.itpub.net/26736162/viewspace-2134706/
● DBA宝典今日头条号地址:http://www.toutiao.com/c/user/6401772890/#mid=1564638659405826
.............................................................................................................................................
● QQ群号:230161599(满)、618766405
● 微信群:可加我微信,我拉大家进群,非诚勿扰
● 联系我请加QQ好友(646634621),注明添加缘由
● 于 2017-11-01 09:00 ~ 2017-11-30 22:00 在魔都完成
● 文章内容来源于小麦苗的学习笔记,部分整理自网络,若有侵权或不当之处还请谅解
● 版权所有,欢迎分享本文,转载请保留出处
.............................................................................................................................................
● 小麦苗的微店:https://weidian.com/s/793741433?wfr=c&ifr=shopdetail
● 小麦苗出版的数据库类丛书:http://blog.itpub.net/26736162/viewspace-2142121/
.............................................................................................................................................
使用微信客户端扫描下面的二维码来关注小麦苗的微信公众号(xiaomaimiaolhr)及QQ群(DBA宝典),学习最实用的数据库技术。
小麦苗的微信公众号 小麦苗的DBA宝典QQ群2 《DBA笔试面宝典》读者群 小麦苗的微店
.............................................................................................................................................