合肥的网站建设剂屏,百度如何网站,网站搭建在线支付,wordpress基本设置mt 内存监控#xff1a;mcecheck.pyraid监控#xff1a; check-raidmcelog 是 x86 的 Linux 系统上用来检查硬件错误#xff0c;特别是内存和CPU错误的工具。安装方式yum install mcelog运行mcelog查看日志方式/var/log/mcelogMCE 0HARDWARE ERROR. This is NOT a software …mt 内存监控mcecheck.pyraid监控 check-raidmcelog 是 x86 的 Linux 系统上用来检查硬件错误特别是内存和CPU错误的工具。安装方式yum install mcelog运行mcelog查看日志方式/var/log/mcelogMCE 0HARDWARE ERROR. This is NOT a software problem!Please contact your hardware vendorCPU 1 BANK 8 TSC 1193fd60c6699 [at 2000 Mhz 1 days 18:56:49 uptime (unreliable)]MISC 8f44960800095840 ADDR 4a9f3b1c0MCG status:MCi status:Error overflowMCi_MISC register validMCi_ADDR register validMCA: MEMORY CONTROLLER RD_CHANNELunspecified_ERRTransaction: Memory read errorMemory read ECC errorMemory corrected error count (CORE_ERR_CNT): 18Memory transaction Tracker ID (RTId): 40Memory DIMM ID of error: 1Memory channel ID of error: 0Memory ECC syndrome: f449608STATUS cc0004800001009f MCGSTATUS 0作为一个企业服务器管理员面对服务器莫名宕机或者主动重启历经折磨后判断为内存问题引起可当看到内存多达几十条时难道要单条测试要真这样估计领导也要废了你吧。有没方便有效的方法去速度定位那个DIMM槽内存或者在日常监测内存正常与否呢下面介绍下linux系统下的监控方法--MCElog。What are Machine Check Exceptions (or MCE)?A machine check exception is an error dedected by your systems processor. There are 2 major types of MCE errors, a notice or warning error, and a fatal execption. The warning will be logged by a Machine Check Event logged notice in your system logs, and can be later viewed via some Linux utilities. A fatal MCE will cause the machine to stop responding and the details of the MCE will be printed out to the systems console.What causes MCE errors?There most common reason for MCE events to occur are:1.Memory errors or Error Correction Code (ECC) problems2.Inadequate cooling / processor over-heating3.System bus errors4.Cache errors in the processor or hardware##一般来说当有错误提示时需要优先注意内存问题但由于现在内存控制器是集成在cpu里所以有个别情况是由CPU问题引起的##Installmcelog-1.0_pre3_p20120918.tar.gzMcelog安装#tar -zxvf mcelog-1.0_pre3_p20120918.tar.gz 解压出来#cd andikleen-mcelog-0f5d023 进入解压出来的文件夹#make#make install 编译和安装Mcelog相关文件/dev/mcelog 设备文件/var/log/mcelog messages日志文件/etc/mcelog/mcelog.conf配置文件/var/run/mcelog.pid默认故障日志只记录在/var/log/mcelog并不记录到系统日志中。如果需要在系统日志中也体现需修改/etc/mcelog/mcelog.conf文件将前面#去掉并保存。Mcelog相关设置1.mcelog的随系统启动查看boot下的config文件可以看到mce模块随机启动2.配置mcelog后台运行#mcelog --daemon3.查看mcelog日志文件由于各厂家服务器内存槽位设计可能不同这边关于错误中的cpu0 bank5内存槽位定位不做讨论。