日韩性视频-久久久蜜桃-www中文字幕-在线中文字幕av-亚洲欧美一区二区三区四区-撸久久-香蕉视频一区-久久无码精品丰满人妻-国产高潮av-激情福利社-日韩av网址大全-国产精品久久999-日本五十路在线-性欧美在线-久久99精品波多结衣一区-男女午夜免费视频-黑人极品ⅴideos精品欧美棵-人人妻人人澡人人爽精品欧美一区-日韩一区在线看-欧美a级在线免费观看

歡迎訪問 生活随笔!

生活随笔

當前位置: 首頁 > 运维知识 > linux >内容正文

linux

Linux监控   Nagios

發布時間:2025/3/18 linux 34 豆豆
生活随笔 收集整理的這篇文章主要介紹了 Linux监控   Nagios 小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

Linux監控 ?

Nagios ?

?

1 什么是監控? 監視控制

?

2 監控誰? ???????各種服務器

?

3 監控什么? ????網絡流量(eth0,eth1) ??服務的狀態(運行,停止) ?硬件資源 cpu ?內存 存儲 ???

??????????????????????????系統運行情況(總數,運行,休眠,僵尸) ??

?

?

?

[root@room1pc01 桌面]# uptime

?09:15:02 up ?15min, ?4 users, ?load average: 0.01, 0.10, 0.08

??????????????????(數越大,說明在線時間越長,越好)

?

[root@room1pc01 桌面]# top ?(看cpu)

?

?

?

4 如何監控? ???使用命令 ??編寫腳本 ????監控軟件

?

自動監控:

計劃任務+監控腳本 (chkconfig crond on)

搭建監控服務器(軟件 ?Nagios ?Cacti ?Zabbix)

?

?

5如何接收報警信息(郵件,短信,微信,即時消息)

?

++++++++++++++++++++++++++++++++

配置監控服務器的步驟:

1. ?部署服務運行環境 httpd(nginx或Apache,php) yum -y install httpd php

2. ?安裝提供服務的軟件

2.1 安裝準備

2.2 安裝監控服務軟件 和 監控插件軟件

2.3 修改服務配置文件

2.4 啟動監控服務

3. ?配置監控服務

3.1 配置監控遠端服務器

3.2 配置監控自己(特殊情況,公司小只有一個網站服務器時,網站和監控服務配在一臺上,所以監控本機網站)

不監控本機的交換分區

監控引導分區的使用情況

監控ftp的運行狀態

指定接收報警信息的郵箱地址是nagios@localhost

4 ?配置監控報警

5 ?查看監控信息

?

?

?

?

設置查看訪問監控頁面的認證用戶名和密碼(不能隨便指) nagiosadmin

nagios服務的監控過程:服務運行時調用插件,調用插件時,可以設置插件監控的閥值,nagios服務把監控到的值和插件指定的值比較,根據比較結果顯示監控狀態》

插件監控的閥值種類:警告值 錯誤值(值的多少由運維人員指定)

監控狀態: ?ok正常 ?warning警告狀態 ?critical 嚴重錯誤 ??unknow配置錯誤 ?pending正在監控

監控到的數據小于警告值 ok ??大于 警告值且小于錯誤值顯示warning ???大于錯誤值 顯示critical

nagios服務默認監控本機哪些資源?

cpu負載

登錄系統總用戶數

網站服務運行狀態

主機是否在線

跟分區使用量

sshd服務運行狀態

?

[root@99 libexec]# ./check_users -h (每個插件都會提供幫助)

?

?

?

_____________________________________________________

———————————————————————————————————————————————————-

搭建Nagios服務器 (LAP)環境 99監控服務器 ?真機客戶機

————————————————————————————————————————————————————

_____________________________________________________

[root@99 ~]# yum -y httpd php

?

[root@99 ~]# service httpd restart ,chkconfig httpd on

停止 httpd: ??????????????????????????????????????????????[失敗]

正在啟動 httpd:httpd: Could not reliably determine the server's fully qualified domain name, using 0.0.0.99 for ServerName

???????????????????????????????????????????????????????????[確定]

[root@99 ~]# echo 123 >/var/www/html/index.html

[root@99 ~]# yum -y install elinks

?

[root@99 ~]# elinks --dump http://localhost

???123

?

[root@99 ~]# vim /var/www/html/test.php

<?php

phpinfo();

?>

[root@room1pc01 桌面]# firefox http:192.168.4.99/test.php ?(瀏覽器中輸這個一樣http://192.168.4.99/test.php )

?

?

[root@99 ~]# unzip nagios.zip

[root@99 ~]# cd nagios

[root@99 nagios]# ls

nagios-3.2.1.tar.gz ??????????nrpe-2.12.tar.gz

nagios-plugins-1.4.14.tar.gz ?ntop-3.3.7.tar.gz

?

[root@99 nagios]# yum -y install gcc gcc-c++

?

[root@99 nagios]# useradd nagios ?(默認服務,監控進程的用戶名,組)

[root@99 nagios]# groupadd nagcmd

[root@99 nagios]# usermod -G nagcmd nagios

?

[root@99 nagios-3.2.1]# ./configure --help

?

[root@99 nagios-3.2.1]# ./configure --with-nagios-user=nagios --with-nagios-group=nagcmd ??--with-command-user=nagios ?--with-command-group=nagcmd

[root@99 nagios-3.2.1]# make all

?

[root@99 nagios-3.2.1]# make install

?

[root@99 nagios-3.2.1]# ls /usr/local/nagios

bin ?libexec ?sbin ?share ?var

[root@99 nagios-3.2.1]# make install-init

/usr/bin/install -c -m 755 -d -o root -g root /etc/rc.d/init.d

/usr/bin/install -c -m 755 -o root -g root daemon-init /etc/rc.d/init.d/nagios

?

*** Init script installed ***

?

[root@99 nagios-3.2.1]# ls /etc/rc.d/init.d/nagios ?(啟動腳本安裝在這)

/etc/rc.d/init.d/nagios

[root@99 nagios-3.2.1]# make install-commandmode

/usr/bin/install -c -m 775 -o nagios -g nagcmd -d /usr/local/nagios/var/rw

chmod g+s /usr/local/nagios/var/rw

?

*** External command directory configured ***

?

[root@99 nagios-3.2.1]# make install-config

?

[root@99 nagios-3.2.1]# ls /usr/local/nagios/etc

cgi.cfg ?nagios.cfg ?objects ?resource.cfg

[root@99 nagios-3.2.1]# make install-webconf

?

[root@99 nagios-3.2.1]# ll /etc/rc.d/init.d/nagios (啟動腳本這有)

-rwxr-xr-x. 1 root root 5178 3月 ??9 02:00 /etc/rc.d/init.d/nagios

[root@99 nagios-3.2.1]# ll /etc/init.d/nagios ??(這下面也有啟動腳本)

-rwxr-xr-x. 1 root root 5178 3月 ??9 02:00 /etc/init.d/nagios

?

[root@99 nagios-3.2.1]# /etc/init.d/nagios ??status (查看服務狀態)

No lock file found in /usr/local/nagios/var/nagios.lock

[root@99 nagios-3.2.1]# /etc/init.d/nagios ??start ?(啟動)

Starting nagios: done.

[root@99 nagios-3.2.1]# /etc/init.d/nagios ??status (再查看)

nagios (pid 6807) is running.

?

[root@room1pc01 桌面]# http://192.168.4.99/nagios

[root@99 nagios-3.2.1]# service httpd restart

[root@room1pc01 桌面]# http://192.168.4.99/nagios ?打開的瀏覽器 選擇serves選項

?

?

[root@99 nagios-3.2.1]# vim /etc/httpd/conf.d/nagios.conf

?22 ???AuthUserFile /usr/local/nagios/etc/htpasswd.users (查看認證用戶在的文件)

?

[root@99 nagios-3.2.1]# ls /usr/local/nagios/etc/htpasswd.users (文件不存在 需要創建)

ls: 無法訪問/usr/local/nagios/etc/htpasswd.users: 沒有那個文件

[root@99 nagios-3.2.1]# which htpasswd

/usr/bin/htpasswd

[root@99 nagios-3.2.1]# rpm -qf /usr/bin/htpasswd

httpd-tools-2.2.15-45.el6.x86_64

?

[root@99 nagios-3.2.1]# htpasswd -h (查看幫助)

?

[root@99 nagios-3.2.1]# htpasswd -c /usr/local/nagios/etc/htpasswd.users nagiosadmin (創建監控認證的用戶密碼,寫入到文件)

New password:

Re-type new password:

Adding password for user nagiosadmin

[root@99 nagios-3.2.1]# ls /usr/local/nagios/etc/htpasswd.users (文件創建成功)

/usr/local/nagios/etc/htpasswd.users

?

[root@99 nagios-3.2.1]# cat /usr/local/nagios/etc/htpasswd.users (查看內容 用戶)

nagiosadmin:SdiDPECEPUFkM

?

[root@room1pc01 桌面]# http://192.168.4.99/nagios

輸入帳號 ?密碼

?

?

?

按裝插件:(用圖形界面監控,最上面是命令操作顯示)

[root@99 nagios-3.2.1]# cd /usr/local/nagios/

[root@99 nagios]# tar -zxvf nagios-plugins-1.4.14.tar.gz

?

[root@99 nagios]# cd nagios-plugins-1.4.14

?

[root@99 nagios-plugins-1.4.14]# ./configure && make && make install

?

[root@99 nagios-plugins-1.4.14]# ls /usr/local/nagios/libexec/ (查看插件是否安裝好)

?

[root@room1pc01 桌面] 瀏覽器 刷新 再點serves 等一下

?

?

監控插件的使用

/usr/local/nagios/libexec/插件名 -h

?

[root@99 libexec]# ./check_users -w 1 ?-c 2

USERS CRITICAL - 3 users currently logged in |users=3;1;2;0 (報錯)

?

[root@99 libexec]# ./check_http -h

?

[root@99 libexec]# ./check_http -I 192.168.4.254

HTTP WARNING: HTTP/1.1 403 Forbidden - 5159 bytes in 0.047 second response time |time=0.047352s;;;0.000000 size=5159B;;;0

[root@99 libexec]# ./check_http -I 192.168.4.254 -p 80

HTTP WARNING: HTTP/1.1 403 Forbidden - 5159 bytes in 0.002 second response time |time=0.001934s;;;0.000000 size=5159B;;;0

[root@99 libexec]# ./check_http -I 192.168.4.254 -p 80

拒絕連接

HTTP CRITICAL - Unable to open TCP socket

[root@99 libexec]# ./check_http -I localhost -p 80

HTTP OK: HTTP/1.1 200 OK - 271 bytes in 0.001 second response time |time=0.001469s;;;0.000000 size=271B;;;0

?

[root@99 libexec]# ./check_ping ?-H 192.168.4.253 ?-w 10,50% ???-c 10,60%

PING OK - Packet loss = 0%, RTA = 0.15 ms|rta=0.152000ms;10.000000;10.000000;0.000000 pl=0%;50;60;0

?

?

[root@99 libexec]# ./check_disk -h

[root@99 libexec]# df -h

Filesystem ???????????Size ?Used Avail Use% Mounted on

/dev/mapper/VolGroup-lv_root

???????????????????????47G ?1.6G ??43G ??4% /

tmpfs ????????????????499M ????0 ?499M ??0% /dev/shm

/dev/vda1 ????????????477M ??36M ?416M ??8% /boot

[root@99 libexec]# ./check_disk -w 50% ?-c 40% ??-p /

DISK OK - free space: / 43791 MB (96% inode=98%);| /=1560MB;23893;28671;0;47786

?

[root@99 libexec]# dd if=/dev/zero of=/boot/test.txt bs=1M count=400

記錄了400+0 的讀入

記錄了400+0 的寫出

419430400字節(419 MB)已復制,7.00428 秒,59.9 MB/秒

[root@99 libexec]# ./check_disk -w 50% ?-c 40% ??-p /boot

DISK CRITICAL - free space: /boot 15 MB (3% inode=99%);| /boot=435MB;238;285;0;476

[root@99 libexec]# ./check_disk -w 50% ?-c 40% ??-p /dev/shm

DISK OK - free space: /dev/shm 498 MB (100% inode=99%);| /dev/shm=0MB;249;298;0;498

?

[root@99 libexec]# ./check_ssh -p 22 localhost

SSH OK - OpenSSH_5.3 (protocol 2.0)

[root@99 libexec]# ./check_ssh ?192.168.4.254

SSH OK - OpenSSH_5.3 (protocol 2.0)

?

[root@99 libexec]# ./check_swap

?

[root@99 libexec]# ./check_procs -h

?

?

[root@99 libexec]# ./check_procs -w 100 -c 110

PROCS OK: 100 processes

[root@99 libexec]# ./check_procs -w 90 -c 110

PROCS WARNING: 100 processes

[root@99 libexec]# ./check_procs -w 90 -c 95

PROCS CRITICAL: 100 processes

?

[root@99 libexec]# ./check_procs ?-w 60 -c 65 -s R

PROCS OK: 0 processes with STATE = R

[root@99 libexec]# ./check_procs ?-w 60 -c 65 -s Z

PROCS OK: 0 processes with STATE = Z

[root@99 libexec]# ./check_procs ?-w 60 -c 65 -s ZR

PROCS OK: 0 processes with STATE = ZR

?

有些服務沒有專屬插件,可以根據端口號來指定(都是tcp):

[root@99 libexec]# ./check_tcp -H 192.168.4.254 -p 80

TCP OK - 0.000 second response time on port 80|time=0.000351s;;;0.000000;10.000000

[root@99 libexec]# ./check_tcp -H 192.168.4.254 -p 25

拒絕連接

[root@99 libexec]# ./check_tcp -H 192.168.4.254 -p 22

TCP OK - 0.000 second response time on port 22|time=0.000243s;;;0.000000;10.000000

?

?

?

____修改服務配置文件可以自己增加監控服務 (先增加監控本機服務)_____________________

?

[root@99 nagios-3.2.1]# ls /usr/local/nagios

bin ?etc ?libexec ?sbin ?share ?var

?

libexec:插件目錄

bin:可執行命令

etc:配置文件

sbin:cgi文件(點監控頁面時出現的各種頁面)

var:變化的,日志 緩存

share:配置文件

?

nagios服務配置文件說明?

?

[root@99 nagios]# ?/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg ??(檢查配置文件是否有錯)

?

[root@99 nagios]# vim /usr/local/nagios/etc/nagios.cfg

??19 log_file=/usr/local/nagios/var/nagios.log

??30 cfg_file=/usr/local/nagios/etc/objects/commands.cfg ?(設置服務運行時使用的監控插件)

??31 cfg_file=/usr/local/nagios/etc/objects/contacts.cfg ???(設置接收報警消息的郵箱地址)

??32 cfg_file=/usr/local/nagios/etc/objects/timeperiods.cfg ??(定義監控時間模板的文件—)

??33 cfg_file=/usr/local/nagios/etc/objects/templates.cfg ????(定義監控模板的配置文件)

?36 cfg_file=/usr/local/nagios/etc/objects/localhost.cfg ?????(這個文件監控本機的配置文件,是里面的內容容器導致,不是文件名可以隨變)

?

/usr/local/nagios/etc/resource.cfg 宏定義 ??$USER1$(插件定義目錄)

?

_______________________

?

1)

[root@99 ~]# vim ?/usr/local/nagios/etc/objects/commands.cfg ?(設置增加監控服務使用的插件)

?

define command {

????????command_name monitor_localhost_boot

?????????command_line ?/usr/local/nagios/libexec/check_disk -w 20% -c 10% ?-p ?/boot

} (這四行和下面四行只要寫一個就行。這用的是常量,下面變量)

?

define command {

????????command_name monitor_localhost_boot

?????????command_line ?/usr/local/nagios/libexec/check_disk -w $ARG1$ -c $ARG2$ ?-p ?$ARG3$

}

?

define command {

?????command_name monitor_localhost_ftp

?????command_line $USER1$/check_ftp -H localhost -p 21

?

}

?

2)設置接收報警消息的郵箱地址

[root@99 nagios]# /etc/init.d/postfix status

master (pid ?1766) 正在運行...

?

[root@99 ~]# vim /usr/local/nagios/etc/objects/contacts.cfg (設置接收報警消息的郵箱地址,先郵件服務開啟,如果設置的是163,一定要先能163能接收到)

?

3)定義監控時間模板的文件

[root@99 ~]# vim /usr/local/nagios/etc/objects/timeperiods.cfg

?28 define timeperiod{

?29 ????????timeperiod_name 24x7

?30 ????????alias ??????????24 Hours A Day, 7 Days A Week

?31 ????????sunday ?????????00:00-24:00

?32 ????????monday ?????????00:00-24:00

?33 ????????tuesday ????????00:00-24:00

?34 ????????wednesday ??????00:00-24:00

?35 ????????thursday ???????00:00-24:00

?36 ????????friday ?????????00:00-24:00

?37 ????????saturday ???????00:00-24:00

?38 ????????}

?39

?40

?41 # 'workhours' timeperiod definition

?42 define timeperiod{

?43 ????????timeperiod_name workhours

?44 ????????alias ??????????Normal Work Hours

?45 ????????monday ?????????09:00-17:00

?46 ????????tuesday ????????09:00-17:00

?47 ????????wednesday ??????09:00-17:00

?48 ????????thursday ???????09:00-17:00

?49 ????????friday ?????????09:00-17:00

?50 ????????}

?

?

4)定義監控模板的配置文件

[root@99 ~]# vim /usr/local/nagios/etc/objects/templates.cfg

?

5)(這個文件監控本機的配置文件,是里面的內容容器導致,不是文件名,文件名可以隨變起)把增加的監控本機的服務寫在監控本機配置文件中(在底下寫)

[root@99 ~]# vim /usr/local/nagios/etc/objects/localhost.cfg

?

define host{

????????use ????????????????????linux-server ???????????; Name of host template to use

; This host definition will inherit all variables that are defined

; in (or inherited by) the linux-server host template definition.

????????host_name ??????????????localhost

????????alias ??????????????????localhost

????????address ????????????????127.0.0.1

????????}

?

define service{

????????use ????????????????????????????local-service ????????; Name of service template to use

????????host_name ??????????????????????localhost

????????service_description ????????????PING

check_commandcheck_ping!100.0,20%!500.0,60%

????????}

?

.........................

.........................

?

##################################myset################################333

?

define service{

????????use ????????????????????????????local-service ????????; Name of service template to use

????????host_name ??????????????????????localhost

????????service_description ????????????boot

????????check_command ??????????????????monitor_localhost_boot!10%!15%!/boot

????????notifications_enabled ??????????0

????????}

?

define service{

????????use ????????????????????????????local-service ????????; Name of service template to use

????????host_name ??????????????????????localhost

????????service_description ????????????ftp

????????check_command ??????????????????monitor_localhost_ftp

????????notifications_enabled ??????????0

????????}

?

[root@99 nagios]# ?/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg 測試配置文件語法是否有錯

?

6)設置訪問cgi配置用戶

[root@99 nagios]# cd /usr/local/nagios/sbin/

[root@99 sbin]# ls

avail.cgi ??extinfo.cgi ???????outages.cgi ?statuswml.cgi ?tac.cgi

cmd.cgi ????history.cgi ???????showlog.cgi ?statuswrl.cgi

config.cgi ?notifications.cgi ?status.cgi ??summary.cgi

?

[root@99 etc]# vim /usr/local/nagios/etc/cgi.cfg ??設置訪問cgi配置用戶

119 authorized_for_system_information=nagiosadmin ?(登錄的用戶名)

?

?

[root@room1pc01 桌面]# http://192.168.4.99/nagios 在瀏覽器中再點server刷新一下 等幾分鐘會再增加監控的服務

?

————————————————————————————————————————————————————————————————

—————————修改配置文件,增加一個監控遠程主機———————————————————————————————————————

監控遠端主機:

監控遠端主機的公有數據192.168.4.98

監控遠端主機的私有數據(服務狀態)

網站 ftp 數據庫 sshd ?

?

?

?

監控遠端公有數據:

?

?

[root@99 sbin]# /usr/local/nagios/libexec/check_http -H 192.168.4.98 -p 80

拒絕連接

HTTP CRITICAL - Unable to open TCP socket

?

[root@99 sbin]# vim /usr/local/nagios/etc/nagios.cfg ??????(1.在nagios主配置文件中,寫上監控服務器的文件名,起服務會自動加載這)

??36 cfg_file=/usr/local/nagios/etc/objects/localhost.cfg (監控本機)

??37 cfg_file=/usr/local/nagios/etc/objects/otherser.cfg ?(監控遠程的主機)

?

[root@99 objects]# vim /usr/local/nagios/etc/objects/otherser.sh ?(2.寫容器定義監控遠端主機的哪些服務)

?

define host{

????????use ????????????????????linux-server

????????host_name ??????????????server98

????????address ????????????????192.168.4.98

????????}

?

define service{

????????use ????????????????????????????local-service

????????host_name ??????????????????????server98

????????service_description ????????????httpd

????????check_command ???????????????monitor_server98_httpd

????????}

?

define service{

????????use ????????????????????????????local-service

????????host_name ??????????????????????server98

????????service_description ????????????sshd

????????check_command ???????????????monitor_server98_sshd

?

[root@99 objects]# vim /usr/local/nagios/etc/objects/commands.cfg ???????(3.設置監控遠端服務使用的插件,) ?

..................................

..................................

######################monitor##################################################

?

define command {

?????command_name monitor_server98_httpd

?????command_line $USER1$/check_httpd -H 192.168.4.98 -p 80

}

?

define command {

?????command_name monitor_server98_ftp

?????command_line $USER1$/check_ftp -H 192.168.4.98 -p 21

}

?

?

測試:會查到除了監控自己也會監控遠端。

[root@room1pc01 桌面]# firefox http://192.168.4.99/nagios

?

?

[root@99 objects]# hostname localhost (退出再進,收郵件時,主機名如果是數字會有影響)

[root@localhost ~]# /etc/init.d/nagios restart

?

[root@localhost ~]# mail -u nagios

"/var/mail/nagios": 2 messages 2 unread

>U ?1 nagios@localhost.loc ?Thu Mar ?9 08:44 ?32/924 ??"** PROBLEM Service Alert: server98/httpd is CRITICAL **"

?U ?2 nagios@localhost.loc ?Thu Mar ?9 08:47 ?32/894 ??"** PROBLEM Service Alert: server98/sshd is CRITICAL **"

?

————————————————————————————————————————————————————————————————————————————————

?

監控遠端主機的私有數據(磁盤 進程 用戶)

總數,運行,休眠,僵尸

1 ?在被監控的主機上安裝 監控插件 ?(192.168.4.98)

#yum ?-y install gcc ??gcc-c++

250 ?tar -zxvf nagios-plugins-1.4.14.tar.gz

cd nagios-plugins-1.4.14

??257 ?./configure

??258 ?make

??259 ?make install

262 ?ls /usr/local/nagios/libexec/

?

[root@stu ~]# /usr/local/nagios/libexec/check_users -w 3 -c 5

USERS OK - 2 users currently logged in |users=2;3;5;0

?

[root@stu ~]#/usr/local/nagios/libexec/check_procs -w 50 -c 60 ?-s ?Z

PROCS OK: 0 processes with STATE = Z

?

[root@stu ~]# /usr/local/nagios/libexec/check_procs -w 50 -c 60 ?-s ?R

PROCS OK: 0 processes with STATE = R

?

[root@stu ~]# /usr/local/nagios/libexec/check_procs -w 50 -c 60 ??(總進程)

PROCS CRITICAL: 103 processes

?

[root@stu ~]# /usr/local/nagios/libexec/check_disk -w 50% -c30% ?-p ?/

DISK OK - free space: / 42137 MB (92% inode=97%);| /=3215MB;23893;33450;0;47786

?

2 ?在被監控的主機上運行nrpe服務

#useradd ?nagios

#rpm ?-q openssl ??openssl-devel

#tar -zxvf nrpe-2.12.tar.gz

#cd nrpe-2.12

#./configure

#make all

#make install-plugin

#make install-daemon

#make install-daemon-config

#make install-xinetd

#vim /etc/xinetd.d/nrpe

only_from ??????= 127.0.0.1 ?172.40.50.99

:wq

?

#vim /etc/services

nrpe ???????????5666/tcp# NRPE

:wq

?

#yum -y ?install xinetd

#service ?xinetd start

# chkconfig xinetd on

#netstat -utnalp ?| grep :5666

?

3.改nrpe服務的主配置文件,設置獲取本地的私有數據(192.168.4.98)

?

?

vim /usr/local/nagios/etc/nrpe.cfg

#command[命令名]=本機使用的插件

199 command[check_nrpe_users]=/usr/local/nagios/libexec/check_users -w 2 -c 5

200 command[check_nrpe_load]=/usr/local/nagios/libexec/check_load -w 15,10,5 -c ????30,25,20

201 command[check_nrpe_hda1]=/usr/local/nagios/libexec/check_disk -w 20% -c 10% ????-p /dev/vda1

202 command[check_nrpe_root]=/usr/local/nagios/libexec/check_disk -w 20% -c 10% ????-p /

204 command[check_nrpe_zombie_procs]=/usr/local/nagios/libexec/check_procs -w 5 ????-c 10 -s Z

205 command[check_nrpe_total_procs]=/usr/local/nagios/libexec/check_procs -w 150 ????-c 200

?

:wq

?

# /etc/init.d/xinetd restart

?

?

?

驗證nrpe命令(192.168.4.98)

/usr/local/nagios/libexec/check_nrpe -H localhost -c ?check_nrpe_root

?

?

4.在監控服務器上配置(192.168.4.99)

?

#/usr/local/nagios/libexec/check_nrpe -H 192.168.4.98 ??-p 5666 ?-c ?check_nrpe_users ?(報錯,沒那個目錄)

?

1)安裝依賴包

# yum -y install openssl openssl-devel

2)

# tar -zxvf ?nrpe-2.12.tar.gz

?

# cd nrpe-2.12

#./configure

?

?#make all

#make install-plugin

?

?#ls /usr/local/nagios/libexec/check_nrpe

?

?

?

3)在命令行下測試

?

?

# /usr/local/nagios/libexec/check_nrpe -H 192.168.4.98 -p 5666 -c check_nrpe_users

USERS OK - 2 users currently logged in |users=2;2;5;0

?

# /usr/local/nagios/libexec/check_nrpe -H 192.168.4.98 -p 5666 -c check_nrpe_load

OK - load average: 0.00, 0.00, 0.00|load1=0.000;15.000;30.000;0; load5=0.000;10.000;25.000;0; load15=0.000;5.000;20.000;0;

?

?

4)把連接nrpe服務的插件定義成nagios服務可以使用的監控命令

?

# vim /usr/local/nagios/etc/objects/commands.cfg

...............................

?

define command {

?????command_name monitor_server98_boot

?????command_line $USER1$/check_nrpe -H 192.168.4.98 -p 5666 -c check_nrpe_boot

}

?

define command {

?????command_name monitor_server98_load

?????command_line $USER1$/check_nrpe -H 192.168.4.98 -p 5666 -c ?check_nrpe_load

}

?

define command {

?????command_name monitor_server98_users

?????command_line $USER1$/check_nrpe -H 192.168.4.98 -p 5666 -c ?check_nrpe_load

}

?

define command {

?????command_name monitor_server98_root

?????command_line $USER1$/check_nrpe -H 192.168.4.98 -p 5666 -c ?check_nrpe_root

}

?

define command {

?????command_name monitor_server98_zombie

?????command_line $USER1$/check_nrpe -H 192.168.4.98 -p 5666 -c ?check_nrpe_zombie_proc

}

?

define command {

?????command_name monitor_server98_total_procs

????command_line $USER1$/check_nrpe -H 192.168.4.98 -p 5666 -c ?check_nrpe_total_procs

}

?

5)在監控主機配置文件中調用定義的監控命令

?

# vim /usr/local/nagios/etc/objects/otherser.cfg

?

#################################private###########################

define service{

????????use ????????????????????????????local-service

????????host_name ??????????????????????server98

??????service_description ????????????users

????????check_command ??????????????????monitor_server98_users

}

?

define service{

????????use ????????????????????????????local-service

????????host_name ??????????????????????server98

????????service_description ????????????root

????????check_command ??????????????????monitor_server98_root

}

?

define service{

????????use ????????????????????????????local-service

????????host_name ??????????????????????server98

????????service_description ????????????total_procs

????????check_command ??????????????????monitor_server98_total_procs

}

?

define service{

????????use ????????????????????????????local-service

????????host_name ??????????????????????server98

????????service_description ????????????boot

????????check_command ??????????????????monitor_server98_boot

}

?

define service{

????????use ????????????????????????????local-service

????????host_name ??????????????????????server98

????????service_description ????????????load

????????check_command ??????????????????monitor_server98_load

}

define service{

????????use ????????????????????????????local-service

????????host_name ??????????????????????server98

????????service_description ????????????zombie

????????check_command ??????????????????monitor_server98_zombie

}

?

?

?

6)測試配置文件是否有語法錯誤

?

# alias plj='/usr/local/nagios/bin/nagios -v /usr/local/nagios/etc/nagios.cfg'

# plj

?

?

7)重啟監控服務

?

# /etc/init.d/nagios restart

?

8)查看監控信息

?

?

?

?

?

?

?

?

?

?

?

?

?

?

?

?

?

?

?

?

?

?


轉載于:https://blog.51cto.com/12346621/1905292

總結

以上是生活随笔為你收集整理的Linux监控   Nagios的全部內容,希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯,歡迎將生活随笔推薦給好友。