Bonjour,

Je pense avoir un problème de configuration sur ma machine qui me la fait tombé, mais je ne sais pas du tout d'ou cela peux venir j'ai cependant 2 pistes que je peux vous proposer pour m'aider .

La première:

- J'ai programmé 1 cron qui tourne toutes les 15 minutes et peux renvoyer entre 10 et 50 lignes de donnée qui sont généralement envoyé par mail au root... Pour évité cela j'ai rajouté dans mes cron en fin de commande "MA COMMANDE >/dev/null 2>&1" afin de ne plus recevoir de mail...

Pensez-vous que ce problème pourrais être lié à celui-ci trouvé sur comment ca marche :

et surtout comment cela peux ce produire et ce corriger ?

La seconde:

J'ai analyser les logs mais je ne sais pas si le problème scsi est lié au problème d'écriture ou si c'est lui qui en est la cause...

MESSAGE.LOG
Apr 1 22:15:26 rxxxxx kernel: scsi 2:0:0:0: Direct-Access IET VIRTUAL-DISK 0 PQ: 0 ANSI: 4
Apr 1 22:15:26 rxxxxx kernel: sd 2:0:0:0: [sda] 20971520 512-byte hardware sectors (10737 MB)
Apr 1 22:15:26 rxxxxx kernel: sd 2:0:0:0: [sda] Write Protect is off
Apr 1 22:15:26 rxxxxx kernel: sd 2:0:0:0: [sda] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
Apr 1 22:15:26 rxxxxx kernel: sd 2:0:0:0: [sda] 20971520 512-byte hardware sectors (10737 MB)
Apr 1 22:15:26 rxxxxx kernel: sd 2:0:0:0: [sda] Write Protect is off
Apr 1 22:15:26 rxxxxx kernel: sd 2:0:0:0: [sda] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
Apr 1 22:15:26 rxxxxx kernel: sda: sda1 sda2
Apr 1 22:15:26 rxxxxx kernel: sd 2:0:0:0: [sda] Attached SCSI disk
Apr 1 22:15:26 rxxxxx kernel: ReiserFS: sda1: warning: sh-2021: reiserfs_fill_super: can not find reiserfs on sda1
Apr 1 22:15:26 rxxxxx kernel: EXT3-fs: INFO: recovery required on readonly filesystem.
Apr 1 22:15:26 rxxxxx kernel: EXT3-fs: write access will be enabled during recovery.
Apr 1 22:15:26 rxxxxx kernel: kjournald starting. Commit interval 5 seconds
Apr 1 22:15:26 rxxxxx kernel: EXT3-fs: sda1: orphan cleanup on readonly fs
Apr 1 22:15:26 rxxxxx kernel: EXT3-fs: sda1: 23 orphan inodes deleted
Apr 1 22:15:26 rxxxxx kernel: EXT3-fs: recovery complete.
Apr 1 22:15:26 rxxxxx kernel: EXT3-fs: mounted filesystem with ordered data mode.
Apr 1 22:15:26 rxxxxx kernel: EXT3 FS on sda1, internal journal
Apr 1 22:15:26 rxxxxx kernel: kjournald starting. Commit interval 5 seconds
Apr 1 22:15:26 rxxxxx kernel: EXT3 FS on sda2, internal journal
Apr 1 22:15:26 rxxxxx kernel: EXT3-fs: mounted filesystem with ordered data mode.
...

Apr 2 08:25:43 rxxxxx kernel: connection1:0: iscsi: detected conn error (1011)
Apr 2 08:26:11 rxxxxx nagios2: Warning: A system time change of 6965 seconds (forwards in time) has been detected. Compensating...
Apr 2 08:26:11 rxxxxx kernel: iscsi: host reset succeeded
Apr 2 08:27:20 rxxxxx kernel: apache2 invoked oom-killer: gfp_mask=0x1201d2, order=0, oomkilladj=0
Apr 2 08:27:21 rxxxxx nagios2: HOST ALERT: localhost;DOWN;SOFT;1;CRITICAL - Plugin timed out after 10 seconds
Apr 2 08:27:22 rxxxxx kernel: Pid: 3013, comm: apache2 Not tainted 2.6.24.2-xxxx-std-ipv6-32 #4
Apr 2 08:27:22 rxxxxx nagios2: HOST ALERT: localhost;UP;SOFT;2;PING OK - Packet loss = 0%, RTA = 0.09 ms
Apr 2 08:27:22 rxxxxx kernel: [<c0142d98>] oom_kill_process+0x108/0x120
Apr 2 08:27:22 rxxxxx nagios2: SERVICE ALERT: localhost;Total Processes;CRITICAL;SOFT;1;PROCS CRITICAL: 1026 processes
Apr 2 08:27:22 rxxxxx kernel: [<c0142f56>] out_of_memory+0xd6/0x110
Apr 2 08:27:22 rxxxxx kernel: [<c0144790>] __alloc_pages+0x250/0x340
Apr 2 08:27:22 rxxxxx kernel: [<c0114dcf>] try_to_wake_up+0x1bf/0x2e0
Apr 2 08:27:22 rxxxxx kernel: [<c01471df>] __do_page_cache_readahead+0xff/0x150
Apr 2 08:27:22 rxxxxx kernel: [<c01472f6>] do_page_cache_readahead+0x46/0x70
Apr 2 08:27:22 rxxxxx kernel: [<c0140726>] filemap_fault+0x266/0x330
Apr 2 08:27:22 rxxxxx kernel: [<c0116b2e>] __wake_up+0x3e/0x60
Apr 2 08:27:22 rxxxxx kernel: [<c014f8ce>] __do_fault+0x7e/0x3c0
Apr 2 08:27:22 rxxxxx kernel: [<c01cebd0>] ext3_file_write+0x30/0xc0
Apr 2 08:27:22 rxxxxx kernel: [<c014ffa0>] handle_mm_fault+0x130/0x300
Apr 2 08:27:22 rxxxxx kernel: [<c012ece0>] autoremove_wake_function+0x0/0x50
Apr 2 08:27:22 rxxxxx kernel: [<c0111afb>] do_page_fault+0x13b/0x790
Apr 2 08:27:22 rxxxxx kernel: [<c049ac56>] net_tx_action+0xa6/0xf0
Apr 2 08:27:22 rxxxxx kernel: [<c0160a1b>] vfs_write+0xeb/0x110
Apr 2 08:27:22 rxxxxx kernel: [<c016da54>] sys_poll+0x34/0x80
Apr 2 08:27:22 rxxxxx kernel: [<c01119c0>] do_page_fault+0x0/0x790
Apr 2 08:27:22 rxxxxx kernel: [<c058e372>] error_code+0x72/0x78
Apr 2 08:27:22 rxxxxx kernel: [<c0580000>] sctp_ulpq_order+0x140/0x1b0
Apr 2 08:27:22 rxxxxx kernel: =======================
Apr 2 08:27:22 rxxxxx kernel: Mem-info:
Apr 2 08:27:22 rxxxxx kernel: DMA per-cpu:
Apr 2 08:27:22 rxxxxx kernel: CPU 0: Hot: hi: 0, btch: 1 usd: 0 Cold: hi: 0, btch: 1 usd: 0
Apr 2 08:27:22 rxxxxx kernel: Normal per-cpu:
Apr 2 08:27:22 rxxxxx kernel: CPU 0: Hot: hi: 186, btch: 31 usd: 124 Cold: hi: 62, btch: 15 usd: 13
Apr 2 08:27:22 rxxxxx kernel: Active:95886 inactive:723 dirty:0 writeback:0 unstable:0
Apr 2 08:27:22 rxxxxx kernel: free:1131 slab:9103 mapped:440 pagetables:6059 bounce:0
Apr 2 08:27:22 rxxxxx kernel: DMA free:1920kB min:92kB low:112kB high:136kB active:6412kB inactive:0kB present:16256kB pages_scanned:14982 all_unreclaimable? yes
...

Apr 2 08:27:22 rxxxxx kernel: 9103 pages slab
Apr 2 08:27:22 rxxxxx kernel: 6059 pages pagetables
Apr 2 08:27:42 rxxxxx nagios2: SERVICE ALERT: localhost;Total Processes;OK;SOFT;2;PROCS OK: 120 processes
Apr 2 08:29:32 rxxxxx nagios2: SERVICE ALERT: localhost;Current Load;CRITICAL;SOFT;1;CRITICAL - load average: 26.82, 109.85, 115.41
Apr 2 08:30:32 rxxxxx nagios2: SERVICE ALERT: localhost;Current Load;CRITICAL;SOFT;2;CRITICAL - load average: 10.11, 89.92, 108.21
Apr 2 08:31:32 rxxxxx nagios2: SERVICE ALERT: localhost;Current Load;CRITICAL;SOFT;3;CRITICAL - load average: 3.78, 73.57, 101.44
Apr 2 09:05:43 rxxxxx nagios2: SERVICE ALERT: localhost;Current Load;CRITICAL;HARD;4;CRITICAL - load average: 1.95, 60.31, 95.13
Apr 2 09:05:43 rxxxxx nagios2: SERVICE NOTIFICATION: root;localhost;Current Load;CRITICAL;notify-by-email;CRITICAL - load average: 1.95, 60.31, 95.13
Apr 2 09:06:02 rxxxxx nagios2: Warning: A system time change of 1988 seconds (forwards in time) has been detected. Compensating...
Apr 2 09:09:39 rxxxxx nagios2: Auto-save of retention data completed successfully.
Apr 2 09:17:39 rxxxxx nagios2: Warning: The check of service 'PING' on host 'gateway' looks like it was orphaned (results never came back). I'm scheduling
an immediate check of the service...
KERN.LOG
Apr 1 22:40:07 rxxxxx kernel: ReiserFS: sda1: warning: sh-2021: reiserfs_fill_super: can not find reiserfs on sda1
Apr 1 22:40:07 rxxxxx kernel: kjournald starting. Commit interval 5 seconds
Apr 1 22:40:07 rxxxxx kernel: EXT3-fs: mounted filesystem with ordered data mode.
Apr 1 22:40:07 rxxxxx kernel: EXT3 FS on sda1, internal journal
Apr 1 22:40:07 rxxxxx kernel: eth0: no IPv6 routers present
Apr 1 22:40:07 rxxxxx kernel: kjournald starting. Commit interval 5 seconds
Apr 1 22:40:07 rxxxxx kernel: EXT3 FS on sda2, internal journal
Apr 1 22:40:07 rxxxxx kernel: EXT3-fs: mounted filesystem with ordered data mode.
Apr 2 08:25:43 rxxxxx kernel: connection1:0: iscsi: detected conn error (1011)
Apr 2 08:26:11 rxxxxx kernel: iscsi: host reset succeeded
Apr 2 08:27:20 rxxxxx kernel: apache2 invoked oom-killer: gfp_mask=0x1201d2, order=0, oomkilladj=0
Apr 2 08:27:22 rxxxxx kernel: Pid: 3013, comm: apache2 Not tainted 2.6.24.2-xxxx-std-ipv6-32 #4
Apr 2 08:27:22 rxxxxx kernel: [<c0142d98>] oom_kill_process+0x108/0x120
Apr 2 08:27:22 rxxxxx kernel: [<c0142f56>] out_of_memory+0xd6/0x110
Apr 2 08:27:22 rxxxxx kernel: [<c0144790>] __alloc_pages+0x250/0x340
Apr 2 08:27:22 rxxxxx kernel: [<c0114dcf>] try_to_wake_up+0x1bf/0x2e0
Apr 2 08:27:22 rxxxxx kernel: [<c01471df>] __do_page_cache_readahead+0xff/0x150
Apr 2 08:27:22 rxxxxx kernel: [<c01472f6>] do_page_cache_readahead+0x46/0x70
Apr 2 08:27:22 rxxxxx kernel: [<c0140726>] filemap_fault+0x266/0x330
Apr 2 08:27:22 rxxxxx kernel: [<c0116b2e>] __wake_up+0x3e/0x60
Apr 2 08:27:22 rxxxxx kernel: [<c014f8ce>] __do_fault+0x7e/0x3c0
Apr 2 08:27:22 rxxxxx kernel: [<c01cebd0>] ext3_file_write+0x30/0xc0
Apr 2 08:27:22 rxxxxx kernel: [<c014ffa0>] handle_mm_fault+0x130/0x300
Apr 2 08:27:22 rxxxxx kernel: [<c012ece0>] autoremove_wake_function+0x0/0x50
Apr 2 08:27:22 rxxxxx kernel: [<c0111afb>] do_page_fault+0x13b/0x790
Apr 2 08:27:22 rxxxxx kernel: [<c049ac56>] net_tx_action+0xa6/0xf0
Apr 2 08:27:22 rxxxxx kernel: [<c0160a1b>] vfs_write+0xeb/0x110
Apr 2 08:27:22 rxxxxx kernel: [<c016da54>] sys_poll+0x34/0x80
Apr 2 08:27:22 rxxxxx kernel: [<c01119c0>] do_page_fault+0x0/0x790
Apr 2 08:27:22 rxxxxx kernel: [<c058e372>] error_code+0x72/0x78
Apr 2 08:27:22 rxxxxx kernel: [<c0580000>] sctp_ulpq_order+0x140/0x1b0
SYSLOG

Apr 2 08:25:43 rxxxxx kernel: connection1:0: iscsi: detected conn error (1011)
Apr 2 08:25:43 rxxxxx iscsid: Kernel reported iSCSI connection 1:0 error (1011) state (3)
Apr 2 08:25:43 rxxxxx iscsid: connect failed (113)
Apr 2 08:25:51 rxxxxx named[2440]: Cleaned cache of 59 RRsets
Apr 2 08:26:11 rxxxxx nagios2: Warning: A system time change of 6965 seconds (forwards in time) has been detected. Compensating...
Apr 2 08:26:11 rxxxxx /USR/SBIN/CRON[3342]: (root) CMD (run-parts /usr/local/oco/bin/60sec >/dev/null 2>/dev/null)
Apr 2 08:26:11 rxxxxx /USR/SBIN/CRON[3341]: (root) CMD (/usr/local/rtm/bin/rtm 2 > /dev/null 2> /dev/null)
Apr 2 08:26:11 rxxxxx /USR/SBIN/CRON[3344]: (root) CMD (run-parts /usr/local/oco/bin/120sec >/dev/null 2>/dev/null)
Apr 2 08:26:11 rxxxxx /USR/SBIN/CRON[3343]: (root) CMD (/usr/local/rtm/bin/rtm 2 > /dev/null 2> /dev/null)
Apr 2 08:26:11 rxxxxx kernel: iscsi: host reset succeeded
Apr 2 08:26:11 rxxxxx /USR/SBIN/CRON[3345]: (root) CMD (run-parts /usr/local/oco/bin/60sec >/dev/null 2>/dev/null)
Apr 2 08:26:11 rxxxxx iscsid: connect failed (113)
....

Apr 2 08:27:13 rxxxxx iscsid: connect failed (113)
Apr 2 08:27:20 rxxxxx kernel: apache2 invoked oom-killer: gfp_mask=0x1201d2, order=0, oomkilladj=0
Apr 2 08:27:20 rxxxxx mysqld_safe[4453]: Number of processes running now: 0
Apr 2 08:27:20 rxxxxx iscsid: connect failed (113)
Apr 2 08:27:21 rxxxxx nagios2: HOST ALERT: localhost;DOWN;SOFT;1;CRITICAL - Plugin timed out after 10 seconds
Apr 2 08:27:22 rxxxxx postfix/pickup[3189]: 0365A1FF29: uid=0 from=<root>
Apr 2 08:27:22 rxxxxx kernel: Pid: 3013, comm: apache2 Not tainted 2.6.24.2-xxxx-std-ipv6-32 #4
Apr 2 08:27:22 rxxxxx mysqld_safe[5840]: restarted
Apr 2 08:27:22 rxxxxx iscsid: connect failed (113)
Apr 2 08:27:22 rxxxxx nagios2: HOST ALERT: localhost;UP;SOFT;2;PING OK - Packet loss = 0%, RTA = 0.09 ms
Apr 2 08:27:22 rxxxxx /USR/SBIN/CRON[5844]: (root) CMD (/usr/local/rtm/bin/rtm 2 > /dev/null 2> /dev/null)
Apr 2 08:27:22 rxxxxx /USR/SBIN/CRON[5845]: (root) CMD (run-parts /usr/local/oco/bin/60sec >/dev/null 2>/dev/null)
Apr 2 08:27:22 rxxxxx kernel: [<c0142d98>] oom_kill_process+0x108/0x120
Apr 2 08:27:22 rxxxxx iscsid: connect failed (113)
Apr 2 08:27:22 rxxxxx nagios2: SERVICE ALERT: localhost;Total Processes;CRITICAL;SOFT;1;PROCS CRITICAL: 1026 processes
Apr 2 08:27:22 rxxxxx kernel: [<c0142f56>] out_of_memory+0xd6/0x110
Apr 2 08:27:22 rxxxxx postfix/cleanup[5838]: warning: connect to mysql server 127.0.0.1: Can't connect to MySQL server on '127.0.0.1' (111)