[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Kernel/Memory problem.



Hi list.
I just had my P233 512 MB RAM 2 9.1GB 10K RPM SCSI disk box crushed !.

In the message log i got the following lines :

Jun 24 03:03:00 proxyint kernel: Uhhuh. NMI received. Dazed and confused,
but trying to continue
Jun 24 03:03:00 proxyint kernel: You probably have a hardware problem with
your RAM chips
Jun 24 03:03:00 proxyint kernel: Unable to handle kernel paging request at
virtual address 99ce6fc8
Jun 24 03:03:00 proxyint kernel: current->tss.cr3 = 00101000, %cr3 =
00101000
Jun 24 03:03:00 proxyint kernel: *pde = 00000000
Jun 24 03:03:00 proxyint kernel: Oops: 0000
Jun 24 03:03:00 proxyint kernel: CPU:    0
Jun 24 03:03:00 proxyint kernel: EIP:    0010:[kmem_cache_free+61/353]
Jun 24 03:03:00 proxyint kernel: EFLAGS: 00010082
Jun 24 03:03:00 proxyint kernel: eax: 000000bc   ebx: d9ce69c0   ecx:
99ce6fc0   edx: d9ce6a7c
Jun 24 03:03:00 proxyint kernel: esi: dff9b980   edi: 00000286   ebp:
00000000   esp: d9c79e38
Jun 24 03:03:00 proxyint kernel: ds: 0018   es: 0018   ss: 0018
Jun 24 03:03:00 proxyint kernel: Process squid (pid: 17995, process nr: 32,
stackpage=d9c79000)
Jun 24 03:03:00 proxyint kernel: Stack: d9ce6a1c 00000000 d9ce6a7c d16e562c
c014c21d dff9b980 d9ce69c0 d9ce69c0
Jun 24 03:03:00 proxyint kernel:        c014c2c9 d9ce69c0 df478444 c8ad80c0
c0160e08 d9ce69c0 df478400 d4e957fc
Jun 24 03:03:00 proxyint kernel:        c8ad80c0 00000003 00008218 d4e957fc
df478400 d4e957fc c016e0c1 df478400
Jun 24 03:03:00 proxyint kernel: Call Trace: [kfree_skbmem+50/61]
[__kfree_skb+161/167] [tcp_close+208/601] [inet_release+122/130]
[sock_release+31/80] [sock_close+50/57] [__fput+31/69]
Jun 24 03:03:00 proxyint kernel:        [fput+23/68] [filp_close+80/89]
[do_exit+288/616] [do_signal+487/601] [force_sig_info+121/129]
[force_sig+17/21] [do_page_fault+807/883] [error_code+45/52]
Jun 24 03:03:00 proxyint kernel:        [signal_return+20/24]
Jun 24 03:03:00 proxyint kernel: Code: 8b 69 08 81 fd 2b 2f c3 a5 0f 85 d0
00 00 00 8b 69 0c 85 ed
Jun 24 03:03:00 proxyint kernel: Unable to handle kernel paging request at
virtual address 00005d9c
Jun 24 03:03:00 proxyint kernel: current->tss.cr3 = 0ca43000, %cr3 =
0ca43000
Jun 24 03:03:00 proxyint kernel: *pde = 00000000
Jun 24 03:03:00 proxyint kernel: Oops: 0000
Jun 24 03:03:00 proxyint kernel: CPU:    0
Jun 24 03:03:00 proxyint kernel: EIP:    0010:[fput+5/68]
Jun 24 03:03:00 proxyint kernel: EFLAGS: 00010282
Jun 24 03:03:00 proxyint kernel: eax: 00000000   ebx: 00005d80   ecx:
00000000   edx: 00000000
Jun 24 03:03:00 proxyint kernel: esi: 00000000   edi: 00000000   ebp:
00001000   esp: dab3dfac
Jun 24 03:03:00 proxyint kernel: ds: 0018   es: 0018   ss: 0018
Jun 24 03:03:00 proxyint kernel: Process dnsserver (pid: 18003, process nr:
36, stackpage=dab3d000)
Jun 24 03:03:00 proxyint kernel: Stack: 00005d80 dab3c000 401498c0 bffffb58
bffffa64 c010a0d4 00000000 40015000
Jun 24 03:03:00 proxyint kernel:        00001000 401498c0 bffffb58 bffffa64
00000003 0000002b 0000002b 00000003
Jun 24 03:03:00 proxyint kernel:        40101ad4 00000023 00000202 bffffa4c
0000002b
Jun 24 03:03:00 proxyint kernel: Call Trace: [system_call+52/56]
Jun 24 03:03:00 proxyint kernel: Code: 8b 43 1c 48 75 34 53 e8 8b 96 00 00
53 e8 8b ef ff ff c7 43
Jun 24 03:03:00 proxyint kernel: swap_duplicate: entry 40070000, offset
exceeds max
Jun 24 03:03:37 proxyint squid[458]: Squid Parent: child process 18062
started
Jun 24 03:03:00 proxyint kernel: VM: killing process dnsserver
Jun 24 03:03:37 proxyint kernel: swap_free: offset exceeds max
Jun 24 03:03:37 proxyint kernel: swap_free: swap-space map bad (entry
00070000)
Jun 24 03:03:37 proxyint kernel: swap_free: offset exceeds max
Jun 24 03:03:37 proxyint kernel: free_one_pmd: bad directory entry 40140000
Jun 24 03:03:37 proxyint kernel: free_one_pmd: bad directory entry bfff0000
Jun 24 03:03:37 proxyint kernel: free_one_pmd: bad directory entry 100f0000
Jun 24 03:03:37 proxyint kernel: magic (corrupt) (name=size-128)
Jun 24 03:03:37 proxyint kernel: kmem_alloc: Bad slab magic (corrupt)
(name=size-128)
Jun 24 03:05:08 proxyint last message repeated 292 times
Jun 24 03:05:37 proxyint kernel: magic (corrupt) (name=size-128)
Jun 24 03:05:37 proxyint kernel: kmem_alloc: Bad slab magic (corrupt)
(name=size-128)
Jun 24 03:06:37 proxyint last message repeated 71 times
Jun 24 03:06:37 proxyint kernel: kmem_alloc: Bad slab magic (magic (corrupt)
(name=size-128)
Jun 24 03:06:37 proxyint kernel: kmem_alloc: Bad slab magic (corrupt)
(name=size-128)
Jun 24 03:06:50 proxyint last message repeated 292 times
Jun 24 03:12:37 proxyint kernel: d slab magic (corrupt) (name=size-128)
Jun 24 03:14:37 proxyint kernel: kmem_alloc: Bad slab magic (corrupt)
(name=size-128)

I notice that there is a message there about maybe a problem with my RAM.
Is there any tests that i can run in order to verify that the RAM is really
the thing that made the kernel to crush ?

I also got some screen messages that said that the swap/paging mechanism
stop working and that for some reason the system killed the idle process.

Few days earlier to this error i fixed my swap and i saw that it did in fact
used it.
(i used cat /proc/swaps, free and top) I have 4 partitions of 512 MB each, 1
primary and 3 logical.

I will send the other errors as soon as i will be at the office. (tomorrow
hopefully)


Thanks,

Mike



=================================================================
To unsubscribe, send mail to linux-il-request@linux.org.il with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail linux-il-request@linux.org.il