It is currently Tue, 14 Jul 2020 02:11:32 GMT



 
Author Message
 SCSI disk error : host 1 channel 0 id 4 - L440GX , Adaptec AIC7xx and RedHat SCSI Errors

I've been having ongoing problems with an Intel L440GX (Lancewood)
with both the RAID controller and onboard adaptec aic7896 controller.

We worked through the RAID problem by chucking out the Intel SRCU21
raid controller, replacing it with an ADAPTEC 2100S, which so far
seems
to have sorted that probelm out (the SRCU was randomly{*filter*},
dumping running processes into a 'D' state, forcing a powercyle to
reboot).

Problem now is with the onboard Adaptec controller, which is hooked up
to an IBM Ultrastar. Having got past all of the install problems with
RH7.3 (had to use dd and apic options to prevent the installer
{*filter*}), I'm seeing errors in /var/log/messages relating to the
controller (its also the root drive).

For example :

--------------------- sample /var/log/messages ---------

Jun  3 14:43:57 bashir proftpd[12455]: 192.168.0.28
(192.168.0.11[192.168.0.11]) - FTP session opened.
Jun  3 14:52:03 bashir kernel: loop: loaded (max 8 devices)
Jun  3 15:02:36 bashir kernel: (scsi1:A:4:0): Unexpected busfree in
Data-out phase
Jun  3 15:02:36 bashir kernel: SEQADDR == 0x54
Jun  3 15:02:37 bashir kernel: SCSI disk error : host 1 channel 0 id 4
lun 0 return code = 10000
Jun  3 15:02:37 bashir kernel:  I/O error: dev 08:01, sector 40208
Jun  3 15:02:37 bashir kernel: SCSI disk error : host 1 channel 0 id 4
lun 0 return code = 10000
Jun  3 15:02:37 bashir kernel:  I/O error: dev 08:01, sector 40216
Jun  3 15:03:36 bashir kernel: scsi1:0:4:0: Attempting to queue an
ABORT message
Jun  3 15:03:36 bashir kernel: scsi1: Dumping Card State while idle,
at SEQADDR 0x9
Jun  3 15:03:36 bashir kernel: ACCUM = 0x0, SINDEX = 0x6f, DINDEX =
0xe4, ARG_2 = 0x0
Jun  3 15:03:36 bashir kernel: HCNT = 0x0 SCBPTR = 0xc
Jun  3 15:03:36 bashir kernel: SCSISEQ = 0x12, SBLKCTL = 0x6
Jun  3 15:03:36 bashir kernel:  DFCNTRL = 0x0, DFSTATUS = 0x89
Jun  3 15:03:36 bashir kernel: LASTPHASE = 0x1, SCSISIGI = 0x0,
SXFRCTL0 = 0x80
Jun  3 15:03:38 bashir kernel: SSTAT0 = 0x0, SSTAT1 = 0xa
Jun  3 15:03:38 bashir kernel: STACK == 0x3, 0x107, 0x15f, 0x8a
Jun  3 15:03:38 bashir kernel: SCB count = 140
Jun  3 15:03:39 bashir kernel: Kernel NEXTQSCB = 56
Jun  3 15:03:39 bashir kernel: Card NEXTQSCB = 56
Jun  3 15:03:39 bashir kernel: QINFIFO entries:
Jun  3 15:03:39 bashir kernel: Waiting Queue entries:
Jun  3 15:03:39 bashir kernel: Disconnected Queue entries: 10:4 16:119
9:91 25:19 22:32
Jun  3 15:03:39 bashir kernel: QOUTFIFO entries:
Jun  3 15:03:40 bashir kernel: Sequencer Free SCB List: 12 20 29 2 4 3
5 24 0 1 31 21 30 11 8 14 15 28 18 17 7 6 13 27 26 23 19
Jun  3 15:03:40 bashir kernel: Sequencer SCB Info: 0(c 0x60, s 0x47, l
0, t 0xff) 1(c 0x60, s 0x47, l 0, t 0xff) 2(c 0x60, s 0x47, l 0, t
0xff) 3(c 0x60, s 0x47, l 0, t 0xff) 4(c 0x60, s 0x47, l 0, t 0xff)
5(c 0x60, s <<ETC ETC>>
Jun  3 15:03:40 bashir kernel: xff) 31(c 0x60, s 0x47, l 0, t 0xff)
Jun  3 15:03:40 bashir kernel: Pending list: 4(c 0x60, s 0x47, l 0),
119(c 0x60, s 0x47, l 0), 91(c 0x60, s 0x47, l 0), 19(c 0x60, s 0x47,
l 0), 32(c 0x60, s 0x47, l 0)
Jun  3 15:03:40 bashir kernel: Kernel Free SCB list: 111 138 67 76 92
39 6 13 81 20 108 30 105 93 35 64 122 118 116 44 78 36 98 96 107 62 29
97 0 75 5 34 17 43 7 90 55 2 53 61 99 70 38 125 120 28 106 12 123 87
110  <<ETC ETC>>
Jun  3 15:03:40 bashir kernel: DevQ(0:4:0): 0 waiting
Jun  3 15:03:40 bashir kernel: (scsi1:A:4:0): Queuing a recovery SCB
Jun  3 15:03:40 bashir kernel: scsi1:0:4:0: Device is disconnected,
re-queuing SCB
Jun  3 15:03:40 bashir kernel: Recovery code sleeping
Jun  3 15:03:41 bashir kernel: (scsi1:A:4:0): Abort Tag Message Sent
Jun  3 15:03:41 bashir kernel: (scsi1:A:4:0): SCB 32 - Abort Tag
Completed.
Jun  3 15:03:41 bashir kernel: Recovery SCB completes
Jun  3 15:03:41 bashir kernel: Recovery code awake
Jun  3 15:03:41 bashir kernel: aic7xxx_abort returns 0x2002
Jun  3 15:03:41 bashir kernel: scsi1:0:4:0: Attempting to queue an
ABORT message
Jun  3 15:03:41 bashir kernel: scsi1: Dumping Card State while idle,
at SEQADDR 0x8
Jun  3 15:03:41 bashir kernel: ACCUM = 0x0, SINDEX = 0x20, DINDEX =
0xe4, ARG_2 = 0x0
Jun  3 15:03:41 bashir kernel: HCNT = 0x0 SCBPTR = 0x16
Jun  3 15:03:41 bashir kernel: SCSISEQ = 0x12, SBLKCTL = 0x6
Jun  3 15:03:41 bashir kernel:  DFCNTRL = 0x0, DFSTATUS = 0x89
Jun  3 15:03:41 bashir kernel: LASTPHASE = 0x1, SCSISIGI = 0x0,
SXFRCTL0 = 0x80
Jun  3 15:03:41 bashir kernel: SSTAT0 = 0x0, SSTAT1 = 0xa
Jun  3 15:03:41 bashir kernel: STACK == 0x3, 0x107, 0x15f, 0xe6
Jun  3 15:03:41 bashir kernel: SCB count = 140
Jun  3 15:03:41 bashir kernel: Kernel NEXTQSCB = 56
Jun  3 15:03:41 bashir kernel: Card NEXTQSCB = 56
Jun  3 15:03:41 bashir kernel: QINFIFO entries:
Jun  3 15:03:41 bashir kernel: Waiting Queue entries:
Jun  3 15:03:41 bashir kernel: Disconnected Queue entries: 10:4 16:119
9:91 25:19
Jun  3 15:03:41 bashir kernel: QOUTFIFO entries:
Jun  3 15:03:41 bashir kernel: Sequencer Free SCB List: 22 12 20 29 2
4 3 5 24 0 1 31 21 30 11 8 14 15 28 18 17 7 6 13 27 26 23 19
Jun  3 15:03:41 bashir kernel: Sequencer SCB Info: 0(c 0x60, s 0x47, l
0, t 0xff) 1(c 0x60, s 0x47, l 0, t 0xff) 2(c 0x60, s 0x47, l 0, t
0xff) 3(c 0x60, s 0x47, l 0, t 0xff) 4(c 0x60, s 0x47, l 0, t 0xff)
5(c 0x60, s 0x47,  <<ETC ETC>>
Jun  3 15:03:41 bashir kernel: xff) 31(c 0x60, s 0x47, l 0, t 0xff)
Jun  3 15:03:41 bashir kernel: Pending list: 4(c 0x60, s 0x47, l 0),
119(c 0x60, s 0x47, l 0), 91(c 0x60, s 0x47, l 0), 19(c 0x60, s 0x47,
l 0)
Jun  3 15:03:41 bashir kernel: Kernel Free SCB list: 32 111 138 67 76
92 39 6 13 81 20 108 30 105 93 35 64 122 118 116 44 78 36 98 96 107 62
29 97 0 75 5 34 17 43 7 90 55 2 53 61 99 70 38 125 120 28 106 12 123
87 110 3 48 52 132 112  <<ETC ETC>>
Jun  3 15:03:41 bashir kernel: DevQ(0:4:0): 0 waiting
Jun  3 15:03:41 bashir kernel: (scsi1:A:4:0): Queuing a recovery SCB
Jun  3 15:03:41 bashir kernel: scsi1:0:4:0: Device is disconnected,
re-queuing SCB
Jun  3 15:03:41 bashir kernel: Recovery code sleeping
Jun  3 15:03:41 bashir kernel: (scsi1:A:4:0): Abort Tag Message Sent
Jun  3 15:03:41 bashir kernel: (scsi1:A:4:0): SCB 19 - Abort Tag
Completed.
Jun  3 15:03:41 bashir kernel: Recovery SCB completes
Jun  3 15:03:41 bashir kernel: Recovery code awake
Jun  3 15:03:41 bashir kernel: aic7xxx_abort returns 0x2002
Jun  3 15:03:41 bashir kernel: scsi1:0:4:0: Attempting to queue an
ABORT message
Jun  3 15:03:41 bashir kernel: scsi1: Dumping Card State while idle,
at SEQADDR 0x8
Jun  3 15:03:41 bashir kernel: ACCUM = 0x0, SINDEX = 0x13, DINDEX =
0xe4, ARG_2 = 0x0
Jun  3 15:03:41 bashir kernel: HCNT = 0x0 SCBPTR = 0x19
Jun  3 15:03:41 bashir kernel: SCSISEQ = 0x12, SBLKCTL = 0x6
Jun  3 15:03:41 bashir kernel:  DFCNTRL = 0x0, DFSTATUS = 0x89
Jun  3 15:03:41 bashir kernel: LASTPHASE = 0x1, SCSISIGI = 0x0,
SXFRCTL0 = 0x80
Jun  3 15:03:41 bashir kernel: SSTAT0 = 0x0, SSTAT1 = 0xa
Jun  3 15:03:41 bashir kernel: STACK == 0x3, 0x107, 0x15f, 0xe6
Jun  3 15:03:41 bashir kernel: SCB count = 140
Jun  3 15:03:41 bashir kernel: Kernel NEXTQSCB = 56
Jun  3 15:03:41 bashir kernel: Card NEXTQSCB = 56
Jun  3 15:03:41 bashir kernel: QINFIFO entries:
Jun  3 15:03:41 bashir kernel: Waiting Queue entries:
Jun  3 15:03:41 bashir kernel: Disconnected Queue entries: 10:4 16:119
9:91
Jun  3 15:03:41 bashir kernel: QOUTFIFO entries:
Jun  3 15:03:41 bashir kernel: Sequencer Free SCB List: 25 22 12 20 29
2 4 3 5 24 0 1 31 21 30 11 8 14 15 28 18 17 7 6 13 27 26 23 19
Jun  3 15:03:41 bashir kernel: Sequencer SCB Info: 0(c 0x60, s 0x47, l
0, t 0xff) 1(c 0x60, s 0x47, l 0, t 0xff) 2(c 0x60, s 0x47, l 0, t
0xff) 3(c 0x60, s 0x47, l 0, t 0xff) 4(c 0x60, s 0x47, l 0, t 0xff)
5(c 0x60, s 0x47, l 0, t 0xff) 6(c 0x60 <<ETC ETC>>
Jun  3 15:03:41 bashir kernel: Pending list: 4(c 0x60, s 0x47, l 0),
119(c 0x60, s 0x47, l 0), 91(c 0x60, s 0x47, l 0)
Jun  3 15:03:41 bashir kernel: Kernel Free SCB list: 19 32 111 138 67
76 92 39 6 13 81 20 108 30 105 93 35 64 122 118 116 44 78 36 98 96 107
62 29 97 0 75 5 34 17 43 7 90 55 2 53 61 99 70 38 125 120 28 106 12
123 87 110 3 48 52 132 112 77 68 72 14 102  <<ETC ETC>>
Jun  3 15:03:41 bashir kernel: (scsi1:A:4:0): Queuing a recovery SCB
Jun  3 15:03:41 bashir kernel: scsi1:0:4:0: Device is disconnected,
re-queuing SCB
Jun  3 15:03:41 bashir kernel: Recovery code sleeping
Jun  3 15:03:41 bashir kernel: (scsi1:A:4:0): Abort Tag Message Sent
Jun  3 15:03:41 bashir kernel: (scsi1:A:4:0): SCB 91 - Abort Tag
Completed.
Jun  3 15:03:41 bashir kernel: Recovery SCB completes
Jun  3 15:03:41 bashir kernel: Recovery code awake
Jun  3 15:03:41 bashir kernel: aic7xxx_abort returns 0x2002
Jun  3 15:03:41 bashir kernel: scsi1:0:4:0: Attempting to queue an
ABORT message
Jun  3 15:03:41 bashir kernel: scsi1: Dumping Card State while idle,
at SEQADDR 0x9
Jun  3 15:03:41 bashir kernel: ACCUM = 0x0, SINDEX = 0x5b, DINDEX =
0xe4, ARG_2 = 0x0
Jun  3 15:03:41 bashir kernel: HCNT = 0x0 SCBPTR = 0x9
Jun  3 15:03:41 bashir kernel: SCSISEQ = 0x12, SBLKCTL = 0x6
Jun  3 15:03:41 bashir kernel:  DFCNTRL = 0x0, DFSTATUS = 0x89
Jun  3 15:03:41 bashir kernel: LASTPHASE = 0x1, SCSISIGI = 0x0,
SXFRCTL0 = 0x80
Jun  3 15:03:41 bashir kernel: SSTAT0 = 0x0, SSTAT1 = 0xa
Jun  3 15:03:42 bashir kernel: STACK == 0x3, 0x107, 0x15f, 0xe6
Jun  3 15:03:42 bashir kernel: SCB count = 140
Jun  3 15:03:42 bashir kernel: Kernel NEXTQSCB = 56
Jun  3 15:03:42 bashir kernel: Card NEXTQSCB = 56
Jun  3 15:03:42 bashir kernel: QINFIFO entries:
Jun  3 15:03:42 bashir kernel: Waiting Queue entries:
Jun  3 15:03:42 bashir kernel: Disconnected Queue entries: 10:4 16:119
Jun  3 15:03:42 bashir kernel: QOUTFIFO entries:
Jun  3 15:03:42 bashir kernel: Sequencer Free SCB List: 9 25 22 12 20
29 2 4 3 5 24 0 1 31 21 30 11 8 14 15 28 18 17 7 6 13 27 26 23 19
Jun  3 15:03:42 bashir kernel: Sequencer SCB Info: 0(c 0x60, s 0x47, l
0, t 0xff) 1(c 0x60, s 0x47, l 0, t 0xff) 2(c 0x60, s 0x47, l 0, t
0xff) 3(c 0x60, s 0x47, l 0, t 0xff) 4(c 0x60, s 0x47, l 0, t 0xff)
5(c 0x60, s 0x47, l 0, t 0xff) 6(c 0x60, s 0x47, <<ETC ETC>>
Jun  3 15:03:42 bashir kernel: Pending list: 4(c 0x60, s 0x47, l 0),
119(c 0x60, s 0x47, l 0)
Jun  3 15:03:42 bashir kernel: Kernel Free SCB list: 91 19 32 111 138
67 76 92 39 6 13 81 20 108 30 105 93 35 64 122 118 116 44 78 36 98 96
107 62 29 97 0 75 5 34 17 43 7 90 55 2 53 61 99 70 38 125 120 28 106
12 123 87 110 3 48 52 132 112 77 68 72 14 102 15 8 11  <<ETC ETC>>
Jun  3 15:03:42 bashir kernel: (scsi1:A:4:0): Queuing a recovery SCB
Jun  3 15:03:42 bashir kernel: scsi1:0:4:0: Device is disconnected,
re-queuing SCB
Jun  3 15:03:42 bashir kernel: Recovery code sleeping
Jun  3 15:03:42 bashir kernel: (scsi1:A:4:0): Abort Tag Message Sent
Jun  3 15:03:42 bashir kernel: (scsi1:A:4:0): SCB 119 - Abort Tag
Completed.
Jun  3 15:03:42 bashir kernel: Recovery SCB completes
Jun  3 15:03:42 bashir kernel: Recovery code awake
Jun  3 15:03:42 bashir kernel: aic7xxx_abort returns 0x2002
Jun  3 15:03:42 bashir kernel: scsi1:0:4:0: Attempting to queue an
ABORT message
Jun  3 15:03:42 bashir kernel: scsi1: Dumping Card State while idle,
at SEQADDR 0x8
Jun  3 15:03:42 bashir kernel: ACCUM = 0x0, SINDEX = 0x77, DINDEX =
0xe4, ARG_2 = 0x0
Jun  3 15:03:42 bashir kernel: HCNT = 0x0 SCBPTR = 0x10
Jun  3 15:03:42 bashir kernel: SCSISEQ = 0x12, SBLKCTL = 0x6
Jun  3 15:03:42 bashir kernel:  DFCNTRL = 0x0, DFSTATUS = 0x89
Jun  3 15:03:42 bashir kernel: LASTPHASE = 0x1, SCSISIGI = 0x0,
SXFRCTL0 = 0x80
Jun  3 15:03:42 bashir kernel: SSTAT0 = 0x0, SSTAT1 = 0xa
Jun  3 15:03:42 bashir kernel: STACK == 0x3, 0x107, 0x15f, 0xe6
Jun  3 15:03:42 bashir kernel: SCB count = 140
Jun  3 15:03:42 bashir kernel: Kernel NEXTQSCB = 56
Jun  3 15:03:42 bashir kernel: Card NEXTQSCB = 56
Jun  3 15:03:42 bashir kernel: QINFIFO entries:
Jun  3 15:03:42 bashir kernel: Waiting Queue entries:
Jun  3 15:03:43 bashir kernel: Disconnected Queue entries: 10:4
Jun  3 15:03:43 bashir kernel: QOUTFIFO entries:
Jun  3 15:03:43 bashir kernel: Sequencer Free SCB List: 16 9 25 22 12
20 29 2 4 3 5 24 0 1 31 21 30 11 8 14 15 28 18 17 7 6 13 27 26 23 19
Jun  3 15:03:43 bashir kernel: Sequencer SCB Info: 0(c 0x60, s 0x47, l
0, t 0xff) 1(c 0x60, s 0x47, l 0, t 0xff) 2(c 0x60, s 0x47, l 0, t
0xff) 3(c 0x60, s 0x47, l 0, t 0xff) 4(c 0x60, s 0x47, l 0, t 0xff)
5(c 0x60, s 0x47, l 0, t 0xff) 6(c 0x60, s 0x47, l 0, t 0xff) 7 <<ETC
ETC>>
Jun  3 15:03:43 bashir kernel: Pending list: 4(c 0x60, s 0x47, l 0)
Jun  3 15:03:43 bashir kernel: Kernel Free SCB list: 119 91 19 32 111
138 67 76 92 39 6 13 81 20 108 30 105 93 35 64 122 118 116 44 78 36 98
96 107 62 29 97 0 75 5 34 17 43 7 90 55 2 53 61 99 70 38 125 120 28
106 12 123 87 110 3 48 52 132 112 77 68 72 14 102 15 8 11 114 126 22
<<ETC ETC>>
Jun  3 15:03:43 bashir kernel: DevQ(0:4:0): 0 waiting
Jun  3 15:03:43 bashir kernel: (scsi1:A:4:0): Queuing a recovery SCB
Jun  3 15:03:43 bashir kernel: scsi1:0:4:0: Device is disconnected,
re-queuing SCB
Jun  3 15:03:43 bashir kernel: Recovery code sleeping
Jun  3 15:03:43 bashir kernel: (scsi1:A:4:0): Abort Tag Message Sent
Jun  3 15:03:43 bashir kernel: (scsi1:A:4:0): SCB 4 - Abort Tag
Completed.
Jun  3 15:03:43 bashir kernel: Recovery SCB completes
Jun  3 15:03:43 bashir kernel: Recovery code awake
Jun  3 15:03:43 bashir kernel: aic7xxx_abort returns 0x2002
Jun  3 15:03:43 bashir kernel: (scsi1:A:4:0): Locking max tag count at
128
Jun  3 15:04:07 bashir kernel: (scsi1:A:4:0): Unexpected busfree in
Data-out phase
Jun  3 15:04:07 bashir kernel: SEQADDR == 0x1a0
Jun  3 15:04:07 bashir kernel: SCSI disk error : host 1 channel 0 id 4
lun 0 return code = 10000
Jun  3 15:04:07 bashir kernel:  I/O error: dev 08:01, sector 59952
Jun  3 15:04:07 bashir kernel: SCSI disk error : host 1 channel 0 id 4
lun 0 return code = 10000
Jun  3 15:04:07 bashir kernel:  I/O error: dev 08:01, sector 59960
Jun  3 15:14:36 bashir proftpd[12455]: 192.168.0.28
(192.168.0.11[192.168.0.11]) - FTP session closed

--------------------------------------------------------------------------------

I don't know what other information may be relevant, but the basic
configuration is as follows:

System is Intel L440GX (latest BIOS), with 1Gb Ram 2x Intel PIII 256kb
L2, Adaptec 2100S, configured for RAID 1 with 3x IBM Ultrastar and 1x
Ultrastar as Boot Drive.

O/S is Red Hat 7.3 (binary kernel) with latest updates applied

[root@bashir log]# uname -a
Linux bashir.fcpl.com 2.4.18-4smp #1 SMP Thu May 2 18:32:34 EDT 2002
i686 unknown

File system is ext3

[root@bashir log]# df -H
Filesystem            Size  Used Avail Use% Mounted on
/dev/sda1             9.0G  1.5G  7.1G  17% /
/dev/hda1             502M   24M  452M   5% /boot
/dev/hda3              37G   34M   35G   1% /ide
none                  528M     0  527M   0% /dev/shm
/dev/sdb1              18G   39M   17G   1% /raid
/dev/sda2             9.0G  1.3G  7.3G  15% /usr

I have tried the APIC on/ off as well as append=apic to the boot
parameters with no change (have also enabled verbose output on
aic7xxx)

Output of dmesg is as follows :

Linux version 2.4.18-4smp (bhcomp...@stripples.devel.redhat.com) (gcc
version 2.96 20000731 (Red Hat Linux 7.3 2.96-110)) #1 SMP Thu May 2
18:32:34 EDT 2002
BIOS-provided physical RAM map:
 BIOS-e820: 0000000000000000 - 000000000009f400 (usable)
 BIOS-e820: 000000000009f400 - 00000000000a0000 (reserved)
 BIOS-e820: 00000000000e8000 - 0000000000100000 (reserved)
 BIOS-e820: 0000000000100000 - 000000003fff0000 (usable)
 BIOS-e820: 000000003fff0000 - 000000003ffffc00 (ACPI data)
 BIOS-e820: 000000003ffffc00 - 0000000040000000 (ACPI NVS)
 BIOS-e820: 00000000fec00000 - 00000000fec10000 (reserved)
 BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
 BIOS-e820: 00000000fff80000 - 0000000100000000 (reserved)
127MB HIGHMEM available.
found SMP MP-table at 000f6ab0
hm, page 000f6000 reserved twice.
hm, page 000f7000 reserved twice.
hm, page 0009f000 reserved twice.
hm, page 000a0000 reserved twice.
On node 0 totalpages: 262128
zone(0): 4096 pages.
zone(1): 225280 pages.
zone(2): 32752 pages.
Intel MultiProcessor Specification v1.4
    Virtual Wire compatibility mode.
OEM ID: INTEL    Product ID: Lancewood    APIC at: 0xFEE00000
Processor #1 Pentium(tm) Pro APIC version 17
Processor #0 Pentium(tm) Pro APIC version 17
I/O APIC #2 Version 17 at 0xFEC00000.
Processors: 2
Kernel command line: auto BOOT_IMAGE=2.418-4smp ro root=801
BOOT_FILE=/boot/vmlinuz-2.4.18-4smp apic aic7xxx=verbose
Initializing CPU#0
Detected 995.697 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 1985.74 BogoMIPS
Memory: 1030440k/1048512k available (1232k kernel code, 17684k
reserved, 842k data, 304k init, 131008k highmem)
Dentry cache hash table entries: 131072 (order: 8, 1048576 bytes)
Inode cache hash table entries: 65536 (order: 7, 524288 bytes)
Mount-cache hash table entries: 16384 (order: 5, 131072 bytes)
Buffer cache hash table entries: 65536 (order: 6, 262144 bytes)
Page-cache hash table entries: 262144 (order: 8, 1048576 bytes)
CPU: Before vendor init, caps: 0383fbff 00000000 00000000, vendor = 0
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 256K
CPU: After vendor init, caps: 0383fbff 00000000 00000000 00000000
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
CPU:     After generic, caps: 0383fbff 00000000 00000000 00000000
CPU:             Common caps: 0383fbff 00000000 00000000 00000000
Enabling fast FPU save and restore... done.
Enabling unmasked SIMD FPU exception support... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.40 (20010327) Richard Gooch (rgo...@atnf.csiro.au)
mtrr: detected mtrr type: Intel
CPU: Before vendor init, caps: 0383fbff 00000000 00000000, vendor = 0
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 256K
CPU: After vendor init, caps: 0383fbff 00000000 00000000 00000000
Intel machine check reporting enabled on CPU#0.
CPU:     After generic, caps: 0383fbff 00000000 00000000 00000000
CPU:             Common caps: 0383fbff 00000000 00000000 00000000
CPU0: Intel Pentium III (Coppermine) stepping 06
per-CPU timeslice cutoff: 731.66 usecs.
task migration cache decay timeout: 10 msecs.
enabled ExtINT on CPU#0
ESR value before enabling vector: 00000000
ESR value after enabling vector: 00000000
Booting processor 1/0 eip 2000
Initializing CPU#1
masked ExtINT on CPU#1
ESR value before enabling vector: 00000000
ESR value after enabling vector: 00000000
Calibrating delay loop... 1985.74 BogoMIPS
CPU: Before vendor init, caps: 0383fbff 00000000 00000000, vendor = 0
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 256K
CPU: After vendor init, caps: 0383fbff 00000000 00000000 00000000
Intel machine check reporting enabled on CPU#1.
CPU:     After generic, caps: 0383fbff 00000000 00000000 00000000
CPU:             Common caps: 0383fbff 00000000 00000000 00000000
CPU1: Intel Pentium III (Coppermine) stepping 06
Total of 2 processors activated (3971.48 BogoMIPS).
ENABLING IO-APIC IRQs
Setting 2 in the phys_id_present_map
...changing IO-APIC physical APIC ID to 2 ... ok.
init IO_APIC IRQs
 IO-APIC (apicid-pin) 2-0, 2-7, 2-9, 2-10, 2-11, 2-16, 2-17, 2-18,
2-22, 2-23 not connected.
..TIMER: vector=0x31 pin1=2 pin2=0
number of MP IRQ sources: 17.
number of IO-APIC #2 registers: 24.
testing the IO APIC.......................

IO APIC #2......
.... register #00: 02000000
.......    : physical APIC id: 02
.... register #01: 00170011
.......     : max redirection entries: 0017
.......     : PRQ implemented: 0
.......     : IO APIC version: 0011
.... register #02: 00000000
.......     : arbitration: 00
.... IRQ redirection table:
 NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect:  
 00 000 00  1    0    0   0   0    0    0    00
 01 003 03  0    0    0   0   0    1    1    39
 02 003 03  0    0    0   0   0    1    1    31
 03 003 03  0    0    0   0   0    1    1    41
 04 003 03  0    0    0   0   0    1    1    49
 05 003 03  0    0    0   0   0    1    1    51
 06 003 03  0    0    0   0   0    1    1    59
 07 000 00  1    0    0   0   0    0    0    00
 08 003 03  0    0    0   0   0    1    1    61
 09 000 00  1    0    0   0   0    0    0    00
 0a 000 00  1    0    0   0   0    0    0    00
 0b 000 00  1    0    0   0   0    0    0    00
 0c 003 03  0    0    0   0   0    1    1    69
 0d 003 03  0    0    0   0   0    1    1    71
 0e 003 03  0    0    0   0   0    1    1    79
 0f 003 03  0    0    0   0   0    1    1    81
 10 000 00  1    0    0   0   0    0    0    00
 11 000 00  1    0    0   0   0    0    0    00
 12 000 00  1    0    0   0   0    0    0    00
 13 003 03  1    1    0   1   0    1    1    89
 14 003 03  1    1    0   1   0    1    1    91
 15 003 03  1    1    0   1   0    1    1    99
 16 000 00  1    0    0   0   0    0    0    00
 17 000 00  1    0    0   0   0    0    0    00
IRQ to pin mappings:
IRQ0 -> 0:2
IRQ1 -> 0:1
IRQ3 -> 0:3
IRQ4 -> 0:4
IRQ5 -> 0:5
IRQ6 -> 0:6
IRQ8 -> 0:8
IRQ12 -> 0:12
IRQ13 -> 0:13
IRQ14 -> 0:14
IRQ15 -> 0:15
IRQ19 -> 0:19
IRQ20 -> 0:20
IRQ21 -> 0:21
.................................... done.
Using local APIC timer interrupts.
calibrating APIC timer ...
..... CPU clock speed is 995.6857 MHz.
..... host bus clock speed is 99.5684 MHz.
cpu: 0, clocks: 995684, slice: 331894
CPU0<T0:995680,T1:663776,D:10,S:331894,C:995684>
cpu: 1, clocks: 995684, slice: 331894
CPU1<T0:995680,T1:331888,D:4,S:331894,C:995684>
checking TSC synchronization across CPUs: passed.
PCI: PCI BIOS revision 2.10 entry at 0xfdab0, last bus=3
PCI: Using configuration type 1
PCI: Probing PCI hardware
Unknown bridge resource 0: assuming transparent
Unknown bridge resource 1: assuming transparent
Unknown bridge resource 0: assuming transparent
Unknown bridge resource 1: assuming transparent
Unknown bridge resource 0: assuming transparent
Unknown bridge resource 1: assuming transparent
Unknown bridge resource 2: assuming transparent
PCI: Discovered primary peer bus ff [IRQ]
PCI: Using IRQ router PIIX [8086/7110] at 00:12.0
PCI->APIC IRQ transform: (B0,I12,P0) -> 19
PCI->APIC IRQ transform: (B0,I12,P0) -> 19
PCI->APIC IRQ transform: (B0,I14,P0) -> 21
PCI->APIC IRQ transform: (B0,I18,P3) -> 21
PCI->APIC IRQ transform: (B2,I4,P0) -> 20
isapnp: Scanning for PnP cards...
isapnp: No Plug & Play device found
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039
Initializing RT netlink socket
apm: BIOS not found.
Starting kswapd
allocated 64 pages and 64 bhs reserved for the highmem bounces
VFS: Diskquotas version dquot_6.5.0 initialized
pty: 2048 Unix98 ptys configured
Serial driver version 5.05c (2001-07-08) with MANY_PORTS MULTIPORT
SHARE_IRQ SERIAL_PCI ISAPNP enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
Real Time Clock Driver v1.10e
block: 1024 slots per queue, batch=256
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with
idebus=xx
PIIX4: IDE controller on PCI bus 00 dev 91
PIIX4: chipset revision 1
PIIX4: not 100% native mode: will probe irqs later
    ide0: BM-DMA at 0x2860-0x2867, BIOS settings: hda:DMA, hdb:DMA
    ide1: BM-DMA at 0x2868-0x286f, BIOS settings: hdc:DMA, hdd:DMA
hda: ST340823A, ATA DISK drive
hdb: CD-540E, ATAPI CD/DVD-ROM drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
blk: queue c03cbd44, I/O limit 4095Mb (mask 0xffffffff)
hda: 78165360 sectors (40021 MB) w/1024KiB Cache, CHS=4865/255/63,
UDMA(33)
ide-floppy driver 0.99.newide
Partition check:
 hda: hda1 hda2 hda3
Floppy drive(s): fd0 is 1.44M
FDC 0 is a National Semiconductor PC87306
RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize
ide-floppy driver 0.99.newide
md: md driver 0.90.0 MAX_MD_DEVS=256, MD_SB_DISKS=27
md: Autodetecting RAID arrays.
md: autorun ...
md: ... autorun DONE.
pci_hotplug: PCI Hot Plug PCI Core version: 0.4
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP, IGMP
IP: routing cache hash table of 8192 buckets, 64Kbytes
TCP: Hash tables configured (established 262144 bind 65536)
Linux IP multicast router 0.06 plus PIM-SM
NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.
RAMDISK: Compressed image found at block 0
Freeing initrd memory: 258k freed
VFS: Mounted root (ext2 filesystem).
SCSI subsystem driver Revision: 1.00
kmod: failed to exec /sbin/modprobe -s -k scsi_hostadapter, errno = 2
scsi0 : Adaptec AIC7XXX EISA/VLB/PCI SCSI HBA DRIVER, Rev 6.2.5
        <Adaptec aic7896/97 Ultra2 SCSI adapter>
        aic7896/97: Ultra2 Wide Channel B, SCSI Id=7, 32/253 SCBs

scsi1 : Adaptec AIC7XXX EISA/VLB/PCI SCSI HBA DRIVER, Rev 6.2.5
        <Adaptec aic7896/97 Ultra2 SCSI adapter>
        aic7896/97: Ultra2 Wide Channel A, SCSI Id=7, 32/253 SCBs

  Vendor: IBM       Model: IC35L018UCD210-0  Rev: S5BS
  Type:   Direct-Access                      ANSI SCSI revision: 03
scsi0:A:4:0: Tagged Queuing enabled.  Depth 253
Attached scsi disk sda at scsi0, channel 0, id 4, lun 0
(scsi0:A:4): 40.000MB/s transfers (20.000MHz, offset 63, 16bit)
SCSI device sda: 35843670 512-byte hdwr sectors (18352 MB)
 sda: sda1 sda2
Loading Adaptec I2O RAID: Version 2.4 Build 5
Detecting Adaptec I2O RAID controllers...
Adaptec I2O RAID controller 0 at f885c000 size=100000 irq=20
dpti: If you have a lot of devices this could take a few minutes.
dpti0: Reading the hardware resource table.
TID 008  Vendor: ADAPTEC      Device: AIC-7899     Rev: 00000001    
TID 519  Vendor: ADAPTEC      Device: RAID-1       Rev: 370F        
scsi2 : Vendor: Adaptec  Model: 2100S            FW:370F
  Vendor: ADAPTEC   Model: RAID-1            Rev: 370F
  Type:   Direct-Access                      ANSI SCSI revision: 02
Attached scsi disk sdb at scsi2, channel 0, id 1, lun 0
SCSI device sdb: 35843584 512-byte hdwr sectors (18352 MB)
 sdb: sdb1
Journalled Block Device driver loaded
kjournald starting.  Commit interval 5 seconds
EXT3-fs: mounted filesystem with ordered data mode.
Freeing unused kernel memory: 304k freed
Adding Swap: 1534196k swap-space (priority -1)
EXT3 FS 2.4-0.9.17, 10 Jan 2002 on sd(8,1), internal journal
kjournald starting.  Commit interval 5 seconds
EXT3 FS 2.4-0.9.17, 10 Jan 2002 on ide0(3,1), internal journal
EXT3-fs: mounted filesystem with ordered data mode.
kjournald starting.  Commit interval 5 seconds
EXT3 FS 2.4-0.9.17, 10 Jan 2002 on ide0(3,3), internal journal
EXT3-fs: mounted filesystem with ordered data mode.
kjournald starting.  Commit interval 5 seconds
EXT3 FS 2.4-0.9.17, 10 Jan 2002 on sd(8,17), internal journal
EXT3-fs: mounted filesystem with ordered data mode.
kjournald starting.  Commit interval 5 seconds
EXT3 FS 2.4-0.9.17, 10 Jan 2002 on sd(8,2), internal journal
EXT3-fs: mounted filesystem with ordered data mode.
ide-floppy driver 0.99.newide
hdb: ATAPI 40X CD-ROM drive, 128kB Cache, UDMA(33)
Uniform CD-ROM driver Revision: 3.12
hdb: DMA disabled

----------------End dmesg--------------------

It seems reading around that loads of people have had similar (if not
identical) problems with this board, paticularly with
regard to interrupt problems and apic (a definition of what apic does
would be nice!). Predictably, Intel didn't even
want to know (although they charged my $25 for the privilege of
telling me so). Red Hat themselves mention as much on their forum,
talking specifically about the apic problems on boot. I'll never buy
another Intel board, that's for sure.

The trouble is that I'm kinda out of my depth with this. My
inclination is to junk the whole thing and buy another board from
somebody else
(suggestions welcome!), although I would really appreciate an expert
opinion from somebody who knows more than me. I'm even now wondering
whether
the original RAID problems that we were having were related.

Any suggestions/ help would be most welcome - let me know if there's
any other information that would help.

Regards,

Ben Stoneham, FCP Internet LTD
ben_NOSPAM_@fcpl_NOSPAM_.com

terminators, no change though!



 Wed, 24 Nov 2004 05:01:21 GMT   
 
   [ 1 post ] 

Similar Threads

1. L440GX , Adaptec AIC7xx and RedHat SCSI Errors

2. Error message on SCSI disk, Adaptec EZ-SCSI 4.0, SCO UNIX 5.02

3. ide-scsi, 1394-sbp2 and usb-storage scsi host ids

4. ide-scsi, 1394-sbp2 and usb-storage scsi host ids

5. Adaptec AIC7xx on an HP Netserver LC

6. Adaptec 3940U problem with 2nd channel disk (x86)

7. SCSI: Unable to allocate DMA channel for Adaptec.

8. FS: Adaptec 3940UW Dual Channel SCSI adapter

9. *Twin Channel Adaptec SCSI cards*

10. Dual Channel Adaptec 3940 SCSI with ODT5


 
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group.
Designed by ST Software