lists.openwall.net   lists  /  announce  owl-users  owl-dev  john-users  john-dev  passwdqc-users  yescrypt  popa3d-users  /  oss-security  kernel-hardening  musl  sabotage  tlsify  passwords  /  crypt-dev  xvendor  /  Bugtraq  Full-Disclosure  linux-kernel  linux-netdev  linux-ext4  linux-hardening  linux-cve-announce  PHC 
Open Source and information security mailing list archives
 
Hash Suite: Windows password security audit tool. GUI, reports in PDF.
[<prev] [next>] [<thread-prev] [thread-next>] [day] [month] [year] [list]
Message-ID: <12bfabe40812071521j2f668cf8v96655e572af5c514@mail.gmail.com>
Date:	Mon, 8 Dec 2008 00:21:02 +0100
From:	"Giangiacomo Mariotti" <gg.mariotti@...il.com>
To:	"Arjan van de Ven" <arjan@...radead.org>
Cc:	"Robert Hancock" <hancockr@...w.ca>, linux-kernel@...r.kernel.org
Subject: Re: [HW PROBLEM] Intel I7 MCE. Erratum or not?

On Sun, Dec 7, 2008 at 11:13 PM, Arjan van de Ven <arjan@...radead.org> wrote:
> On Sun, 7 Dec 2008 22:55:10 +0100
> "Giangiacomo Mariotti" <gg.mariotti@...il.com> wrote:
>
>> On Sun, Dec 7, 2008 at 4:25 AM, Robert Hancock <hancockr@...w.ca>
>> wrote:
>> >
>> > If it only happened once then who knows, could be a cosmic ray or
>> > something.. but if it happens again it sounds like you likely have
>> > a bad CPU.
>> >
>> This same MCE happens every time I boot the 2.6.27.8 kernel, but it
>> never happens with kernel 2.6.26.8. It appears every time at log time
>> [  301.7320xx](where the x represent the scale of the approximation).
>> In the previous log it appeared at [  301.732037], now it appeared at
>> [  301.732042]. It happens just once at that moment and then never
>> more until I reboot. No MCE gets logged with kernel 2.6.26.8 nor with
>> Windows Vista 32bit, not even under very heavy load on memory and
>> cpu(like 3d games or 3d mark vintage). Is this really an hardware
>> problem? Why does it get logged only on kernel 2.6.27.8 at that exact
>> moment? Now I'm gonna compile kernel 2.6.28-rc7 and see if there it
>> happens too. Any suggestion? It would be appreciated, because,
>> depending on the final result, I may be forced to change the
>> hardware(It's still guaranteed, but it'd mean some months without a
>> pc).
>
> which video driver are you using?
> While your HW might be defective (MCE's tend to indicate that), if it's
> software dependent it might also indicate that something, usually a
> video driver, gets the system in a really bad state.
> Also, make sure that you have MCE enabled in both kernels (I'm sure you
> do, but it's one of those things worth double checking)
>
> Arjan van de Ven        Intel Open Source Technology Centre
> For development, discussion and tips for power savings,
> visit http://www.lesswatts.org
>

About the .config file :
cat config-2.6.26-1-amd64(unaffected/silent debian unstable based on
2.6.26.8) | grep MCE
CONFIG_X86_MCE=y
CONFIG_X86_MCE_INTEL=y
CONFIG_X86_MCE_AMD=y

cat config-2.6.27.8-nodbg001(affected, built by me) | grep MCE
CONFIG_X86_MCE=y
CONFIG_X86_MCE_INTEL=y
CONFIG_X86_MCE_AMD=y

I have an ati hd4850 1GB. It's not supported by the current x.org
server on my distro(7.3(I tried 7.4 in experimental, but hard lock
up)), so x uses the vesa driver. On 2.6.27.8(affected) I get this :
cat config-2.6.27.8-nodbg001 | grep RADEON
CONFIG_DRM_RADEON=m
CONFIG_FB_RADEON=m
CONFIG_FB_RADEON_I2C=y
CONFIG_FB_RADEON_BACKLIGHT=y
# CONFIG_FB_RADEON_DEBUG is not set

But lsmod gives this :
Module                  Size  Used by
binfmt_misc            13836  1
kvm_intel              51136  0
kvm                   149272  1 kvm_intel
acpi_cpufreq           12176  0
cpufreq_userspace       8324  0
cpufreq_stats           9160  0
fuse                   57552  3
loop                   20124  0
snd_pcm_oss            41648  0
snd_mixer_oss          18952  1 snd_pcm_oss
snd_seq_dummy           7300  0
snd_seq_oss            34640  0
snd_seq_midi_event     12168  1 snd_seq_oss
snd_hda_intel         447988  1
snd_seq                57400  5 snd_seq_dummy,snd_seq_oss,snd_seq_midi_event
i2c_i801               14492  0
i2c_core               29592  1 i2c_i801
snd_seq_device         12188  3 snd_seq_dummy,snd_seq_oss,snd_seq
pcspkr                  7040  0
evdev                  15520  3
snd_pcm                84016  2 snd_pcm_oss,snd_hda_intel
snd_timer              26784  2 snd_seq,snd_pcm
snd                    67240  10
snd_pcm_oss,snd_mixer_oss,snd_seq_oss,snd_hda_intel,snd_seq,snd_seq_device,snd_pcm,snd_timer
wmi                    11712  0
button                 11680  0
soundcore              12176  1 snd
snd_page_alloc         13328  2 snd_hda_intel,snd_pcm
firewire_sbp2          20144  7
sg                     37304  0
sr_mod                 20004  0
cdrom                  37928  1 sr_mod
jmicron                 7040  0
ide_pci_generic         8708  0
ide_core              101336  2 jmicron,ide_pci_generic
pata_acpi               9344  0
pata_jmicron            8192  0
sd_mod                 31368  14
usbhid                 47488  0
hid                    44320  1 usbhid
ff_memless              9488  1 usbhid
usb_storage           100000  0
ata_generic            10116  0
firewire_ohci          25996  0
ahci                   36620  5
firewire_core          44976  2 firewire_sbp2,firewire_ohci
crc_itu_t               6400  1 firewire_core
libata                176352  4 pata_acpi,pata_jmicron,ata_generic,ahci
r8169                  33436  0
mii                     9984  1 r8169
scsi_mod              166184  6
firewire_sbp2,sg,sr_mod,sd_mod,usb_storage,libata
dock                   14576  1 libata
ehci_hcd               38552  0
uhci_hcd               26408  0
thermal                23208  0
processor              44468  2 acpi_cpufreq,thermal
fan                     8192  0
thermal_sys            18064  3 thermal,processor,fan

I can't find anything related to radeon or ati in here, so I guess
that the kernel too has loaded a vesa driver?

Giangiacomo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@...r.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

Powered by blists - more mailing lists

Powered by Openwall GNU/*/Linux Powered by OpenVZ