Diagnosing hardware crashing problem
FedoraForum.org - Fedora Support Forums and Community
Results 1 to 6 of 6
  1. #1
    Join Date
    Jul 2012
    Location
    Chicago
    Posts
    73
    Linux (Fedora) Safari 11.0

    Diagnosing hardware crashing problem

    Since I installed my new motherboard and processor:

    • ASUS Prime B350-Plus motherboard
      AMD Ryzen 5 1600, 6 core/12 thread Processor



    I have been having spontaneous reboots (maybe every other day).
    When I log on after one of these reboots, there is a
    dialog box on the desktop that offers to mail some information
    regarding this crash (presumably to the Fedora organization)
    but when I try to initiate this procedure, I get the message
    that the crash was caused because of a hardware problem.


    After each of these spontaneous crashes and reboots,
    I have been saving the following information:

    • /var/log/messages
      /var/log/boot.log
      output of dmesg
      output of journalctl


    so that I can try to diagnose this issue.

    Now, when one of these crash/reboots occurs, during the reboot
    some messages about a "Hardware Error" are displayed in
    the left hand corner of the screen. These messages
    appear and then disappear too quickly for me to write
    then down, but in the output of dmesg just after one
    of the crash/reboots are the following lines:

    [message]
    [ 0.031021] mce: [Hardware Error]: Machine check events logged
    [ 0.031023] mce: [Hardware Error]: CPU 3: Machine Check: 0 Bank 0: baa0000000060165
    [ 0.031027] mce: [Hardware Error]: TSC 0 MISC d012000101000000 SYND 2d032500 IPID b000000000
    [ 0.031031] mce: [Hardware Error]: PROCESSOR 2:800f11 TIME 1540830475 SOCKET 0 APIC 3 microcode 8001129
    [/message]

    The above lines _only_ occur in the dmesg output _after_ one of these
    spontaneous crash/reboots, _not_ when I reboot with the command

    shutdown -r now

    And so now, I wonder, are these lines in the dmesg output the key?
    Are these lines indicating that my processor is bad?


    Also:

    - In the information I have been saving after these crash/reboots,
    what else might I be checking for?

    - What other information should I possibly look at and/or save
    after one of these crash/reboots?

    - Do people have other ideas?


    Thank you very much for your help.

  2. #2
    Join Date
    Jul 2012
    Location
    Chicago
    Posts
    73
    Linux (Fedora) Safari 11.0

    Re: Diagnosing hardware crashing problem

    My "inxi -Fxmz" output:

    Code:
    System:    Host: computer-room01 Kernel: 4.18.16-100.fc27.x86_64 x86_64 bits: 64 compiler: gcc v: 7.3.1 
               Desktop: Xfce 4.12.4 Distro: Fedora release 27 (Twenty Seven) 
    Machine:   Type: Desktop Mobo: ASUSTeK model: PRIME B350-PLUS v: Rev X.0x serial: <filter> 
               UEFI: American Megatrends v: 0902 date: 09/08/2017 
    Memory:    RAM: total: 15.66 GiB used: 3.03 GiB (19.4%) 
               Array-1: capacity: 64 GiB slots: 4 EC: None max module size: 16 GiB note: est. 
               Device-1: DIMM_A1 size: No Module Installed 
               Device-2: DIMM_A2 size: 8 GiB speed: 2133 MT/s type: DDR4 
               Device-3: DIMM_B1 size: No Module Installed 
               Device-4: DIMM_B2 size: 8 GiB speed: 2133 MT/s type: DDR4 
    CPU:       Topology: 6-Core model: AMD Ryzen 5 1600 bits: 64 type: MT MCP arch: Zen rev: 1 L2 cache: 3072 KiB 
               flags: lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm bogomips: 76647 
               Speed: 1375 MHz min/max: 1550/3200 MHz Core speeds (MHz): 1: 1375 2: 1371 3: 1373 4: 1372 5: 1375 
               6: 1375 7: 1374 8: 1375 9: 1374 10: 1373 11: 1374 12: 1373 
    Graphics:  Device-1: NVIDIA GT218 [GeForce 210] driver: nouveau v: kernel bus ID: 22:00.0 
               Display: server: Fedora Project X.org 1.19.6 driver: modesetting unloaded: fbdev,vesa 
               resolution: 1920x1080~60Hz 
               OpenGL: renderer: NVA8 v: 3.3 Mesa 17.3.9 direct render: Yes 
    Audio:     Device-1: VIA ICE1712 [Envy24] PCI Multi-Channel I/O driver: snd_ice1712 v: kernel bus ID: 20:00.0 
               Device-2: NVIDIA High Definition Audio driver: snd_hda_intel v: kernel bus ID: 22:00.1 
               Device-3: Advanced Micro Devices [AMD] Family 17h HD Audio driver: snd_hda_intel v: kernel 
               bus ID: 24:00.3 
               Device-4: Creative Live! Cam Chat HD [VF0700] type: USB 
               driver: hid-generic,snd-usb-audio,usbhid,uvcvideo bus ID: 1-7:2 
               Sound Server: ALSA v: k4.18.16-100.fc27.x86_64 
    Network:   Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet driver: r8169 v: 2.3LK-NAPI 
               port: e000 bus ID: 1e:00.0 
               IF: enp30s0 state: up speed: 1000 Mbps duplex: full mac: <filter> 
    Drives:    Local Storage: total: 931.51 GiB used: 494.52 GiB (53.1%) 
               ID-1: /dev/sda vendor: Western Digital model: WD1002FAEX-007BA0 size: 931.51 GiB temp: 35 C 
    Partition: ID-1: / size: 72.83 GiB used: 12.55 GiB (17.2%) fs: ext4 dev: /dev/sda6 
               ID-2: /boot size: 3.60 GiB used: 192.5 MiB (5.2%) fs: ext4 dev: /dev/sda2 
               ID-3: swap-1 size: 29.80 GiB used: 0 KiB (0.0%) fs: swap dev: /dev/sda7 
    Sensors:   System Temperatures: cpu: 28.4 C mobo: N/A gpu: nouveau temp: 58 C 
               Fan Speeds (RPM): cpu: 0 
    Info:      Processes: 297 Uptime: 1h 09m Init: systemd runlevel: 5 Compilers: gcc: 7.3.1 Shell: bash v: 4.4.23 
               inxi: 3.0.26

  3. #3
    Join Date
    Jun 2005
    Location
    Montreal, Que, Canada
    Posts
    5,477
    Linux Firefox 63.0

    Re: Diagnosing hardware crashing problem

    Is your CPU fan running? Your printout shows zero
    Sensors: System Temperatures: cpu: 28.4 C mobo: N/A gpu: nouveau temp: 58 C
    Fan Speeds (RPM): cpu: 0
    Leslie in Montreal

    Interesting web sites list
    http://forums.fedoraforum.org/showth...40#post1697840

  4. #4
    Join Date
    Dec 2013
    Location
    United Kingdom
    Posts
    6,485
    Linux (Ubuntu) Firefox 63.0

    Re: Diagnosing hardware crashing problem

    refer to this thread about c-state and see if the fixes work for you. apart from that though it wouldn't do any harm to do the following. F27 will be end of life in near enough 30 days time anyway.

    update to the latest available firmware build for the motherboard which will contain the newest agesa code for the ryzen cpu internals but also try using Fedora 29 with the newer builds of hardware stacks, drivers, kernel 4.19, firmware modules and the latest amd-ucode

    Version 4023 2018/09/148.01 MBytesPRIME B350-PLUS BIOS 4023
    1. Improve system compatibility

    as for the zero fan speed reading, check what the UEFI firmware fan control readings are and ignore inxi for now. it may not be able to properly detect the CPU fan. the temperature itself indicates heat is not an issue, that's the normal operating window for that CPU. I have the same model CPU and chipset combination although mine's an MSI motherboard and I don't yet run linux with it

  5. #5
    Join Date
    Jul 2012
    Location
    Chicago
    Posts
    73
    Linux (Fedora) Firefox 63.0

    Re: Diagnosing hardware crashing problem

    Thank you very much for your response. Sorry I was
    away for a while. antikythera writes:

    refer to this thread
    [https://forums.fedoraforum.org/showt...t=ryzen+crash]
    about c-state and see if the fixes work for you.
    I did not actually see any suggested fixes in that thread.
    Did I miss something?

    update to the latest available firmware build for the motherboard which will
    contain the newest agesa code for the ryzen cpu internals but also try using
    Fedora 29 with the newer builds of hardware stacks, drivers, kernel 4.19,
    firmware modules and the latest amd-ucode
    Ok, I am making plans to upgrade to Fedora 29. As far as the lastest firmware
    for the motherboard, I found an update of the UEFI bios at Asus, but
    nothing else.

    And the dmesg output lines after a crash that read:

    Code:
      [    0.031021] mce: [Hardware Error]: Machine check events logged
      [    0.031023] mce: [Hardware Error]: CPU 3: Machine Check: 0 Bank 0: baa0000000060165
      [    0.031027] mce: [Hardware Error]: TSC 0 MISC d012000101000000 SYND 2d032500 IPID b000000000
      [    0.031031] mce: [Hardware Error]: PROCESSOR 2:800f11 TIME 1540830475 SOCKET 0 APIC 3 microcode 8001129
    Those are not an indication of a faulty processor ?

    Thanks again.

  6. #6
    Join Date
    Dec 2013
    Location
    United Kingdom
    Posts
    6,485
    Linux Chrome 70.0.3538.77

    Re: Diagnosing hardware crashing problem

    Did I miss something? set the global c-state to disabled instead of auto

    I found an update of the UEFI bios - yes that is what I am referring to as the firmware build

Similar Threads

  1. Fedora 19 - Hardware Crashing!!
    By craigfedora in forum Hardware & Laptops
    Replies: 2
    Last Post: 12th September 2013, 08:25 AM
  2. [SOLVED]
    Help needed in diagnosing a login problem
    By rajat152 in forum Using Fedora
    Replies: 5
    Last Post: 26th February 2013, 06:04 PM
  3. Need help diagnosing hardware issue
    By jdelisle in forum Hardware & Laptops
    Replies: 8
    Last Post: 1st March 2009, 12:58 AM
  4. Diagnosing Crashes
    By kurtg in forum Using Fedora
    Replies: 1
    Last Post: 27th March 2007, 06:04 PM
  5. Replies: 1
    Last Post: 1st August 2006, 11:12 AM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •