Crash Troubleshooting
FedoraForum.org - Fedora Support Forums and Community
Results 1 to 6 of 6
  1. #1
    Join Date
    Jan 2007
    Location
    Charlotte, NC, USA
    Posts
    66
    Linux Firefox 64.0

    Crash Troubleshooting

    I thought I remember a sticky in the old forum with instructions on 1st-steps and what to provide when asking for help with a crash.

    I tried searching "crash" and got "Sorry - no matches. Please try some different terms. "
    Here's the proof https://forums.fedoraforum.org/faq.p...xt=1&match=all

    I tried a different search which took me to a fedora forum post on windows crashes--appropriately ironic given that's where I'd expect a crash :-)

    I'm running Fedora 28. Can someone please either tell me what I can provide or point me to the list of requirements so I can get things rolling?

    I have a new Dell computer. I reformatted and install Fedora 28. It crashes every 3 to 7 days and requires BRS (nothing else works).

    I can start by providing the output of journalctl -b -1 but I didn't see anything obvious:
    http://paste.scsys.co.uk/582904
    ThePowerTool
    Bigger, Faster, More Power!!!

  2. #2
    Join Date
    Jan 2007
    Location
    Charlotte, NC, USA
    Posts
    66
    Linux Firefox 64.0

    Re: Crash Troubleshooting

    My system crashed again and I took a picture which I am attaching.

    Click image for larger version. 

Name:	20190126_222556-redux.jpg 
Views:	11 
Size:	205.4 KB 
ID:	30018

    As the picture shows "ext4" which is a reference to my 2nd HD (sdb) I ran both the short (#2) and long (#1) smartctl tests (details immediately below).

    Any help sincerely appreciated.

    ]# smartctl -a /dev/sdb
    smartctl 6.6 2017-11-05 r4594 [x86_64-linux-4.19.16-200.fc28.x86_64] (local build)
    Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org

    === START OF INFORMATION SECTION ===
    Model Family: Seagate Archive HDD
    Device Model: ST8000AS0002-1NA17Z
    Serial Number: *******
    LU WWN Device Id: 5 000c50 087765095
    Firmware Version: AR17
    User Capacity: 8,001,563,222,016 bytes [8.00 TB]
    Sector Sizes: 512 bytes logical, 4096 bytes physical
    Rotation Rate: 5980 rpm
    Device is: In smartctl database [for details use: -P show]
    ATA Version is: ACS-2, ACS-3 T13/2161-D revision 3b
    SATA Version is: SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
    Local Time is: Mon Jan 28 20:40:37 2019 EST
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled

    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED

    General SMART Values:
    Offline data collection status: (0x82) Offline data collection activity
    was completed without error.
    Auto Offline Data Collection: Enabled.
    Self-test execution status: ( 0) The previous self-test routine completed
    without error or no self-test has ever
    been run.
    Total time to complete Offline
    data collection: ( 0) seconds.
    Offline data collection
    capabilities: (0x7b) SMART execute Offline immediate.
    Auto Offline data collection on/off support.
    Suspend Offline collection upon new
    command.
    Offline surface scan supported.
    Self-test supported.
    Conveyance Self-test supported.
    Selective Self-test supported.
    SMART capabilities: (0x0003) Saves SMART data before entering
    power-saving mode.
    Supports SMART auto save timer.
    Error logging capability: (0x01) Error logging supported.
    General Purpose Logging supported.
    Short self-test routine
    recommended polling time: ( 1) minutes.
    Extended self-test routine
    recommended polling time: ( 962) minutes.
    Conveyance self-test routine
    recommended polling time: ( 2) minutes.
    SCT capabilities: (0x30b5) SCT Status supported.
    SCT Feature Control supported.
    SCT Data Table supported.

    SMART Attributes Data Structure revision number: 10
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
    1 Raw_Read_Error_Rate 0x000f 119 099 006 Pre-fail Always - 204505208
    3 Spin_Up_Time 0x0003 091 090 000 Pre-fail Always - 0
    4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 81
    5 Reallocated_Sector_Ct 0x0033 100 100 010 Pre-fail Always - 0
    7 Seek_Error_Rate 0x000f 070 060 030 Pre-fail Always - 8614212409
    9 Power_On_Hours 0x0032 097 097 000 Old_age Always - 2816
    10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
    12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 75
    183 Runtime_Bad_Block 0x0032 100 100 000 Old_age Always - 0
    184 End-to-End_Error 0x0032 100 100 099 Old_age Always - 0
    187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
    188 Command_Timeout 0x0032 100 100 000 Old_age Always - 0
    189 High_Fly_Writes 0x003a 100 100 000 Old_age Always - 0
    190 Airflow_Temperature_Cel 0x0022 061 052 045 Old_age Always - 39 (Min/Max 38/45)
    191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 0
    192 Power-Off_Retract_Count 0x0032 098 098 000 Old_age Always - 5429
    193 Load_Cycle_Count 0x0032 098 098 000 Old_age Always - 5730
    194 Temperature_Celsius 0x0022 039 048 000 Old_age Always - 39 (0 22 0 0 0)
    195 Hardware_ECC_Recovered 0x001a 119 099 000 Old_age Always - 204505208
    197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
    198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
    199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
    240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 1641 (138 11 0)
    241 Total_LBAs_Written 0x0000 100 253 000 Old_age Offline - 10698602167
    242 Total_LBAs_Read 0x0000 100 253 000 Old_age Offline - 3630283409

    SMART Error Log Version: 1
    No Errors Logged

    SMART Self-test log structure revision number 1
    Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
    # 1 Extended offline Completed without error 00% 2810 -
    # 2 Short offline Completed without error 00% 2788 -
    # 3 Short offline Completed without error 00% 687 -

    SMART Selective self-test log data structure revision number 1
    SPAN MIN_LBA MAX_LBA CURRENT_TEST_STATUS
    1 0 0 Not_testing
    2 0 0 Not_testing
    3 0 0 Not_testing
    4 0 0 Not_testing
    5 0 0 Not_testing
    Selective self-test flags (0x0):
    After scanning selected spans, do NOT read-scan remainder of disk.
    If Selective self-test is pending on power-up, resume after 0 minute delay.
    ThePowerTool
    Bigger, Faster, More Power!!!

  3. #3
    Join Date
    Dec 2013
    Location
    United Kingdom
    Posts
    6,830
    Linux Firefox 64.0

    Re: Crash Troubleshooting

    a hardware configuration dump might be handy but from the I/O error message, the system is not able to read from sda which is the primary internal storage device, nothing to do with the second drive

    https://forums.fedoraforum.org/showt...kmark-it-first

  4. #4
    Join Date
    Jan 2007
    Location
    Charlotte, NC, USA
    Posts
    66
    Linux Firefox 64.0

    Re: Crash Troubleshooting

    Hi antikythera and thank you very much!

    HW Dump:
    Code:
    ~]# inxi -Fxmz
    System:    Host: theater Kernel: 4.19.16-200.fc28.x86_64 x86_64 bits: 64 compiler: gcc v: 8.2.1 Console: tty 0 
               Distro: Fedora release 28 (Twenty Eight) 
    Machine:   Type: Desktop System: Dell product: XPS 8930 v: 1.0.13 serial: <filter> 
               Mobo: Dell model: 0DF42J v: A00 serial: <filter> UEFI: Dell v: 1.0.13 date: 07/20/2018 
    Memory:    RAM: total: 15.38 GiB used: 2.33 GiB (15.2%) 
               Array-1: capacity: 64 GiB slots: 4 EC: None max module size: 16 GiB note: est. 
               Device-1: ChannelA-DIMM0 size: No Module Installed 
               Device-2: ChannelA-DIMM1 size: 8 GiB speed: 2666 MT/s type: DDR4 
               Device-3: ChannelB-DIMM0 size: No Module Installed 
               Device-4: ChannelB-DIMM1 size: 8 GiB speed: 2666 MT/s type: DDR4 
    CPU:       Topology: 6-Core model: Intel Core i7-8700 bits: 64 type: MT MCP arch: Kaby Lake rev: A L2 cache: 12.0 MiB 
               flags: lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx bogomips: 76608 
               Speed: 900 MHz min/max: 800/4600 MHz Core speeds (MHz): 1: 900 2: 900 3: 900 4: 900 5: 900 6: 901 7: 900 8: 900 
               9: 900 10: 900 11: 900 12: 900 
    Graphics:  Device-1: Intel UHD Graphics 630 vendor: Dell driver: i915 v: kernel bus ID: 00:02.0 
               Device-2: NVIDIA GP107 [GeForce GTX 1050 Ti] vendor: Dell driver: nvidia v: 410.73 bus ID: 01:00.0 
               Display: server: X.org 1.19.6 driver: nvidia tty: 211x61 
               Message: Advanced graphics data unavailable in console for root. 
    Audio:     Device-1: Intel 200 Series PCH HD Audio vendor: Dell driver: snd_hda_intel v: kernel bus ID: 00:1f.3 
               Device-2: NVIDIA GP107GL High Definition Audio vendor: Dell driver: snd_hda_intel v: kernel bus ID: 01:00.1 
               Sound Server: ALSA v: k4.19.16-200.fc28.x86_64 
    Network:   Device-1: Qualcomm Atheros QCA9377 802.11ac Wireless Network Adapter vendor: Dell driver: ath10k_pci v: kernel 
               port: e000 bus ID: 03:00.0 
               IF: wlp3s0 state: down mac: <filter> 
               Device-2: Qualcomm Atheros Killer E2400 Gigabit Ethernet vendor: Dell driver: alx v: kernel port: d000 
               bus ID: 04:00.0 
               IF: enp4s0 state: up speed: 1000 Mbps duplex: full mac: <filter> 
               Device-3: Qualcomm Atheros type: USB driver: btusb bus ID: 1-8:2 
               IF-ID-1: virbr0 state: down mac: <filter> 
               IF-ID-2: virbr0-nic state: down mac: <filter> 
    Drives:    Local Storage: total: 7.51 TiB used: 4.57 TiB (60.9%) 
               ID-1: /dev/sda model: SPCC Solid State Disk size: 238.47 GiB temp: 33 C 
               ID-2: /dev/sdb vendor: Seagate model: ST8000AS0002-1NA17Z size: 7.28 TiB 
    RAID:      Hardware-1: Intel SATA Controller [RAID mode] driver: ahci v: 3.0 bus ID: 00:17.0 
    Partition: ID-1: / size: 216.81 GiB used: 11.05 GiB (5.1%) fs: ext4 dev: /dev/dm-0 
               ID-2: /boot size: 975.9 MiB used: 222.3 MiB (22.8%) fs: ext4 dev: /dev/sda2 
               ID-3: swap-1 size: 16.00 GiB used: 1.8 MiB (0.0%) fs: swap dev: /dev/dm-1 
    Sensors:   System Temperatures: cpu: 36.0 C mobo: N/A 
               Fan Speeds (RPM): N/A 
    Info:      Processes: 314 Uptime: 2d 13h 39m Init: systemd runlevel: 5 Compilers: gcc: 8.2.1 Shell: bash v: 4.4.23 
               inxi: 3.0.29
    smartctl for /dev/sda
    Code:
    ~]# smartctl -a /dev/sda
    smartctl 6.6 2017-11-05 r4594 [x86_64-linux-4.19.16-200.fc28.x86_64] (local build)
    Copyright (C) 2002-17, Bruce Allen, Christian Franke, www.smartmontools.org
    
    === START OF INFORMATION SECTION ===
    Device Model:     SPCC Solid State Disk
    Serial Number:    ****************
    LU WWN Device Id: 0 000000 000000000
    Firmware Version: SBFM61.2
    User Capacity:    256,060,514,304 bytes [256 GB]
    Sector Size:      512 bytes logical/physical
    Rotation Rate:    Solid State Device
    Form Factor:      2.5 inches
    Device is:        Not in smartctl database [for details use: -P showall]
    ATA Version is:   ACS-4 (minor revision not indicated)
    SATA Version is:  SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
    Local Time is:    Tue Jan 29 16:07:55 2019 EST
    SMART support is: Available - device has SMART capability.
    SMART support is: Enabled
    
    === START OF READ SMART DATA SECTION ===
    SMART overall-health self-assessment test result: PASSED
    
    General SMART Values:
    Offline data collection status:  (0x00)	Offline data collection activity
    					was never started.
    					Auto Offline Data Collection: Disabled.
    Self-test execution status:      (   0)	The previous self-test routine completed
    					without error or no self-test has ever 
    					been run.
    Total time to complete Offline 
    data collection: 		(65535) seconds.
    Offline data collection
    capabilities: 			 (0x79) SMART execute Offline immediate.
    					No Auto Offline data collection support.
    					Suspend Offline collection upon new
    					command.
    					Offline surface scan supported.
    					Self-test supported.
    					Conveyance Self-test supported.
    					Selective Self-test supported.
    SMART capabilities:            (0x0003)	Saves SMART data before entering
    					power-saving mode.
    					Supports SMART auto save timer.
    Error logging capability:        (0x01)	Error logging supported.
    					General Purpose Logging supported.
    Short self-test routine 
    recommended polling time: 	 (   2) minutes.
    Extended self-test routine
    recommended polling time: 	 (  30) minutes.
    Conveyance self-test routine
    recommended polling time: 	 (   6) minutes.
    
    SMART Attributes Data Structure revision number: 16
    Vendor Specific SMART Attributes with Thresholds:
    ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
      1 Raw_Read_Error_Rate     0x000b   100   100   050    Pre-fail  Always       -       0
      9 Power_On_Hours          0x0012   100   100   000    Old_age   Always       -       2706
     12 Power_Cycle_Count       0x0012   100   100   000    Old_age   Always       -       44
    168 Unknown_Attribute       0x0012   100   100   000    Old_age   Always       -       0
    170 Unknown_Attribute       0x0003   081   081   010    Pre-fail  Always       -       179
    173 Unknown_Attribute       0x0012   100   100   000    Old_age   Always       -       1
    192 Power-Off_Retract_Count 0x0012   100   100   000    Old_age   Always       -       34
    194 Temperature_Celsius     0x0023   067   067   000    Pre-fail  Always       -       33 (Min/Max 33/33)
    218 Unknown_Attribute       0x000b   100   100   050    Pre-fail  Always       -       0
    231 Temperature_Celsius     0x0013   100   100   000    Pre-fail  Always       -       100
    241 Total_LBAs_Written      0x0012   100   100   000    Old_age   Always       -       155
    
    SMART Error Log Version: 1
    No Errors Logged
    
    SMART Self-test log structure revision number 1
    Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
    # 1  Extended offline    Completed without error       00%      2703         -
    # 2  Short offline       Completed without error       00%      2702         -
    # 3  Extended offline    Completed without error       00%      1716         -
    # 4  Short offline       Completed without error       00%      1715         -
    # 5  Short offline       Completed without error       00%      1715         -
    # 6  Short offline       Completed without error       00%      1715         -
    # 7  Short offline       Completed without error       00%       616         -
    # 8  Short offline       Completed without error       00%         1         -
    # 9  Short offline       Completed without error       00%         0         -
    
    SMART Selective self-test log data structure revision number 0
    Note: revision number not 1 implies that no selective self-test has ever been run
     SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
        1        0        0  Not_testing
        2        0        0  Not_testing
        3        0        0  Not_testing
        4        0        0  Not_testing
        5        0        0  Not_testing
    Selective self-test flags (0x0):
      After scanning selected spans, do NOT read-scan remainder of disk.
    If Selective self-test is pending on power-up, resume after 0 minute delay.
    I hope this meets your expectations.

    As you can see I ran a short (#2) and long (#1) test on sda which both found no errors. I guess I noticed, earlier, that the error was with sda because you can also see I ran the same tests 985 lifetime hours earlier.

    Any thoughts on a next-step?

    Thank you, again.
    ThePowerTool
    Bigger, Faster, More Power!!!

  5. #5
    Join Date
    Dec 2013
    Location
    United Kingdom
    Posts
    6,830
    Linux (Android 6.0) Android Browser 4.0

    Re: Crash Troubleshooting

    Check the sata cable isn't loose either end or kinked. Maybe replace that anyway because they can degrade.

    Also check the power cable is connected to the drive properly.

  6. #6
    Join Date
    Jan 2007
    Location
    Charlotte, NC, USA
    Posts
    66
    Linux Firefox 65.0

    Re: Crash Troubleshooting

    antikythera,

    You nailed it in one!

    Over this past weekend I opened the case and as I pulled the cover back the cable actually fell away from the HD as the SATA2 connector on the HD was broken and would not hold the cable in-place. I'm surprised the system booted at all seeing the broken plastic edge of the connector (or socket?).

    Thank you very much!
    ThePowerTool
    Bigger, Faster, More Power!!!

Similar Threads

  1. NFS Troubleshooting
    By radl33t in forum Using Fedora
    Replies: 8
    Last Post: 9th August 2005, 12:16 PM
  2. Troubleshooting X.org my way
    By ilja in forum Guides & Solutions (Not For Questions)
    Replies: 3
    Last Post: 26th November 2004, 10:35 AM

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •