Closure On the Linux Lockup Bug

Closure On the Linux Lockup Bug 115

Posted by Soulskill on Friday January 09, 2015 @10:01PM from the it-was-dead-the-whole-time dept.

jones_supa writes: Dave Jones from Red Hat has written a wrap-up of the strange bug that has made some machines running Linux to freeze. (Previous discussion.) Right down to his final week at Red Hat before Dave gave all his hardware back, Linus Torvalds managed to reproduce similar symptoms, by scribbling directly to the HPET timer. He came up with a hack that at least made the kernel survive for him. When Dave tried the same patch, the machine ran for three days before he interrupted it, which was a promising result. The question remains, what was scribbling over the HPET in his case? The only two plausible scenarios Dave could think of were that Trinity generated 0xFED000F0 as a random address and passed that to a syscall which wrote to it, or a hardware bug. That's where the story ends for now. Linus' hacky workaround didn't get committed, but him and John Stultz continue to back and forth on hardening the clock management code in the face of screwed up hardware, so maybe soon we'll see something real get committed on that area.

Closure On the Linux Lockup Bug

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 115 Comments Log In/Create an Account

Comments Filter:

does not sound like closure to me (Score:4, Informative)

by Narcocide ( 102829 ) writes: on Friday January 09, 2015 @10:12PM (#48778913) Homepage

"probably a hardware bug" is code for "well, we bought new hardware and threw out all the old stuff, sorry"

- Re:does not sound like closure to me (Score:5, Informative)
  
  by thegarbz ( 1787294 ) writes: on Friday January 09, 2015 @10:34PM (#48779019)
  
  Re-read the summary. They know what is causing the lockup, they don't know what is making the system call which is triggering the bug. Once you know what is causing the lockup it can be fixed, and the hack that was written made the lock-ups stop. At no point did anyone throw out or try new hardware, though one thought is everything is originating from a hardware bug.
  
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  "probably firmware SMM code messing with the HPET counter behind our back" != "probably a hardware bug"
  - Re: (Score:1)
    
    by GlowingCat ( 2459788 ) writes:
    
    Maybe kernel or driver code writing to HPET counter accidentally. Kernel and drivers both have access to same unlimited memory space, right ?
    - Re: (Score:2)
      
      by TechyImmigrant ( 175943 ) writes:
      
      Someone with the right equipment should be able to do a hardware trace and catch the culprit.
- Re:does not sound like closure to me (Score:5, Interesting)
  
  by sjames ( 1099 ) writes: on Friday January 09, 2015 @11:06PM (#48779163) Homepage Journal
  
  RTFA, they have good reason to point at the hardware. Then there's the bazillions of servers running on different hardware that have never seen the bug.
  Many teams would have written it off as a hardware bug a long time ago, but the linux kernel team was willing to consider and investigate the possibility that it was a rarely triggered bug in the software before they passed the buck.
  Sometimes it really is a hardware bug.
  
  - plus don't crash on bad hardware. Hotplugged CPU (Score:3)
    
    by raymorris ( 2726007 ) writes:
    
    >. Many teams would have written it off as a hardware bug a long time ago, but the linux kernel team was willing to consider and investigate the possibility that it was a rarely triggered bug in the software before they passed the buck.
    And try to avoid crashing due to hardware bugs, if possible.
    A contractor once hotplugged one of the CPUs in one of my servers. That's right, they took the processor out and replaced it with the machine running. The box did not crash. It kept running at least for the f
    - Re: (Score:2)
      
      by sjames ( 1099 ) writes:
      
      Hot swapping the CPU without an immediate crash had to be a million to one shot!
      But yes, resilient software is always a good thing.
      I do hope Linus's patch goes in in some form to at least make it clear what the problem is if someone with similarly borked hardware sees the problem.
      - Re:plus don't crash on bad hardware. Hotplugged CP (Score:4, Informative)
        
        by TechyImmigrant ( 175943 ) writes: on Saturday January 10, 2015 @01:19AM (#48779629) Homepage Journal
        
        >Hot swapping the CPU without an immediate crash had to be a million to one shot!
        With QPI interconnect and the voltage and temp supervisory circuits on chip, it's not such a long shot these days, especially on Xeons with failover support that is explicitly intended to cope with a neighbor CPU going down.
        
        
        Re: (Score:3)
        
        by pasamio ( 737659 ) writes:
        
        Yes it's great to support hotplugged CPUs! 1969 called and they want to let you know they supported online reconfiguration back then too: http://en.wikipedia.org/wiki/M... [wikipedia.org]
        
        Re: (Score:2)
        
        by raymorris ( 2726007 ) writes:
        
        That's interesting. Apparently it was supported well enough that they actually did hotplug CPUs regularly, as standard practice. I wonder if they "unmounted" the components before removal and "mounted" them upon insertion. That's a much easier approach, especially for CPUs, than handling a CPU suddenly going AWOL.
        
        Re: (Score:2)
        
        by hitmark ( 640295 ) writes:
        
        USB has slightly longer contacts on the power pins for much the same reason.
        
        Re: (Score:2)
        
        by TechyImmigrant ( 175943 ) writes:
        
        Yes. Exactly this. Pulling the latches on the card generates an interrupt. In the systems I designed (for a mainframe raid disk system in this case), a little green light would light up when it was ready. So pull the latches out, wait for green light, pull the card out. The light generally lit up in a few milliseconds, so you could just rip the card out.
        I presume this is how it worked for all products from this (very large, well known) manufacturer, because that's what the spec required.
        
        Linux CPU hotplug support link (Score:4, Informative)
        
        by raymorris ( 2726007 ) writes: on Saturday January 10, 2015 @04:05AM (#48779977) Journal
        
        Replying to myself, but I figured someone reading this might be interested. Linux does support CPU hotplug where you disable the CPU before removing it. Your motherboard might get mad about it if it's not supported by the board, though.
        http://www.cyberciti.biz/faq/d... [cyberciti.biz]
        
        
        Re: (Score:3)
        
        by sjames ( 1099 ) writes:
        
        Yes. It's mostly used for reconfiguring VMs, but it is possible to do it with real hardware if the board supports it.
        It's interesting how as time goes on, PC hardware is slowly coming to resemble an affordable version of the mainframes they replaced.
        
        Re: (Score:2)
        
        by hitmark ( 640295 ) writes:
        
        Was not one reason why mainframes was so highly valued that one could hotswap virtually anything without interrupting workflow?
        
        Re: (Score:2)
        
        by sjames ( 1099 ) writes:
        
        Yes, I can see that would limit the damage, but it still leaves the OS surprised to have running tasks just go away.
        It would likely work less well with AMD processors since a chunk of memory would also go away.
    - Re:plus don't crash on bad hardware. Hotplugged CP (Score:5, Funny)
      
      by cerberusss ( 660701 ) writes: on Saturday January 10, 2015 @02:37AM (#48779815) Journal
      
      Sometimes it
      Sometimes it -- what? Did someone attempt to hot-swap your CPU again? (-:
      
      - Re: (Score:2)
        
        by raymorris ( 2726007 ) writes:
        
        Sometimes it screws up the post, where "it" is the Android browser.
    - Re: (Score:2)
      
      by the_B0fh ( 208483 ) writes:
      
      Solaris supported hot pluggable CPUs in the last century!
  - Re: (Score:2)
    
    by PoochieReds ( 4973 ) writes:
    
    It's still not a given that it's the hardware. It's likely that something is scribbling over the HPET timer. As to whether that's due to faulty hardware or a software bug is still undetermined.
    Random memory corruption is oh so painful. :(
  - Re:does not sound like closure to me (Score:4, Funny)
    
    by tippen ( 704534 ) writes: on Saturday January 10, 2015 @12:47PM (#48781427)
    
    One of the more memorable quotes I heard while developing embedded systems: if you can fix it in software, it isn't a hardware bug
    Annoying as hell to the software team when it is clearly a bug in the hardware, but very true at a practical level for the engineering team trying to get product out the door.
    
    - Re: (Score:2)
      
      by sjames ( 1099 ) writes:
      
      I'm famioliar with that one. Same thing happens in boot ROMs.
    - Re: (Score:1)
      
      by Anonymous Coward writes:
      
      if you can fix it in software, it isn't a hardware bug
      I'm a hardware and software guy, and I can tell you that is entirely bullshit. While I understand it may seem this way because sometimes software guys can't write a driver to save their lives, there are many bugs in hardware which are actual hardware bugs (race conditions, dropped interrupts, whatever) that have workarounds in software.
      I've seen buggy hardware NAND flash ECC units "fixed" by doing ECC entirely in software, leaving the hardware unit unused, and taking a bit throughput hit.
      I also seem to reca
- - - - Re: does not sound like closure to me (Score:1)
        
        by Anonymous Coward writes:
        
        My windows servers have an uptime of 49 years, 31 days, 22 hrs, 15 mins and 4539 ms. No Linux server can beat that
        
        Re: (Score:2)
        
        by paulatz ( 744216 ) writes:
        
        Did you add up the uptime of all the 4096 servers?
      - Re: does not sound like closure to me (Score:1)
        
        by chentiangemalc ( 1710624 ) writes:
        
        If you are using GUI desktop Windows 7 or 8.1 is way more stable then many popular Linux GUIs, unless you load up your Windows machine with crapware / adware. Unfortunately most windows machines come preloaded with crap.
        
        Re: (Score:1)
        
        by nobodie ( 1555367 ) writes:
        
        Everyone else? Like all hardware is OSX certified? Try putting any old HDD or SSD into a macbook and see how that works.
  - Re: (Score:2)
    
    by the_B0fh ( 208483 ) writes:
    
    bwahahahahahaha, come on, we need sarcasm font here!!
  - Re: (Score:1)
    
    by fidelleon ( 3533731 ) writes:
    
    Nice try, you troll.
In other words.. (Score:2, Funny)

by Anonymous Coward writes:

Closed NOTABUG?
Editors, edit! (Score:3)

by msauve ( 701917 ) writes: on Friday January 09, 2015 @10:25PM (#48778973)

"has made some machines running Linux to freeze... but him and John Stultz continue to back and forth"

Really?

- Re: (Score:3)
  
  by SeaFox ( 739806 ) writes:
  
  The second sentence isn't much better:
  Right down to his final week at Red Hat before Dave gave all his hardware back, Linus Torvalds managed to reproduce similar symptoms, by scribbling directly to the HPET timer.
  Was Linus at Dave's place working on the issue? Is the first part a sentence fragment and Dave did something before he gave his hardware back we aren't being told? Or is the first part really a continuation of the first sentence, and Dave was working on his writeup all the way until the deadline for returning his hardware?
him? (Score:1)

by Anonymous Coward writes:

him and John Stultz

Hey youse editors, you want I should take the mug out?
hardening is NOT blaming the hardware (Score:5, Interesting)

by dltaylor ( 7510 ) writes: on Friday January 09, 2015 @10:33PM (#48779011)

Too many clueless comments already that don't understand the difference between "blaming the hardware" and hardening to deal with demonstrably-broken hardware (and/or firmware for devices). I've spent years writing drivers for various OS', including Windows and Linux. It is rare for any complex device to be bug-free at the hardware level (look how many patches are BIOS-applied to CPUs, for example). Sometimes, under NDA, of course, the Windows driver writers are apprised of the deficiencies, or, at least, get better response from the vendor when an anomaly appears. Linux rarely gets that same assistance.
My favorite example, though, is all-IBM. We were porting AIX to the PS/2s and 370s. We consistently had problems with the diskette interface under AIX and the response from Boca Raton was always "it works in MS-DOS, so it's your code, not our hardware". When OS-2 came around, they ran into exactly the same problem in the hardware. By then, we had a work-around (slower, more locks, but no more glitches) which was how OS-2 got around it, as well.

- Re: (Score:2)
  
  by thegarbz ( 1787294 ) writes:
  
  Too many clueless comments already
  Not bad given you were the ~4th poster and 2 of them didn't mention the hardware.
  - Re: (Score:3, Funny)
    
    by kad77 ( 805601 ) writes:
    
    What you posted about his being the 4th post struck me as wrong, given how far it was down the page. I'm bored, so I took a moment to look at how many posts have an earlier timestamp than the one you are slamming (at least 8), and 2 make dismissive statements about hardware, including the first comment of article at 8:12, and another at 8:19 seemingly dismissing hardware as a possibility.
    So your snide comment is not based in fact. It's like you are reading a different page. Maybe you need glasses. An attitu
    - Re: (Score:2)
      
      by Dog-Cow ( 21281 ) writes:
      
      The other posts were, in fact, made later, but someone was messing around with the HPET timer and, well, bugs.
- "friend" and "foe", but no "neckbeard" (Score:1)
  
  by raymorris ( 2726007 ) writes:
  
  I wish Slashdot would allow me to mark users not just as "friend" or "foe", but as "neckbeard". :) That must have been 1986 or 1987?
  - Re: (Score:3)
    
    by dltaylor ( 7510 ) writes:
    
    0: I do shave my neck. :) In fact, the beard has been gone for more than a year.
    1: a bit later, early 1990; we all got a big laugh out of the 486SX/487 when those came out. https://en.wikipedia.org/wiki/Intel_80486SX [wikipedia.org]
    - meant in the best possible way. Gray beard. (Score:2)
      
      by raymorris ( 2726007 ) writes:
      
      PS I meant that in the best possible way. I didn't really think through the connotations of "neck beard" before posting. I was really thinking more "gray beard" , including wizardly connotations.
  - Re: (Score:2, Funny)
    
    by Anonymous Coward writes:
    
    AC here, no longer posting as myself since I've long lost my SO account, can't be bothered to find the password for the ancient yahoo email address, and after working on the inside in finance will probably never post an opinion (as my own) again. (Yes, that was a run on sentence.)
    If 1986 qualifies as a "neckbeard" you missed the mark, unless he's a Berkley neckbeard. The 80's were a magical time when power ties, very bad print shirts, and driving your overpriced car with women and blow was available to an
    - - Re: (Score:2)
        
        by sound+vision ( 884283 ) writes:
        
        No, I think he's implying that coding has gone out of fashion (or at least no longer guarantees a high-paying job.)
        
        Re: (Score:2)
        
        by Lunix Nutcase ( 1092239 ) writes:
        
        No, I think he's implying that coding has gone out of fashion (or at least no longer guarantees a high-paying job.)
        Coding going out if fashion? Have you been living in a cave these last few years?
  - Re: (Score:2)
    
    by Lehk228 ( 705449 ) writes:
    
    marking users a "neckbeard" on slashdot has been available since the beginning. all you need to do is check if the user has an account on slashdot, if so, neckbeard is present.
- Folds in space time continuum (Score:1)
  
  by Anonymous Coward writes:
  
  Obviously, it's folds in the space time continuum that is causing HPET (the high precision hardware timer) to jump backwards, causing negative deltas and lockups.
  Perhaps a future version of ourselves has transcended space-time and is trying to contact us to help us with our bad harvests? Did Linus try to determine any kind of co-ordinates from the glitch?
  Has NASA seen any kind of weird portholes near Jupiter?
  - - Re: (Score:2)
      
      by thephydes ( 727739 ) writes:
      
      To understand that joke you need to be aware that in some place Uranus is pronounced your-anus (here in oz for example). The old 9th grade joke - " Mr R, can you see uranus with a telescope?" "yes if you use a mirror lens" ....
      - Re: (Score:2)
        
        by Teun ( 17872 ) writes:
        
        You should for once get out of your English-centric world and use the languages of the people who named the planet.
In the mean time... (Score:1)

by Anonymous Coward writes:

Windows still BSOD's and always will.
- No it doesn't (Score:2)
  
  by johncandale ( 1430587 ) writes:
  
  No it doesn't. Maybe you should upgrade past XP already and use a windows made in this century
  - - Re: (Score:2)
      
      by fnj ( 64210 ) writes:
      
      Whether or not you see a blue screen with a lot of text on it is beside the point. Every OS can potentially panic. Even if it's configured to paper over the problem by doing it quietly and rebooting, the system has gone tits up.
      - Re: (Score:2)
        
        by drinkypoo ( 153816 ) writes:
        
        How much would it cost to have a computer which could leave a trace of the cause of a lockup, even if the machine exploded?
        You would have to have double your main memory, basically. Not really that expensive.
        
        Re: No it doesn't (Score:2)
        
        by corychristison ( 951993 ) writes:
        
        The problem is that when the kernel panics, everything grinds to a stand still. More specifically: hard drive controller/driver. How are you going to write the data if you don't have access to the disks?
        This is by design, as the disk controller could br the reason for the lockup, and you would potentially corrupt your entire disk by trying to write to it.
        I'm sure its been thought of before, but my first thought is to include a very small chunk of memory on the motherboard, with a stupidly simple api that is
  - Re: (Score:2)
    
    by Osgeld ( 1900440 ) writes:
    
    hell I cant recall the last time I saw XP BSOD
"closure" (Score:1)

by Anonymous Coward writes:

About as much as this year being the year of the linux desktop... no really, it's gonna be THIS year... promise.
"him and John Stultz continue ..." (Score:3)

by seyyah ( 986027 ) writes: on Friday January 09, 2015 @11:00PM (#48779135)

"... him and John Stultz continue to back and forth ..."
What in the world is happening, editors?

- Re: (Score:2)
  
  by Rick Zeman ( 15628 ) writes:
  
  "... him and John Stultz continue to back and forth ..."
  What in the world is happening, editors?
  The only editors on slashdot are some vi's, some pines, and a couple of notepads and textedit. Certainly, no human editors....
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  They have obviously outsourced the editing to India.
  - Re: (Score:2)
    
    by dwye ( 1127395 ) writes:
    
    They have obviously outsourced the editing to India.
    Or New Jersey
Call me crazy (Score:5, Interesting)

by Nyall ( 646782 ) writes: on Friday January 09, 2015 @11:55PM (#48779371) Homepage

Sorry if I've found the wrong stuff. I'm doing this via a quick googling...
Is this really the code for reading and writing the HPET?
http://www.cs.fsu.edu/~baker/d... [fsu.edu]
I've been a powerpc programmer in aviation for a while. If you need to read the time base register (also a 64 bit up counter) you have to be aware that your read might coincide with the lower 32 bits incrementing and carrying into the upper 32 bits. So you read the upper 32 bits, read the lower 32 bits, then re-read the upper bits and make sure the upper bits didn't change. If they did repeat this process. But if they are the same then you combine the 32 bit halves into a 64 bit time and call it good.

- Re: (Score:1)
  
  by myforwik ( 1465003 ) writes:
  
  And what does writel do?
- Re: (Score:1)
  
  by Anonymous Coward writes:
  
  Is this really the code for reading and writing the HPET?
  Yup.
  I've been a powerpc programmer in aviation for a while. If you need to read the time base register (also a 64 bit up counter) you have to be aware that your read might coincide with the lower 32 bits incrementing and carrying into the upper 32 bits. So you read the upper 32 bits, read the lower 32 bits, then re-read the upper bits and make sure the upper bits didn't change. If they did repeat this process. But if they are the same then you combine the 32 bit halves into a 64 bit time and call it good.
  That would be entirely wrong here.
  The upper 32 bits of the current timer value are latched into the register at the upper address when the lower 32 bits are read from the lower address.
  - Re: (Score:2)
    
    by Nyall ( 646782 ) writes:
    
    OK then. Where in this return statement are the lower 32 bits read first? I don't believe the bitwise or operator is a sequence point. (The logical one is)
    return readl(addr) | (((unsigned long long)readl(addr + 4)) http://www.intel.com/hardwared [intel.com]...
    but I did find the following, which documents the race condition I explained above.
    http://www.intel.com/content/d... [intel.com]
    I will search for newer documentation than a 1.0a.
    - Re: (Score:2)
      
      by WinstonWolfIT ( 1550079 ) writes:
      
      Might want to check your first link.
      - Re: (Score:2)
        
        by Nyall ( 646782 ) writes:
        
        Sorry for the bad post. Yes, the first link does not work, but it is what is documented in hpet.c as the reference. A sentence went missing somewhere saying that I couldn't find it. The second link, which does work, is what I've found so far. I have yet to find something newer which documents the latching behavior that was claimed.
        Sorry again for the bad post.
        -Nyall
  - Re: (Score:2)
    
    by _merlin ( 160982 ) writes:
    
    The upper 32 bits of the current timer value are latched into the register at the upper address when the lower 32 bits are read from the lower address.
    Well in that case, you'd need to ensure the lower 32 bits are read first so you're reading the upper 32 bits that you latched this time through, not last time through. And if that's the case, the code is still wrong because there's nothing to force a sequence point between the two reads. The compiler is free to re-order the two reads in that expression.
- Re: (Score:3)
  
  by DamnOregonian ( 963763 ) writes:
  
  That code doesn't suffer from the problem you think it does.
  
  readq is only defined in that code if undefined elsewhere, and is only used to read counters on 64-bit architectures.
  
  on 32-bit architectures, that code uses readl to read the counter.
  
  readq is undefined in some 32-bit architectures, so is defined there- but only used there to read the configuration register (not likely to roll over ;)
  
  Also, the actual reading of the counter is done indirectly: it's returned from the IRQ handler for the HPET.
  - Re: (Score:1)
    
    by hendric ( 30596 ) * writes:
    
    http://www.cs.fsu.edu/~baker/devices/lxr/http/source/linux/arch/x86/include/asm/io.h#L49
    Line 49 looks like where readq is defined for x64 architecture.
Freezes on Mac under Parallels (Score:1)

by iMactheKnife ( 2556934 ) writes:

I had the freeze bug in a VM system on a Mac running Parallels. I downloaded Ubuntu 14.04 from Parallels and could not get around it. Then I downloaded directly from Canonical and it worked just find. I assumed it was a bad download from Parallels, but perhaps it is more subtle. The virtual machine has the same vulnerabilities - is that a clue?
How to Follow this Bug (Score:1)

by 4rest ( 725123 ) writes:

I am affected by this bug, but can't seem to find any real place to follow it. I searched https://bugzilla.kernel.org/ [kernel.org] but that didn't turn up anything. Anyone know where the source of truth for tracking this issue might be located?
- Re: (Score:2)
  
  by Hognoxious ( 631665 ) writes:
  
  Was it caused by Monkeeing around?

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

does not sound like closure to me (Score:4, Informative)

Re:does not sound like closure to me (Score:5, Informative)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re:does not sound like closure to me (Score:5, Interesting)

plus don't crash on bad hardware. Hotplugged CPU (Score:3)

Re: (Score:2)

Re:plus don't crash on bad hardware. Hotplugged CP (Score:4, Informative)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Linux CPU hotplug support link (Score:4, Informative)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re:plus don't crash on bad hardware. Hotplugged CP (Score:5, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:does not sound like closure to me (Score:4, Funny)

Re: (Score:2)

Re: (Score:1)

Re: does not sound like closure to me (Score:1)

Re: (Score:2)

Re: does not sound like closure to me (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

In other words.. (Score:2, Funny)

Editors, edit! (Score:3)

Re: (Score:3)

him? (Score:1)

hardening is NOT blaming the hardware (Score:5, Interesting)

Re: (Score:2)

Re: (Score:3, Funny)

Re: (Score:2)

"friend" and "foe", but no "neckbeard" (Score:1)

Re: (Score:3)

meant in the best possible way. Gray beard. (Score:2)

Re: (Score:2, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Folds in space time continuum (Score:1)

Re: (Score:2)

Re: (Score:2)

In the mean time... (Score:1)

No it doesn't (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: No it doesn't (Score:2)

Re: (Score:2)

"closure" (Score:1)

"him and John Stultz continue ..." (Score:3)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Call me crazy (Score:5, Interesting)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:1)

Freezes on Mac under Parallels (Score:1)

How to Follow this Bug (Score:1)

Re: (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals