Next.js App Router + React Server Components Demo

NHacker Next

new
past
show
ask
show
jobs
submit

▲Finding a CPU Design Bug in the Xbox 360 (2018) (randomascii.wordpress.com)

152 points by mariuz 5 days ago | 53 comments

chasil 1 days ago [-]

It is interesting that IBM dominated this generation of consoles, and was vanquished in the next.

The high failure rates of the Xbox 360 did not help.

https://en.wikipedia.org/wiki/Xbox_360_technical_problems

vondur 1 days ago [-]

I thought the design flaws of the Xbox 360 cooling system had more to do with Microsoft than any inherent design flaw by IBM. I assumed that switching to x86 processors let Microsoft leverage their native developer tools from Windows which helped developers.

chasil 1 days ago [-]

The main issue was revealed to be solder.

"Microsoft did not reveal the cause of the issues publicly until 2021, when a 6-part documentary on the history of Xbox was released. The Red Ring issue was caused by the cracking of solder joints inside the GPU flip chip package, connecting the GPU to the substrate interposer, as a result of thermal stress from heating up and cooling back down when the system is power cycled."

timw4mail 1 days ago [-]

And there was the same problem with early PS3s, on Nvidia's GPU package...it was a fairly widespread problem at the time.

rwmj 1 days ago [-]

And Apple iBook G3s too. There's a whole thing with owners reflowing the GPU: https://www.instructables.com/Fixing-the-infamous-iBook-scre...

keyringlight 1 days ago [-]

I seem to recall baking PC nvidia GPU boards in your oven was a reasonably common out-of-warranty fix around that era.

Moosdijk 1 days ago [-]

I had to do this with my MacBook Pro models early 2015 and late 2017.

It seems like there was a period in time when solder just wasn’t done well, it seems like.

rangestransform 1 days ago [-]

IIRC this is to do with the phase in of RoHS and bad lead free solder

hbn 1 days ago [-]

I don't have any solid numbers on me, but I believe early 360s failing wasn't just widespread; it was straight up most of them dying within the first couple years. It's honestly insane they more or less got away with that. And I guess also speaks to how much Microsoft was killing it in that era that people were willing to go through multiple console RMAs (which I heard was a terrible, slow, and unreliable process) to play 360 games. How far they've fallen.

Aarostotle 1 days ago [-]

Simple answer: Halo 3.

mrguyorama 1 days ago [-]

It was something like 25% - 50% of all first version 360s died.

Microsoft spent over a billion dollars replacing and repairing consoles to maintain the good brand name of Xbox.

https://en.wikipedia.org/wiki/Xbox_360_technical_problems

rustystump 1 days ago [-]

Family got first gen 360. Still works to this day. We hit the jackpot with that console. It out lasted 2 wiis and a ps2

kjkjadksj 1 days ago [-]

Whenever we lost a 360 we got a pre owned 360 from gamestop. I think they went for like $70 for one without any hdd.

bombcar 1 days ago [-]

That was the real story, by the time they started dying you could just grab a working one for "decently cheap" if you still cared.

However, I wonder how many people got "burned" by it and swore off Xbox consoles going forward.

I know that era we got a lot more use out of the Xbox (original) and the Wii.

kjkjadksj 4 hours ago [-]

I knew plenty of people who had rrod no one swore off xbox though. You were still in the ecosystem through games, controllers, xbox live membership, and in my case network effects since we all played halo.

thenthenthen 1 days ago [-]

Sounds like the 2012(?) Macbook Pro after the switch to leadless solder (?). I had to cook my motherboard 3 times in the oven to revive it.

chasil 1 days ago [-]

Funny!

I've heard that flash memory can also be revived with heat, either long duration or high intensity.

https://www.extremetech.com/science/142096-self-healing-self...

thenthenthen 1 days ago [-]

Some macbook hacks involved disabling sleepmode, running a benchmark and putting it in a pile of blankets for a few hours

esaym 1 days ago [-]

When did the industry transition to different/lead free solders? Wonder if that was part of the issue?

monocasa 1 days ago [-]

Yeah, it was the transition to RoHS.

hrmtst93837 1 days ago [-]

Sony stuck with Cell even longer and it locked them into a decade of weird ports and missed optimization tricks, so "vanquished" cuts both ways.

brucedawson 24 hours ago [-]

It did not help that the performance of the Xbox 360 CPU (and Cell) was terrible. In-order execution without the crazy-high frequencies the CPU was designed for was a failure.

cptskippy 1 days ago [-]

> It is interesting that IBM dominated this generation of consoles, and was vanquished in the next.

IBM's Power was the only logical option at the time.

These consoles were being designed around 2000. Intel and AMD weren't partnering on bespoke CPUs at that time. I don't even think AMD would have been considered a viable partner. Neither had viable 64 bit options and part of console marketing at the time was the ever increasing bit depths.

Prior console generations had use MIPS which wasn't keeping up with ever increasing performance expectations and players like Toshiba and Sony were looking for a higher performance CPU architecture. IBM's Power architecture was really the only option. Sony, Toshiba, and IBM partnered to develop their a new 64 bit microarchitecture called Cell.

Microsoft's first console was basically a PC and that's how everyone saw it. The 360 was an opportunity for Microsoft to show that it could compete with the big boys. It was also an opportunity to keep a toe dipped in RISC, because it had dropped support for RISC CPUs with Windows 2000.

Grazester 1 days ago [-]

By the way, the AMD athlon 64-bit launched 2003. The PS3 launched in 2006. I had an AMD64 bit process in my laptop in 2005.

What wasn't viable?

Narishma 1 days ago [-]

Yeah that part didn't make sense, not to mention that neither the PS3 nor the 360 were running 64-bit software. They didn't have enough memory for it to be worth it.

sidewndr46 1 days ago [-]

you don't need memory to make 64 bit software worth it. Just 64 bit mathematics requirements. Which basically no video game console uses as from what I understand 32-bit floating point continue to be state of the art in video game simulations

duped 1 days ago [-]

Fundamentally it's still a memory limitation, just in terms of memory latency/cache misses instead of capacity. If you double the size of your numbers you're doubling the space it takes up and all the problems that come with it.

sidewndr46 1 days ago [-]

No it isn't. The 64-bit capabilities of modern CPUs have almost nothing to do with memory. The address space is rarely 64 bits of physical address space anyways. A "64-bit" computer doesn't actually have the ability to deal with 64 bits of memory.

If you double the size of numbers, sure it takes up twice the space. If the total size is still less that one page it isn't likely to make a big difference anyways. What really makes a difference is trying to do 64-bit mathematics with 32-bit hardware. This implies some degree of emulation with a series of instructions, whereas a 64-bit CPU could execute that in 1 instruction. That 1 instruction very likely executes in less cycles than a series of other instructions. Otherwise no one would have bothered with it

duped 44 minutes ago [-]

You misread my comment. I'm not saying that it limits the amount of memory, I'm saying that _using more memory has cost_.

> If the total size is still less that one page it isn't likely to make a big difference anyways

It makes a significant difference when you're optimizing around cache behavior and SIMD lanes.

bombcar 1 days ago [-]

"Bitness" of a CPU almost always refers to memory addressing.

Now you could build a weird CPU that has "more memory" than it has addressable width (the 8086 is kind of like this with segmentation and 8/16 bit) but if your CPU is 64 bit you're likely not to use anything less than 64 bit math in general (though you can get some tricks with multiple adds of 32 bit numbers packed).

But a 32 bit CPU can do all sorts of things with larger numbers, it's just that moving them around may be more time-consuming. After all, that's basically what MMX and friends are.

chasil 1 days ago [-]

The original 8087 implemented 80-bit operands in its stack.

It would also process binary-coded decimal integers, as well as floating point.

"The two came up with a revolutionary design with 64 bits of mantissa and 16 bits of exponent for the longest-format real number, with a stack architecture CPU and eight 80-bit stack registers, with a computationally rich instruction set."

https://en.wikipedia.org/wiki/Intel_8087

comex 24 hours ago [-]

Typically, it doesn't have the ability to deal with a full 64 bits of memory, but it does have the ability to deal with more than 32 bits of memory, and all pointers are 64 bits long for alignment reasons.

It's possible but rare for systems to have 64-bit GPRs but a 32-bit address space. Examples I can think of include the Nintendo 64 (MIPS; apparently commercial games rarely actually used the 64-bit instructions, so the console's name was pretty much a misnomer), some Apple Watch models (standard 64-bit ARM but with a compiler ABI that made pointers 32 bits to save memory), and the ill-fated x32 ABI on Linux (same thing but on x86-64).

That said, even "32-bit" CPUs usually have some kind of support for 64-bit floats (except for tiny embedded CPUs).

monocasa 22 hours ago [-]

The 360 and PS3 also ran like the N64. On PowerPC, 32 bit mode on a 64 bit processor just enables a 32 bit mask on effective addresses. All of the rest is still there line the upper halves of GPRs and the instructions like ldd.

monocasa 1 days ago [-]

Parts of the 360 did. The hypervisor ran in 64bit mode, and use multiple simultaneous mirrors of physical address space with different security properties as part of its security model.

phire 20 hours ago [-]

It's not like the games weren't running in 64 bit mode too (on both consoles)

They had full access to the 64 bit GPRs. There wasn't anything technically stopping game code from accessing the 64 bit address space by reinterpreting a 64 bit int as a pointer (except that nothing was mapped there).

It's only the pointers that were 32 bit, and that was nothing more than a compiler modification (like the linux x32 ABI).

They did it to minimise memory space/bandwidth. With only 512 MB of memory, it made zero sense to waste the full 8 bytes per pointer. The savings quickly add up for pointer heavy structures.

I remember this being a pain point for early PS3 homebrew. Stock gcc was missing the compiler modifications, and you had a choice between compiling 32 bit code (which couldn't use the 64bit GPRs) or wasting bandwidth on 64 bit pointers (with a bunch of hacky adapter code for dealing 32 bit pointers from Sony libraries)

monocasa 4 hours ago [-]

Games themselves ran in 32 bit mode.

The difference is that on PowerPC, 32bit mode on 64bit processors (clearing the SF bit in the MSR) is just enabling a hardware 32bit mask on the effective address before it gets translated into a virtual address.

Unlike on x86-64 and arm64, there's no free (or even that cheap) way to do an ILP32 abi purely in software. x86 and arm allow encodings for memory reference instructions that only use the bottom half of the registers (the E* registers on x86, and the W* registers on arm64). No such encoding exists on PowerPC for memory reference instructions, so you'd be stuck manually masking each generated pointer.

Because of that, the compiler hacks you're talking about are kind of the opposite from what you're describing. The hacks are because on the upstream gcc PowerPC backend, having a 32bit pointers in hardware and having operations that operate on 64bit quantities had the same feature flag despite technically being able to be separately enabled on actual hardware. It was just very rare to do so. So the goal of the hacks was to describe to the compiler that the target has 32 hardware pointers, but still can issue instructions like ldd to operate on the full 64bit GPRs.

chasil 1 days ago [-]

I have some confidence that AMD's acquisition of ATI had a huge impact.

That allowed both a CPU and an advanced GPU to be on the same die.

They also wisely sold Global Foundries, and were able to scale with TSMC.

0x457 22 hours ago [-]

Because consoles don't use off-the-shelf CPUs for many reasons. Neither Intel nor AMD of that time would even consider making a bespoke CPU for Sony or MS.

Even they could use off-the-shelf SKU it wouldn't be viable - neither one had one that fits in power envelope (not that it helped xbox...)

pezezin 20 hours ago [-]

Consoles used off-the-shelf CPUs until the 6th generation. Even the Dreamcast and the first Xbox used off-the-shelf CPUs, it was only the PS2 and the GameCube that started the trend of using custom-made CPUs.

phire 19 hours ago [-]

Not entirely accurate.

The PSX's CPU is semi-custom. The core is a reasonably stock R3000 CPU, but the MMU is slightly modified and they attached a custom GTE coprocessor.... I guess you can debate if attaching a co-processor counts as custom or not (but then the ps4/xbone/ps5/xbs use unmodified AMD jaguar/zen2 cores)

IMO, the N64's CPU counts as off-the-shelf... however the requirements of the N64 (especially cost requirements) might have slightly leaked into the design of the R4300i. But the N64's RSP is a custom CPU, a from scratch MIPS design that doesn't share DNA with anything else.

But the Dreamcast's CPU is actually the result of a joint venture between Hitachi and Sega. There are actually two variants of the SH4, the SH4 and SH4a. The Dreamcast uses the SH4a (despite half the documentation on the internet saying it uses the SH4), which adds a 4-way SIMD unit that's absolutely essential for processing vertices.

We don't know how much influence Sega's needs had over the whole SH4 design, but the SIMD unit is absolutely there for the Dreamcast, I'm pretty sure it's the first 4-way floating point SIMD on the market. The fact that both the SH4/SH4a were then sold to everyone else, doesn't mean they were off the shelf.

Really, the original Xbox using an off-the-shelf CPU is an outlier (technically it's a custom SKU, but really it's just a binned die with half the cache disabled).

monocasa 22 hours ago [-]

They would have started designing the systems in 2003, and one of the first choices is CPU partner.

Do you trust the new line of CPUs that just launched that year?

cptskippy 1 days ago [-]

You have to remember that the AMD and Intel of today are very different companies than they were 20-25 years ago. AMD split off it's fab capabilities, acquired ATI, adopted TSMC as a fab, and developed a custom silicon business.

At that time AMD wasn't in the custom CPU business, AMD64 was a new unproven ISA, and x86 based CPUs of that time were notoriously hot for a console. These were also some of the reasons why Microsoft moved away from the Pentium III it had used in the original Xbox.

The PS3 was launched in 2006 but the hardware design was decided years earlier to provide a reference platform for the software.

TazeTSchnitzel 1 days ago [-]

This article is from 2018.

Previously:

- https://news.ycombinator.com/item?id=16094925

- https://news.ycombinator.com/item?id=27480448

dev-ns8 1 days ago [-]

Wow... A speculative branch prediction path actually get's preemptively executed despite the branch outcome? No matter if the execution has side-affects??? That's quite amazing. Are modern CPUs doing speculative execution like this and just put extra safeguards around affects or do they just prefetch / decode instructions now-a-days?

brucedawson 24 hours ago [-]

Author here: This is not a common problem. I think I was told that Alpha had basically the same bug but it is a bug, for sure. Speculative execution causing problematic side effects is a deal killer.

Speculative execution, however, can cause less problematic side effects. For instance, a speculatively executed load or prefetch will usually actually prefetch which will pollute the cache, TLB, etc., and reveal side-band information, but that is a performance problem and perhaps a subtle security flaw, not a correctness bug like this was.

MBCook 23 hours ago [-]

I remember reading this many years ago, it was great.

In the last week Raymond Chen on his The Old New Thing mentioned the concept of delay slots on some CPUs.

It sounds like a similar thing, just formalized so it’s not a bug. Knowing that the instruction after a branch always executes, regardless of if the branch is taken.

Do I have that right?

cumshitpiss 1 days ago [-]

[dead]

jszymborski 1 days ago [-]

Would need "(2018)" in the title.

NooneAtAll3 1 days ago [-]

unrelated, but recently XBox One was hacked for the first time

https://www.youtube.com/watch?v=FTFn4UZsA5U

brcmthrowaway 1 days ago [-]

How does XBox get hacked when it uses Secure Boot?

Tuna-Fish 1 days ago [-]

Voltage glitching. An outside attacker who has direct, extremely fine-grained control over the power supply to the chip can cause it to brown out for one instruction cycle, preventing a result of an instruction from being written.

With enough sophistication, physical access is more powerful than root access, no exceptions.

23 hours ago [-]

1 days ago [-]

alumno_007 1 days ago [-]

Sobre calentamiento

Rendered at 20:26:25 GMT+0000 (Coordinated Universal Time) with Vercel.