if Nvidia is so bad, then what does that make the other graphic cards?
Well, after reading this really long article about how lousy nvidia is, all I can say is that I have have tried about every brand of video card out there before switching to (and sticking with, for about 7 years now) and I have never had a software compatability issue with a nvidia video card.
Wish I could say the same for the other brands of video cards. Quite a few of them are really bad about this too, like brands such as Raedon (don't know if I spelled that righ).
A few of the more modern pc games I have now are really graphic intensive, and I had to upgrade from a nvidia geoforce 5200 to a nvidiea geoforce fx 9100, but as long as I have always made sure that I had the required or better video card for the software I get, I have never had any such issues with a nvidia card.
Other brands I have tried in the past were always buggy with some software or another, even though the software was within the capabilities of the video card.
The graphics either didn't run smoothly, no matter how you tweaked the settings, or the software needed to be patched for that specific card, or the software just plain didn't run at all, even though it was suppose to with the video card in question.
So my question is, if nvidia makes such a bad video cards, defects and all, then what does that make all of these other brands? Something much worse than just 'bad' I suppose.
Wanted to correct some statements on the nVidia article, at a risk of getting too technical.
1. High lead bumps (typically 95/5 Pb/Sn) are more ductile than "eutectic" (~ 63/37 Sn/Pb). Eutectic bumps are stronger and more resistant to fatigue ('broken fork' model) but this increased stiffness translates more stress to the GPU and can lead to delamination in the multi-layer stack of dielectric and copper that interconnects all the circuits on the GPU. Intel has moved to plating copper columns that are capped with a lead-free solder on recent 45 nm processors. The stiff copper column transfers a lot of stress to the column - processor interface, so Intel had to do a lot of engineering work to make this interface reliable.
2. I believe nVidia used eutectic on older technology GPUs, and changed to high lead flip chip bumps when they went to 90 nm and 65 nm technology nodes for the GPUs. This was done specifically to try to avoid cracking / delamination between the bump and the interconnect stack on the GPU. Bonding a high lead bump to the eutectic solder cladding on the substrate bond pad results in the molten eutectic solder dissolving some of the high lead bump, creating a tin-rich phase bonding the remaining unaffected high lead bump volume to the bond pad on the substrate. There is nothing wrong with this connection.
3. Flip chip bumps arrayed on a GPU on a pitch of ~180 - 200 micrometers, are typically sized such that they can handle ~100 milli-amps per bump with a junction temperature of ~100 Celsius for about 10 - 20 years of use. 
4. One aspect that is not noted in the story is that the thermal management solution (heatsink, how the heat sink is pressed against the back side of the GPU, how much pressure is applied to the GPU back side, how the GPU package is supported on the board, etc., has a tremendous impact on the integrity of the flip chip bump interconnect. The greater the pressure on the GPU, the greater the potential damage to the interfaces of the bump to the GPU and the bump to the substrate pad. 
5. The stated risk of the polyIMIDE (not polyamide)coating on the GPU being torn from the die is not the risk imposed by a high Tg / high stiffness (high modulus). If the underfill is too stiff, it may protect the bumps from any form of degradation quite nicely, but it may succeed in causing the multilayer stack of insulating layers and copper to delaminate from the GPU, typically starting in the corners of the GPU. So it is correct that the underfill must be carefully chosen based upon the material properties of thermal expansion (above and below the Tg where it softens a lot), the modulus of elasticity (stiffness) over the intended temperature range of use, and Tg. 
6. So since the thermal solutions in laptops and desktops are so different, while the normal (as supplied) junction temperatures can be expected to be about the same, the use of high pressure on the die backside, with a junction temperature higher than the Tg (underfill softens a lot), with a lot of power cycling, aggravated by electromigration (atoms diffuse in the direction of the electron flow), can lead to accelerated degradation of the flip chip bump bond interfaces.
7. Flux is used to help the flip chip solder bumps flow and bond well, and the residue of this flux is often not removed before the underfill is applied, or is the residue is removed with cleaning processes, some residue is commonly trapped betwee the solder connection and the green solder mask material on the surface of the substrate. This residue can melt at low temperatures and further aggravate the reliability of the interconnection.

Flip chip packaging is complicated....
eutectic isn't the way to go then... neither is high-lead on eutectic...
why didn't they just switch to high-lead organic substrates? That seems like it would solve the current density, stress, and MTBF problems all at once.

Let me guess, RoHS considerations?
You mention 40% of specific parts have early failure rates. Which specific parts, and what is considered early failure? This would be good info to give everyone a baseline.
Nice article, but let me add the good reason for high lead bumps. Eutectic solder has a much lower melting temperature, and that is the traditional alloy for attaching the package board to the mother board. You want high lead for attaching the die to the package so that those bumps don't melt when the package is attached to the mother board. Of course this is all getting thrown out the window in the face of Rohs compliance, so be ready for more problems in the future as well as higher costs.
Heat raises resistance in most materials. True, there are certain materials, like semiconductors, with a negative temperature coefficient (NTC), but the vast majority are the opposite. Bill Hewlett (of Hewlett-Packard) first exploited tungsten's POSITIVE temperature coefficient to stabilize an oscillator by putting a light bulb in the feedback loop. That was in Hewlett-Packard's very first product: an audio oscillator. The rest is history.

The rest of your post is pretty much gibberish. High-K gates have lower leakage due to better oxide performance; they don't handle heat better, they dissipate less heat at idle, is all.
Thanks for another worthless opinion piece! Ever gonna put some substance into these fluffy articles???
"Mac - I'm not a Mac! Why doncha turn your lights on, ya mo-ron"
Heat Lowers Resistance, As light Bulb has very high resistance, yet when it approachs red hot, resistance goes down & final white hot tungstun has nearly NO resistance, few Ohms. Yet if item has 2 ohms resistance, then made to 4 ohm, twice heat is generated at 4 ohm than two in same circumstance. Maybe HIGH K gates are solution. Made for Heat.

Cyandie is Metal, yet if Manufacturing Process is flawed, resistance of Metal greatly increases or worse goes down to nil, allowing way too much current to pass. 
I am just speculating that somewhere in RED Zone in photo, transistors have been corrupted in Manufacturer & it takes but few to screw up transfer to internal chip memory & become much hindered by inherent defects.Gasified Metal recondensed into aminoliquid slime, changing Metal characteristics, at Gate. Over Blasting Next Gates with Flawed Signal.

Plug in Video memory was standard, then advances in manufacturing made it hardwired again. Funny that CPU isn't advanced back to hardwired, except 1,300 connection points is bit much to solder. 
Perhaps snap in, grip in, slot chips have something to offer in this problem, to avoid manufacturering heat.Basic first thoughts, Tam.
drashek
There's a good reason why desktop parts are not *yet* showing up in large quantities: they aren't as stressed as the laptop parts.

The desktop parts aren't cycled as often and are generally more effectively cooled. It doesn't make them any less likely to fail - it simply means that the NV desktop chips are on a longer fuse before the bomb detonates, that's all.
if you lot can remember as far back as 2003, there was a large scale replacement of the white G3/G4 iBook due to the GPU BGA bump detachment over time. I personally got Apple to change a 600MHz and a 1GHz.
Apparently a lot of people who went through the effort of commenting on this article failed to read the previous articles by chucky on this topic.

His last article claimed that more chips are defective due to the exact same reason as the previous chips ... that 'same' reason being related to the replacement of the solder even though the original reason was the very different relation to the replacement of the underfill material.

Chucky also has not acknowledged that Nvidia must use low lead materials to be RoHS compliant, simply to be allowed to sell Nvidia parts into some markets.

Chucky really has no idea if the bumps or underfill are causing the chip failures, he has just observed the blind correlation between the material switching and shipping of supposedly fixed products. 

This is no different than the natural health product scams or the general scam of religion, in that they all use coincidence as proof of effectiveness (or defectiveness).

Readers should also be reminded that by Chucky not only kept writing about Vista, but kept using Vista long long after his long brutal rant about how he would never use or write about Vista ever again.

Chucky is just trying to hide the shame of his Vista lies with the shame of pretending he knows anything at all about the high failure rates of recent Nvidia products.

It is quite apparent that Chucky's biggest experience with any lead containing product was the paint chips he gnawed of the walls of his childhood home.
I don't see any firms stating 40% failure rates on any cards (AMD or NV). Surely that would be HUGE news.

Nothing you've explained proves anything about NV's parts. Still just opinion crap that isn't provable. List sources at companies. Or better yet post their emails to you that are complaints about NV failures so we can read them for ourselves.

Ohh, that's right, you have no evidence, just more whining. Go back to handing out those flyers again...LOL. Get a few friends, your 3 person protest didn't work...ROFL. You're not the only one with connections in this business and NONE of them are writing this stuff or saying it. IF NV had 40% failures on their cards we'd all know, and on top of that they would have been taking a BILLION charge, not 196mil (confirmed as a ONE TIME ONLY charge during conference call - or did you miss that?). :)
Great article. You stuck to the hard facts and not even the hardest NV fanboy can argue the truth. NV messed up, and now we know why. Know, if only they knew.....lol

Seriously though, I think Charlie here knows more about chips than the current Supervisor at NV. Go apply for a job and save them!
I don't know if anyone else is noticing this, but if Charlie wrote this article, he must have slept at a Holiday Inn the night before. This writing is clearly not representative of his style or content. Not only that, but suddenly he's an engineer...?
It's a plausible flaw, but I don't see the credible evidence.
Without a lot of measuring and testing, one can't be sure Charlie is correct. However, the two verifiable facts - the $200 million USD charge and the PCNs - do fit his explanation nicely.

I'm not sure what joke drashek found - no resistance equals no heat, increasing resistance or voltage means increased heat. This isn't your "Warning: 10,000 Ohms" sniggering opportunity.

And these fanbois - classic case studies in deviant psych. Shouldn't you all be at your religion classes discussing how we should all think about more important issues?
Charlie, This article along with part two was great. Thanks for the technical explanation of what is going on. One question for the general public, though... Why not just use Liquid Cooling and lessen your chances of thermal failure?
Nice article, but please, when trying to give evidence of the size of something, could you use a non nation specific scale.

I have no idea what size a quarter is, except it's going to be in the coin size region. If it's the size of a UK 5p, or a UK £2 coin is unknown to me.

Could you try a universal scale, say a ruler. I know there are two scales for this, metric and imperial, luckily most rulers I have seen cater for this, showing both scales.

If the reporter responsible is in the US and unable to obtain such a multi-scale ruler, I will happily send him one!
"But, but the AMD add that is (or isn't in my case) at the top of the page means that the Inq is obviously biased'.

Give me a break. The Inq is so, so kind to AMD and it's Phenomenal failure. nVidia made a poor engineering decision. Period. They have a lot of bad chips out there, you're just seeing the laptop stuff die first because it's a much more stressful environment with a lot more heat cycling.

As for the 4850, that's more a case of trying to cool a card that should have had a dual slot cooler with a single slot cooler. That, should it become a major problem, could be remedied a hell of a lot easier than nVidia's current mea culpa.
First of all, nice explanation of the tech obstacles and compromises of CPU/GPU engeneering.

Yhe proble is with all your hand waving and prancing around angrily I am still not seeing ANY significant failure rates in nVidia video cards.

I have over 100+ desktops all running high performance nVidia GPUs and many with nVidia mainboard chipsets all running hi res graphic workstations.

Where are the failures? Why am I not seeing increased failure rates?

This is the problem with your entire campaign against nVidia and their engineering choices. Engeneering choices that every high current/temp chip manufacturer must face.

So listen Chuck, where's the meat?

CD Baric
This is one of better articles Charles has written on Complex subject. Simple area is that chip only does one of four things. It inverts voltage, it passes voltage, it converts voltage to only high or only low string. thats it, it depends on instruction to comparrison unit for each blast.

How to keep chip stable to perform those simple functions that then go to output of memory or device is question. Chip isn't so new to need deep understanding of physics to make it work, yet it helps in knowing why it dosn't work.

You mention high voltage & high resistance, good. Both Culprits, Maybe in with Mike Magee. I don't know. Gags are only funny when things are right.

Third culprit in this story is unmentioned in detail, although mentioned. Beyond temperature of cystalization of solder, theres heat itself migrating to new transistors during Manufacturer. that is reason Modern CPU has mechanical connection by grippers for Pins.

Maybe at such smaller scale, some critical transistors are being preburnt, forcing higher voltage problems, leakage El Maximo & just warmed up like toast, crispy edges of nodules of baked metal & substrate cyanide mixed. Making Voltage passage irregular due to Resistance variations thruout on transistor by transistor basis. Tin melts about 490F, Lead melts at 620F, So poor little Dralins' are Being Fried before warm mellow constant cooking. By being too close to solder point, or solder point too Hot.Or cooling of solder NOT deep enough Nor Strong enough. Say after few good hacks machinery holding metal contacts gets too hot & too Much heat passses into gpu transistors. 

Reverand Tom States: Ashes to Ashes & Defectives to Trash.
drashek
Well well....

Thanks Charlie for giving us the technical reasons for how and why NV 55nm and 65nm chips are failing. It should be pretty obvious by now to NV fanboys that these reports are based on facts and science, not "NV Hate". Bad chip designs from the beginning. It's all going to come out, no matter how NV tries to spin it. Watch and see.
Meanwhile people discovered that furmark which is used to test graphics cards suddenly runs slow on ATI 5850/70 cards with the latests drivers and then they found out that those drivers specifically recognise the furmark exe and slows it down, because apparantely the 4850 overheats and according to some actually dies sometimes when running it.
In other word the ATI cards cannot run at 100% it seems, now when will we read about that on the inq? Perhaps in a article named 'why ATI cards are defective'?
And why didn't that bit of news show up yet on the inq? At least charlie has the excuse of sleeplessness and jetlag and distraction from the Intel IDF thing, but not everybody at the inq does surely.
wow!, I am surprised at the technical know-how of Charlie in packaging technology.

Has intel started giving prepared articles also to publish, in addition to lots of money to Inquirer???
"whine/moan Charlie is biased against nVidia blah blah..."

... thank you, once again, for a clearly reasoned and well thought out look at the reality of the situation, which will inevitably be cited by ignorant fanbois as blatant bias.
I work at a UK based computer hardware supplier and we see roughly the same amount of RMA's for Nvidia and ATI cards. I don't doubt that Nvidia might have cut corners/made bad engineering choices, but is there any firm evidence that ATI are not doing the same thing themselves?

Looking around various hardware forums i see no evidence of mass failure of Nvidia cards. Laptops are obviously seeing a problem but i am not sure that the same thing can be said for desktop parts.
Well, after reading this really long article about how lousy nvidia is, all I can say is that I have have tried about every brand of video card out there before switching to (and sticking with, for about 7 years now) and I have never had a software compatability issue with a nvidia video card.
Wish I could say the same for the other brands of video cards. Quite a few of them are really bad about this too, like brands such as Raedon (don't know if I spelled that righ).
A few of the more modern pc games I have now are really graphic intensive, and I had to upgrade from a nvidia geoforce 5200 to a nvidiea geoforce fx 9100, but as long as I have always made sure that I had the required or better video card for the software I get, I have never had any such issues with a nvidia card.
Other brands I have tried in the past were always buggy with some software or another, even though the software was within the capabilities of the video card.
The graphics either didn't run smoothly, no matter how you tweaked the settings, or the software needed to be patched for that specific card, or the software just plain didn't run at all, even though it was suppose to with the video card in question.
So my question is, if nvidia makes such a bad video cards, defects and all, then what does that make all of these other brands? Something much worse than just 'bad' I suppose.
Just check the Hp forums, thousands of Nvidia GPU Laptops DIE after about 1 year of usage.
Just search over the net these words " HP defective GPU"
http://forums11.itrc.hp.com/service/forums/bizsupport/questionanswer.do?admit=109447626+1245333842556+28353475&threadId=1274587
If your HP laptop is showing signs of the above mentioned defect, there is a website that has a lot of useful information to help.
www.hplies.com
Wanted to correct some statements on the nVidia article, at a risk of getting too technical.
1. High lead bumps (typically 95/5 Pb/Sn) are more ductile than "eutectic" (~ 63/37 Sn/Pb). Eutectic bumps are stronger and more resistant to fatigue ('broken fork' model) but this increased stiffness translates more stress to the GPU and can lead to delamination in the multi-layer stack of dielectric and copper that interconnects all the circuits on the GPU. Intel has moved to plating copper columns that are capped with a lead-free solder on recent 45 nm processors. The stiff copper column transfers a lot of stress to the column - processor interface, so Intel had to do a lot of engineering work to make this interface reliable.
2. I believe nVidia used eutectic on older technology GPUs, and changed to high lead flip chip bumps when they went to 90 nm and 65 nm technology nodes for the GPUs. This was done specifically to try to avoid cracking / delamination between the bump and the interconnect stack on the GPU. Bonding a high lead bump to the eutectic solder cladding on the substrate bond pad results in the molten eutectic solder dissolving some of the high lead bump, creating a tin-rich phase bonding the remaining unaffected high lead bump volume to the bond pad on the substrate. There is nothing wrong with this connection.
3. Flip chip bumps arrayed on a GPU on a pitch of ~180 - 200 micrometers, are typically sized such that they can handle ~100 milli-amps per bump with a junction temperature of ~100 Celsius for about 10 - 20 years of use. 
4. One aspect that is not noted in the story is that the thermal management solution (heatsink, how the heat sink is pressed against the back side of the GPU, how much pressure is applied to the GPU back side, how the GPU package is supported on the board, etc., has a tremendous impact on the integrity of the flip chip bump interconnect. The greater the pressure on the GPU, the greater the potential damage to the interfaces of the bump to the GPU and the bump to the substrate pad. 
5. The stated risk of the polyIMIDE (not polyamide)coating on the GPU being torn from the die is not the risk imposed by a high Tg / high stiffness (high modulus). If the underfill is too stiff, it may protect the bumps from any form of degradation quite nicely, but it may succeed in causing the multilayer stack of insulating layers and copper to delaminate from the GPU, typically starting in the corners of the GPU. So it is correct that the underfill must be carefully chosen based upon the material properties of thermal expansion (above and below the Tg where it softens a lot), the modulus of elasticity (stiffness) over the intended temperature range of use, and Tg. 
6. So since the thermal solutions in laptops and desktops are so different, while the normal (as supplied) junction temperatures can be expected to be about the same, the use of high pressure on the die backside, with a junction temperature higher than the Tg (underfill softens a lot), with a lot of power cycling, aggravated by electromigration (atoms diffuse in the direction of the electron flow), can lead to accelerated degradation of the flip chip bump bond interfaces.
7. Flux is used to help the flip chip solder bumps flow and bond well, and the residue of this flux is often not removed before the underfill is applied, or is the residue is removed with cleaning processes, some residue is commonly trapped betwee the solder connection and the green solder mask material on the surface of the substrate. This residue can melt at low temperatures and further aggravate the reliability of the interconnection.

Flip chip packaging is complicated....
eutectic isn't the way to go then... neither is high-lead on eutectic...
why didn't they just switch to high-lead organic substrates? That seems like it would solve the current density, stress, and MTBF problems all at once.

Let me guess, RoHS considerations?
You mention 40% of specific parts have early failure rates. Which specific parts, and what is considered early failure? This would be good info to give everyone a baseline.
Nice article, but let me add the good reason for high lead bumps. Eutectic solder has a much lower melting temperature, and that is the traditional alloy for attaching the package board to the mother board. You want high lead for attaching the die to the package so that those bumps don't melt when the package is attached to the mother board. Of course this is all getting thrown out the window in the face of Rohs compliance, so be ready for more problems in the future as well as higher costs.
A very long and detailed story but could be explained easier - bad choice of solder make joints break early.



Heat raises resistance in most materials. True, there are certain materials, like semiconductors, with a negative temperature coefficient (NTC), but the vast majority are the opposite. Bill Hewlett (of Hewlett-Packard) first exploited tungsten's POSITIVE temperature coefficient to stabilize an oscillator by putting a light bulb in the feedback loop. That was in Hewlett-Packard's very first product: an audio oscillator. The rest is history.

The rest of your post is pretty much gibberish. High-K gates have lower leakage due to better oxide performance; they don't handle heat better, they dissipate less heat at idle, is all.
Thanks for another worthless opinion piece! Ever gonna put some substance into these fluffy articles???
"Mac - I'm not a Mac! Why doncha turn your lights on, ya mo-ron"
Just had to share an "attaboy" for the more detail and less flame thrower.
Heat Lowers Resistance, As light Bulb has very high resistance, yet when it approachs red hot, resistance goes down & final white hot tungstun has nearly NO resistance, few Ohms. Yet if item has 2 ohms resistance, then made to 4 ohm, twice heat is generated at 4 ohm than two in same circumstance. Maybe HIGH K gates are solution. Made for Heat.

Cyandie is Metal, yet if Manufacturing Process is flawed, resistance of Metal greatly increases or worse goes down to nil, allowing way too much current to pass. 
I am just speculating that somewhere in RED Zone in photo, transistors have been corrupted in Manufacturer & it takes but few to screw up transfer to internal chip memory & become much hindered by inherent defects.Gasified Metal recondensed into aminoliquid slime, changing Metal characteristics, at Gate. Over Blasting Next Gates with Flawed Signal.

Plug in Video memory was standard, then advances in manufacturing made it hardwired again. Funny that CPU isn't advanced back to hardwired, except 1,300 connection points is bit much to solder. 
Perhaps snap in, grip in, slot chips have something to offer in this problem, to avoid manufacturering heat.Basic first thoughts, Tam.
drashek
There's a good reason why desktop parts are not *yet* showing up in large quantities: they aren't as stressed as the laptop parts.

The desktop parts aren't cycled as often and are generally more effectively cooled. It doesn't make them any less likely to fail - it simply means that the NV desktop chips are on a longer fuse before the bomb detonates, that's all.
if you lot can remember as far back as 2003, there was a large scale replacement of the white G3/G4 iBook due to the GPU BGA bump detachment over time. I personally got Apple to change a 600MHz and a 1GHz.
Apparently a lot of people who went through the effort of commenting on this article failed to read the previous articles by chucky on this topic.

His last article claimed that more chips are defective due to the exact same reason as the previous chips ... that 'same' reason being related to the replacement of the solder even though the original reason was the very different relation to the replacement of the underfill material.

Chucky also has not acknowledged that Nvidia must use low lead materials to be RoHS compliant, simply to be allowed to sell Nvidia parts into some markets.

Chucky really has no idea if the bumps or underfill are causing the chip failures, he has just observed the blind correlation between the material switching and shipping of supposedly fixed products. 

This is no different than the natural health product scams or the general scam of religion, in that they all use coincidence as proof of effectiveness (or defectiveness).

Readers should also be reminded that by Chucky not only kept writing about Vista, but kept using Vista long long after his long brutal rant about how he would never use or write about Vista ever again.

Chucky is just trying to hide the shame of his Vista lies with the shame of pretending he knows anything at all about the high failure rates of recent Nvidia products.

It is quite apparent that Chucky's biggest experience with any lead containing product was the paint chips he gnawed of the walls of his childhood home.
I don't see any firms stating 40% failure rates on any cards (AMD or NV). Surely that would be HUGE news.

Nothing you've explained proves anything about NV's parts. Still just opinion crap that isn't provable. List sources at companies. Or better yet post their emails to you that are complaints about NV failures so we can read them for ourselves.

Ohh, that's right, you have no evidence, just more whining. Go back to handing out those flyers again...LOL. Get a few friends, your 3 person protest didn't work...ROFL. You're not the only one with connections in this business and NONE of them are writing this stuff or saying it. IF NV had 40% failures on their cards we'd all know, and on top of that they would have been taking a BILLION charge, not 196mil (confirmed as a ONE TIME ONLY charge during conference call - or did you miss that?). :)
Great article. You stuck to the hard facts and not even the hardest NV fanboy can argue the truth. NV messed up, and now we know why. Know, if only they knew.....lol

Seriously though, I think Charlie here knows more about chips than the current Supervisor at NV. Go apply for a job and save them!
I don't know if anyone else is noticing this, but if Charlie wrote this article, he must have slept at a Holiday Inn the night before. This writing is clearly not representative of his style or content. Not only that, but suddenly he's an engineer...?
It's a plausible flaw, but I don't see the credible evidence.
Without a lot of measuring and testing, one can't be sure Charlie is correct. However, the two verifiable facts - the $200 million USD charge and the PCNs - do fit his explanation nicely.

I'm not sure what joke drashek found - no resistance equals no heat, increasing resistance or voltage means increased heat. This isn't your "Warning: 10,000 Ohms" sniggering opportunity.

And these fanbois - classic case studies in deviant psych. Shouldn't you all be at your religion classes discussing how we should all think about more important issues?
Charlie, This article along with part two was great. Thanks for the technical explanation of what is going on. One question for the general public, though... Why not just use Liquid Cooling and lessen your chances of thermal failure?
Nice article, but please, when trying to give evidence of the size of something, could you use a non nation specific scale.

I have no idea what size a quarter is, except it's going to be in the coin size region. If it's the size of a UK 5p, or a UK £2 coin is unknown to me.

Could you try a universal scale, say a ruler. I know there are two scales for this, metric and imperial, luckily most rulers I have seen cater for this, showing both scales.

If the reporter responsible is in the US and unable to obtain such a multi-scale ruler, I will happily send him one!
"But, but the AMD add that is (or isn't in my case) at the top of the page means that the Inq is obviously biased'.

Give me a break. The Inq is so, so kind to AMD and it's Phenomenal failure. nVidia made a poor engineering decision. Period. They have a lot of bad chips out there, you're just seeing the laptop stuff die first because it's a much more stressful environment with a lot more heat cycling.

As for the 4850, that's more a case of trying to cool a card that should have had a dual slot cooler with a single slot cooler. That, should it become a major problem, could be remedied a hell of a lot easier than nVidia's current mea culpa.
First of all, nice explanation of the tech obstacles and compromises of CPU/GPU engeneering.

Yhe proble is with all your hand waving and prancing around angrily I am still not seeing ANY significant failure rates in nVidia video cards.

I have over 100+ desktops all running high performance nVidia GPUs and many with nVidia mainboard chipsets all running hi res graphic workstations.

Where are the failures? Why am I not seeing increased failure rates?

This is the problem with your entire campaign against nVidia and their engineering choices. Engeneering choices that every high current/temp chip manufacturer must face.

So listen Chuck, where's the meat?

CD Baric
The AMD advert on these pages speaks volumes.
This is one of better articles Charles has written on Complex subject. Simple area is that chip only does one of four things. It inverts voltage, it passes voltage, it converts voltage to only high or only low string. thats it, it depends on instruction to comparrison unit for each blast.

How to keep chip stable to perform those simple functions that then go to output of memory or device is question. Chip isn't so new to need deep understanding of physics to make it work, yet it helps in knowing why it dosn't work.

You mention high voltage & high resistance, good. Both Culprits, Maybe in with Mike Magee. I don't know. Gags are only funny when things are right.

Third culprit in this story is unmentioned in detail, although mentioned. Beyond temperature of cystalization of solder, theres heat itself migrating to new transistors during Manufacturer. that is reason Modern CPU has mechanical connection by grippers for Pins.

Maybe at such smaller scale, some critical transistors are being preburnt, forcing higher voltage problems, leakage El Maximo & just warmed up like toast, crispy edges of nodules of baked metal & substrate cyanide mixed. Making Voltage passage irregular due to Resistance variations thruout on transistor by transistor basis. Tin melts about 490F, Lead melts at 620F, So poor little Dralins' are Being Fried before warm mellow constant cooking. By being too close to solder point, or solder point too Hot.Or cooling of solder NOT deep enough Nor Strong enough. Say after few good hacks machinery holding metal contacts gets too hot & too Much heat passses into gpu transistors. 

Reverand Tom States: Ashes to Ashes & Defectives to Trash.
drashek
Finally some detailed explanation as to why you've been ranting about Nvidia soo much.
Well well....

Thanks Charlie for giving us the technical reasons for how and why NV 55nm and 65nm chips are failing. It should be pretty obvious by now to NV fanboys that these reports are based on facts and science, not "NV Hate". Bad chip designs from the beginning. It's all going to come out, no matter how NV tries to spin it. Watch and see.
Meanwhile people discovered that furmark which is used to test graphics cards suddenly runs slow on ATI 5850/70 cards with the latests drivers and then they found out that those drivers specifically recognise the furmark exe and slows it down, because apparantely the 4850 overheats and according to some actually dies sometimes when running it.
In other word the ATI cards cannot run at 100% it seems, now when will we read about that on the inq? Perhaps in a article named 'why ATI cards are defective'?
And why didn't that bit of news show up yet on the inq? At least charlie has the excuse of sleeplessness and jetlag and distraction from the Intel IDF thing, but not everybody at the inq does surely.
wow!, I am surprised at the technical know-how of Charlie in packaging technology.

Has intel started giving prepared articles also to publish, in addition to lots of money to Inquirer???
Great article Charlie!
I like such details!
"whine/moan Charlie is biased against nVidia blah blah..."

... thank you, once again, for a clearly reasoned and well thought out look at the reality of the situation, which will inevitably be cited by ignorant fanbois as blatant bias.
i like the way u wrote this article without being too sarcastic or critical of Nvidia, and instead sticking to explaining the facts.
I work at a UK based computer hardware supplier and we see roughly the same amount of RMA's for Nvidia and ATI cards. I don't doubt that Nvidia might have cut corners/made bad engineering choices, but is there any firm evidence that ATI are not doing the same thing themselves?

Looking around various hardware forums i see no evidence of mass failure of Nvidia cards. Laptops are obviously seeing a problem but i am not sure that the same thing can be said for desktop parts.
Good article,