The Inquirer-Home

High end system vendors get the blues

Analysis As servers get more generic, it's back to workstations
Tue Feb 16 2010, 15:39

ONCE UPON A TIME, say 10 years ago, the PC hierarchy was very clear: the bottom level client - desktop and laptop alike - PCs were mainstream systems with similar performance and features as well as resulting lower margins for both the manufacturers making them and the system integrators installing them. Then there were professional 3D graphics, desktop publishing and electronic design automation workstations, which were powerful usually dual-socket deskside machines with high priced OpenGL graphics cards. At the very top were servers, usually monsters with two or four CPUs, larger memory and I/O, as well as custom, expensive system level design.

As you climbed up every level, the volumes diminished, but both the unit prices and percentage sales margins achievable jumped massively. So, a quad CPU server would cost as much as four dual CPU workstations or 40 PC clients respectively, even though the plain bill of materials might only have been three and 15 times more costly, respectively. The added value, service and support margins made up the difference.

And that was a time when most of these big machines were still designed, and often made, in the US or other Western countries. A decade ago, the Taiwanese were still novices at workstation and server board design, and the cost savings or quick turnarounds that the little Chinese island can bring to the table didn't mean that much yet in this exclusive marketplace, which was dominated then by DEC, Compaq, HP, Sun, IBM and Silicon Graphics and what was then a healthy mix of various 64-bit RISC architectures as well as the ascendant but still 32-bit X86 at the low end.

Now, things look quite a bit different.

The RISC platforms with their extra performance and price niches are mostly gone. None remain in workstations and only very high-end servers are around with IBM POWER7 and Fujitsu SPARC64 VIII coming this year. No RISC in workstations means a far smaller software developer base, as having a machine on the desk can never be fully replaced by access to a remote big machine somewhere far away. Even Intel's Itanium, the supposed RISC-killer, seems destined to follow the same fate in due course. Too bad Nvidia didn't decide to get hold of the Alpha platform and its fast X86 binary translators, since if it had maybe we'd now have some real competition there.

The 64-bit X86 has fully taken over the workstation and all classes of servers up to eight sockets now, with only a limited few RISC boxen still around in this mainstream server category. And, yes, it's mostly the Taiwanese firms like Supermicro, Asus, Gigabyte and Tyan - increasingly through their mainland China bureaus - churning out the actual designs for companies like HP, Dell, Fujitsu and others. Quick turnaround, low price and pretty much generic standard feature sets on, say, 1U dual processor (DP) server platforms now make them just as ubiquitous, and with similar zero value add, as a typical client PC.

As an example, a typical Intel dual processor Nehalem or Westmere Xeon platform, expected to be like 90 per cent of all DP server shipments this year, has complete memory control and QPI interconnect already in the CPU, and only Intel's Tylersburg chipset is available to choose from. Everyone even uses the same LSI SAS optional storage controller. So, how much 'added value' is there to add at the hardware level? It seems, even less than on a high-end gaming desktop board. No value add means no margin to add.

When coupled with the tough competition between both these Taiwanese firms and the big US vendors using the designs, this means rock bottom margins for the generic 1U and 2U DP server platforms, not that different percentage-wise from, say, a desktop.

With Intel's Nehalem-EX quad socket platform arrival late next month, a similar 'standardisation' will happen in the higher end space. Except for very big boxes that scale to eight sockets and more - mind you, that's 64 cores and 128 threads here - the base four socket platform will, sooner or later, become as generic as the DP one. One CPU socket type, one I/O chipset, one memory buffer chip, and you can play around with the rest, but there's not much to play with at the base hardware. Why should there be, when Intel did a great job with all these chosen components anyway? As you'll see at the end of next month, Nehalem-EX is expected to scale very well, the best ever multiprocessor scaling for any X86 platform.

Let's see what tricks and treats the OEMs will come up with for the new platform, but in the meantime, is there any hope for more varied and more value added configurations in the mainstream and still predominant dual socket platform, Intel and AMD alike? And that's putting aside the more proprietary vendor-specific stuff like blades or other non standard formats.

The answer is back in the past - graphics workstations, a market with less pomp and glamour, but that has sustained its growth, unique value add and, therefore, reasonable margins, for many years already.

Workstations are more demanding than generic servers, with the exceptions of HPC supercomputing nodes or I/O intensive machines. A high-end 3D workstation needs the best of everything, from the fastest CPU with the most cores - remember Intel's workstation-only W-series Nehalem Xeons - to the fastest and often very large memory system, where capacity, bandwidth and latency all matter. Then, a fast multi-slot PCIe I/O system to feed both one or two fast GPUs and often a RAID or fast SSD array. Finally, networking and other I/O are there, but not before we throw in an expensive display, a powerful PSU or two and sometimes even an unusual 3D pointing device such as the spaceball.

Where can Taiwan vendors make that difference felt, to justify higher prices and therefore margins? Workstation users would love performance tuning just as much as PC overclockers, as long as it, well, works, and brings added - very productivity-linked here - performance benefit. After all, finishing a render in four days instead of five days might mean 20 per cent more money earned per day here. So, what Asus did with its Z7S-WS dual Xeon mainboard two years ago, and what EVGA intends to do with its dual Xeon Classified board in April, might apply here well, that is, reasonable CPU and memory performance optimisation options for frequency, bandwidth and latency. Of course, that requires an improved power system, good component choice and demanding overall board design.

Then, a choice of I/O options: even though a single Intel Tylersburg 5520 chipset has 36 PCIe v2 lanes, these can be handled in many ways. One is, for those happy with this number of lanes and desiring minimum latency, like in say GPGPU work where the GPU to main memory round trip time is important, just optimise the traces and latencies on the 2 x PCIe x16 v2 slots, and leave the x4 v2 slot open for a, say, PCIe high-speed SSD card like the one Intel showed at the last IDF.

Another is for those requiring three or more GPGPU cards in one system, but all hanging from one chipset. Then, like EVGA Classified, the mainboard designers would add two hot - beware of heat handling here - Nvidia Nforce 200 PCIe bridges, and have four full PCIe x16 v2 slots here. An alternative, like in the Supermicro boards, is using two Tylersburg 5520 bridges, one on each CPU and then connected together via an additional QPI link. This way, you'll also get four full PCIe X16 slots, plus more extra x4 slots for I/O expansion.

Another add-on category that makes sense is better I/O. Like, for instance, using a more intelligent Gigabit Ethernet controller such as the Intel 82576 that takes on more TCP/IP protocol stack processing to offload work from the CPU. Same for USB3 and, why not, hardware-assisted SATA3 RAID capability. And, did I forget easy BIOS update via Flash within the BIOS itself, just like on most desktop boards?

There won't be a shortage of CPUs to fill into these. After all, Intel's Xeon 5600 series of six-core Westmere CPUs will come out next month, and the high performance top bins are expected to set new speed records. Also, the vendors could consider workstation flavours of dual socket and quad socket Nehalem-EX boards, even with multiple QPI links between each CPU pair for higher bandwidth, as well as massive memory for EDA chip design and computational science simulations which are the main targets of a possible Nehalem-EX workstation niche.

In summary, the generic server value add hardware side looks just as bleak for any Taiwan mainboard maker as a, well, generic client PC. But, custom servers aren't the only way out. Good old workstations might reward those vendors who dare to make the right moves. But let's not only wait for Gigabyte, Asus, Supermicro. It's also up to us, the users, to voice our opinions and make our needs known, too. µ

 

Share this:

Comments
Options on high-end systems

If we, the user, have more input into what they put on the mainboards, how about making ultra-overkill mainboards suitable for gamers?

We all know people who would, if they could use a four-socket, 32-core board if they knew it would do one thing alone: Reduce latency to nearly nothing.

One of the main beefs I have with system specs that people advertise after testing a system with 4DMark 2096 or whatever is that they never show anything but averages.

In a game, getting killed or surviving in many situations is not about your average frame rate. Anyone can get 400 fps in the middle of a small dark room with no features and no enemies nearby.

What is needed is a rating system that shows bottlenecks, such as when you are in a huge field battle, lots of enemies, explosions, and graphic-intensive effects in play, where you momentarily dip to 4 frames per hour.

It isn't average or peak framerates the gamers need to know about. It's having very high framerates at the WORST situations that determine the usefulness of a system for gaming.

Having two things: a 3dMark or whatever system that can identify framerate bottlenecks as they occur, thus showing which computer setup can get you killed just when you needed the speed the most, and the availability of systems based on very-high-end motherboards that can dramatically raise the floor of how bad your framerates GET in those critical situations.

With a recognized benchmark, people would have a new thing to target, brag about, use in gaming, and buy hardware to achieve: the High Minimum Framerate system.

I have played quite a few games. I find the same problem in every one. Even my top-of-the-line video card-equipped system bogs to hell RIGHT when I most need it to be working smoothly.

I would pay good money for a motherboard that had, as its main feature, a very high base framerate capability because things wouldn't get bottlenecked at the worst time, which is usually in the middle of a battle with lots of very hostile enemies nearby.

By introducing hardware for the server market that had a few tweaks or modifications to also make it very suitable for gaming, the manufacturers could appeal to those who will have speed at any cost.

By introducing benchmarks that more accurately measure the most critical performance of all: How badly the system slows down right in the middle of a very intense battle, there could be an increased awareness of the need for such boards.

Measuring peak or average speeds, but not mentioning that this high-average fps board bogs to 4 frames per fortnight right at the worst possible moments is not giving a full measure of what is needed in the board or system.

With such a benchmark, I think even the most basic server board might be found to kick the ass of the so-called best gaming board, or be designed to do just that.

posted by : The Nesh, 09 August 2010 Complain about this comment
Shortage of Competition

There is one link in the chain still making a massive profit from all this, and that’s Intel.

What we need is more competition. Intel shouldn’t be allowed to dictate what chipsets are available for its CPUs.

And the customers shouldn’t be so fixated on x86 compatibility. Which you only need to run Windows anyway. And since when did “Windows” and “high-performance” go together?

posted by : Lawrence D'Oliveiro, 18 February 2010 Complain about this comment
Bottom Up

ARM will do to x86 what x86 did to specialty workstations that specialty workstations did to minicomputers that minicomputers did to mainframes that mainframes did to offices full of human computers.

posted by : Ugly American, 18 February 2010 Complain about this comment
Why not POWER7 on the workstation?

To counter act the x86 marching into server space. I would love to have a POWER7 in my house :)

posted by : aNewbie, 17 February 2010 Complain about this comment
GRAPHENE GOING TO FAB PRODUCTION, DOPING FIGURED OUT....

Instead of more ramble on semi conductors, something younger than self, heres todays 5PM EE:

http://www.eetimes.com/news/latest/showArticle.jhtml;jsessionid=YQ54NTHO21APTQE1GHPSKH4ATMY32JVN?articleID=222900570

Signed:Glad thats done.

posted by : CARBON ARCS, 17 February 2010 Complain about this comment
More news on HKMG

Last Thuuurrrrsday, AMD Stated NO More Contracts for HKMG on 40/45nm will be taken. here:Globalfoundries: No AMD 45nm Microprocessors with HKMG Incoming. Globalfoundries Dismisses Rumours Regarding 45nm SOI + HKMG Microprocessors Mess has migratd to 28 & 32 nm proces.

HKMG is focused on 32nm/28nm generations,” said Jon Carvill

Vat Does change mean to theINQ Reefer, errrr, READER. Well, Meaning is Simple 45/40 nm thurban & Magny coups upswing will incorporate added bonus of Lower Power Gates,less heat,less leakage due to Low K metal gates needing less voltage to flip III-V Reaction. Fits Pattern explained so very long ago by Same As Self.

Same with graphene, if it becomes true, Ultimate low K gate as Finisher.

Signed:Ultee' K Coups & Dakotian Prophet.

posted by : Bi Ke, 16 February 2010 Complain about this comment
Ummmm

why did you even bother mentioning AMD? They do have multiple SR5690 boards out and more coming with MagnyCours.

They have plenty of bandwidth, plenty of cores and I believe 42 lanes. I guess they aren't trying hard enough to monopolize all PC parts for you, huhn?

A company is selling an MC 4P system on eBay at 2.2GHz. Thats 12 cores mind you. With 2 of those and 2 Evergreen-based FirePros......

InWho.

posted by : BaronMatrix, 16 February 2010 Complain about this comment
hot swappable

hot swappable redundant PSUs and drives in a RAID array. Not every server has them, and they come with a premium.

posted by : mike, 16 February 2010 Complain about this comment
Metal Schmetal

Medal-Schmetal, I like to pedal.

Power-Schmower, I don't need to shower.

Signed: theRealSwedishChef

posted by : hekvondreshdork, 16 February 2010 Complain about this comment
Graphene WorkStations & Complexing Thought....

Way Back When Mike Was editor, Drashek Claimed High metal K Gates Evolve to Low K Metal gates As Standard Practice of Stepping UP. ?Was editor....

Now Community of FABS hae Skipped Low K Metal Gates For Generation & Hangs Start to Appear, At least at 45nm level. So Is D wrong. NO.
Graphene is next Low K Transistor 'acomin. Once again SongBirds Will Roost in Yard & Happy Campers, EveryOne,EveryWhere.

Ahso, HRM STeWie Stated Long, Long AGO That World needs 32X Pci-e Slot. Well got Hz in Effect With pci-e 2.0.
actual 32X slot Specs are still in Pci-e standards manuel, with NO takers, To Date.NEED:32X Pci-e 2.0
tehREADER, in ALL Mighty Wisdom, Knows that combineing Multi socket or Multi sloted graphics really CUTS BAck On Total Performance. 50% less at first cut. So Neb, in detailing need for Workstation May mean 32X pci-e Card Slots, to Give GPU Room to Move & Develope.

Now Wasn't That Tsatee'.

In Power7 Cell Enviorment with 32X pci-e Flame Broiling Whopper.King of Machines Develope. Real Killer Machine. Then Add graphene, IBM. or is it just bee 'mmmmm? M.

Obiviously What World truely Needs is More Drash.

Candidates:EveryWhere, Sylv,Eva, Botox ETA, Maureen O'Hara, Scarlet Be Thy Name, ladies.
Calling:Copper Harding...LunchTime,Honey.

Signed:theDRASH

posted by : Thomas Stewart, 16 February 2010 Complain about this comment
aboutus
Advertisement
Subscribe to INQ newsletters
Advertisement
INQ Poll

Authorities in several countries raided Megaupload recently, shut down all of its services, seized hundreds of servers and arrested several of its executives on criminal charges.

Do you think the move was justified?