Jump to content
The Inquirer-Home

Azul launches Vega2 Java machines

768 cores and 768GB
Monday, 4 December 2006, 16:41
AZUL UNWRAPPED its newest chip the 48-core Vega2 today. It is the successor to last year's Vega1. This Java processor triples performance from the year-old part and has an attractive feature set.

Let's start out with the systems. Azul does not sell chips, only complete machines. There is a two-chip, 96 core, 48GB model 3210 and a four-chip, 96-192GB model 3220. Both are 5U machines, with the 3210 pulling about 580W max and the 3220 pulling about 1000W. There will be eight and 16 socket machines coming in 2007.

The machines are all SSI or Single System Image, all cores can see all the memory. This may not sound like much of a feat, but remember that each chip is 48 cores. This means the biggest box has 768 cores and up to 768GB of memory, every last bit accessible to each CPU.

The way it works is that each core is a simple CPU that runs one thread. The cores have 16K L1 cache, and 8 of these are connected to a single L2 cache. Those L2 caches are connected to an internal mesh, and a Vega2 is 6 of these clusters. There are 4 on chip memory controllers, supporting 12 DIMMs per CPU. The mesh connects them all in a non-blocking way.

The CPUs externally are connected via another mesh, 16 sockets means 15 links per CPU. The link is a SERDES link similar to PCIe2 in that it has an embedded clock. If you are not getting the picture, everything is connected to everything else as directly as possible.

What you end up with is a monster compute engine that plugs into a host PC. The cores themselves are tuned to run Java and other languages that live in a VM. The ISA for the Vega2 cores does not directly map to Java, but they map pretty well to the concepts in VM based languages.

One interesting bit is that the Vega2 ISA is different from the Vega1 ISA. Between the generations, a lot was learned about what works, what does not work, and how things can be sped up. Azul is claiming that Vega2 is 3x faster than Vega1, and if 2x of that came from the core doubling, the other 1x probably came from the new ISA. Since the user never sees, touches or cares about the bare metal, Azul can tweak away with impunity, and they do.

In the end, you get a box. Said box plugs into your rack and runs Java really stinking fast. The numbers they showed were SPECjbb2005, and they scored a little over 850K, beating out a Fujitsu 128 chip SPARC64 that scored over 800K. The Azul box starts at $49,995 for a 2-way and probably around $500K for the 16-way, so it is notably cheaper than the Fujitsu machine.

For perspective, the 16 chip 384-core Azul 3840 (Vega1) scored about 325K, a 2-way Core2 at 3.0GHz hits about 100K, and an Opteron takes up the rear at closer to 50K. Azul holds the absolute record and probably the perf/watt record too.

One very nice partnership is BEA, they have ported the entire portfolio of Java software to the Vega based machines, so you have a decent set of apps right off the bat. Since Java is portable (no snickering people!), theoretically you can put an Azul box on almost any machine and offload the Java processing to it.

This brings up the most important question. We asked a fiend with a long history of Java coding experience. He has written good portions of several engines that run on the Azul boxes, and also has a Vega1 box at his beck and call. He gives it a thumbs up with a few caveats.

The main one is your code must be able to utilise multiple CPUs. If each Vega2 core is equivalent to a 100MHz Intel core, and your software can only use one thread, you have a very slow box with 767 idle cores. If your code can use multiple cores, you have a machine that can truly fly through code.

Since most server code is very amenable to threading, he thinks this is not a problem for the majority of applications out there. They will just run fast on the big box humming away in the corner. One nice feature of this type of setup is consistency. Since you have so many cores, chances are they are not all working at the same time, a percentage of them will be idle at any given time.

This means that any task you fire off will be immediately serviced leading to a fairly predictable execution time. If you only have four vastly faster cores, each one will be heavily loaded, so a task might finish quickly or it might take a bit. Each Vega2 core may be slower, but it is more 'reliable' in a timing sense.

The down side to this type of box is there is no local storage, so anything persistent needs to be offloaded back to the host and saved. For transactional systems, this type of logging is critical and imposes a huge overhead on Azul boxes. They still are faster than most equivalent machines, but with the overhead end up with less of a crushing victory.

A CPU launch would not be a launch without the requisite list of partners declaring their unrequited love for the machine. Azul lined up some big ones, with BEA an obvious believer for porting the code, and BT another big one. BT has ported their Openreach B2B gateway to Azul, and since it is the world's largest gateway, yours won't be nearly as big or demanding.

In the end, the user sees a box that runs code faster, and it runs the software they want or it does not. If it does, it can be a very clear winner for a number of scenarios but not all. The geeks see an interesting architecture that is actually commercially viable instead of simply a curiosity. Whatever you see it as, you can buy one now from Azul. ยต

Share this:

Comments

There are no comments submitted yet. Do you have an interesting opinion? Then be the first to post a comment.

Advertisement
Subscribe to the INQ Newsletter
Sign-up for the INQBot weekly newsletter
Click here to sign up Existing user
Advertisement
INQ Poll

Christmas computer sales

Will you be buying a new computer this Christmas?