10/2/2002: Ed retired after 1.5 years of service. All 96 working and non-working DS10s removed from Ed.
Our second LinuxBIOS/BProc cluster is a 128 node Alpha cluster comprised of 104 single processor Compaq DS10s, 16 dual processor API networks CS20s, and 8 four processor SMP Compaq ES40s.
Ed retired! All 96 DS10s and Myrinet hardware removed from Ed. The remainder of the cluster is set up as a simple 10/100 Ethernet connected cluster for 64-bit development.
Due to the previous day's firedrill, it was decided that we should remove three racks of DS10s — 96 total nodes (not all working). The Myrinet cards were also removed from all 96 nodes. The nodes will be shipped to Sandia California to live out the rest of their lives as part of their LinuxBIOS cluster. The Myrinet cards will be used in pinkish.
A couple more DS10s overheat and cause a burning smell that concerns our staff. After consulting the fire department, it is decided to pull the fire alarm and bring in the troops. It turns out that the heatsinks got so hot it burnt the paint off.
Quadrics is returned to Myricom for in-trade.
The purchase order for the Quadrics/Myrinet2000 trade-in faxed to Myricom.
- We still cannot get working Linux drivers for Elan 3 under bproc. As far as we can tell you must be running the Quadrics RMS scheduler for even IP to work, and bproc and RMS are fundamentally incompatible. Also, much of the software we need to work with is not Open Source, which makes it difficult for us to test new ideas. That being the case, we have received a quote from Myricom for a trade-in for Myrinet 2000. The quote is currently being approved for purchase.
- The information to get LinuxBIOS working on the ES40s, so that the fontend and 5 compute nodes could run LinuxBIOS, was never forthcoming from Compaq.
- Charlie Strauss (from LANL's Biology division) has been running his CPU-intensive protein folding codes on Ed continuously since about day one.
Affectionately named Ed, the current cluster is comprised of the following:
- 104 DS10's booting Linux out of flash — yes, the SRM is gone!
- ES-40 front end, running BProc (but not LinuxBIOS, yet).
- No Quadrics support, yet. Quadrics is working with us on this and we hope to have it soon.
- 104 DS10 nodes with Quadrics interface delivered
- No switches
- Power in machine room not ready
- Minion sacrifice
Notice no CD ROM drive, no floppy drive.