Alpha LinuxBIOS cluster (Ed)
Ed retired after 1.5 years of service. All 96 working and non-working DS10s removed from Ed. Our second LinuxBIOS/BProc cluster is a 128 node Alpha cluster comprised of 104 single processor Compaq DS10s, 16 dual processor API networks CS20s, and 8 four processor SMP Compaq ES40s.
Ed retired! All 96 DS10s and Myrinet hardware removed from Ed. The remainder of the cluster is set up as a simple 10/100 Ethernet connected cluster for 64-bit development.
The Cluster Team has a retirement party for Ed!
DS10s in the ACL Lobby
Due to the previous day's firedrill, it was decided that we should remove three racks of DS10s -- 96 total nodes (not all working). The Myrinet cards were also removed from all 96 nodes. The nodes will be shipped to Sandia California to live out the rest of their lives as part of their LinuxBIOS cluster. The Myrinet cards will be used in pinkish.
DS10s in the ACL Lobby
Firedrill at the ACL
A couple more DS10s overheat and cause a burning smell that concerns our staff. After consulting the fire department, it is decided to pull the fire alarm and bring in the troops.
It turns out that the heatsinks got so hot it burnt the paint off.
Quadrics is returned to Myricom for in-trade.
Nice, Myrinet fiber Myrinet installed, up, and working.
Cables are finally out Quadrics un-cabling day!
The purchase order for the Quadrics/Myrinet2000 trade-in faxed to Myricom.
We still cannot get working Linux drivers for Elan 3 under bproc. As far as we can tell you must be running the Quadrics RMS scheduler for even IP to work, and bproc and RMS are fundamentally incompatible. Also, much of the software we need to work with is not Open Source, which makes it difficult for us to test new ideas. That being the case, we have received a quote from Myricom for a trade-in for Myrinet 2000. The quote is currently being approved for purchase.
The information to get LinuxBIOS working on the ES40s, so that the fontend and 5 compute nodes could run LinuxBIOS, was never forthcoming from Compaq.
Charlie Strauss (from LANL's Biology division) has been running his CPU-intensive protein folding codes on Ed continuously since about day one.
50 GFlops alpha LinuxBIOS cluster up! Affectionately named ed, the current cluster is comprised of the following:
104 DS10's booting Linux out of flash -- yes, the SRM is gone! ES-40 front end, running BProc (but not LinuxBIOS, yet). No Quadrics support, yet. Quadrics is working with us on this and we hope to have it soon.
104 DS10 nodes with Quadrics interface delivered No switches Power in machine room not ready Minion sacrifice
Our intern/minion took one for the team while unpacking the racks of DS10s (left). Full-time employees attempted to administer first aid, but failed as they are not up to date with the proper training (right). Died happy ..at least he died happy.
104 racked DS10 nodes
1 rack of DS10 nodes
The 3 racks of 104 DS10 nodes -- 35, 34, and 35 each (left). One of the cabinets of 35 DS10s (right). Notice no CD ROM drive, no floppy drive.