Installing, Running and Maintaining Large Linux Clusters
Found via
SlashDork is this
piece on buiding up Linux clusters to more than 1000 nodes... experience confronting some of the LHC scale computing challenges: scalability, automation, hardware diversity, security, and rolling OS upgrades. Looks like a must read (must try to understand!). 1K nodes would be a good start in
SPTRTW