Hey Cryptic, here is a suggestion. How about you cut the shard in half. Run 1/2 the servers and when they crash (not IF, but WHEN) then you quickly turn on the other half. Then you fix/reboot the broken ones and put them back on stand by. Keep repeating until you get a real IT team to manage things.