JHU ACM Systems’ Documentation: Sysadmin Edition¶
Note
If you’re looking for more end-user documentation, please see JHU ACM Systems’ Documentation: End User Edition and/or Group Services in the Brave New World.
Here you will find, hopefully, everything we know about running a state-of-the-art computing infrastructure. You can see how well we’re doing by looking at our nagios instance. The authoritative version of the source of this collection of documents is here and read-only replicas exist even if the read-write volume is offline.
- Basics of Being a Sysadmin
- Working with Deez Notes
- Structure of the Cluster
- Common Installation Steps
- Random Simple Things that have Worked in the Past and are Likely to Work Again
- A Sunfire Goes Down
- The Website is Reachable, but everything 403’s or 404’s
- Mail server fails IMAP requests
- Echidna’s AFS servers died
- You can’t do ceph things with cinder (like create/delete volumes)
- Ceph won’t start on a sunfire due to permission errors
- A ceph mon is down after a restart
- OpenStack VMs won’t be deleted, and they just hang
- You Can’t Delete OpenStack VMs (they’re stuck in the deleting state)
- Service Wishlist
- Core ACM Systems
- Accounts on ACM Systems
- Services for Admins and Others
- Website
- Mail Services
- Webserver Configuration
- Configuring a New Shell Server
- Egg Shell (JHED AD Integration)
- Janus: The God Of Doorways, or The ACM Door Lock Controller
- Git with Gitlab
- Quassel
- Bigbrother
- Ebola
- Logging Aggregation
- LXC and Docker DIY
- Mirrors
- Metapackages and Apt Repository
- Steam
- Office Printers
- Chicago: The ACM In A Box
- Outside the Cluster
- Out of Date