Dapper Cluster Documentation
This documentation covers the architecture, configuration, and operations of the Dapper Kubernetes cluster, a high-performance home lab infrastructure with GPU capabilities.
Cluster Overview
graph TD subgraph Control Plane CP1[Control Plane 1<br>4 CPU, 16GB] CP2[Control Plane 2<br>4 CPU, 16GB] CP3[Control Plane 3<br>4 CPU, 16GB] end subgraph Worker Nodes W1[Worker 1<br>16 CPU, 128GB] W2[Worker 2<br>16 CPU, 128GB] GPU[GPU Node<br>16 CPU, 128GB<br>4x Tesla P100] end CP1 --- CP2 CP2 --- CP3 CP3 --- CP1 Control Plane --> Worker Nodes
Hardware Specifications
Control Plane
- 3 nodes for high availability
- 4 CPU cores per node
- 16GB RAM per node
- Dedicated to cluster control plane operations
Worker Nodes
- 2 general-purpose worker nodes
- 16 CPU cores per node
- 128GB RAM per node
- Handles general workloads and applications
GPU Node
- Specialized GPU worker node
- 16 CPU cores
- 128GB RAM
- 4x NVIDIA Tesla P100 GPUs
- Handles ML/AI and GPU-accelerated workloads
Key Features
- High-availability Kubernetes cluster
- GPU acceleration support
- Automated deployment using Flux CD
- Secure secrets management with SOPS
- NFS and OpenEBS storage integration
- Comprehensive monitoring and observability
- Media services automation
Infrastructure Components
graph TD subgraph Core Services Flux[Flux CD] Storage[Storage Layer] Network[Network Layer] end subgraph Applications Media[Media Stack] Monitor[Monitoring] GPU[GPU Workloads] end Core Services --> Applications Storage --> |NFS/OpenEBS| Applications Network --> |Ingress/DNS| Applications
Documentation Structure
-
Architecture: Detailed technical documentation about cluster design and components
- High-availability control plane design
- Storage architecture and configuration
- Network topology and policies
- GPU integration and management
-
Applications: Information about deployed applications and their configurations
- Media services stack
- Monitoring and observability
- GPU-accelerated applications
-
Operations: Guides for installation, maintenance, and troubleshooting
- Cluster setup procedures
- Node management
- GPU configuration
- Maintenance tasks
Getting Started
For new users, we recommend starting with:
- Architecture Overview - Understanding the cluster design
- Installation Guide - Setting up the cluster
- Application Stack - Deploying applications