I design, build, and maintain large-scale Linux clusters, optimize distributed file systems, and build automated pipeline architectures for massive parallel workloads.
Execute Workspace Scoped ViewArchitected and deployed a multi-tenant Slurm compute environment managing highly specific job prioritizing matrices, preemption policies, and GPU allocation rules.
Designed and fine-tuned a Lustre file system layered over ZFS pools, managing high-throughput I/O pipelines tailored for parallel AI/ML calculations.
Engineered an end-to-end PXE-boot server provisioning pipeline leveraging Terraform and Packer to instantly spin up stateless computation nodes.