HPC Node Communication Healthcheck
MPI+Slurm script that verifies node-to-node connectivity and surfaces failures fast.
I design, build, and care for reliable compute infrastructure: from HPC clusters (Slurm, MPI, InfiniBand) to Kubernetes platforms and secure Linux servers. I like turning messy systems into fast, well-documented, and automated ones.
• Slurm • MPI • Kubernetes
• Active Directory • Windows Server • Troubleshooting
• Cisco • Mellanox • VLAN
Junior Computer Science – Information Technology major focused on systems and infrastructure. I've worked as an HPC Cluster Technician and Student Information Technician, supporting cluster expansion, job scheduling, and end-user workflows. I document thoroughly and prefer scripts for repeatability.
MPI+Slurm script that verifies node-to-node connectivity and surfaces failures fast.
Hardened dashboard with RBAC, service accounts, and namespaced access.
Documented and deployed secure DNS with SELinux, views, and zone hygiene.
Please let me know if you have any questions.