GPU ML Pipeline on AWS with Terraform
December 2025 6 min read
I built a Terraform module for running ML workloads on AWS GPU instances because spinning up infrastructure manually for each experiment was getting old. This post walks through the architecture decisions, the gotchas I hit with AWS Batch and Docker, and how it all fits together into something actually usable.