Infrastructure Engineer
Maxana
Job Description
Job Description Job Description Maxana is seeking an experienced Infrastructure Engineer for a confidential client โ a fast-growing AI company. In this role you will build and maintain the platform layer supporting large-scale ML training, inference, and deployment. This is a high-impact role at the intersection of cloud infrastructure and ML systems.
Key Responsibilities Build and maintain infrastructure supporting large-scale ML training and inference workloads Work with GPU and compute infrastructure, distributed systems, and cloud-native platforms Improve reliability, observability, and performance across the platform layer Collaborate directly with senior engineers and product teams on architecture decisions Own production reliability โ monitoring, incident response, and proactive risk reduction Develop and maintain internal tooling and automation to support engineering operations Requirements 5+ years of infrastructure or platform engineering experience in a production environment Strong distributed systems background โ experience with large-scale compute workloads preferred Cloud-native infrastructure experience โ AWS, GCP, or Azure; Docker and Kubernetes required Familiarity with ML infrastructure a strong plus โ training pipelines, inference serving, GPU workloads Experience owning production reliability end to end Benefits Competitive base salary ($130,000-$240,000) + equity Medical, dental, and vision Flexible paid time off Learning and development stipend Working at the forefront of AI infrastructure at scale