โšก New

Infrastructure Engineer

Maxana

San FranciscoFull-timeMid LevelOn-site

Job Description

Job Description Job Description Maxana is seeking an experienced Infrastructure Engineer for a confidential client โ€” a fast-growing AI company. In this role you will build and maintain the platform layer supporting large-scale ML training, inference, and deployment. This is a high-impact role at the intersection of cloud infrastructure and ML systems.

Key Responsibilities Build and maintain infrastructure supporting large-scale ML training and inference workloads Work with GPU and compute infrastructure, distributed systems, and cloud-native platforms Improve reliability, observability, and performance across the platform layer Collaborate directly with senior engineers and product teams on architecture decisions Own production reliability โ€” monitoring, incident response, and proactive risk reduction Develop and maintain internal tooling and automation to support engineering operations Requirements 5+ years of infrastructure or platform engineering experience in a production environment Strong distributed systems background โ€” experience with large-scale compute workloads preferred Cloud-native infrastructure experience โ€” AWS, GCP, or Azure; Docker and Kubernetes required Familiarity with ML infrastructure a strong plus โ€” training pipelines, inference serving, GPU workloads Experience owning production reliability end to end Benefits Competitive base salary ($130,000-$240,000) + equity Medical, dental, and vision Flexible paid time off Learning and development stipend Working at the forefront of AI infrastructure at scale

Posted Yesterday

Related Jobs

Related Searches

Apply Now