🕐 Posted 6d ago

Principal Engineer, Federal Cloud Platform

Saviynt

RemoteFull-timeMid LevelRemote

Job Description

Job Description

Job Description

About Saviynt


Saviynt is a leader in identity security, delivering an AI-powered platform that governs and secures access to applications, data, and business processes for some of the world’s largest enterprises and government institutions. Built for the AI era, Saviynt enables organizations to move faster—securely and compliantly.

 


Why This Role Matters


Saviynt’s platform is mission-critical for our customers. As we scale globally, reliability, availability, and performance are not optional—they are core product features.


As a Principal  Engineer, you will define and drive the reliability strategy for our SaaS platform. This is a high-impact, hands-on engineering role with broad influence across infrastructure, platform, and application teams. You will shape how Saviynt designs, operates, and measures reliability at scale.


This role is ideal for engineers who want to work on hard reliability problems, influence architecture across teams, and leave a lasting mark on a growing SaaS platform in Federal.

 


What You’ll Do


In this pivotal role, you will be instrumental in designing, building, and maintaining the shared infrastructure services and platforms that our product and application teams will depend on


Vulnerability management, holding teams accountable to meet customer facing Service Level Agreements (SLAs)


Designing Continuous Delivery (CD) processes for government deployments that will eventually be used commercially.


Develop robust, internal-facing tools and automation for infrastructure provisioning and management primarily using Go (Golang) or Python


Architect and optimize foundational solutions within Cloud environments (AWS, Azure, etc.), focusing on creating reusable patterns and modules for other teams


Design and implement shared Event-Driven Architecture components and messaging platforms using technologies like Kafka or Google Pub/Sub that product teams can easily utilize


Design and build resilient Distributed Systems components that serve as building blocks for other applications, focusing on reliability, fault tolerance, and performance


Manage and optimize our shared infrastructure across Multi-Region Cloud Environments, ensuring that platform services are globally available and performant for all consumers


Establish and enhance centralized Observability and Monitoring platforms and tools that provide self-service insights for consuming teams


Define and implement clear, well-documented RESTful API designs for the infrastructure services you build, ensuring ease of integration for internal clients


Implement and manage Service Mesh (e.g., Envoy, Istio) capabilities, providing traffic management, security, and policy enforcement as a shared platform for services


Design, implement, and optimize highly available Relational Database services or shared data platforms for broad organizational use


Collaborate closely with product development teams to understand their infrastructure needs and pain points, providing technical guidance and support


Participate in on-call rotations to support the critical shared infrastructure you build


 

What We’re Looking For


9+ years of experience in an Infrastructure Development, Platform Engineering, or Site Reliability Engineering role, with a strong focus on building tools and services for other engineers


Deep expertise with Kubernetes in production environments, particularly in providing it as a platform(i.e single tenant and multi-tenant deployment architectures)


Strong programming skills in Go (Golang) and Python, with experience building robust, maintainable backend services and automation


Extensive hands-on experience with at least one major Cloud Provider (AWS, GCP, or Azure); multi-cloud experience is a strong plus, especially in building abstractions over them


Proven experience designing and implementing Event-Driven Architecture and message queuing systems (e.g., Kafka, RMQ, NATS) as shared services


Solid understanding and practical experience with CI/CD pipeline tools (especially GitLab CI) and experience establishing automated delivery processes for other teams


Demonstrable experience designing and operating Distributed Systems, with an understanding of patterns for creating reliable, shared components


Familiarity with Multi-Region Cloud Environments and strategies for building globally distributed and highly available platform


Proficiency in establishing and utilizing comprehensive Observability and Monitoring platforms (e.g., Prometheus, Grafana, ELK stack, Datadog) for shared infrastructure


Strong experience with RESTful API design principles and building well-documented, consumable APIs


Knowledge of Service Mesh concepts and practical experience with solutions like Istio in a platform context


Hands-on experience with Relational Databases (e.g., MySQL, PostgresSQL), ideally in managing them as a service


Excellent communication skills and the ability to clearly articulate complex technical concepts to both technical and non-technical audiences


A strong customer-centric mindset, treating internal development teams as your primary customers


Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience or equivalent military experience required



Nice-to-Have Qualifications


Experience with FedRAMP compliance and government security requirements


Track record of implementing secure CI/CD pipelines in restricted or regulated environments

 


Why Join Saviynt


•        Work on a mission-critical SaaS platform used by global enterprises

•        Solve complex reliability challenges at scale

•        Influence architecture and engineering culture at a company level

•        Competitive compensation, benefits, and growth opportunities

 


Security & Compliance


This role requires compliance with Saviynt’s information security and privacy policies, including annual security training


We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Posted 6 days ago

Related Jobs

Related Searches

Apply Now