SRE Engineer
Idemia
Job Description
Since our founding, IDEMIA has been on a mission to unlock the world and make it safer through our cutting-edge identity technologies. Our technology leadership makes us the partner of choice for hundreds of governments and thousands of enterprises in over 180 countries, including some of the biggest and most influential brands in the world. In applying our unique expertise in biometrics and cryptography , we enable our clients to unlock simpler and safer ways to pay, connect, access, identify, travel and protect public places โ at scale and in total security.
Our teams work from 5 continents and speak 100+ different languages. We strongly believe that our diversity is a key driver of innovation and performance. Purpose This role is responsible for maintaining the service-level agreement critical production platforms or products and providing automated operations to ensure the service to our clients is always of the best quality.
Key Missions We are hiring for Site Reliability Engineer role based at Noida. Key Responsibilities: Monitor end-to-end performance and health of both applications and Azure cloud infrastructure. Develop and implement dashboards, alerting mechanisms, and continuous monitoring improvements.
Respond to incidents and tickets, perform root cause analysis (RCA), and provide detailed RCA reports. Work closely with development teams to provide reliability input during product design and development phases. Maintain and operate Docker containers and Kubernetes clusters to ensure application reliability and performance.
Collaborate with stakeholders during planned changes, deployments, and outages with effective communication and planning. Support deployment pipelines and contribute to infrastructure-as-code (IaC) and automation practices. Manage and automate SSL certificate renewals, including tracking expiration and coordinating updates across environments.
Mandatory Skills: Between 4 to 8 Years with Kubernetes in production environments. Hands-on experience with Microsoft Azure services and infrastructure monitoring. Experience working with Linux and MySQL databases.
Proven ability to monitor and maintain highly available, distributed applications. Proficient in creating and managing dashboards and alerts using tools such as Prometheus, Grafana, Azure Monitor, etc. Familiarity with incident management, ticket resolution, and RCA documentation.
Understanding of CI/CD pipelines and version control (e.g., bitbucket, Git, sourcetree). Understanding of mutual SSL (mSSL) concepts and configuration for secure service-to-service communication. Strong communication and collaboration skills.
Profile & Other Information