Site Reliability Engineer
Tata Consultancy Services
Job Description
Role : DevOps & Site Reliability Lead - Retail Job Description Containers & Orchestration (AKS, Docker) Networking (VNETs, Private Endpoints, Application Gateway, Load Balancers) Proven experience designing enterprise‑grade, highly available cloud platforms Strong understanding of hybrid and multi‑cloud architectures (AWS / GCP exposure preferred) Advanced experience with Azure DevOps and CI/CD pipeline architecture GitOps concepts, branching strategies, release orchestration Site Reliability Engineering (Leadership Level) Ownership of platform reliability, resiliency, and performance Definition and governance of: SLIs, SLOs, SLAs Error budgets and reliability metrics Advanced observability strategy, designing and implementation: Incident response leadership, RCA facilitation, and long‑term remediation planning Experience operating 99.9%–99.99% availability systems Containers, APIs & Integration Leadership‑level experience with AKS‑based platforms, ingress, and scaling strategies Understanding of microservices, API‑led and event‑driven architectures Familiarity with Azure Integration Services (Service Bus, Event Hub, API Management) Security, Compliance & Cost Secure cloud design using Key Vault, managed identities, RBAC Act as Lead SRE for Retail platforms, owning reliability and stability outcomes Define and enforce SRE standards, best practices, and operating models Architect and govern highly available, scalable cloud platforms Lead the design and implementation of CI/CD and IaC strategies Establish proactive monitoring, alerting, and incident prevention mechanisms Own major incident leadership, RCA execution, and corrective action tracking Partner with application, security, and architecture teams to build reliability by design Drive automation to reduce toil and improve operational efficiency Mentor and coach SRE and DevOps engineers across teams Influence roadmap decisions with a reliability, scalability, and cost lens #J-18808-Ljbffr