Where is this Site Reliability Engineer (SRE) job located?

This position is located in Deerfield.

⚡ New

Site Reliability Engineer (SRE)

Veriipro

DeerfieldFull-timeMid LevelOn-site

Job Description

We are looking for an experienced Site Reliability Engineer (SRE) to ensure the reliability, availability, and performance of Azure-based services in a large-scale enterprise environment. This role involves managing cloud infrastructure, enhancing observability, implementing disaster recovery strategies, and driving reliability improvements through SLOs/SLIs and automation. Key Responsibilities Define and manage SLOs, SLIs, and Error Budgets for Azure-hosted services, reporting SLA compliance to stakeholders.

Lead architectural reviews, ensuring reliability targets (availability, RTO/RPO) are met from design to production. Implement chaos engineering practices and conduct disaster recovery drills across Azure regions. Serve as Incident Commander for P1/P2 incidents, owning the incident lifecycle and post-mortem actions.

Design and operate enterprise observability using Azure Monitor, Log Analytics, Application Insights, and Grafana. Develop alerting frameworks and automate self-healing operations with Azure Automation and scripting (Python/PowerShell). Embed reliability gates in CI/CD pipelines and manage AKS cluster reliability (scaling, upgrades, security).

Enforce infrastructure-as-code best practices with Terraform/Bicep for Azure Landing Zones. Required Qualifications 7+ years in SRE, platform engineering, or cloud infrastructure in large-scale environments. 4+ years of hands-on Azure experience with AKS and cloud engineering. Expertise in Terraform (required), Bicep, and managing Azure Landing Zones.

Proficiency in Python, Go, or PowerShell scripting. Experience with Azure observability tools (Monitor, Log Analytics, Application Insights). Proven track record of owning SLOs/SLIs and improving production reliability. #J-18808-Ljbffr

Posted Yesterday

Related Jobs

Software Engineers

L3Harris Technologies

Garland Today

Full-time

Sr. Software Development Engineer, Support Engineering Tech

Amazon Web Services, Inc.

Atlanta Today

Full-time

Senior Software Engineer Staff - Clearance Required

Lockheed Martin

Lanham Today

Full-time

Senior Software Development Engineer, AWS Infrastructure Supply Chain Automation

Amazon Web Services, Inc.

Seattle Today

Full-time

Business Developer II- Infrastructure (Hiring Immediately)

Walter P Moore

Austin Today

Full-time

Software Engineer Mid-level (Java Full Stack)

USAA

Phoenix Today

Full-time

SAP iXp Intern - Mobile Developer

SAP

Naperville Today

Full-time

Software Quality Assurance Analyst

Belvedere Trading

Chicago Today

Full-time

Explore More Jobs

Browse jobs by category, type, and pay level

Job Search High Paying Jobs Remote Jobs Part Time Jobs Full Time Jobs No Experience Jobs Entry Level Jobs Tech Jobs Healthcare Jobs Finance Jobs Marketing Jobs Engineering Jobs Hiring Near Me Local Jobs Salary Jobs Weekend Jobs Best Part Time Jobs Data & Analytics Jobs Sales Jobs

Browse all jobs »

Apply Now

Site Reliability Engineer (SRE)

Job Description

Related Jobs

Related Searches

Explore More Jobs