⚡ New

Site Reliability/Observability Engineer (SREs)

E-Solutions

Phoenix, ArizonaFull-timeMid LevelOn-site

Job Description

Hello, Job Title: (Site Reliability/Observability Engineer (SREs).) Phoenix, AZ Job Description: Objectives of this role: Run the production environment by monitoring availability and taking a holistic view of system health. Build software and systems to manage platform infrastructure and applications. Improve reliability, quality, and time-to-market of our suite of software solutions.

Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement. Provide primary operational support and engineering for multiple large-scale distributed software applications. Responsibilities: Gather and analyse metrics from operating systems as well as applications to assist in performance tuning and fault finding.

Partner with development team, Data Scientist, MLOps Architect/Engineers to improve services through rigorous testing and release procedures. Participate in system design consulting, platform management, Troubleshooting production issues and capacity planning. Create/manage sustainable systems and services through automation and uplifts.

Balance feature development speed and reliability with well-defined service-level objectives Required skills and qualifications: Ability to program (structured and OOP) using one or more high-level languages, such as Python, Java, C/C++, Ruby, and JavaScript Experience in working with such as Amazon S3, Sagemaker, Amazon Bedrock Excellent knowledge working with cloud-native infrastructure, such as AWS Lambda, OpenShift Good understanding of API management and should be able to troubleshoot API related issues. Automation Mindset to manage cloud infrastructure using AWS CloudFormation/Terraform Impeccable creative and communication skills. Ability to problem solve in a fast-paced, high-stakes environment.

Proactive approach to identifying problems, performance bottlenecks, and areas for improvement.

Posted Yesterday

Related Jobs

Risk Adjustment Coding Specialist II (South Bay/LA/OC)

Astrana Health, Inc.

Villa Park, California Today 2 views

Risk Adjustment Coding Specialist II (South Bay/LA/OC) Department: Quality - Risk Adjustment Employment Type: Full Time Location: 600 City Parkway West 10th Floor, Orange, CA 92868 Reporting To:...

Full-time On-site Mid Level

IRT Manager

Sumitomo Pharma

Blue Hills, Connecticut Today 2 views

Sumitomo Pharma Co., Ltd., is a global pharmaceutical company based in Japan with operations in the U.S. (Sumitomo Pharma America, Inc.), focused on addressing patient needs in oncology, urology,...

Full-time On-site Mid Level

Assistant Manager-Product Engineering-Fuses

S&C

Schiller Park, Illinois Today 2 views

Job Description As an S able to liaise with internal and external stakeholders at and present compellingly. Rounded project management skills with the ability to lead projects and product...

Full-time On-site Mid Level

Data Center Build Engineer

Oracle

South Des Moines, Iowa Today 2 views

Job Description The Data Center Build Engineering Team designs and builds physical data center infrastructure to create capacity that supports Oracle Cloud Infrastructure in different regions around...

Full-time On-site Mid Level

Cvent Technologist

American Express Global Business Travel

South Des Moines, Iowa Today 2 views

Amex GBT is a place where colleagues find inspiration in travel as a force for good and - through their work - can make an impact on our industry. We're here to help our colleagues achieve success...

Full-time On-site Mid Level

Professor, Internal Medicine-Geriatrics

UTMB Health

Tiki Island, Texas Today 2 views

Professor, Internal Medicine-Geriatrics Galveston, Texas, United States Faculty UTMB Health Requisition 2600355 POSITION OVERVIEW The University of Texas Medical Branch (UTMB) seeks a distinguished...

Full-time On-site Mid Level

Scrum Master

Akima

Fort Sam Houston, Texas Today 2 views

The Scrum Master will lead and facilitate Scaled Agile Framework (SAFe) planning activities across the enterprise, ensuring alignment between business strategy and delivery execution. This role...

Full-time On-site Mid Level

Related Searches

Apply Now