Lead SRE Engineer
Relativity
Job Description
Posting Type Hybrid Job Overview Lead Site Reliability Engineer (SRE) is responsible for driving customer confidence by assuring the quality of Relativity's current and future software products and to contribute to a motivating environment that empowers teams and individuals to engineer performant and reliable software through SRE best practices and principles. At it's core the Lead Site Reliability Engineer is responsible for the system availability and the reliability of key platform services and applications, with central focus on cross-service reliability of RelativityOne. Job Description and Requirements Role Responsibilities : Proactivelymonitoringand actioning found items related to baseline performance and reliability ofRelativityOne'score cloud product.
Providing feedback to Engineering teamsregardingareas of the software that require increased reliabilitymonitoringand alerting when gaps areidentified. Plan andCoordinatemid-size software projects that provide SRE enhanced tools and systems to improve SRE operational challenges. Develop software and tools with mastery of Agile software development best practices.
Exhibit technical leadership throughsystem design and implementation,in order tominimize assessed risk. Lead and organize team members through onboarding and coachingactivities. Work with SRE leadershipin order toidentifyskillsand training opportunities for other SRE team members Represent SRE team goalsand progress on key results, throughengagement with key stakeholders within engineering, service delivery, and productverticals.
Coordinates with product management, management, and team members on projects Lead customer consultation engagementprocesses related tolarge and complex workspace challenges and workflows that help drive the way we scale andmonitorwithinRelativityOne'score products. Support Client Services by offeringexpertiseto customers on performance related incidents and requests, including internal SRE escalations forourmost complexcustomer challenges. Support Problem and IncidentManagers by providing informationregardingtrends of recurring issues within the application and cloud infrastructure.
Planning, developing, and delivering tools and systems to improve SRE operational challenges through Agile software development best practices. Takes Technical leadership and coaching of SRE Team members. Lead the team in application of ablameless postmortem culture; responsible for any post-incident actions that involve the development or optimization of any part of the software development lifecycle or the incident lifecycle.
ProactivelyIdentify areas for improvement,leadpost-incident reviews, and driving initiatives to enhance system reliability, performance, and operational efficiency. Documentingthe teamknowledge, continuously improving processes and infrastructure to enhance platform reliability and performance. Active participation in the recruitment and onboarding process of new team members.
Demonstrating consistent commitment tocompanycorevalues. Performingadditionalduties as assigned. Preferredqualifications: 7+ years of experience in Site Reliability Engineering or DevOps.
Experience with other tools such as SQL and NoSQL databases and orchestration services. Experience in dealing with MS Azure(AS-900, AS-104, PowerShell). Good knowledge ofasoftwarearchitecture anddesignpatterns (Kubernetes).
Experience ina systemmonitoringand alerting. Experience in usingJIRA,New Relic, Jenkins, Tableau. Experience with Relativity Server orRelativityOnesoftware product (RCA Preferred) Good knowledge ofagile methodologies and a rapid development cycle.
Experience with DevOps practices, including CI/CD. Very goodknowledge of English (spoken & written). Excellent problem-solving and communication skills.
Excellent analytical skills. Meticulous attention to detail. An eagerness to learn, explore, and introducenew technologies.
Ability to workindependently andefficiently under pressure, drive projects to completion and meet deadlines. Ability to work in a fast-paced and dynamic environment. Ability to work as a team player in a cross-functional team to develop practical solutions and ensure positive user experiences.
Soft skills : Strong problem-solving and decision-making skills. Willingness to takeownership and drive topics end-to-end. Personalinitiative, commitment, perseverance, and resilience.
Well-developedcommunication andteamwork skills. An innovative approach and a well-founded way of working. Aspiration forDevOps principles and SRE engineering excellence Drive to empower your colleagues.
Relativity is committed to competitive, fair, and equitable compensation practices. This position is eligible for total compensation which includes a competitive base salary, an annual performance bonus, and long-term incentives. The expected salary range for this role is between following values: $150,000 and $224,000 The final offered salary will be based on several factors, including but not limited to the candidate's depth of experience, skill set, qualifications, and internal pay equity.
Hiring at the top end of the range would not be typical, to allow for future meaningful salary growth in this position. Required Skills: Documentations, Innovation, Leadership, Problem Solving, Process Improvements, Project Management, Quality Assurance (QA), Risk Management, Technical Knowledge, Troubleshooting