Site Reliability Engineer II
Todyl
Job Description
About The Role At Todyl, our Application Platform Engineering team is dedicated to building infrastructure, services and patterns that enable our application development teams to quickly and safely deploy services at the core of our security offering. As a member of this innovative team, you will play a pivotal role in designing and engineering cuttingâedge solutions that are highly performant, highly resilient and lowâmaintenance. Your work will not only directly impact the reliability and security of our platform but also empower the engineering team to continuously push the boundaries of whatâs possible in the security space.
Responsibilities As a Platform SRE (Site Reliability Engineer) at Todyl, you will develop tools and services that support Todylâs application hosting infrastructure, including but not limited to Kubernetesâbased environments. Build automation to improve reliability and reduce human interaction for Dayâ2 operations, with an emphasis on infrastructureâasâcode practices. Implement and enforce security policies, access controls, and system patchingâtreating security hygiene as a firstâclass operational responsibility.
Own attackâsurface management for production infrastructure: identify exposure, prioritize remediation, and drive CVE resolution to completion rather than leaving findings unactioned. Operationalize security tooling by building integrations, establishing remediation workflows, and ensuring findings are consistently acted upon. Own features and services through deployment and stabilizationâwork isnât done until itâs stable in production and documented.
Collaborate with product and engineering teams to deliver solutions that meet the needs of stakeholders and the business; improve application monitoring and alerting to minimize time to detect and time to restore; review dashboards and logs to verify deployments succeeded. Identify and drive costâoptimization opportunities, including resource labeling, rightâsizing, and efficiency improvements, to reduce COGs. Participate in a weekly onâcall rotation, resolve most issues independently, and update runbooks and documentation after incidents.
Requirements Must have: experience managing Kubernetes and applications running on Kubernetes. Must have: general competency in one or more scripting or programming languages, including Python or Bash. Must have: demonstrated experience identifying and remediating vulnerabilities in production infrastructure, including CVE triage and remediation workflows.
Experience managing production Linux systems at scale. Working knowledge of REST APIs. Familiarity with networking fundamentals and common attackâsurface concepts (exposed services, misconfigured access controls, unpatched dependencies).
Comfort with cloud security tooling and the ability to operationalize findings into actionable remediation work. Comfort with cloud cost management concepts, including resource tagging and cost attribution strategies. Breaks work into incremental deliverables; communicates delays early and tracks progress against estimates.
Writes and maintains tests for the automation and tooling you build; proactively considers edge cases and failure conditions. Ability to quickly learn new concepts, frameworks, and technologies, including AIâassisted tools to accelerate development and reduce toil. Comfortable building and maintaining production services with a strong sense of ownership from build through stabilization.
Production experience using CI/CD for code deployment. Experience with onâcall rotations and incident response processes. What we Offer Health & Wellbeing Medical, dental, and vision coverage for you and your family HSA/FSA options Life insurance and shortâandâlongâterm disability coverage Financial & Future Competitive 401(k) to invest in your future Short- and long-term disability coverage for when life gets unpredictable Flexibility & Time Off Hybrid work schedule Flexible PTO + 13 company holidays Generous parental leave Todyl provides equal employment opportunities to all employees and applicants for employment without regard to race, color, religion, gender, sexual orientation, transgender status, gender identity or expression, national origin, age, disability, marital status, genetic information, military status or any other status protected by applicable federal, state or local laws.
Compensation Range: $130K - $160K #J-18808-Ljbffr