Senior Software Engineer (Production Engineering)
Pure Storage
Job Description
Requirements Strong ownership and accountability, with the ability to collaborate and influence across teams and time zones, while continuously learning and helping others level up on CI and release operations, and keeping mainline branches healthy , Calm under pressure, leading high-impact incidents while maintaining focus and clarity , 8+ years of experience in software engineering, production engineering, SRE, or DevOps, including designing, building, and owning backend services or tooling (not just ad-hoc scripts), with proven experience leading technical projects or initiatives that span multiple teams or services, especially in CI, test, or release domains , Strong software engineering fundamentals and distributed systems understanding, with a track record of improving reliability, performance, or overall product quality , Proficiency in at least one modern programming language (e.g. Python, Go) and strong scripting skills (e.g. Bash, Python) for automation and diagnostics , Hands-on experience with CI concepts and tooling (e.g.
Jenkins), including pipeline-as-code, job configuration (often via Groovy or equivalent DSL), managing multi-job CI workflows, and using database/analytics tools (e.g. SQL) and metrics to analyze and improve CI and release processes , Solid Linux/Unix fundamentals and demonstrated ability to debug issues across OS, network, storage, and application layers in production or CI environments What the job involves As a Senior Software Engineer in Production Engineering, you will own the continuous integration (CI) and test-orchestration platform that enables FlashArray and FlashBlade teams to ship high-quality products at predictable velocity , This is a developer-first, highly visible role with real operational responsibility: you will design, build, and run the systems behind code merges and release operations, with a strong focus on developer experience , Youโll turn quality gates into automation instead of process, and use data and observability to continually improve reliability, throughput, and time to signal , You will work closely with engineering teams across the organization to accelerate Everpure innovation , Design and implement CI and test-orchestration services, tools, and pipelines that provide fast, reliable, high-quality feedback for engineers across FlashArray and FlashBlade , Build and extend developer-facing automation (services, CLIs, APIs, scripts) that simplify onboarding, configuration, and debugging of CI and release workflows , Define and encode quality gates and signals (run rates, pass rates, flakiness thresholds, coverage, escape metrics) directly into CI and release systems , Develop and maintain metrics, queries, and dashboards that give clear visibility into CI and release health, and use them to drive targeted improvements , Integrate and evolve observability (metrics, logs, traces, alerts) for CI and test-orchestration services to reduce time-to-detect and time-to-resolve incidents , Participate in on-call duty rotations for CI and test-orchestration infrastructure; lead incident triage and mitigation, bring order to chaos, and ensure thorough follow-through on root-cause fixes , Lead or contribute to cross-functional projects with engineering teams and TPMs to improve CI and release quality, developer experience, and overall reliability #J-18808-Ljbffr