Elastic Site Reliability Engineer (SRE)
Zachary Piper Solutions
Job Description
Zachary Piper Solutions is seeking an experienced Elastic Site Reliability Engineer (SRE) to support a highâvisibility federal engagement focused on observability, platform reliability, and security operations across classified environments. This position will support missionâcritical Elastic infrastructure deployments at either Hanscom AFB, MA or Langley AFB, VA. The ideal candidate will have deep expertise supporting enterprise Elastic Stack environments, Kubernetesâbased deployments, and production SRE operations within secure or regulated infrastructure environments.
Key Responsibilities Operate, maintain, and optimize largeâscale Elastic Stack environments supporting logging, search, observability, and telemetry operations. Ensure platform reliability, uptime, scalability, and performance across production mission systems. Manage Kubernetesâbased Elastic deployments, including ECK operator environments.
Develop and maintain automation for deployment workflows, monitoring, alerting, and incident response processes. Integrate Elastic infrastructure with SIEM and security tooling including Splunk, EDR platforms, and telemetry systems. Troubleshoot complex issues across distributed systems, infrastructure, and application environments.
Implement and support observability frameworks including logging, metrics, tracing, and monitoring solutions. Support CI/CD pipelines and infrastructureâasâcode initiatives within DevOps environments. Maintain operational runbooks, escalation procedures, and technical documentation.
Participate in onâcall support rotations and incident response activities. Qualifications 5+ years of experience supporting Site Reliability Engineering, DevOps, or infrastructure operations environments. Strong handsâon experience with Elastic Stack in enterprise production environments.
Advanced Kubernetes experience, including ECK operator deployments. Strong Linux/Unix administration and networking fundamentals. Experience supporting observability, telemetry, logging, and monitoring platforms.
Experience working within secure, classified, federal, or highly regulated environments. Ability to work onsite at Hanscom AFB (MA) or Langley AFB (VA). NiceâtoâHaves Elastic certifications including Elastic Engineer, Security, or Observability.
Experience with Terraform, Ansible, and CI/CD pipeline automation. Exposure to SIEM and EDR technologies including Splunk, CrowdStrike, or Trellix. Experience supporting GovCloud, DoD, or federal infrastructure environments.
Prior experience supporting distributed logging or telemetry platforms. Soft Skills Strong incident response and operational troubleshooting mindset. Ability to remain calm and effective during production outages or highâpressure situations.
Strong collaboration skills across security, infrastructure, DevOps, and operations teams. Excellent communication skills for escalation and operational coordination environments. Selfâsufficient and capable of operating independently within classified environments.
Compensation & Benefits Target compensation: $180,000 â $200,000 annually. Longâterm federal engagement supporting missionâcritical infrastructure initiatives. Opportunity to support advanced observability and security operations within classified environments. #J-18808-Ljbffr