Site Reliability Engineer (Senior / Staff)

  • Satine Technologies
  • Atlanta, Georgia
  • Full Time
The WorkYou'll build and operate a cloud platform supporting mission-critical software. That means Kubernetes clusters, CI/CD pipelines, observability systems, and infrastructure-as-code - owned end to end, not handed off. You'll work alongside software engineers and security engineers who are building real capabilities. Your job is to make sure what they build actually runs.This is not a ticket-queue SRE role at either level. We want engineers who notice things that need improving and fix them, not engineers who wait to be assigned work.What You'll DoBuild and operate Kubernetes clusters and cloud infrastructureOwn and improve CI/CD pipelines - reliability, deployment safety, rollback capabilityImplement and maintain observability: metrics, logging, alerting, dashboardsWrite Terraform and IaC to provision and manage cloud resourcesParticipate in on-call rotation and lead incident response for platform issuesIdentify and drive reliability improvements - SLO gaps, toil reduction, capacity issuesDocument what you build so the team can operate and extend itLevelsWe're hiring at two levels. Read both - if you're on the boundary, apply and we'll figure it out together.Senior Site Reliability EngineerSalary: $130,000 - $150,000This is the right level if you have solid SRE fundamentals and want to keep deepening them. You'll own real platform components, participate in on-call, and have a clear growth path toward Staff scope. The team is small enough that your contributions are visible from day one.What We're Looking For:5+ years of SRE, DevOps, or systems/infrastructure engineering experienceWorking proficiency with Kubernetes - you understand how it works, not just how to use kubectlHands-on Terraform experienceSolid cloud fundamentals on at least one major platform (AWS, Azure, or GCP)Strong Linux command line - comfortable debugging system-level issuesUS citizenship or Lawful Permanent Resident status (Public Trust eligibility required)Paths In - You Might Be a Fit If You:Are a DevOps or infrastructure engineer who has been doing SRE work without the title and wants to make it officialCome from a development background and have been drawn toward the platform and operations sideHave been at a larger company where the SRE work felt like ticket-pushing, and want real ownership on a smaller teamAre earlier in your SRE career but have done real work - you can point to systems you've built or incidents you've ownedStaff Site Reliability EngineerSalary: $145,000 - $165,000This is the right level if you've been doing serious SRE work for several years and want to own large platform components, not just contribute to them. You'll work directly alongside the Sr Staff SRE who sets technical direction and own significant portions of its implementation. You're expected to bring your own judgment about what needs improving - not wait for it to be assigned.What We're Looking For:7+ years of SRE, platform engineering, or DevOps experienceStrong Kubernetes - you've operated clusters in production, not just deployed workloads to themTerraform or equivalent IaC at real scaleSolid Linux fundamentals - you can debug a system-level issue, understand network behavior, read a flame graphExperience with at least one major cloud platform (AWS, Azure, or GCP)US citizenship or Lawful Permanent Resident status (Public Trust eligibility required)Paths In - You Might Be a Fit If You:Have been doing solid SRE work at a startup, product company, or agency and want to work on systems that matter beyond the business metricHave been the SRE generalist on a small team - you've done everything and want to go deeper on platform reliability specificallyAre a strong infrastructure engineer who has been growing into SRE responsibilities and wants to formalize that transitionHave commercial cloud experience and want to bring those skills somewhere the work has real stakesHelpful but Not Required (Both Levels)Experience with Kafka or event-driven architecturesObservability stack experience: Prometheus, Grafana, ELK, or similarFamiliarity with security or compliance frameworks (FedRAMP, NIST 800-53, SOC 2, or similar)GitOps experience with tools like ArgoCD or FluxExperience with scripting in Python or Bash for automationAbout Satine TechnologiesOur mission is to protect the institutions that underpin free society from cyber threats. We're a small, mission-driven team that works on problems that matter - from offensive security testing for hospitals and banks to building capabilities for national security missions.We invest in people who invest in themselves. This isn't a body shop. You'll work with a team that takes pride in technical craft and cares about developing the people who join us.BenefitsHealth insurance with vision, dental, and HSALife insurance (100% employer-funded)401(k) with 4% matchFlexible PTOTo all recruitment agencies: Satine Technologies does not accept agency resumes. recblid xicoeelhl2mimlel12k3fjvfibshqc Not Specified
Job ID: 522142786
Originally Posted on: 5/22/2026

Want to find more Engineering opportunities?

Check out the 141,924 verified Engineering jobs on iHireEngineering