Jobs / I4D***

Senior Site Reliability Engineer

I4D*** · United States · Remote
Visa sponsorship details are locked. Unlock company name and apply link with .
United StatesRemote
Remuneration
Not specified
Location
United States · Remote
Eastern Daylight Time (UTC-4)
Visa sponsorship
Sponsors visa

Job summary

About Our Team Our employees thrive in a culture that is fast-paced, collaborative, and ego-free, where innovation and teamwork are encouraged at every level. We provide Federal agencies with immediate access to highly skilled professionals who understand complex mission challenges and deliver efficient, scalable solutions.

Qualifications

  • Veterans and military spouses are strongly encouraged to apply and bring their valuable experience to our team.
  • including secure configuration, least privilege, vulnerability remediation, and policy-based controls.
  • Partner with cybersecurity and engineering teams to support secure-by-design infrastructure and application delivery practices.
  • Help ensure operational processes and automation align with compliance expectations for Federal and VA environments.
  • Cross-Functional Collaboration
  • Collaborate with development, platform, operations, monitoring, incident management, and architecture teams to improve service reliability and deployment outcomes.
  • Work closely with the Technical Director and team leads to translate technical direction into actionable engineering improvements and operational standards.
  • Support Agile and SAFe delivery practices by helping teams adopt reliable release processes, operational readiness checks, and continuous improvement measures.
  • Incident Support & Continuous Improvement
  • Participate in incident response, service restoration, root cause analysis, and post-incident reviews for critical systems and services.
  • Identify recurring issues, reliability gaps, and failure patterns, and drive corrective actions through automation, architectural improvements, and process refinement.
  • Contribute to on-call readiness, operational documentation, and blameless continuous improvement practices that improve resilience and reduce mean time to recovery.

Responsibilities

  • If you enjoy expanding your technical expertise while supporting impactful Federal initiatives, you will thrive within our organization.
  • Site Reliability Engineering & Service Ownership
  • Partner with the Technical Director to implement and mature Site Reliability Engineering (SRE) practices across platform services and hosted applications.
  • Improve the full service lifecycle from design and deployment through operation and continuous refinement, with a focus on availability, latency, performance, efficiency, and capacity.
  • Define, track, and report service level indicators (SLIs), service level objectives (SLOs), and error budgets to guide engineering decisions and service improvements.
  • Automation, CI/CD & Infrastructure as Code
  • Build, enhance, and maintain CI/CD pipelines that enable secure, automated, and repeatable application and infrastructure delivery.
  • Develop and support Infrastructure as Code (IaC) and configuration automation using

Skills

Teamwork

Certifications

AWS CertifiedAWS Certified Solutions ArchitectCKACertified Kubernetes AdministratorTerraform Associate

Degrees

AssociateDegree

Languages

Arabic

Work schedule

24/7NightOn-call

Industry

AutomotiveDefenseEnergyHealthcareMediaOil-gasPublic-sector

Company size

EnterpriseSmb

Security clearance

Public trustSecurity check