All roles

Site Reliability Engineer lll

Remote · USA Full-time New today

# Site Reliability Engineer III ## About AbsenceSoft At AbsenceSoft, we're transforming the employee experience. Our secure, intuitive technology helps employers bring humanity, certainty, and efficiency to some of the most complex moments in the workplace. Built by HR professionals for HR professionals, we're proud of where we've been and even more excited about where we're going. We're looking for a senior Site Reliability Engineer to join our small, high-ownership SRE team. In this hands-on individual contributor role, you'll own the reliability, scalability, and security of AbsenceSoft's production infrastructure on AWS — supporting a B2B SaaS platform that processes sensitive employee leave data for enterprise customers. You'll work closely with infrastructure, application engineering, product leadership, and cross-functional partners in Security and Compliance, with a clear path to grow toward a Tech Lead opportunity as our team and platform continue to mature. ## What You'll Do - Architect, implement, and operate scalable, resilient, and secure AWS infrastructure — including GuardDuty, Lambda, EventBridge, SNS, SES, S3, ALB, and ECS container workloads. - Lead infrastructure-as-code initiatives to ensure all environments are reproducible, auditable, and consistently configured in support of SOC 2 change management controls. - Design, maintain, and improve CI/CD pipelines using Jenkins and GitHub to enable reliable, repeatable software delivery — partnering with application engineering to reduce release risk and increase deployment frequency. - Own the Datadog observability platform, including dashboards, monitors, alerting thresholds, and log management; define and maintain SLOs, SLIs, and error budgets to guide reliability investment and reduce alert fatigue. - Serve as a senior technical responder across the full incident lifecycle — detection, containment, resolution, and postmortem — within a shared on-call rotation, and lead blameless postmortems to drive down incident frequency and MTTR. - Refine, implement, and test disaster recovery plans to meet RTO/RPO objectives, while contributing to SOC 2 audit readiness with a focus on access controls, incident response, and risk mitigation. - Mentor junior SREs through code reviews, incident pairing, and documentation of runbooks and engineering standards. - Participate in a highly compliant environment and assist in maintaining company security and compliance controls. - Other duties as assigned. ## What You'll Bring - 5+ years of experience in SRE, DevOps, or a related engineering role, with advanced hands-on expertise in AWS production environments and core services including Lambda, ECS, S3, ALB, and GuardDuty. - Strong proficiency in infrastructure-as-code tooling such as Terraform, CloudFormation, or CDK, paired with experience building and operating CI/CD pipelines using Jenkins and GitHub. - Proficiency in Python, Go, or Bash for automation, alongside hands-on experience with Datadog or a comparable observability platform for monitoring, alerting, and log management. - Demonstrated experience leading incident response in complex, distributed systems, with working knowledge of SLO/SLI frameworks, error budgets, and disaster recovery planning against defined RTO/RPO objectives. - Familiarity with SOC 2 compliance frameworks and experience contributing to audit readiness, access controls, and security control evidence collection. - A collaborative, ownership-driven mindset with strong communication skills, a passion for mentoring junior engineers, and a commitment to reducing toil through automation and AI-assisted tooling. ## Company Values

  • *Lead with Innovation** - We create meaningful change through intelligence, focus and passion. We embrace curiosity, data, and insight to shape the future of our industry. Always innovating, learning and evolving.
  • *Elevate Every Voice** - Every perspective matters. We listen, learn, and build a culture where diversity of thought and experience drives better solutions and smarter decisions.
  • *Achieve Together** - The customer fuels everything we do. We share knowledge, collaborate, celebrate wins, and face challenges as one team because success is always a collective achievement.
  • *Drive Outcome** - Every action we take delivers measurable value to our teams, our customers, and the employees they support. Accountability is non-negotiable. We honor our commitments, take responsibility for results, and see every success and setback as a chance to grow stronger.

## What We Offer -

Impact that matters

— You'll do work that shapes the future of the modern workplace. -

Flexibility and trust

— We're remote-first and results driven. You'll have the freedom and flexibility to do your best work, wherever you do it best. -

Growth and development

— We believe the best work happens when people are growing. You'll have access to learning resources, leadership programs, and real opportunities to ta Apply To This Job

Related roles

Senior Site Reliability Engineer - Remote EST

Remote · USA Full-time

Senior DevOps Engineer/Site Reliability Engineer-East Coast

Remote · USA Full-time

Distinguished Site Reliability Engineer - Cloud

Remote · USA Full-time

Senior Site Reliability Engineer, APAC

Remote · USA Full-time

(Senior) Site Reliability Engineer (m/f/d) – Platform & Agentic Operations

Remote · USA Full-time

Senior Site Reliability Engineer (SRE) - (GCP)

Remote · USA Full-time

Kubernetes Engineer - Remote

Remote · USA Full-time

Senior Kubernetes Engineer – Secret Eligible

Remote · USA Full-time

Principal Customer Engineer, Openstack, Kubernetes

Remote · USA Full-time

Sr. Kubernetes Engineer (Secret Eligible)

Remote · USA Full-time

Risk Control Consultant (SRT) (Northern California)

Remote · USA Full-time

Experienced Customer Experience Representative - Mom & Baby: Join arenaflex in Revolutionizing Home Health Products and Equipment Industry

Remote · USA Full-time

Experienced Full Stack Security Specialist – Bug Bounty and Vulnerability Management

Remote · USA Full-time

Application System Administrator

Remote · USA Full-time

Remote Chat Moderator Roles - Entry-Level Opportunities Earning $25-$35 Per Hour

Remote · USA Full-time

Clinical Pharmacist for Medicare STARS - Remote

Remote · USA Full-time

Experienced Full Stack Sales Agent – Data Entry & Customer Engagement Specialist – Work From Home Opportunity

Remote · USA Full-time

Brand Manager - Client Facing - Fully Remote

Remote · USA Full-time

Experienced Chat Support Associate – Veterinary Professionals Community Engagement

Remote · USA Full-time

Licensing Specialist (Sync)

Remote · USA Full-time