Staff Site Reliability Engineer

United States – Remote Full-Time$164k - $226k /year

Job Description

Collaborate with engineering and product teams to architect and implement the infrastructure and services necessary for cloud-native, event-driven feature delivery. Refine and advance Infrastructure as Code (Terraform) and Configuration Management (Helm) strategies to optimize scaling and promote self-service capabilities for engineering teams. Identify and resolve system bottlenecks within AWS and the Kubernetes platform. Maintain customer-facing uptime of 99.99%. Continuously enhance platform monitoring and alerting for proactive issue resolution.

Qualifications

1. 8+ years of professional SRE/DevOps experience with a proven track record on high-volume production systems. 2. Expertise in systems architecture, demonstrated through the resolution of complex technical challenges and the implementation of company-wide solutions. 3. Advanced knowledge of AWS services and technologies (ALB/ELB, IAM permissions, DynamoDB, SNS, EKS/Fargate, etc.). 4. Experience with infrastructure as code and configuration management (Terraform and Helm charts) for designing and provisioning new services. 5. Proficiency in Python, Bash, or other scripting languages; Ruby or Golang experience is a plus. 6. Strong ownership and drive to collaborate and implement improvements in production.

Benefits

- Health insurance - Pharmacy benefits - Optical care benefits - Dental care benefits - Paid time off - Sick time off - Short term disability coverage - Long term disability coverage - Life insurance - 401k contribution


Apply Now