Femploy

New York City, NY • Posted 1 weeks ago • $17,000 - $17,000 per year

Hybrid Full Time Not specified Level general

Location: New York City (Hybrid) Salary: $205,000 $225,000 annual + Equity Position Type: Full-time Femploy by Infinite Code is recruiting for two Senior Site Reliability Engineers (SRE) to build the reliability foundations for a high-impact, fast-scaling platform in New York City (Hybrid). This is a premier "build-from-scratch" opportunity for an infrastructure specialist to define the SRE culture of a rapidly growing firm. We are looking for AWS experts with 5+ years of experience who can transition a platform from thousands to millions of users by implementing sophisticated Observability, Terraform-based IaC, and blameless incident cultures. What You’ll Do Reliability & Incident Management • Lead incident response and establish sustainable on-call practices • Create runbooks and drive blameless postmortems • Reduce MTTR through systematic improvements Observability & Monitoring • Build and maintain self-service observability systems • Implement monitoring solutions that provide actionable insights • Enable faster debugging and performance optimization Infrastructure & Scalability • Design and manage infrastructure-as-code (Terraform, CloudFormation) • Architect scalable, secure AWS environments • Improve reliability of databases, async workflows, and data pipelines CI/CD & Deployment • Partner with DevX to build robust CI/CD pipelines • Implement advanced deployment strategies (blue/green, canary) • Enable fast, safe, and reliable releases Cross-Team Collaboration • Work closely with engineering teams to embed reliability early in design • Advocate for SRE best practices across the organization Core Requirements • 5+ years in SRE/DevOps OR 7+ years in Software Engineering (infrastructure-focused) • Strong experience leading incident response & root cause analysis • Expertise in designing high-availability systems • Deep knowledge of AWS and infrastructure-as-code (Terraform preferred) • Hands-on experience with CI/CD pipelines and automation Key Skills • Experience with tools like Datadog, Prometheus, ELK • Ability to design monitoring systems that drive actionable insights Nice to Have • Strong communication and documentation skills • Experience working in fast-scaling, high-growth environments • Background in high-performance engineering cultures • Evidence of initiative (side projects, startups, rapid career growth) Why Join Us • Opportunity to build SRE practices from the ground up • Work on real scaling challenges (infra, data, reliability) • High ownership and impact on system architecture • Fast-paced, engineering-driven environment How to Apply Send your CV and a short summary of your experience to: Only shortlisted candidates who meet the requirements and are available immediately will be contacted.

Back to Job Search

Senior SRE, Software Engineering (2 )