Femploy
Senior SRE, Software Engineering (2 )
Location: New York City (Hybrid)
Salary: $205,000 $225,000 annual + Equity
Position Type: Full-time
Femploy by Infinite Code is recruiting for two Senior Site Reliability Engineers (SRE) to build the reliability foundations for a high-impact, fast-scaling platform in New York City (Hybrid).
This is a premier "build-from-scratch" opportunity for an infrastructure specialist to define the SRE culture of a rapidly growing firm. We are looking for AWS experts with 5+ years of experience who can transition a platform from thousands to millions of users by implementing sophisticated
Observability, Terraform-based IaC, and blameless incident cultures.
What You’ll Do
Reliability & Incident Management
• Lead incident response and establish sustainable on-call practices
• Create runbooks and drive blameless postmortems
• Reduce MTTR through systematic improvements
Observability & Monitoring
• Build and maintain self-service observability systems
• Implement monitoring solutions that provide actionable insights
• Enable faster debugging and performance optimization
Infrastructure & Scalability
• Design and manage infrastructure-as-code (Terraform, CloudFormation)
• Architect scalable, secure AWS environments
• Improve reliability of databases, async workflows, and data pipelines
CI/CD & Deployment
• Partner with DevX to build robust CI/CD pipelines
• Implement advanced deployment strategies (blue/green, canary)
• Enable fast, safe, and reliable releases
Cross-Team Collaboration
• Work closely with engineering teams to embed reliability early in design
• Advocate for SRE best practices across the organization
Core Requirements
• 5+ years in SRE/DevOps OR 7+ years in Software Engineering (infrastructure-focused)
• Strong experience leading incident response & root cause analysis
• Expertise in designing high-availability systems
• Deep knowledge of AWS and infrastructure-as-code (Terraform preferred)
• Hands-on experience with CI/CD pipelines and automation
Key Skills
• Experience with tools like Datadog, Prometheus, ELK
• Ability to design monitoring systems that drive actionable insights
Nice to Have
• Strong communication and documentation skills
• Experience working in fast-scaling, high-growth environments
• Background in high-performance engineering cultures
• Evidence of initiative (side projects, startups, rapid career growth)
Why Join Us
• Opportunity to build SRE practices from the ground up
• Work on real scaling challenges (infra, data, reliability)
• High ownership and impact on system architecture
• Fast-paced, engineering-driven environment
How to Apply
Send your CV and a short summary of your experience to:
Only shortlisted candidates who meet the requirements and are available immediately will be contacted.