The right talent can transform your business—and we make that happen. At Collabera, we go beyond staffing to deliver strategic workforce solutions that drive growth, innovation, and agility. With deep industry expertise, a global talent network, and a people-first approach, we connect you with professionals who don’t just fit the role but elevate your business. Partner with us and build a workforce that powers success.
Site Reliability Engineer
Contract: Charlotte, North Carolina, US span>
Salary Range: 65.00 - 68.00 | Per Hour
Job Code: 368000
End Date: 2026-04-05
Days Left: 25 days, 1 hours left
Work Arrangement: 3 day in office with 2 day WFH
Client Industry: Banking
Duration: 12 -18 months Contract
Schedule: Monday to Friday
-
Partner with Application Development and Production Support teams to implement reliability and stability measures defined by senior SRE leadership.
-
Ensure key services have proper monitoring, instrumentation, alerting, ticketing workflows, and on-call procedures in place.
-
Define and maintain a multi-year platform stability roadmap aligned with business and technology strategy.
-
Identify critical system dependencies, risks, and mitigation strategies across infrastructure, applications, and services.
-
Collaborate with architecture teams to ensure systems follow enterprise architectural patterns that promote reliability and fault tolerance.
-
Lead post-incident reviews and root cause analysis to identify improvements and prevent recurring issues.
-
Work with infrastructure and application teams to implement monitoring capabilities and reliability tooling.
-
Develop and maintain reusable reliability scripts, automation tools, and libraries to support monitoring, instrumentation, and operational efficiency.
-
Participate as a subject matter expert in major incident triage and failure scenario modeling.
-
Identify reliability gaps, monitoring noise, and performance issues, and implement improvements to reduce manual support and increase system stability.
-
Partner with engineering, operations, and product teams to embed reliability principles into the software development lifecycle.
-
8+ years of experience in technology architecture, site reliability engineering, or infrastructure strategy.
-
Strong knowledge of distributed systems and microservices architecture.
-
Experience with cloud platforms such as AWS, Azure, or GCP.
-
Experience implementing observability, monitoring, and reliability engineering practices.
-
Proven experience improving system stability and performance in large-scale enterprise environments.
-
Strong understanding of high availability, disaster recovery, and performance optimization strategies.
-
Experience participating in major incident management and root cause analysis.
-
Ability to collaborate with development, infrastructure, and operations teams.
-
Strong communication skills with the ability to explain complex technical concepts to non-technical stakeholders.
This range reflects base compensation and may vary based on location, market conditions, experience, and candidate qualifications.
Job Requirement
- SRE
Reach Out to a Recruiter
- Recruiter
- Phone
- Ashwini Pawar
- ashwini.pawar@collabera.com