Back to Jobs

Site Reliability Engineer/ Chaos Engineer Remote to start

Remote, USA Full-time Posted 2026-06-21

Site Reliability Engineer/ Chaos Engineer – Remote to Start Location Malvern, PA Position type 06 months contract Start date Immediately Rate DOE US Citizens and Green Card, GC-EAD and TN VISA accepted.

Job Description

Candidate will be part of the SRE team and lead technical role to determine Reliability Chaos Engineering needs of mission critical systems and business processes. Candidate will assess high level architecture and design issues relating to platform enterprise software interactions with other systems. Application development infrastructure database and middleware teams to ensure stability and reliability of the system. Chaos Engineering will proactive detect issues within the applications platform network and databases in a controlled way using Chaos tools like Chaos Monkey Gremlin Simian Army. Candidate should have familiarity with Internet protocols such as HTTP DNS TCP and UDP and Linux development environment and well versed with DevOps. Candidate will identify anti patterns optimization and support development of self-healing capabilities.

Responsibilities

Create operational tooling for monitoring self-healing infrastructures and chaos testing. Design and create controlled chaos in production systems. Create Python and Terraform scripts. Work across teams identify and fix issues that affect systems reliability and performance. Guide and design architectural decisions and direct solutions that will enhance our client’s product reliability. Dive into system and latent reliability issues service performance and capacity modeling of distributed systems at scale. Partner with development team to identify anti patterns and optimization strategies create fallback options and help develop self-healing capabilities across the enterprise in a sustainable manner. Apply tot his job Apply To this Job Apply To This Job Apply tot his job Apply To this Job

Similar Jobs

Site Reliability Engineer, Core Streaming (Remote - United States)

Remote, USA Full-time

Staff Site Reliability Engineer — Project Volcano [Remote]

Remote, USA Full-time

Site Reliability Engineer (Remote + Travel)

Remote, USA Full-time

Immediate Hiring: Remote - Site Reliability Engineer/Production

Remote, USA Full-time

Site Reliability Engineer; DevOps; Remote

Remote, USA Full-time

Senior Site Reliability Engineer (Remote USA)

Remote, USA Full-time

Site Reliability Engineer (FULLY REMOTE-Graveyard Shift)

Remote, USA Full-time

Principal Site Reliability Engineer - ARINCDirect (Remote)

Remote, USA Full-time

[Remote] Senior Site Reliability Engineer — Government & Sovereign Cloud

Remote, USA Full-time

Urgently Need Site Reliability Engineer (Remote) in Saint Paul, MN

Remote, USA Full-time

Remote Live Chat Customer Service Representative – Full‑Time, Flexible Scheduling, Sales‑Driven Customer Retention at arenaflex

Remote, USA Full-time

Senior Project Manager

Remote, USA Full-time

Join Today: [Fully Remote] Amazon Data Entry Jobs (URGENT)

Remote, USA Full-time

Sales Operations Analyst

Remote, USA Full-time

Remote Amazon Flex Delivery Specialist – Flexib...

Remote, USA Full-time

Engineering Project Manager - Remote Operation Center Projects

Remote, USA Full-time

Android Developer (Jetpack Compose Specialist) 100% REMOTE

Remote, USA Full-time

Financial Analyst II, Corporate Finance

Remote, USA Full-time

Part-Time Remote Data Entry Specialist for arenaflex: A Dynamic and Innovative Opportunity

Remote, USA Full-time

Community Engagement Manager

Remote, USA Full-time