Site Reliability Engineer
Rhapsody

About Rhapsody
Healthcare is innovating and you can be a part of it. Getting data from one provider to another, or from a provider to a health insurance company, is harder than it should be. Our mission is to change this – to accelerate innovation by easing the data access burden. Imagine developing solutions that accelerate digital transformation. This is what we do at Rhapsody. By providing data exchange and data quality solutions that enable information such as patient visit details, lab results, and billing balances– to move seamlessly from one system to another. Whether building an application or using one, every piece of the health ecosystem needs Rhapsody as a foundation.
Most people will not ever see our products (that's how infrastructure works) and services during a medical visit. Our solutions run behind the scenes, and you can think of them as the central nervous system helping to move data to accelerate innovation and improve outcomes. If using your knowledge to help solve this important problem sounds rewarding, apply today at rhapsody.health.
Position Summary
As a Site Reliability Engineer (SRE), you will play a pivotal role in shaping the future of Rhapsody Health’s cloud-native infrastructure and operations. You’ll be part of a high-impact team driving the reliability, scalability, and performance of our internal and customer-facing platforms. With a strong DevOps and AI-first mindset, you’ll lead initiatives that automate, optimize, and secure our cloud environments, enabling rapid innovation and resilient service delivery.
You will collaborate across engineering, product, and compliance teams to embed reliability into every layer of our software lifecycle. Your expertise in cloud architecture, observability, and infrastructure-as-code will help us scale intelligently while maintaining operational excellence. Hands-on development experience in cloud environments is required.
Key Responsibilities
- Ensure the availability, performance, and resilience of Rhapsody Health’s cloud platforms, meeting or exceeding defined SLAs.
- Architect and implement scalable, self-healing infrastructure using modern cloud-native patterns and automation.
- Lead the design and evolution of CI/CD pipelines, infrastructure-as-code, and observability frameworks.
- Champion AI-driven monitoring and anomaly detection to proactively identify and resolve issues.
- Optimize cloud spend through cost-aware architecture and usage analytics.
- Collaborate with engineering teams to embed SRE principles into the software development lifecycle.
- Partner with Compliance to ensure security, privacy, and regulatory alignment across all environments (HIPAA, SOC2, ISO 27001, HITRUST, etc.).
- Act as a technical evangelist for cloud and AI-first practices—internally and externally.
- Support the live environment and be prepared to participate in occasional on-call rotations.
Qualifications
Education:
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).
- AWS and/or Azure certification preferred.
Experience:
- 5+ years in SRE, DevOps, or Cloud Engineering roles.
- Proven hands-on development experience in cloud environments.
- Proven experience with AWS (preferred) and Azure.
- Deep knowledge of Linux systems administration and software development, and familiar with Kubernetes, ECS, EKS.
- Proficiency in infrastructure-as-code tools (e.g., Terraform, CloudFormation, Ansible).
- Experience with observability stacks such as Coralogix, SentinelOne, and UpWind.
- Programming/scripting in Python and Terraform.
- Familiarity with serverless architectures (e.g., Lambda, Fargate) and distributed systems.
- Strong understanding of networking, security, and compliance in cloud environments.
Preferred Skills:
- Experience with AI/ML operations or integrating AI into development and automation workflows.
- Exposure to threat modeling, SIEM tools, and security hardening (e.g., CIS Benchmarks).
- Experience in regulated industries (healthcare, finance, etc.).
- Hands-on experience with Terraform, CloudFormation, and Python development
- One or more AWS certification
Core Competencies
- Strategic thinker with an AI-first, cloud-first, automation-first mindset.
- Strong communicator and cross-functional collaborator.
- Passion for mentorship, continuous learning, and driving cultural change.
- Comfortable navigating ambiguity and complexity in a fast-paced environment.
Rhapsody provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local law
See more Remote jobs