Back to all jobs

Site Reliability Engineer

RemoteOK
Apply NowSign in to track
AI-enhanced for better readability

Site Reliability Engineer

The Wormhole Foundation

Our mission is to empower passionate people in the research and development of blockchain interoperability technologies. We support teams building secure, open-source, and decentralized products within the Wormhole ecosystem.

About the Role: Site Reliability Engineer

Wormhole Foundation is seeking an experienced Site Reliability Engineer (SRE) to improve the reliability, security, and operational excellence of Wormhole’s production infrastructure. This role focuses on uptime, observability, deployment workflows, and incident response across critical blockchain and networking services. The SRE will work closely with engineering, DevOps, and validator partners to ensure Wormhole services operate at a minimum 99.99% uptime, excluding scheduled maintenance windows.

Responsibilities

  • Act as first responder and incident commander during production incidents
  • Lead incident triage, root cause analysis, and retrospective documentation
  • Build detailed incident timelines and preventative runbooks
  • Respond to incidents related to: performance issues, CCQ failures or degraded throughput, observability pipeline outages, and core Wormhole products
  • Deliver remediation recommendations and implement approved fixes
  • Improve reliability and uptime across all Wormhole services
  • Strengthen observability, monitoring, and alerting systems
  • Harden infrastructure for security and operational resiliency
  • Enhance deployment workflows and reduce operational friction
  • Lead incident response, analysis, and continuous improvement
  • Support operational tooling used by engineering, DevOps, and validator partners

Requirements

  • Relevant tertiary qualifications in computer science or a closely related field (bachelors/masters) and/or relevant work experience

Similar jobs