Alice: DevOps Engineer
Headquarters: BR URL: http://alice.com.br
Why this isn't "just another job"
At Alice, we don't offer a place to watch from the stands — you'll enter the arena of one of the most daring growth journeys in Latin America in technology and health.
As a Software Engineer, you will build cutting-edge technology with a direct impact on improving people's lives. If you are hungry to generate results, learn quickly, and be part of the most innovative company in Latin America, keep reading.
Attention: At Alice, Engineering operates 100% in the Agentic Development model. If you do not have experience and enthusiasm in acting as an orchestrator of AI agents, this position is not for you.
About Alice
Our mission is to make the world healthier. To get there, we are building something rare: a health experience that people really trust, engage with, and even love. We are a health plan for companies powered by technology — uniting care and health insurance — and we deliver superior health results at a lower cost for more than 90,000 members (and growing).
- Institutional Website: https://alice.com.br/
- Institutional Blog: https://alice.com.br/blog/
- Tech Blog: https://alice.com.br/tech/
About the Position
DevOps Engineer at Alice is the person who goes beyond keeping the infrastructure up! In this position, we expect you to act in several crucial dimensions for our mission to make the world healthier.
Platform and Infrastructure:
- Build and evolve our platform in AWS and Kubernetes, with sustainable and scalable solutions.
- Design and maintain Infrastructure as Code (Terraform, Helm) with versioning, review, and testing.
- Decompose platform changes into tasks that make sense for agents — order, dependencies, parallelism, level of autonomy.
- Collaborate with cross-squad architecture decisions.
- Maintain our ADRs, documentation, rules, and skills updated.
Reliability, Delivery, and Operation:
- Build and evolve our CI/CD pipelines and release strategies (canary, blue/green, rollback) via agents.
- Work spec-first throughout the cycle — plan, code, debug, test, document, review diffs, and iterate.
- Be responsible for the platform lifecycle (deploy, metrics, alarms, SLOs) and for the guardrails that maintain quality (infrastructure tests, linting, policy checks, security scans).
- Use agents also in operation, investigating incidents via MCPs (Datadog, logs, traces, GitHub).
- Participate in on-call, evolving runbooks and automations that reduce noise and MTTR.
- Ensure high performance, high availability, and security of systems.
Technical Leadership:
- Contribute to platform and technology team challenges, sharing ideas, solutions, IaC modules, architectures, skills, subagents, rules, and reusable workflows.
- Evolve the context files of your domain to elevate the quality of what agents produce in infrastructure changes.
- Publish and maintain skills in our internal marketplace.
- Improve the development experience at Alice (platform tooling, ephemeral environments, self-service deploys, observability) and mentor colleagues in the orchestrator mindset.
Product / Business:
- Work with Software Engineers, Designers, and Product Managers to understand the pain points of users and internal teams, creating the best platform solutions.
- Help the team decide where agentic automation fits and where we want human judgment, especially in critical production changes and in flows with care, clinical data, and regulatory decisions.
- Collaborate to strengthen the team culture, actively participating in the rituals and processes of the squad.
About the Engineering Team at Alice
Being part of the Alice Engineering team means having the responsibility and privilege of working to make the world healthier. Our mission is enormous and we are looking for people aligned with our purpose and with high competence and resilience to transform the health system in Brazil. It's a huge challenge, but very rewarding, and that brings countless growth opportunities.
We are a collaborative team that seeks technical excellence, without losing sight of the real objective of each of the lines we write. You will work with the business and product areas, with many initiatives already matured and many others to be started from scratch.
We focus on quality and diligent decisions. At the same time that we seek to evolve every day and reinvent ourselves, we are a team that focuses on consistent, rational, and well-documented decisions.
We are remote. If you prefer to work in the office (like some people on our team), our address is Avenida Rebouças, 3535. Other Alice teams work in a hybrid way (product, design, business, operations).
What makes you a strong candidate
Platform and Infrastructure Fundamentals
- Experience operating infrastructure in AWS (VPC, IAM, EC2, EKS/ECS, RDS, S3, networking, costs).
- Proficiency with Kubernetes in production (deployments, autoscaling, networking, RBAC, observability, troubleshooting).
- Experience with Infrastructure as Code (Terraform, Helm) — modularization, versioning, testing, and change review.
- Experience with CI/CD pipelines (GitHub Actions, Jenkins or equivalent) and release strategies (blue/green, canary, rollback, feature flags).
- Experience with distributed systems / micro-services and their operational pitfalls.
- Strong practice with observability and monitoring (Datadog, Prometheus, logs, traces, definition of SLIs/SLOs and alarms).
- Experience with the best practices of security and data privacy (secrets management, least privilege, hardening, vulnerability scans, compliance).
- Ability to code when necessary (Python, Go, Bash or similar) to build platform tooling and automations.
- Experience as a leader of technical platform projects (migrations, cross-squad standardization, internal tools, architectural decisions).
Agentic development
- Ability to adapt your way of working continuously to extract the maximum benefit from the best AI models / harnesses / frameworks.
- Regular use of agentic tools in agent mode, not just autocomplete (Claude Code, Cursor, Augment Code, Codex or equivalent).
- Spec-driven development throughout the cycle: plans before generating, reviews diffs, uses tests as verification, and applies the same rigor in debugging, documentation, and incident investigation.
- Context engineering: configures and maintains CLAUDE.md, AGENTS.md, .cursorrules and equivalents; knows how to divide tasks into large codebases.
- Failure literacy: knows failure modes (hallucination, context rot, loops, sycophancy) and adjusts the process from them.
- Multiplication: has already created a skill, subagent, rule, prompt template, or workflow that the team has adopted.
- Guardrails and autonomy: decides what to delegate to the agent and what requires human review.
- Differentials: orchestrate multiple instances/worktrees in parallel; operate long-running agents; write evals for skills, prompts, and workflows (measure output quality and detect regression); critically evaluate tools (cost, quality, security, privacy); lead the adoption of agentic development in a team or organization.
Mentality
- Strong sense of commitment and focus on results.
- Passionate about technology and health.
- Genuine enthusiasm for agentic development, with a desire to push the frontier of what is possible today.
Compensation
- We do our best to compensate competitively.
- We always make our best offer.
- We do not negotiate (this is good for you).
To apply: https://weworkremotely.com/remote-jobs/alice-devops-engineer