← All roles
Alchemy logoAlchemyManufacturing and Robotics

Cloud Infrastructure Engineer

KubernetesTerraformSF · Mid · Seed

About the Role

As an engineer in the Infrastructure department at Alchemy, you will design, deploy, and continuously improve the infrastructure powering our blockchain developer platform — serving 100+ chains, billions of daily requests, and over $150B in annual transactions.

The Infrastructure team provides the infrastructure, tooling, and expertise needed to allow Alchemy engineers to ship, scale, and operate high-quality products in a fast, safe, and cost-efficient manner.

What You'll Do

  • Architect and operate scalable, self-healing infrastructure leveraging Kubernetes, Terraform, and cloud-native tools across multi-region deployments.

  • Drive AI enablement across engineering — ensuring repos, tooling, and workflows are optimized for agentic development with tools like Claude Code, Cursor, and Codex.

  • Build AI-powered infrastructure tooling and automation (e.g., automated K8s upgrades, IaC plan analysis, cost optimization advisors, MCP servers, n8n workflows).

  • Build and maintain internal developer platform (IDP) capabilities for self-service deployments, observability, and reliability.

  • Develop observability frameworks using Prometheus and Grafana for metrics, dashboards, and alerting.

  • Lead incident management with blameless post-mortems; define and enforce SLIs, SLOs, and error budgets across services.

  • Design and manage multi-cloud, multi-region network architecture — VPC design, IPAM, DNS (Cloudflare), cross-cloud connectivity, security groups, and edge-proxy/istio gateway configuration.

  • Collaborate with security teams to embed compliance into infrastructure, including IaC scanning and runtime protection.

  • Provide technical leadership and mentorship to elevate the team's operational capabilities.

What We're Looking For

  • 5+ years as an Infrastructure Engineer focused on reliability (SRE, Production Engineer, Platform Engineer).

  • Experience driving company-wide reliability efforts, including SLO frameworks and error budget policies.

  • Strong proficiency with observability stacks: OpenTelemetry, Prometheus/Grafana.

  • Deep experience with cloud infrastructure (AWS/GCP), Kubernetes, and multi-region architectures.

  • Skilled with Terraform, Helm, and GitOps workflows (e.g., ArgoCD) with an automation-first mindset.

  • Experience leveraging agentic development tools (Claude Code, Cursor, Codex) and workflow automation (n8n) to accelerate IaC and build internal tooling is a strong plus.

  • Solid networking fundamentals — VPC design, DNS, IPAM, security groups, cross-cloud connectivity, and service mesh (e.g., Istio) experience is a plus.

  • Strong cross-functional communicator across SRE, security, and product engineering.

  • Blockchain infrastructure, distributed systems, or high-throughput RPC experience — not required but a plus.

Benefits and Perks

🩺 Medical, Dental, & Vision

💪 Gym Reimbursement

🖥️ Home Office Build-out Budget

🥙 In-Office Group Meals

🧘‍♂️ Wellbeing & Mental Health Perks

📚 Learning & Development Stipend

🎉 Company Sponsored Conferences & Events

💸 HSA and FSA Plans

🧬 Fertility Benefits

More on the Role

Alchemy is committed to offering competitive compensation, including base salary as well as equity. Additionally, Alchemy offers comprehensive medical, dental, and vision coverage, as well as other benefits such as 401k and unlimited flexible time off.

AI

Check your CV against this role

Drop your CV. You get a 0-100 fit score against the actual job description, plus the read a senior engineering lead would write. Private to you.

Score this once, or every future role

Start the candidate journey and every new role on the board gets scored against you.

Five minutes. Tell us what you’re after, drop your CV once, pick how we should reach out. You get a candid read back and you only hear from us when a role actually fits.

More at Alchemy