Senior Platform Software Engineer
About the Role
Our team is growing and we're hiring a Senior Software Engineer to join our engineering team and enable our next phase of growth. Canary's engineering team is fully remote!
This role focuses on building and scaling the cloud infrastructure that powers our platform. You will own the strategy for our AWS infrastructure, automation, and reliability practices while collaborating closely with engineering teams across the organization to keep our platform both secure and scalable.
Responsibilities
- Architect and manage AWS infrastructure, balancing performance, cost, and reliability
- Build and maintain infrastructure-as-code using Terraform to ensure repeatable, version-controlled deployments
- Design and operate CI/CD and GitOps pipelines on Kubernetes, including Helm-based deployments managed through ArgoCD, that enable teams to ship safely and frequently
- Manage service-to-service traffic, observability, and security using a service mesh such as Istio
- Lead capacity planning, performance tuning, and cost-optimization efforts across the AWS footprint
- Implement observability, alerting, and incident-response practices to meet reliability and uptime targets
- Drive security standards into infrastructure, including IAM, network segmentation, and secrets management
- Mentor engineers, conduct design and code reviews, and document architecture decisions and runbooks
Qualifications
- 5+ years of professional software engineering experience, including substantial hands-on time with AWS infrastructure
- Deep expertise with infrastructure-as-code using Terraform
- Strong working knowledge of Kubernetes and container runtimes (e.g., containerd) in production environments
- Experience packaging and deploying Kubernetes applications with Helm charts
- Experience with GitOps continuous delivery using ArgoCD (or similar tools like Flux)
- Familiarity with service mesh technologies, particularly Istio, for traffic management and observability
- Proficiency in at least one scripting or programming language (Python, Go, or Bash) for automation and tooling
- Solid understanding of networking, Linux systems, distributed systems, and cloud security principles
- Experience designing for high availability, disaster recovery, and multi-region or multi-account architectures
- Strong communication skills and a track record of mentoring engineers and influencing technical direction
Check your CV against this role
Drop your CV. You get a 0-100 fit score against the actual job description, plus the read a senior engineering lead would write. Private to you.
Score this once, or every future role
Start the candidate journey and every new role on the board gets scored against you.
Five minutes. Tell us what you’re after, drop your CV once, pick how we should reach out. You get a candid read back and you only hear from us when a role actually fits.