EVA is in the business of acquiring enterprise software companies and transforming them into AI-native, profitable, scalable businesses. By prioritising Customer Success, we enable companies with strong product market fit to emerge as market leaders through sustained product innovation and high-quality professional services delivered via our global team. We are a remote-first company and our team is spread across the globe.
WHY JOIN EVA
We offer:
- High ownership, flexibility, and impact in a fast-moving environment
Opportunity to design cloud governance across a growing portfolio of acquired SaaS products
Growth pathways across engineering, platform, operations, and leadership
Remote-first flexibility
ROLE OVERVIEW
We are hiring a Chief Agentic Cloud Architect to design, govern, and continuously optimise the AWS cloud estate that underpins EVA’s entire organisation — from internal development tooling to a portfolio of customer-facing SaaS products acquired from multiple companies.
This is not a single-product infrastructure role. You will define the AWS Landing Zone architecture that governs how every environment — development, QA, non-production, internal IT, and customer-facing SaaS — is structured, secured, and operated. You will set the Service Control Policies that enforce compliance and cost boundaries across the organisation, own the observability and APM platform, author the runbooks that make our L1 operations team effective, and introduce AI agents that monitor, detect, and resolve infrastructure issues before they become incidents.
WHAT YOU’LL DO
1. Cloud Foundation & Governance
- Design and own EVA’s AWS Landing Zone — multi-account architecture, OU hierarchy, and account vending standards that bring every acquired product into a consistent, governed baseline
Author and enforce Service Control Policies (SCPs) across all environment tiers: development, QA, non-production, internal IT, and customer-facing SaaS — balancing the right level of velocity, security, and cost control at each layer
Own the end-to-end infrastructure architecture across the full product portfolio — compute, storage, networking, and database layers — with high availability, fault tolerance, and disaster recovery built in across all production workloads
Define Infrastructure-as-Code standards (Terraform or AWS CDK) so all environments are reproducible, auditable, and version-controlled; drive cost optimisation through right-sizing, reserved capacity planning, and agentic usage analysis
2. Security Architecture
Own network, application, and data security across the entire cloud estate: VPC segmentation, WAF, IAM least-privilege, secrets management, encryption standards, and service-to-service authentication
Maintain a continuous compliance posture across all accounts through automated policy enforcement, drift detection, and audit logging — with product-level account isolation for all customer-facing SaaS deployments
3. Observability, APM & Operational Runbooks
Build and own the observability stack on AWS using Prometheus and Grafana — metrics, alerting, SLO/SLA dashboards, and deployment health monitoring across all products and environments
Author precise, unambiguous runbooks for every recurring operational scenario so the L1 operations team can execute them without interpretation — each with a clear trigger, action sequence, escalation path, and resolution confirmation
4. Agentic Operations & Proactive Reliability
Deploy AI agents for APM monitoring, deployment health checks, anomaly detection, auto-scaling pattern analysis, and cost anomaly alerts — shifting the operations model from reactive incident response to proactive control
Own the four reliability pillars across the full cloud estate — availability, security, cost, and performance — as continuously monitored, agent-managed outcomes, and build the feedback loop that returns learnings into the runbook library
WHAT WE’RE LOOKING FOR
- 12+ years in cloud infrastructure, platform engineering, SRE, or DevOps
AWS Solutions Architect Associate certification is a minimum requirement; Professional is strongly preferred
Proven experience designing and operating AWS Landing Zones using AWS Organizations, including OU hierarchy design and Service Control Policy authoring
Deep, hands-on expertise across core AWS services: EC2, EKS, RDS, Lambda, S3, CloudFront, Route†53, CloudWatch, IAM, VPC, Transit Gateway, AWS Control Tower, and more
Strong command of network security, application security, and data security in a multi-account AWS context
Experience managing cloud environments across diverse stages — development, QA, staging, and production — with appropriate governance at each level
Experience building and operating observability stacks using Prometheus, Grafana, or equivalent tooling
Proven ability to author operational runbooks that non-specialist teams can follow without ambiguity
Practical exposure to agentic AI, LLM-based automation, or AI-assisted infrastructure operations
NICE TO HAVE
- Additional AWS certifications: DevOps Engineer Professional, Security Specialty, or Advanced Networking Specialty
Experience with Terraform or AWS CDK for Infrastructure-as-Code at scale
Background in multi-product SaaS environments, PE-backed technology companies, or post-acquisition platform consolidation
Familiarity with FinOps principles and cloud cost governance frameworks
Experience with AWS Control Tower or similar Landing Zone automation tooling
WHY THIS ROLE MATTERS
- You are not managing a single product’s infrastructure.
- You are designing and governing the cloud foundation that every EVA product runs on — across every environment, every acquired company, and every team — and building the intelligent operations layer that keeps it all reliable, secure, and cost-efficient without waiting for an incident to tell you something is wrong.
- This is a career-defining role for cloud architects who want to operate at organisational scale and define what agentic cloud operations looks like in practice.