Description
About the Team
The Cloud Platform Engineering team builds and operates the highly available, active-active cloud infrastructure that powers Salesforce at scale. We treat our internal platform as a product — obsessing over developer velocity, automation-first design, and a strict "No Ticket-Ops" philosophy.
We are a lean team at the leading edge of platform engineering — AI tools in every workflow, Slack as our enterprise WorkOS, and agents wired into our GitOps pipelines. We move fast, we automate everything, and we expect you to do the same.
The Shared Team DNA
We are "T-shaped" engineers with distinct focus areas who learn from one another. Every member shares a passion for:
Platform-as-a-Product: Internal engineers are our customers — their DX and velocity come first.
Eradicating Toil: Recurring issue? Write the automation. Ship code, don't file tickets.
Shift-Left Security: Compliance and guardrails baked into the code by default — never bolted on.
SRE Mindset: Engineer for failure. Self-healing systems. 99.999% availability.
Deep Observability: Rich telemetry, distributed tracing, automated alerting — proactive, not reactive.
AI-Native & Agentic: AI tools in every workflow, agents in every pipeline. Non-negotiable.
About the Role
You are the engineer who builds the cloud infrastructure engine. Partnering across platform teams, you design and deliver the infrastructure vending machines and reusable IaC modules that empower internal product teams to provision fully governed, multi-account AWS environments in 15 minutes, not two weeks. You own your domain like a product — understanding customer pain, reducing friction, and shipping solutions.
Your Impact – Responsibilities
AWS Infrastructure Vending Machine: Own AFT as Terraform Enterprise SME; automate multi-region, multi-account provisioning via AWS Organizations and Control Tower.
Enterprise Golden Modules: Author, test, and version-control a library of secure, pre-approved Terraform modules; translate compliance requirements into Sentinel/SCP policies.
CI/CD & Guardrail Integration: Embed automated security scanning, Policy-as-Code enforcement, and cost guardrails directly into developer PRs — leveraging AI to intelligently surface risk, suggest remediations, and evolve the ruleset continuously.
Resilience & Active-Active Delivery: Implement automated failover across AZs and regions using Cloud WAN, ARC, and FIS; lead Game Days simulating catastrophic failure.
Agentic ChatOps & Slack Integration: Wire provisioning pipelines and AI agents into Slack for real-time intelligence, ChatOps control, and zero-touch operations.
AI-Augmented IaC: Use AI coding assistants (Claude Code, Codex, Cursor) as primary tools to author, review, and validate infrastructure code at speed.
Observability Platform: Build centralized logging, metrics, and tracing (Prometheus, Grafana, ELK, Datadog); define and enforce SLOs/SLIs.
SRE Advisory: Advise internal teams on observability best practices, alerting strategies, and resilience architecture — trusted advisor, not CloudOps.
Minimum Qualifications
7+ years in Software Engineering, SRE, or Platform Engineering in large-scale AWS environments.
Expert-level Terraform: state at scale, module decoupling, continuous IaC delivery.
Strong Python or Go for custom tooling and AWS SDK (Boto3/Go SDK) integrations.
Deep AWS core services: IAM, VPC, Cloud WAN, EKS/ECS, Lambda, DynamoDB — multi-region active-active DR.
GitOps-driven deployments with automated guardrails baked into enterprise pipelines.
Active practitioner of AI-assisted development (Claude Code, Codex, Cursor, Gemini or equivalent) as daily workflow.
Preferred Qualifications
AWS certifications (Solutions Architect Professional, DevOps Engineer Professional, or Security Specialty).
Experience with AWS Control Tower, Account Factory for Terraform (AFT), or custom account vending.
Hands-on chaos engineering: AWS FIS, Gremlin, or equivalent Game Day experience.
Experience building agentic pipelines or AI-integrated developer tooling.
Familiarity with Slack platform development, Bolt SDK, or ChatOps automation.
Background in FinOps, Policy-as-Code (Sentinel, OPA), or secrets management (Vault).
What We Value
Bias for Action: Recurring issue? You write the automation. You ship code, you don't file tickets.
Extreme Ownership: The platform is mission-critical. You own your domain end-to-end, no hand-offs.
Risk Mitigation: Natural skepticism toward complexity; you find single points of failure before they find you.
Documentation as Code: Runbooks, ADRs, and processes live in the repo — always current, never stale.
AI Amplifier: You wire tools together to eliminate toil permanently and bring strong opinions on how AI accelerates platform engineering.
Agentic Mindset: Slack is your command center. You build for it, automate through it, and live in the agentic future of work — today.
Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Salesforce will consider for employment qualified applicants with arrest and conviction records.
In the United States, compensation offered will be determined by factors such as location, job level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits. Salesforce offers a variety of benefits to help you live well including: time off programs, medical, dental, vision, mental health support, paid parental leave, life and disability insurance, 401(k), and an employee stock purchasing program. More details about company benefits can be found at the following link: https://www.salesforcebenefits.com.
