Performance Engineer - Software Engineering, MTS

Airkit
Airkit

Software Engineering

Bengaluru, Karnataka, India

Posted on Jun 29, 2026

Description

The Performance & Resilience Engineering team specializes in deep understanding of service architecture, dependency chains, usage patterns, and failure modes. We drive performance and scale insights through synthetic transactions, and validate service resilience through systematic chaos engineering — injecting faults into production-like environments to prove services survive AZ failures, dependency outages, and cascading degradation. We ensure Salesforce's most critical cloud services meet availability SLOs under real-world failure conditions, not just theoretical ones.

This is a great opportunity for engineers seeking to build and grow their career in performance engineering and chaos/resilience engineering, developing scaling, optimization, and fault-tolerance insights on complex cloud applications.

  • Responsibilities
    As a Performance & Resilience Engineer, you will:

    • Work closely with product owners, developers, and architects to understand product requirements, system dependencies, and failure modes, designing appropriate performance/scalability/resilience simulations.

    • Design and implement scalability, stress, reliability, and longevity simulations using industry-standard tools and custom-developed software.

    • Design and execute chaos experiments (AZ failures, dependency blackholes, latency injection, network partitions) using tools like Steadybit or AWS Fault Injection Service to validate service resilience and expected behavior under partial and complete failure of each dependency.

    • Validate circuit breaking, retry/backoff, graceful degradation, and auto-recovery mechanisms under fault conditions; identifying gaps such as missing circuit breakers or unexpected crash loops.

    • Produce chaos test reports with clear findings, bug filings for discovered weaknesses, and actionable remediation recommendations for service teams.

    • Partner with service owners to define and validate availability targets — proving multi-AZ resilience for single-AZ faults and characterizing blast radius and recovery time for full-AZ outages.

    • Influence architecture by offering solutions for performance, scale, and fault-tolerance challenges.

    Required Skills

    • 4+ years of relevant technical experience in performance engineering, reliability engineering, or chaos/resilience engineering.

    • Strong knowledge of system architectures, operating systems, databases, middle-tier components (Java/J2EE, Kafka), and how to optimally utilize such systems for performant, scalable, and fault-tolerant applications.

    • Experience designing and executing chaos engineering experiments in cloud environments (fault injection, network partitions, dependency failures, AZ-level disruptions).

    • Ability to dig deep into complex code to understand system design with an eye for performance, scalability, and resiliency bottlenecks.

    • Ability to automate operations such as simulation execution, monitoring, and analysis.

    • Working knowledge of load generation/measurement software (JMeter or custom tools) for HTTP, REST, RPC, and similar systems.

    • Working knowledge of operating system design/tuning/tools (e.g., Linux).

    • Strong knowledge of Java, Python, Go, or other programming languages.

    • BS/MS in Computer Science or equivalent.

    Desired Skills

    • Experience with chaos engineering platforms (Steadybit, Gremlin, Litmus, or similar).

    • Working knowledge of AWS managed service failure modes and SLAs (DynamoDB, Kinesis, KMS, STS, EKS, Route53, NLB).

    • Familiarity with multi-AZ and multi-region resilience architectures in AWS.

    Benefits & Perks

    • Comprehensive benefits package including well-being reimbursement, generous parental leave, adoption assistance, fertility benefits, and more!

    • World-class enablement and on-demand training at Trailhead.com

    • Exposure to executive thought leaders and regular 1:1 coaching with leadership

    • Volunteer opportunities and participation in our 1:1:1 model for giving back to the community

    • For more details, visit salesforcebenefits.com

    Trailhead

    Trailhead | The fun way to learn

    Everyone can learn Salesforce. Whether you are an admin, user, or developer, there is a trail for you.