Lead Site Reliability Engineer
Riverbed Technology
Lead Site Reliability Engineer
- Req No.
- 2025-7831
- Category
- Engineering
Riverbed. Empower the Experience
Riverbed, the leader in AI observability, helps organizations optimize their user’s experiences by leveraging AI automation for the prevention, identification, and resolution of IT issues. With over 20 years of experience in data collection and AI and machine learning, Riverbed’s open and AI-powered observability platform and solutions optimize digital experiences and greatly improves IT efficiency. Riverbed also offers industry-leading Acceleration solutions that provide fast, agile, secure acceleration of any app, over any network, to users anywhere. Together with our thousands of market-leading customers globally – including 95% of the FORTUNE 100 – we are empowering next-generation digital experiences.
Position
Position: Lead SRE Engineer
Location: Bangalore
Join Riverbed Technology and be part of shaping the future of digital experience management!
At Riverbed Technology, we are on a mission to help the world’s leading enterprises deliver superior digital experiences. Our Digital Experience Management (DEM) solutions provide deep visibility, AI-driven insights, and performance optimization across complex, global infrastructures.
We are expanding our Site Reliability Engineering (SRE) team and looking for an experienced SRE Lead in India to drive reliability, scalability, and operational excellence across our production environments. This is a unique opportunity to join a global company, lead technical initiatives, mentor engineers across Israel, the US, and beyond, and be instrumental in keeping Riverbed's SaaS solutions reliable and trusted by customers worldwide.
What you will do
- Lead incident response and resolution – coordinate investigations during critical production incidents, drive root cause analysis, and ensure rapid resolution.
- Architect and implement reliability solutions – design and deploy infrastructure improvements, automation frameworks, and observability systems to prevent issues proactively.
- Own production stability initiatives - drive strategic projects that improve system resilience, reduce MTTR, and optimize infrastructure performance
- Mentor and guide SRE team members - provide technical leadership, conduct code/design reviews, and develop team capabilities
- Lead post-incident reviews and blameless postmortems - facilitate learning, document findings, and drive continuous improvement in incident response playbooks
- Collaborate with DevOps and Engineering leadership - partner with cross-functional teams to influence architectural decisions and reliability standards
- Establish and track SLIs/SLOs/SLAs - define reliability metrics, implement monitoring strategies, and drive data-driven operational improvements
- Participate in and help coordinate global on-call rotation - ensure continuous coverage and mentor team members on escalation procedures
What makes you an ideal candidate
- 4+ years of hands-on experience with AWS - expert-level knowledge of EC2, ECS, EKS, RDS, S3, VPC, Load Balancing, CloudFormation, and multi-account strategies
- Strong leadership and mentorship experience - proven track record of leading technical initiatives and developing engineering talent
- Expert-level proficiency in Linux systems administration and performance tuning
- Advanced experience with infrastructure-as-code - Terraform and Ansible in production environments at scale
- Deep expertise in container orchestration - Kubernetes (K8S) and ECS, including cluster management, scaling strategies, and troubleshooting
- Strong CI/CD pipeline design and implementation experience (Jenkins, GitLab CI, or similar)
- Advanced knowledge of observability stack - CloudWatch, Prometheus, Grafana, ELK/EFK, Datadog, or equivalent platforms
- Expert networking skills - DNS, load balancing, TLS/SSL, VPNs, service mesh architectures, and complex connectivity troubleshooting
- Automation and scripting proficiency - Python, Bash, or Go for building tools and automation frameworks
- Excellent communication and technical documentation skills - able to clearly articulate complex technical concepts to both technical and non-technical stakeholders
- Experience with DORA metrics and SRE best practices - understanding of error budgets, toil reduction, and reliability engineering principles
Nice to Have
- Background in security and compliance (SOC2, ISO, FedRAMP)
- Contributions to open-source SRE/DevOps projects
- Experience with multi-region, high-availability architectures
- Knowledge of FinOps and cloud cost optimization at scale
- Familiarity with GitOps practices (ArgoCD, Flux)
What we offer
Our employee benefits including flexible workplace policies, employee resource groups, learning and development resources, career progression pathways, and community engagement initiatives are some of the reasons why we have had great success in bringing in new talent. In addition, our global employee wellness programs are crafted to support the physical, emotional, and financial well-being of our employees.
Benefits & Perks vary by Country.
About Riverbed
With a 20-year history of innovation, Riverbed is agile, yet proven, and we are disrupting the market with differentiated solutions that enable our customers to deliver secure, seamless digital experiences and accelerate enterprise performance While we are a ‘customer-first’ company, we are all about our people with a shared commitment to creating a global impact. We bring our best selves to work and pursue excellence, trust, and respect for one another. We welcome and encourage transparency and open communication throughout the company. We strive to be an inclusive, fair, and enjoyable workplace for our people globally and care about their wellbeing. We are committed to our people, partners, and customers while supporting the communities where we work and live. It’s the Power of WE that binds us together.
Riverbed is an equal employment opportunity/Affirmative Action (EEO/AA) employer and provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, gender, sexual orientation, gender identity or expression, national origin, age, physical disability (including HIV and AIDS), mental disability, medical condition, pregnancy or child birth (including breast feeding), sexual orientation, genetics, genetic information, marital status, veteran status or any other basis protected by and in accordance with applicable federal, state and local laws.
Check us out on:
www.riverbed.com
@LifeAtRiverbed
Tags
#LI-WH1
#LI-Hybrid
