Site Reliability Engineer

Sysdig

Sysdig

Software Engineering
Remote
Posted on Sep 18, 2024
In the cloud, every second counts. On the leading edge of security, Sysdig stops attacks in real-time by instantly detecting changes in cloud security risk with runtime insights and open source Falco. We are passionate open source enthusiasts at heart and problem-solvers who are building and delivering powerful solutions to secure cloud-native applications.
We value diverse opinions and open dialogue to spur ideas. We believe in working together to achieve our goals and we pride ourselves on a flexible work culture. We’re an international company that understands how to cultivate an inclusive environment across remote teams.
And we’re a great place to work too – we’ve been named a “Best Place to Work” by Inc.,the San Francisco Business Times and the Silicon Valley Business Journal, and we won six workplace awards from Comparably this year. We have been recognized by Deloitte as one of the 500 fastest-growing organizations for the last four years.
We are looking for driven team members who want to join us on our mission to lead cloud security globally. Does this sound like the right place for you?

What you will do

  • Reporting to the SRE Manager you will build and manage systems across internal and production Cloud environments with a focus on configuration as code and platform automation
  • Implement reliability improvement initiatives, including capacity planning, performance tuning, load testing and infrastructure optimization
  • Measure KPI via Service Level Indicators (SLIs), Service Level Objectives (SLOs) and Service Level Agreement (SLAs) and help to define them
  • Participate in and contribute to improving our incident response. Perform root cause analysis (RCA), troubleshoot and debug issues across our infrastructure and platform services to identify and fix root causes

What you will bring with you

  • Solid SRE, DevOps or Cloud Infrastructure Engineer experience
  • Solid experience in containerization (kubernetes, docker and helm charts) - all of them
  • Solid understanding of Linux systems and networking
  • Software development skills; Go and Python a big plus

What we look for

  • Familiarity with monitoring tools such as Sysdig, Prometheus, Nagios, Icinga, Zabbix
  • Strong tooling and automations development experience
  • Experience in CI/CD tools such as Harness and/or Jenkins
  • Experience diagnosing and troubleshooting complex problems in high-throughput applications and network services

Why work at Sysdig?

  • We're a well funded startup that already has a large enterprise customer base
  • We have an organizational focus on delivering value to customers
  • Our open source tools (https://sysdig.com/opensource/) are widely used and loved by technologists & developers

When you join Sysdig, you can expect:

  • Great compensation package, including equity opportunities
  • Benefits vary based on location
  • An international culture with employees in more than 40 countries
  • Flexible work arrangement
  • Mental well-being support for you and your family and company-wide recharge days
  • Development opportunities
We would love for you to join us! Please reach out even if your experience doesn't perfectly match the job description. We can always explore other options after starting the conversation. Your background and passion will set you apart, especially if your career path is different.
Some of our Hiring Managers are globally distributed, an English version of your CV will be appreciated.
Sysdig values a diverse workplace and encourages women, people of color, LGBTQIA+ individuals, people with disabilities, members of ethnic minorities, foreign-born residents, and veterans to apply. Sysdig is an equal-opportunity employer. Sysdig does not discriminate on the basis of race, color, religion, sex, national origin, age, disability, genetic information, sexual orientation, gender identity, or any other legally protected status.
#LI- PJ1
#LI-Hybrid