Site Reliability Engineer

See more jobs from GitHub

over 2 years old

This job is no longer active

GitHub is seeking software engineers to join its SRE team. As a valued member of our close-knit team, you will bring your passion for building fault-tolerant systems and reliable software to help us steward reliability as a feature throughout the organization. Your work will help us scale the world's largest platform for developers.

Our charter is broad but our focus is to improve the availability, resilience, and sustainability of GitHub's products. We do this through building a portfolio of products, shaping processes that improve organisational resilience, and partnering with product teams.

Our SRE team is fully remote and our day-to-day working environment is based on asynchronous communication, trust, and respect. Through your strong written communication and software skills, you will develop meaningful working relationships with coworkers from around the globe.

The SRE role at GitHub is an opportunity to blend your system design, empathy, and software engineering skills on an ever-changing set of novel reliability challenges. Join us on this journey and have a meaningful impact on how the world builds software.

What you’ll be doing:

  • Develop and maintain a suite of reliability products which influences and improves the global reliability of our products.
  • Improve and influence key processes across the company, such as incident response, durable ownership, and on-call rotations.
  • Actively participate in our weekly post-incident reviews, and help foster an environment of blamelessness, psychological safety and organisational learning.
  • Work closely with product teams to boost their reliability and train their first responders.
  • Steward reliability as a feature across the organization through concepts such as SLOs and service maturity.

Your experience:

  • Experience with Ruby or Go. Our key products are written in these languages, so experience or a willingness to learn is useful.
  • Familiarity with modern SRE concepts like SLOs and SLIs, incident response, monitoring and alerting, eliminating toil, and post-incident reviews.
  • An interest in distributed systems, especially those with high availability requirements.
  • An ability to troubleshoot and debug issues, especially those that relate to emerging problems on unfamiliar architectures.
  • Experience with being on-call and willingness to represent our team in incidents.
  • Experience with MySQL and different data querying technologies (GraphQL and Protobufs).
  • Great communication and problem-solving skills, coupled with a willingness to collaborate and work with empathy.

Please feel free to apply even if your skills do not match all of the requirements listed above. More than anything, we’re looking for engineers with passion, empathy and a willingness to learn who can help bring diverse perspectives to our team.

Ability to meet GitHub, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:  Microsoft Cloud Background Check: This position will be required to pass the Microsoft, GitHub’s parent company, Cloud Screen upon hire/transfer and every two years thereafter.

Who We Are:

GitHub is the developer company. We make it easier for developers to be developers: to work together, to solve challenging problems, and to create the world’s most important technologies. We foster a collaborative community that can come together—as individuals and in teams—to create the future of software and make a difference in the world.

Leadership Principles:

Customer Obsessed - Trust by Default - Ship to Learn - Own the Outcome - Growth Mindset - Global Product, Global Team - Anything is Possible - Practice Kindness

Why You Should Join:

At GitHub, we constantly strive to create an environment that allows our employees (Hubbers) to do the best work of their lives. We've designed one of the coolest workspaces in San Francisco (HQ), where many Hubbers work, snack, and create daily. The rest of our Hubbers work remotely around the globe. Check out an updated list of where we can hire here: https://github.com/about/careers/remote

We are also committed to keeping Hubbers healthy, motivated, focused and creative. We've designed our top-notch benefits program with these goals in mind. In a nutshell, we've built a place where we truly love working, we think you will too.

GitHub is made up of people from a wide variety of backgrounds and lifestyles. We embrace diversity and invite applications from people of all walks of life. We don't discriminate against employees or applicants based on gender identity or expression, sexual orientation, race, religion, age, national origin, citizenship, disability, pregnancy status, veteran status, or any other differences. Also, if you have a disability, please let us know if there's any way we can make the interview process better for you; we're happy to accommodate!

Please note that benefits vary by country. If you have any questions, please don't hesitate to ask your Talent Partner.

#LI-POST