Site Reliability Engineer (Sydney)

See more jobs from Okta Inc

almost 4 years old

This job is no longer active

At Okta our motto is "Always On", and nowhere do we embrace that more than in Technical Operations. We strive to build the most reliable and performant systems on the planet through the skillful use of automation. We've created an integrated system that securely connects any person via any device to the technologies they need to do their most significant work.  

Job Overview 

If you like to be challenged and have a passion for solving problems at scale with automation, testing and tuning, then we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, “If you have to do something more than once, automate it,” and who can rapidly self-educate on new concepts and tools. 

Responsibilities:

  • Design, build, and monitor Okta's production infrastructure
  • Respond to production incidents and determine a preventive solution
  • Troubleshoot complex reliability and performance issues
  • Automate manual processes, evolve our monitoring tools, and develop technical documentation
  • Support a highly available online environment as part of an on-call rotation once per quarter

Qualifications & Requirements

  • Computer Science (plus) or relevant experience
  • Foundation in Linux systems administration, networking concepts and IP protocols
  • DevOps - experience automating and running large scale production Java/Tomcat services in AWS (EC2, ECS, KMS, Kinesis, RDS) or other cloud providers
  • Experience writing infrastructure as code using tools such as Chef, Ansible and Terraform
  • Able to code to a good standard with any programming language, but especially Bash, Ruby, Python and GO, using source controlled environment, Git
  • Solid understanding of CI/CD principles & Agile methodologies
  • Good working knowledge of MySQL servers at scale, including configuration and management of replicas and backups (MySQL or related forks)
  • Experience administering NoSQL cluster data stores such as Redis or Elasticsearch
  • Experience using and supporting log and telemetry aggregation services such as Splunk, Zabbix and Wavefront
  • Experience running container technology in production

Our Culture 

Okta is an active, vibrant place that rewards creativity and unconventional thinking. We know that forging new connections between people and technology is no small feat, so we stick together. We work hard and challenge each other. We offer excellent benefits, competitive compensation, career growth opportunities, flexible time-off, catered lunches / free snacks, and much more!

Okta is rethinking the traditional work environment, providing our employees with the flexibility to be their most creative and successful versions of themselves, no matter where they are located.  We enable a flexible approach to work, meaning for roles where it makes sense, you can work from the office, or from home, regardless of where you live.  Okta invests in the best technologies and provides flexible benefits and collaborative work environments/experiences, empowering employees to work productively in a setting that best and uniquely suits their needs.  Find your place at Okta https://www.okta.com/company/careers/.

Okta is an Equal Opportunity Employer.

#LI-LR1