Site Reliability Engineer, GovCloud (US Remote Central)

See more jobs from Okta Inc

almost 4 years old

This job is no longer active

At Okta our motto is "Always On", and nowhere do we embrace that more than in Technical Operations. We strive to build the most reliable and performant systems on the planet through the skillful use of automation. We've created an integrated system that securely connects any person via any device to the technologies they need to do their most significant work. 

Okta is rethinking the traditional work environment, providing our employees with the flexibility to be their most creative and successful versions of themselves, no matter where they are located.  We enable a flexible approach to work, meaning you can work from the office, or from home, regardless of where you live.  Okta invests in the best technologies and provides flexible benefits and collaborative work environments/experiences, empowering employees to work productively in a setting that best and uniquely suits their needs.  Find your place at Okta https://www.okta.com/company/careers/.

Job Overview 

If you like to be challenged and have a passion for solving problems at scale with automation, testing and tuning, then we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, “If you have to do something more than once, automate it,” and who can rapidly self-educate on new concepts and tools. 

Responsibilities:

  • Design, build, and monitor Okta's production infrastructure
  • Respond to production incidents and determine a preventive solution
  • Troubleshoot complex reliability and performance issues
  • Automate manual processes, evolve our monitoring tools, and develop technical documentation
  • Support a highly available online environment as part of an on-call rotation once per quarter

Qualifications & Requirements

  • Due to federal data handling requirements, candidate must be a US Citizen
  • Computer Science (plus) or relevant experience
  • Background with Linux systems administration and strong scripting skills in Bash, Ruby, Python, Go, etc.
  • Experience supporting Docker containers and web applications running on Java / Apache / Tomcat in a live production environment
  • Strong expertise with production services in AWS such as EC2, ECS, KMS, Kinesis, CloudWatch
  • Previous experience with automating systems and infrastructure via Ansible, Chef or Terraform
  • Solid understanding of networking concepts and IP protocols
  • Background using and supporting Splunk, Zabbix, or related tools
  • Experience working in a source controlled environment with Relational Databases, such as MySQL
  • Knowledge of NoSQL systems such as Redis, Cassandra is desired

Our Culture 

Okta is an active, vibrant place that rewards creativity and unconventional thinking. We know that forging new connections between people and technology is no small feat, so we stick together. We work hard and challenge each other. We offer excellent benefits, competitive compensation, career growth opportunities, flexible time-off, catered lunches / free snacks, and much more!

  • We believe that work is a never-ending process of learning and iteration.
  • We work on extremely complex problems.
  • Your colleagues will be really smart (and cool to hang out with).
  • We work on products that make millions of people's work lives better.
  • We're funded by the industry's most respected investors.
  • You'll have the opportunity to change technology forever.

Okta is an Equal Opportunity Employer.

#LI-RA1