Site Reliability Engineer

TableCheck Chuo-ku, Tokyo
  • 💴 No salary range given
  • 🏡 Fully remote (worldwide)
  • 🧪 2+ years experience required
  • 💬 No Japanese required
  • 🌏 Apply from abroad
  • 🧳 Relocate to Japan
DO YOU NEED MORE INFO?
ASK A QUESTION

About TableCheck

TableCheck Chuo-ku, Tokyo

We help diners make restaurant reservations, and we help merchants manage table inventory, enrich dining experiences, and visualize multi-property analytics to gain insights into restaurant performance.

Key benefits

  • Scalable and maintainable
  • Multicultural and full of camaraderie
  • See your work in the wild

About the position

We’re seeking a Site Reliability Engineer. As a member of our SRE team you will own the technology stack and help support our demanding business and developer needs.

We run a robust and fault-tolerant infrastructure built on Amazon Web Services (AWS) with Terraform, Kubernetes, Helm, and an array of tools for CI/CD, logging, monitoring, and so on. We emphasize DevOps best practices such as agile, scrum, automation, and customer-centric improvements.

Responsibilities

  • Following SRE principles to maintain a 24/7 production environment running on Kubernetes
  • Implementation of DevOps methodologies to improve IT team quality of life
  • Proactive system monitoring and configuration
  • Incident response

Requirements

  • Must have at least 2 years experience with Amazon Web Services (AWS), with particular focus on EKS, EC2, RDS, Fargate, CloudFront, Lambda, and S3
  • Must have extensive experience using AWS EKS
  • Must have experience in direct software engineering following DevOps / SRE practices with at least 1 year as a technical lead
  • Current ability in at least one of the following languages: Python, Ruby, Elixir, Go, Javascript, Rust
  • Must understand container and hypervisor fundamentals
  • Configuration management (YAML / Bash), experience with Helm and Terraform preferred
  • Experience running production systems at large scale, and an understanding of the kinds of problems that can occur along with likely solutions

Nice to haves

While not specifically required, tell us if you have any of the following.

  • Previous startup experience is highly desired
  • Terraform, Pulumi
  • ArgoCD
  • Prometheus
  • Grafana
  • PostgreSQL
  • MongoDB
  • Kafka
  • Security, PCI-DSS, GDPR, forensics, etc

Hiring Process

  1. 1

    Initial interview

    A one-on-one 30 minute chat over Google Meet to see if we’re the right fit.

  2. 2

    Technical interview

    (Virtually) meet the SRE team at TableCheck to evaluate your skills (no whiteboard or materials required).

  3. 3

    Take-home project

    We will provide you with a 30-60 minute project, which will evaluate your dev and ops skills.

DO YOU NEED MORE INFO?
ASK A QUESTION

Related jobs

More jobs like this

I'll send you a digest of new English-friendly software developer jobs in Japan. Your email stays private, I don’t share or sell it.