Site Reliability Engineer

Micoworks Minato-ku, Tokyo
  • 💴 ¥6M ~ ¥9M annually
  • 🏡 Partially remote
  • 🧪 3+ years experience required
  • 💬 Business Japanese
  • 🗾 Japan residents only
  • 🧳 No relocation support
DO YOU NEED MORE INFO?
ASK A QUESTION

About Micoworks

Micoworks Minato-ku, Tokyo

We offer multiple products that deliver conversational experiences, including MicoCloud, a platform that is used by over 1,000 clients to reach over 29 million people.

Key benefits

  • A growing company in a growing market
  • Open and inclusive communication
  • Team over self

About the position

You will play a critical role in ensuring the reliability, availability, performance, and scalability of our infrastructure and services. Working closely with development teams, you will help achieve both system stability and a rapid release cycle.

Our company has established a global development structure, accelerating collaboration among engineers based in Japan, India, and the Philippines. Within this environment, bilingual (Japanese and English) SREs serve an essential role, acting as a bridge between teams.

You will be assigned to the SRE / Infrastructure team within the Innovation & Engineering Division. The SRE / Infrastructure team currently consists of 8 members and handles SRE/Infrastructure tasks across multiple products, primarily focusing on “MicoCloud”.

Tech Stack

  • Web backend: TypeScript (Nest.js), React
  • Web front end: TypeScript (Next.js), Chakra UI
  • Databases: TiDB, Aurora MySQL, DynamoDB, MemoryDB for Redis, Snowflake
  • Common Infrastructure Infrastructure: AWS (Cognito, EC2, ECS, Route53, Lambda, Kinesis Data Stream, Kinesis Firehose, SQS, SES, Elasticache, RDS, CloudWatch, IAM, Audit, APIGateway CodeDeploy), IaC (Terraform, AWS CDK)
  • Middleware: Nginx, Supervisor Monitoring: NewRelic, Sentry, AWS (CloudWatch)
  • Data analysis: BigQuery, Google Data Studio, Google Analytics, Metabase, Trocco
  • Environment construction: Docker
  • CI: GitHub Actions, Amplify Hosting
  • CDN: Cloud Front
  • Source code management: GitHub
  • Communication: Google Meet, Slack, Notion, Redmine, Jira, ClickUp

Responsibilities

  • Design, build, and operate cloud infrastructure.
  • Plan and drive cost optimization and efficiency improvements for cloud resources.
  • Improve operational efficiency through the development and implementation of automation tools.
  • Define and measure Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
  • Monitor systems, configure alerts, and manage and continuously improve incident response.
  • Manage incident resolution and conduct post-mortem analyses.
  • Perform capacity planning and performance tuning.
  • Develop and enhance CI/CD pipelines.
  • Manage identity and access for development tools.

Requirements

  • Experience designing and operating cloud infrastructure (AWS, Google Cloud, Azure, etc.).
  • Practical experience in Linux/Unix system administration.
  • Experience with Infrastructure as Code (Terraform, Ansible, etc.).
  • Hands-on experience with container technologies (Docker, Kubernetes).
  • Experience building and operating monitoring services.
  • Experience building and operating CI/CD pipelines.
  • Basic knowledge of networking.
  • Business-level communication skills in Japanese

Nice to haves

While not specifically required, tell us if you have any of the following.

  • Experience operating microservice architectures.
  • Experience analyzing cloud resource usage and implementing cost-efficiency improvements.
  • Programming skills (Python, TypeScript, Shell scripting, etc.).
  • Experience with database management.
  • Knowledge of security best practices.

Compensation

6 to 9 million JPY annually.

DO YOU NEED MORE INFO?
ASK A QUESTION

Related jobs

More jobs like this

I'll send you a digest of new English-friendly software developer jobs in Japan. Your email stays private, I don’t share or sell it.