You will play a critical role in ensuring the reliability, availability, performance, and scalability of our infrastructure and services. Working closely with development teams, you will help achieve both system stability and a rapid release cycle.
Our company has established a global development structure, accelerating collaboration among engineers based in Japan, India, and the Philippines. Within this environment, bilingual (Japanese and English) SREs serve an essential role, acting as a bridge between teams.
You will be assigned to the SRE / Infrastructure team within the Innovation & Engineering Division. The SRE / Infrastructure team currently consists of 8 members and handles SRE/Infrastructure tasks across multiple products, primarily focusing on “MicoCloud”.
Tech Stack
- Web backend: TypeScript (Nest.js), React
- Web front end: TypeScript (Next.js), Chakra UI
- Databases: TiDB, Aurora MySQL, DynamoDB, MemoryDB for Redis, Snowflake
- Common Infrastructure Infrastructure: AWS (Cognito, EC2, ECS, Route53, Lambda, Kinesis Data Stream, Kinesis Firehose, SQS, SES, Elasticache, RDS, CloudWatch, IAM, Audit, APIGateway CodeDeploy), IaC (Terraform, AWS CDK)
- Middleware: Nginx, Supervisor Monitoring: NewRelic, Sentry, AWS (CloudWatch)
- Data analysis: BigQuery, Google Data Studio, Google Analytics, Metabase, Trocco
- Environment construction: Docker
- CI: GitHub Actions, Amplify Hosting
- CDN: Cloud Front
- Source code management: GitHub
- Communication: Google Meet, Slack, Notion, Redmine, Jira, ClickUp
Responsibilities
- Design, build, and operate cloud infrastructure.
- Plan and drive cost optimization and efficiency improvements for cloud resources.
- Improve operational efficiency through the development and implementation of automation tools.
- Define and measure Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
- Monitor systems, configure alerts, and manage and continuously improve incident response.
- Manage incident resolution and conduct post-mortem analyses.
- Perform capacity planning and performance tuning.
- Develop and enhance CI/CD pipelines.
- Manage identity and access for development tools.
Requirements
- Experience designing and operating cloud infrastructure (AWS, Google Cloud, Azure, etc.).
- Practical experience in Linux/Unix system administration.
- Experience with Infrastructure as Code (Terraform, Ansible, etc.).
- Hands-on experience with container technologies (Docker, Kubernetes).
- Experience building and operating monitoring services.
- Experience building and operating CI/CD pipelines.
- Basic knowledge of networking.
- Business-level communication skills in Japanese
Nice to haves
While not specifically required, tell us if you have any of the following.
- Experience operating microservice architectures.
- Experience analyzing cloud resource usage and implementing cost-efficiency improvements.
- Programming skills (Python, TypeScript, Shell scripting, etc.).
- Experience with database management.
- Knowledge of security best practices.
Compensation
6 to 9 million JPY annually.