As a DevOps Engineer in our Tokyo Office, you’ll work across teams to ensure reliability, scalability, security and performance while driving automation, operational excellence and play a critical role in building and maintaining our infrastructure. You will be the first line of defense for incident response, troubleshooting, monitoring and optimization of our infrastructure.
Responsibilities
- Monitoring & Incident Response: Proactively monitor infrastructure and applications, respond to alerts, and perform root cause analysis to resolve issues effectively. Manage, maintain, and improve monitoring and alerting systems such as Prometheus, Grafana, and Graylog.
- Development & Operations Support: Provide hands-on technical support to both development and operations teams.
- Optimizations: Identify and fix performance bottlenecks to enhance system reliability. Implement automation scripts to streamline routine tasks and improve system performance.
- Security: Regularly audit system logs and library versions for potential vulnerabilities and apply necessary patches. Responsible for implementing and managing encryption protocols, SSL/TLS configurations, and ensuring data security and compliance across systems and applications.
- CI/CD Pipelines: Deploy, manage, and improve CI/CD pipelines to ensure smooth and efficient software delivery.
- Cloud Infrastructure: Manage and scale cloud infrastructure (e.g., AWS, GCP) to ensure reliability and scalability.
- Containerized Environments: Support and optimize containerized environments using Docker and Kubernetes, including orchestration workflows.
- Documentation and Knowledge Sharing: Create and maintain runbooks, incident response guides, and knowledge base documentation. Mentor team members to foster continuous improvement and skill development.
Requirements
- Experience: Minimum of 4 years of experience in DevOps, automation, and monitoring.
- Containerization & Orchestration: Hands-on experience with Docker and Kubernetes.
- Monitoring Tools: Familiarity with monitoring frameworks such as Prometheus, Datadog, or similar.
- Startup Mindset: Experience working in a fast-paced startup environment is ideal.
- Soft Skills: Strong problem-solving abilities, attention to detail, and excellent communication skills.
Nice to haves
While not specifically required, tell us if you have any of the following.
- Knowledge of infrastructure as code (IaC) frameworks such as Terraform or Ansible.
- Experience optimizing database performance and scaling.
- Familiarity with pentesting, vulnerability management, system hardening, IDS/IPS tools, and WAF is a strong advantage.
Compensation
¥6,000,000 ~ ¥9,000,000 annually.