Under our mission, “Money Forward. Move your life forward”, Money Forward offers a range of services, including the automatic household accounting and asset management service “Money Forward ME” and the business-oriented cloud service “Money Forward Cloud,” which are used by many users. For these services, utilizing various user data collected through a technology called “Account Aggregation” is essential.
The Account Aggregation Team develops systems to collect user data. The collected data becomes more valuable information through our products and is returned to the users.
However, we believe that the collected user data could have more diverse use cases for each user. The data collected from various sources is also connected to the users’ lives. By enabling users to use their data more conveniently, we aim to move their lives forward. With this vision, we are now developing a new product. We are looking for an infrastructure engineer who will work with us to optimize the development and operation of this product.
Technical Stack
- Languages: Kotlin, TypeScript, React
- Infrastructure: AWS, Kubernetes, Terraform
- Database: MySQL
- Middleware: Docker, Nginx, Consul, Redis
- Platform: AWS, On-premises
Tools
- Repository Management: GitHub
- Monitoring: DataDog, Rollbar
- Communication: Slack, Zoom
- Ticket Management: JIRA
Responsibilities
- Designing, building, and operating cloud (AWS) and on-premises infrastructure.
- Establishment of metrics, monitoring, and alerts using tools like Datadog, as well as incident response (including troubleshooting, recovery, incident management, and post-mortems).
- Performance evaluation of applications and infrastructure.
- Performance optimization and the creation of systems to support it.
- Development of systems for incident response, detection, and prevention.
- Enhancement of service reliability, including the definition and management of SLIs/SLOs for continuous performance improvement.
- Security operations and compliance management for the entire infrastructure.
- Designing, building, and operating platforms to maximize the productivity of development teams(CI/CD, development environments, and testing environments).
- Conducting availability and reliability reviews from the design phase onward.
- Addressing financial industry-specific regulatory requirements.
Requirements
- 3+ years of practical experience in design and operation in SRE, DevOps, or infrastructure domains
- Basic knowledge and work experience with Linux, Network, Security, etc.
- Experience in using/designing/operating AWS
- Experience with container orchestration systems like Kubernetes or ECS
- Experience in building and utilizing monitoring environments with tools for monitoring and observability.
- Experience with Infrastructure as Code (IaC) (tools like Terraform, etc.)
- Development experience with peer reviews using Git, such as Pull Requests
Nice to haves
While not specifically required, tell us if you have any of the following.
- Strategic planning skills (Ability to understand company and business challenges, define a clear direction for your organization, and present a rational path to stakeholders.)
- Experience in building and operating Kubernetes in a multi-tenant environment
- Experience in handling failures in web services
- Experience as an SRE in a web service company
- Experience in operating large-scale services on AWS
- Experience in building CI/CD pipelines
- Experience in implementing and operating monitoring for web services
- Experience in capacity planning and tuning for web services
- Experience in operating MySQL (experience with version upgrades, knowledge of replication, etc.)
- Experience in AI development or development using AI tools
- The ability to grasp the gist of technical discussions in Japanese is a plus. Speaking ability is not required.
Compensation
¥6,408,000 ~ ¥11,000,000 annually.