Responsibilities
- Design highly-automated infrastructure and CI/CD pipelines for our internal PaaS on GCP
- Assess current infrastructure and identify areas for improvement
- Help developers building their infrastructure and CI/CD pipelines:
- Provide guidance and support for developers on infrastructure design and CI/CD best practices
- Research and evaluate new Kubernetes features, tools, and best practices
- Implement new functionality for our Kubernetes operators
- Set up monitoring for our infrastructure:
- Collaborate with development teams to identify and resolve infrastructure and CI/CD pipeline issues
- Participate in on-call rotation for incident response and management
Requirements
- 4+ years of hands-on production experience on public cloud providers, including 2+ years on GCP
- At least 2 years of experience in software development, including kubernetes operators
- 2+ years experience managing production Kubernetes clusters
- Experience treating k8s clusters as cattle (we provision/destroy on demand in automated fashion)
- Experience managing production ArgoCD
- Experience with IaC tools such as Terraform
- Experience with deployment tools including GitHub Actions and Helm
- Experience with monitoring tools such as Datadog, Prometheus, Grafana
- Understanding of networking concepts (VPC architecture, protocols etc)
- Any pager experience (OpsGenie, PagerDuty, etc)
Nice to haves
While not specifically required, tell us if you have any of the following.
- Writing complex workflows, troubleshooting, proposing improvements in GitHub actions and Helm
- Experience in building highly automated, self-service infrastructure
- Deep understanding of GCP services, especially GKE, AI/ML services, Pub/Sub
- Troubleshooting production incidents
- Experience with MongoDB
- Experience with high-traffic architectures
- Experience with large ArgoCD architectures
- Experience with writing templated ArgoCD ApplicationSets
- Experience with IaC in Kubernetes, such as Crossplane and Config Connector
- Experience writing infrastructure unit/integration/e2e tests
Compensation
Up to 9 million JPY annually.