As a Site Reliability Engineer (SRE) at Degica, you will play a critical role at the intersection of software engineering and infrastructure operations. This position is ideal for engineers who are passionate about automation, systems design, and building scalable, reliable platforms.
In this role, you won’t be limited to just managing cloud infrastructure—you will take ownership of the platform’s overall health, performance, and developer experience.
Responsibilities
- Cloud Infrastructure Management: Actively participate in improving and maintaining our AWS infrastructure. Architect, implement, and maintain robust and secure infrastructure in a cloud-native environment using Terraform. You’ll ensure high availability, scalability, and resilience of our systems.
- CI/CD and Deployment Automation: Design and improve continuous integration and continuous delivery pipelines that empower development teams to ship software reliably and rapidly.
- Observability & Monitoring: Implement end-to-end observability tooling—including metrics, logging, distributed tracing, and alerting—to provide real-time insight into platform performance and help reduce mean time to detection and resolution.
- Platform Quality & Reliability: Champion best practices for reliability, scalability, and performance across engineering teams.
- Secure the system and adhere to compliance
- Be part of the teams on-call rotation
Requirements
- 2+ years in SRE roles working with the AWS platform.
- 2+ years experience in a software development role
- Hands-on experience with observability tools, preferably Datadog.
- Proficiency in Terraform.
- Proficiency in at least one scripting or programming language (Ruby/Rails, Python, Go, Shell Script, etc.).
- Experience on working with CI/CD tools such as GitHub Actions, Jenkins, Circle CI, etc.
Nice to haves
While not specifically required, tell us if you have any of the following.
- Strong communication skills to work closely with outside companies and various departments inside the organization
- Knowledge of TCP/IP and other networking protocols
- Experience with AWS Direct connect