What you will be doing
Design, build, and manage Kubernetes-based infrastructure, ensuring scalability, performance, and security.
Automate infrastructure provisioning and configuration using Terraform, Ansible, and Helm Charts.
Optimize CI/CD pipelines with GitHub Actions and ArgoCD to streamline software delivery.
Manage AWS cloud environments, ensuring cost efficiency, security, and performance optimization.
Develop and maintain scripts and automation tools using Python and Bash for operational efficiency.
Monitor, troubleshoot, and improve system reliability, reducing downtime and enhancing incident response.
Implement best practices in Infrastructure as Code (IaC), security, and DevOps methodologies.
Collaborate closely with development teams to support microservices and distributed applications.
Deliver well-constructed, explanatory technical documentation for architectures that we develop, and plan service integration, deployment automation and configuration management to business requirements within the infrastructure and Hadoop ecosystem.
Observe and provide feedback on the current state of the infrastructure, identify opportunities to improve resiliency, reduce the occurrence of incidents and automate repetitive administrative and operational tasks.
Be versed in cloud architecture, service integrations, and operational visibility on common cloud (AWS and Azure) platforms. Understanding ecosystem deployment options and how to automate them via API calls is a plus.
What we are looking for
3+ years of experience in a DevOps, SRE, or Infrastructure role.
Ability to debug and optimize code and automate tasks.
Deep understanding of DevOps principles, system monitoring, logging, and security best practices.
Strong expertise in Kubernetes, including deployment, monitoring, and troubleshooting.
Proficiency in Infrastructure as Code (IaC) tools like Terraform and Ansible.
Cloud-based solutions experiences such as Amazon AWS, Google Cloud, or Azure
Hands-on experience with Helm Charts for managing Kubernetes applications.
Scripting and automation with Python and Bash.
Experience with CI/CD tools, including GitHub Actions and ArgoCD.
Ability to work under pressure, multitask effectively, and handle production incidents.
Strong problem-solving and troubleshooting skills.
Understands the concepts of public cloud Infrastructure
Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
Apply via :
boards.eu.greenhouse.io