Job Purpose
The DevOps Engineer / Site Reliability Engineer (SRE) will bolster our team, ensuring the resilience, scalability, efficiency, and security of our systems on Google Cloud Platform. This dual role will spearhead initiatives in automation, observability, and security, driving the design, implementation, and optimisation of CI/CD pipelines while also focusing on enhancing system observability, resilience, and adherence to security best practices through proactive monitoring, incident response, and robust infrastructure design
Roles & Responsibilities
Lead the conception, execution, and management of automated CI/CD pipelines on Google Cloud Platform, leveraging robust services like Cloud Build, Cloud Deploy, and Cloud Run.
Architect and nurture scalable, fault-tolerant infrastructure on Google Cloud Platform, emphasising resilience, observability, and security, drawing on services such as Compute Engine, Kubernetes Engine, and Cloud Storage.
Collaborate closely with development teams to seamlessly integrate automated testing and deployment processes into the software development lifecycle, ensuring swift and efficient application delivery while adhering to security requirements.
Implement and oversee monitoring, logging, and alerting mechanisms on Google Cloud Platform to guarantee system health, performance, and security posture, harnessing tools like Stack driver.
Develop and refine automation scripts and tools to streamline operational workflows, enhance efficiency, minimize manual intervention, and enforce security policies and controls.
Spearhead incident response strategies and conduct thorough post-incident reviews, pinpointing root causes and implementing preventive measures to mitigate the risk of recurrence, with a focus on security incident response.
Proactively identify opportunities to elevate system observability, resilience, performance, and security, championing the adoption of industry-leading practices in SRE and security methodologies.
Automate infrastructure provisioning, configuration management, and deployment using advanced tools like Terraform, Ansible, or Puppet, ensuring consistency, scalability, and adherence to security standards across environments.
Evaluate emerging technologies, tools, and methodologies to optimize automation, observability, reliability, and security on Google Cloud Platform, staying abreast of industry trends and best practices.
Mentor and empower junior team members, imparting knowledge and guidance in DevOps, SRE, and security principles, fostering their professional growth and development.
Required Skills for a Senior Product Manager
Proficiency in scripting and automation using languages such as Python or Bash, with a deep understanding of software development principles.
Expertise in containerization and orchestration technologies, particularly Docker and Kubernetes, adeptly managing deployments on Google Kubernetes Engine (GKE) and leveraging Helm for package management.
In-depth knowledge of networking concepts, including TCP/IP, DNS, load balancing, and firewalls, with hands-on experience in Google Cloud VPC networking, network security, and Identity and Access Management (IAM).
Mastery of infrastructure as code tools such as Terraform, Ansible, or Puppet, enabling seamless management of Google Cloud Platform resources, configurations, and security controls.
Familiarity with Google Cloud Platform services such as Cloud Run for deploying and managing containerized applications, Apigee for API management, and Cloud Functions for serverless computing.
Proficient with version control systems such as Git, utilizing branching strategies and CI/CD pipelines for automated testing, deployment workflows, and security scanning.
Experience in incident management, security incident response, and post-incident analysis, employing tools like Prometheus, Grafana, and Security Command Center for monitoring, alerting, and security analytics.
Strong understanding of security best practices, standards, and compliance frameworks (e.g., CIS benchmarks, NIST, GDPR, HIPAA), with hands-on experience implementing security controls and mitigating security risks.
Excellent communication and collaboration skills, with a proven track record of effectively working in cross-functional teams, driving alignment across stakeholders, and promoting a culture of security and collaboration.
Relevant certifications such as Google Cloud Professional DevOps Engineer, Certified Kubernetes Administrator (CKA), Certified Information Systems Security Professional (CISSP), or AWS Certified DevOps Engineer are highly desirable
Apply via :
docs.google.com