Description
We are pleased to announce the following Contract Opportunity within Digital IT department in the Technology Division. In keeping with our current business needs, we are looking for a person who meets the criteria indicated below.
Reporting to the Manager Service Availability, the position holder will be responsible for system planning, deployment, monitoring and administration of Linux, Windows, VMware, RHEV, container management, Open stack and storage systems. This includes general system administration tasks, software and hardware installations, system configuration, backups, system monitoring, performance tuning, troubleshooting, skills transfer, internal customer management, configuration management and documentation.
Job Responsibilities
Responsible for ensuring all Microservices, other Container Services and all container Platforms are running optimally.
Responsible for interfacing with application development teams to support applications fit within the container infrastructure with scalability, reliability and security designed and implemented from the onset.
Responsible for building, maintaining and supporting high-performance, fault-tolerant, scalable and distributed container management systems.
Responsible for automation of infrastructure at massive scale to support scaling of digital services.
Monitoring of container management systems performance for potential bottlenecks, identifying possible solutions, and collaborating with the application development teams to implement those fixes.
Working closely with the customer – application teams internal to Safaricom, to drive adoption of the container solutions and to assist the teams in coming up with the best approach for their migrations, including architecture evaluation, setting up orchestration for application deployments. Guiding development teams on best practices to containerize applications.
Proactively building and implementing monitoring services, including end to end monitoring, scripting and automation, modern tooling and maintenance software. Use of AI and Machine learning to perform log analysis and create predictive models that will assist in identifying potential failures.
Providing software-related operations support, including managing level two and level three incident and problem management
Creating prototypes, designing and building modules and solutions in an iterative agile cycle, develop, maintain, and optimize the business outcome
Developing and executing automation scripts and maintenance jobs
Managing incidents as level two and level three support diagnosis and troubleshooting of technical issues and review solutions proposed to resolve ensuring the MTTR is met across all domains.
Qualifications
Degree in computer science/Information Technology or Engineering,
3+ years’ experience in infrastructure support in an environment with mixed Windows, Linux and UNIX server environment
Certification in VMware (VCP), Linux, Kubernetes, Cloud solutions and DevOps tools.
Experience with Configuration Management and Automation E.g. Chef, Ansible, puppet and Terraforms.
Good working knowledge of high availability environments with clustering and distributed storage technologies.
Experience with Container/PaaS orchestration/management platforms such as Kubernetes, OpenShift and others.
Extensive knowledge of enterprise Linux Server Operating Systems such as RHEL/ CentOS, Ubuntu.
Good understanding of System Administration Techniques – e.g. OS Performance, Windows, Virtualization, Linux enterprise, server hardware and storage.
Strong oral, written, interpersonal communication and leadership skills
Programming skills in preferably python
Experience with managing the development lifecycle for agile software development projects.
Good understanding of build tools, CI/CD, Devops and Agile principles
Experience working in a multi tenancy environment.
Experience in documentation
go to method of application »
Use the link(s) below to apply on company website.
Apply via :