Overview
Ibex. is looking for a Site Reliability Engineer, to join our team. This role offers the opportunity to work with cutting-edge technologies, drive automation, and enhance infrastructure resilience. If you are a proactive problem-solver with a passion for optimizing systems, we would love to hear from you!
Responsibilities
CI/CD & Containerization
- Design and maintain CI/CD pipelines for Kubernetes platforms.
- Build and manage Docker images and dependencies.
Infrastructure & Environment Management
- Configure and automate environments across on-prem Kubernetes, AWS EKS, VMware, and cloud platforms.
- Ensure consistency, scalability, and high availability.
Kubernetes & Platform Operations
- Deploy and manage Kubernetes clusters, storage (Open EBS), and logging stacks (ELK).
- Maintain resilient and highly available systems.
Automation & Configuration
- Automate infrastructure using Ansible, Puppet, and scripting.
- Reduce manual effort through APIs, CLI tools, and custom automation.
Application Modernization
- Support migration of monolithic apps to microservices on Kubernetes (on-prem & cloud).
- Monitoring & Observability
- Build and manage dashboards using Grafana.
- Improve system visibility and performance monitoring.
Qualifications
- Strong Linux (Red Hat, Ubuntu) administration skills
- Deep expertise in GitLab (CI/CD pipelines)
- Strong hands-on experience with Kubernetes (on-premises & cloud)
- Proficiency in Docker & container runtimes
- Strong scripting skills (Bash, Python, PowerShell)
- Advanced knowledge of Grafana dashboards & integrations
- Strong understanding of APIs (integration, data handling, DB storage); ability to utilize APIs, CLI tools, and custom scripting to automate repetitive operational tasks, improving efficiency and reducing manual effort
- Good experience with Ansible & Puppet
- Deep expertise in Terraform (IaC)
- Proficiency with Azure CLI & AWS CLI
The candidate should have 5 to 7 years of relevant experience.