Who we are
Ilkari is a privately-held start-up based in Dublin, Ireland. We deliver hyper-private scale innovation and technology to safeguard and secure data, enabling true data sovereignty even as the pace of change accelerates. Our best-in-breed sovereign technology delivers privacy and control over where companies’ data resides, where it flows, and how it’s accessed.
We’re here to rewrite the story of data sovereignty – empowering innovators, pioneers and visionaries to make their mark. We believe the sky is not the limit. We strive for the perfect balance of simplicity and excellence in everything we do, and we’re looking for people who are ready to join our journey and rewrite the story of data sovereignty.
Role overview
We’re looking for a hands-on Infrastructure Engineer to help operate, improve, and scale our OpenStack, Proxmox, Kubernetes, and hybrid cloud platforms.
The role focuses on platform reliability, automation, observability, operational readiness, and secure-by-default infrastructure practices. Working closely with Infrastructure, Engineering, Security, TOC, Service Delivery, and Architecture teams, you’ll help ensure our environments are scalable, supportable, and operationally resilient.
This role combines strong operational ownership with continuous improvement, automation, and infrastructure engineering practices to support both internal platforms and customer-facing services.
What you'll do
• Operate and continuously improve OpenStack, Proxmox, Kubernetes, and supporting infrastructure platforms to ensure reliability, scalability, security, and operational stability across private, shared, and hybrid environments.
• Own day-to-day platform operations including patching, upgrades, maintenance, backup and restore validation, lifecycle management, change execution, and operational readiness activities.
• Strengthen infrastructure automation using Terraform, Ansible, Python, Bash, and GitOps practices to improve consistency, reduce manual operational effort, and support predictable platform changes.
• Improve infrastructure observability through monitoring, logging, alerting, tracing, and dashboarding to support proactive issue detection, operational visibility, and faster incident response.
• Support incident, problem, and change management processes by contributing to investigations, root cause analysis, remediation planning, rollback readiness, and continuous operational improvement.
• Partner with Engineering, Product, Security, TOC, and Architecture teams to deliver repeatable, secure, and operationally supportable platform solutions.
• Support customer onboarding and operational readiness activities, ensuring infrastructure provisioning, monitoring, access, automation, and support requirements are aligned prior to go-live.
• Contribute to high availability, disaster recovery, migration, and platform resilience initiatives, including validation of restore and recovery procedures.
• Create and maintain clear operational documentation, runbooks, standards, and knowledge-sharing materials to improve consistency and supportability across teams.
• Participate in the on-call rota and support production incident response activities as required.
What are we looking for
• 5+ years of hands-on experience in Infrastructure Engineering, Platform Engineering, Cloud Operations, or Systems Engineering roles.
• Strong Linux systems administration experience, including troubleshooting, performance tuning, hardening, and operational support.
• Hands-on experience supporting OpenStack, Proxmox, Kubernetes, VSphere or similar virtualization/container platforms in production environments.
• Experience with infrastructure automation using Terraform, Ansible, Python, Bash, or equivalent tooling.
• Experience implementing or supporting GitOps and infrastructure-as-code practices.
• Practical experience with observability and monitoring platforms such as Datadog, Prometheus, Grafana, Loki, ELK, or equivalent tooling.
• Working knowledge of incident, problem, and change management practices in production environments.
• Good understanding of networking fundamentals including TCP/IP, DNS, routing, load balancing, and firewall concepts.
• Experience supporting production environments with operational accountability, including participation in incident response and on-call support.
• Strong collaboration and communication skills, including documentation and cross-functional working.
Education
• A Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related discipline is desirable. Equivalent hands-on experience, self-directed learning, and relevant vendor certifications will also be considered when supported by strong practical technical capability.
Technical Proficiency
(some or all the following are preferred, but not essential depending on experience and demonstration of ongoing learning)
• Relevant certifications across cloud, Linux, Kubernetes, automation, virtualization, or infrastructure technologies (e.g., AWS, Azure, Red Hat, OpenStack, Kubernetes, Terraform, Ansible, Cisco, or similar) are beneficial
• ITIL Foundation or equivalent operational governance certification desirable.
What's in it for you
• Private life and health insurance for you and your family.
• Four weeks per year to work from anywhere for eligible employees.
• Gym reimbursement.
• Company bus applicable for employees based in Málaga city.
• Learning Pocket for personal development.
• A hybrid working model with flexible hours.
• 3 volunteering paid days each year.
• Generous referral bonus programme.