The Platform Operations Engineer is responsible for maintaining and supporting on-premises infrastructure platforms that underpin mission-critical systems. This role ensures platform reliability, operational stability, and drives infrastructure modernisation and automation initiatives, while maintaining robust support for existing agency systems.
The role also supports the agency’s transition towards cloud adoption and hybrid infrastructure, including the migration and modernisation of existing workloads. You will collaborate closely with internal teams and vendors to ensure the stability, performance, and security of essential government IT infrastructure.
Key Responsibilities
- Maintain and support critical infrastructure platforms, including compute, storage, virtualisation, and supporting systems across development, staging, and production environments.
- Implement platform standards, automation, and modern operational practices to improve efficiency and reliability.
- Support infrastructure enhancement and modernisation initiatives in alignment with enterprise architecture standards.
- Manage virtualisation platforms (e.g., VMware, Hyper-V, Nutanix), including capacity monitoring, performance optimisation, and lifecycle management.
- Execute patching and upgrade strategies using automation while minimising service disruption.
- Provide L2/L3 technical support, including incident management, root cause analysis, and problem resolution.
- Maintain backup, disaster recovery (DR), and high-availability (HA) solutions.
- Implement and enforce security controls, including access management, system hardening, and compliance monitoring.
- Collaborate with application, network, and vendor teams to ensure platform stability, scalability, and service performance.
- Develop and maintain documentation, runbooks, and standard operating procedures (SOPs).
- Support and troubleshoot infrastructure and network-related issues to ensure end-to-end service availability.
- Support cloud migration and hybrid infrastructure initiatives, including workload assessment, migration planning, and post-migration operational support.
Requirements
- 3–7 years of relevant experience in infrastructure or platform operations.
- Strong problem-solving and analytical skills with the ability to work independently.
- Experience in enterprise infrastructure environments.
- Familiarity with vendor and contract management.
- Strong communication skills to engage both technical and non-technical stakeholders.
- Strong teamwork, organisational, and interpersonal skills.
Technical Requirements
- Experience with virtualisation platforms (e.g., VMware vSphere, Hyper-V, Nutanix).
- Experience with storage systems (SAN, NAS) and enterprise backup solutions (e.g., Commvault, Veeam, Cohesity).
- Proficiency in Red Hat Linux and/or Windows Server administration.
- Familiarity with monitoring and observability platforms.
- Solid understanding of networking concepts, including TCP/IP, DNS, DHCP, routing and switching, firewalls, load balancing, and network segmentation.
- Understanding of data centre network fundamentals.
- Experience with high-availability and disaster recovery solutions.
- Experience supporting infrastructure modernisation and migration initiatives.
- Strong documentation skills.
- Familiarity with cloud platforms (e.g., AWS, Azure) and hybrid infrastructure concepts.
Desired Technical Skills
- Experience with automation tools (e.g., Ansible, Puppet, Chef).
- Familiarity with scripting languages (e.g., Python, PowerShell, Bash).
- Experience implementing and maintaining monitoring and observability solutions (e.g., Prometheus, Grafana, ELK, Dynatrace, Splunk).
- Familiarity with containerisation and orchestration platforms (e.g., Docker, Kubernetes).
- Experience with Infrastructure as Code (IaC) for automated provisioning and configuration.
Qualifications
Degree or Diploma in Computer Science, Engineering, Information Technology, or a related field.
Desired Certifications
- VMware Certified Professional (VCP).
- Microsoft Certified: Windows Server.
- Red Hat Certified Engineer (RHCE).
- ITIL v4 Foundation.
- AWS or Azure Associate-level (or equivalent) cloud certifications
Any personal data you share with us during the application process will be processed strictly in compliance with applicable data protection laws and our Privacy Notice.