TransUnion's Job Applicant Privacy Notice
What We'll Bring:
Responsible for managing and executing software and platform releases across the UK region. Includes coordinating with the Centralised Release Management team, raising and managing CAB tickets for all release types (major, minor, hotfixes), and ensuring smooth deployment through sanity testing and documentation. The job family also involves frontline production support, maintaining release traceability, tracking progress, and overseeing Kanban workflows. Effective communication with development and QA teams is essential to ensure visibility and alignment throughout the release cycle.What You'll Bring:
Lead DevSecOps & Release Engineer - Global Business Platforms (UK Region) is responsible for ensuring the operational integrity, release governance, and stakeholder alignment for the global platforms deployed in the UK regions. Acting as a bridge between regional and global teams, this role drives platform reliability, compliance, and continuous improvement through technical leadership and process excellence.
Release & Deployment Management
* Lead the regional execution of all release types (major, minor, emergency) through automated CI/CD pipelines, ensuring timely and high-quality deployments.
* Coordinate with Global Platform testers to oversee post-deployment validation, ensuring issues are identified, documented, and resolved efficiently.
* Support and deputise for the Operations and Technical Release Manager in managing the full CAB ticket lifecycle, maintaining robust deployment traceability and compliance.
* Collaborate with the Operations and Technical Release Manager to lead, manage, and prioritise Kanban workflows, driving effective tracking and continuous improvement of release processes.
* Facilitate cross-functional coordination of development and change activities across the UK region, ensuring alignment with business objectives and minimising disruption.
* Champion continuous improvement by analysing release outcomes, gathering feedback, and implementing process enhancements.
* Communicate release status and risks to stakeholders, ensuring transparency and proactive issue resolution.
Testing & Change Management
* Coordinate regression testing of changes to Global Business Platforms deployed in the UK region in collaboration with Global Platform testers, ensuring all updates impacting UK stakeholders are thoroughly validated and documented.
* Act as a key liaison to the Operations and Technical Release Manager, leading the coordination of future development, maintenance, and change activities between global and UK teams to ensure alignment, minimise risk, and support successful delivery.
* Facilitate effective communication between global and UK teams, ensuring that testing outcomes, risks, and dependencies are clearly documented and addressed.
* Drive continuous improvement in testing and coordination processes by gathering feedback, identifying bottlenecks, and implementing best practices.
Platform Operations & Monitoring
* Provide technical leadership for the operational management of Global Business Platforms deployed in the UK region, ensuring compliance with SLAs, SLOs, RTOs, and other service commitments.
* Oversee incident management, monitoring, and escalation processes to maintain platform stability and minimise service disruptions, including participation in a 24x7 out-of-hours support rota.
* Collaborate with the Operations and Technical Release Manager to produce and analyse reports on platform availability, performance, and capacity, supporting data-driven decision-making and proactive capacity planning.
* Drive continuous improvement in operational processes by identifying risks, implementing best practices, and optimising platform reliability.
Incident & Problem Management
* Provide technical leadership in troubleshooting and resolving regional issues with the Global Business Platforms deployed in the UK region, ensuring rapid restoration of service and minimal business disruption.
* Technically lead critical and high-severity incident response, acting as the primary technical representative on incident bridges and ensuring effective communication and resolution.
* Coordinate and participate in post-incident reviews (post-mortems/PIRs) with global and regional teams, driving the identification and implementation of corrective actions.
* Analyse incident data to identify root causes, predict potential future issues, and implement preventative controls, fostering a culture of continuous improvement and operational resilience.
* Document incident outcomes and share lessons learned to enhance team knowledge and prevent recurrence.
Risk, Compliance & Governance
* Lead the identification, assessment, documentation, and management of risks and issues related to Global Business Platforms deployed in the UK region, ensuring proactive mitigation and platform stability.
* Oversee and support compliance and audit activities for the Global Business Platforms deployed in the UK region, ensuring adherence to regulatory requirements and audit readiness.
* Collaborate with risk, compliance, and audit stakeholders to address findings, implement corrective actions, and continuously improve risk and compliance processes.
* Monitor evolving risks and regulatory changes, adapting controls and processes to maintain ongoing compliance and operational resilience.
Impact You'll Make:
* Extensive experience (5+ years) in technical leadership roles within cloud-native, containerized environments, with a proven track record of delivering complex solutions at scale.
* Deep expertise in Kubernetes, specifically managing and supporting GKE (Google Kubernetes Engine) clusters and containers in production environments.
* Advanced proficiency with Helm, including designing, installing, and managing applications using Helm charts.
* Hands-on experience with Wiz for Cloud Native Application Protection and container security, including policy definition and incident response.
* Strong background in GCP cloud networking, load balancing, and traffic management, with the ability to architect and troubleshoot complex network topologies.
* Good understanding of Cloudflare.
* Comprehensive knowledge of GCP compute services and Google Cloud Storage (GCS) for managing both structured and unstructured data.
* Significant experience with PostgreSQL for database design, optimization, and management in high-availability environments.
* Proven ability to design and support Kafka-based data streaming architectures.
* Experience administering, managaing and maintaining Dataproc clusters.
* Expertise in implementing and managing Identity and Access Management (IAM) solutions using Ping and/or Keycloak.
* In-depth experience with Kong APIM for API lifecycle management, security, and governance.
* Strong monitoring and observability skills, including creating and interpreting dashboard alerts from GCP Native Observability, Prometheus, Grafana, and OpenTelemetry.
* Demonstrated leadership in CI/CD pipeline management, particularly using Harness for automated build and deployment processes.
* Advanced skills in infrastructure automation with Terraform, including integration with Harness IaC pipelines.
* Experience managing secrets and sensitive data using HashiCorp Vault.
* Experience working with Redis for session storage management.
* Solid understanding of application security best practices, including integrating and acting on findings from CheckMarxOne or similar tools.
* Ability to translate high level architectural blue-prints into detailed low level designs.
* Excellent communication, mentoring, and stakeholder management skills, with a passion for developing team capabilities and fostering a culture of technical excellence.
* Ability to collaborate effectively with cross-functional teams to deliver robust, secure, and scalable solutions aligned with business objectives.
TransUnion Job Title
Lead Engineer, Development Ops