We're looking for a Site Reliability Engineer to join our LSEG's (London Stock Exchange Group) Real Time SRE team. As part of the SRE team, you will play a central role in supporting our large, globally-distributed market data infrastructure. You'll be responsible for providing support for critical Real Time applications, identifying and automating away development and operational toil, and driving adoption of the latest and greatest tools and platforms.Ideal candidates will be strong individual contributors, possess a mix of operational support and development skills, have experience working with complex distributed systems, have strong written and verbal communication skills, and have experience scaling complex technical solutions. WHAT YOU'LL BE DOING: Working across multiple functions (infrastructure, networks, data quality, and development teams) to ensure service health for LSEG clients Ensure end-to-end quality for application migrations to cloud, from high-quality documentation to observability flows, to operational acceptance testing and beyond Support a diverse set of applications deployed across on-premise data centers, public and private cloud Automating operational and development toil via scripting skills and familiarity with Git Leverage the latest tools and develop processes to accelerate LSEG's response to and recovery from service-impacting incidents Crafting reusable dashboards and alerting pipelines from raw metrics and logs Integrate cloud services, tooling platforms, and process automation together to accelerate Mean Time to Detection and Resolution for incidents Designing, authoring, and maintaining CI/CD pipelines and patterns Define Service Level Objectives and implement monitoring to ensure our systems are available and healthy WHAT YOU'LL BRING: At least 5 years of hands-on industry experience in DevOps or SRE, or relevant equivalent (experience in supporting financial markets, real-time data, or client-facing applications a plus) Hands-on experience supporting cloud applications (AWS/Azure preferred) Fluent in writing well-documented, production-grade Python code Excellent grasp of key components of the software development lifecycle, including testing, code quality and security scans, artifact versioning, environment promotion, and deployment models Strong background in Unix/Linux concepts, configuration, and shell scripting Experience in working with container-based technologies (Docker, Kubernetes, and managed services like Azure AKS and AWS EKS) Experience deploying changes to large, distributed systems in both cloud and on-premises Excellent practical and theoretical knowledge of Continuous Integration / Continuous Delivery pipelines (CI/CD), sufficient to design, author, and maintain CI/CD codebases Experience designing and implementing observability components like alerts, dashboards, monitors, log-parsing pipelines, and automated remediation flows (experience with DataDog is a plus) Detail-oriented approach to writing crystal-clear technical documentation WHAT YOU'LL GET IN RETURN: Experience coordinating business-critical technical initiatives at a massive scale Hands on expertise supporting low-latency market data applications Work on cutting-edge platforms and initiatives that keep your skills at the forefront of the industry Exposure to a wide range of technologies, tools, and cloud providers The support of a dynamic and growing team interested always interested in sharpening our skillsWe recognize that to attract the best talent, we need to be flexible and we are open to discussing work arrangements with you.LSEG (London Stock Exchange Group) is more than a diversified global financial markets infrastructure and data business. We are dedicated, open-access partners with a dedication to excellence in delivering the services our customers expect from us. With extensive experience, deep knowledge and worldwide presence across financial markets, we enable businesses and economies around the world to fund innovation, manage risk and create jobs. It's how we've contributed to supporting the financial stability and growth of communities and economies globally for more than 300 years. Through a comprehensive suite of trusted financial market infrastructure services - and our open-access model - we provide the flexibility, stability and trust that enable our customers to pursue their ambitions with confidence and clarity.LSEG is headquartered in the United Kingdom, with significant operations in 65 countries across EMEA, North America, Latin America and Asia Pacific. We employ 25,000 people globally, more than half located in Asia Pacific. LSEG's ticker symbol is LSEG. OUR PEOPLE: People are at the heart of what we do and drive the success of our business. Our values of Integrity, Partnership, Excellence and Change shape how we think, how we do things and how we help our people fulfil their potential. We embrace diversity and actively seek to attract individuals with unique backgrounds and perspectives. We break down barriers and encourage teamwork, enabling innovation and rapid development of solutions that make a difference. Our workplace generates an enriching and rewarding experience for our people and customers alike. Our vision is to build an inclusive culture in which everyone feels encouraged to fulfil their potential.We know that real personal growth cannot be achieved by simply climbing a career ladder - which is why we encourage and enable a wealth of avenues and interesting opportunities for everyone to broaden and deepen their skills and expertise. As a global organisation spanning 65 countries and one rooted in a culture of growth, opportunity, diversity and innovation, LSEG is a place where everyone can grow, develop and fulfil your potential with meaningful careers.Join us and be part of a team that values innovation, quality, and continuous improvement. If you're ready to take your career to the next level and make a significant impact, we'd love to hear from you.LSEG is a leading global financial markets infrastructure and data provider. Our purpose is driving financial stability, empowering economies and enabling customers to create sustainable growth.Our purpose is the foundation on which our culture is built. Our values of Integrity, Partnership , Excellence and Change underpin our purpose and set the standard for everything we do, every day. They go to the heart of who we are and guide our decision making and everyday actions.Working with us means that you will be part of a dynamic organisation of 25,000 people across 65 countries. However, we will value your individuality and enable you to bring your true self to work so you can help enrich our diverse workforce.We are proud to be an equal opportunities employer. This means that we do not discriminate on the basis of anyone's race, religion, colour, national origin, gender, sexual orientation, gender identity, gender expression, age, marital status, veteran status, pregnancy or disability, or any other basis protected under applicable law. Conforming with applicable law, we can reasonably accommodate applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs.You will be part of a collaborative and creative culture where we encourage new ideas. We are committed to sustainability across our global business and we are proud to partner with our customers to help them meet their sustainability objectives. Our charity, the LSEG Foundation provides charitable grants to community groups that help people access economic opportunities and build a secure future with financial independence. Colleagues can get involved through fundraising and volunteering.LSEG offers a range of tailored benefits and support, including healthcare, retirement planning, paid volunteering days and wellbeing initiatives.Please
Jan 09, 2026
Full time
We're looking for a Site Reliability Engineer to join our LSEG's (London Stock Exchange Group) Real Time SRE team. As part of the SRE team, you will play a central role in supporting our large, globally-distributed market data infrastructure. You'll be responsible for providing support for critical Real Time applications, identifying and automating away development and operational toil, and driving adoption of the latest and greatest tools and platforms.Ideal candidates will be strong individual contributors, possess a mix of operational support and development skills, have experience working with complex distributed systems, have strong written and verbal communication skills, and have experience scaling complex technical solutions. WHAT YOU'LL BE DOING: Working across multiple functions (infrastructure, networks, data quality, and development teams) to ensure service health for LSEG clients Ensure end-to-end quality for application migrations to cloud, from high-quality documentation to observability flows, to operational acceptance testing and beyond Support a diverse set of applications deployed across on-premise data centers, public and private cloud Automating operational and development toil via scripting skills and familiarity with Git Leverage the latest tools and develop processes to accelerate LSEG's response to and recovery from service-impacting incidents Crafting reusable dashboards and alerting pipelines from raw metrics and logs Integrate cloud services, tooling platforms, and process automation together to accelerate Mean Time to Detection and Resolution for incidents Designing, authoring, and maintaining CI/CD pipelines and patterns Define Service Level Objectives and implement monitoring to ensure our systems are available and healthy WHAT YOU'LL BRING: At least 5 years of hands-on industry experience in DevOps or SRE, or relevant equivalent (experience in supporting financial markets, real-time data, or client-facing applications a plus) Hands-on experience supporting cloud applications (AWS/Azure preferred) Fluent in writing well-documented, production-grade Python code Excellent grasp of key components of the software development lifecycle, including testing, code quality and security scans, artifact versioning, environment promotion, and deployment models Strong background in Unix/Linux concepts, configuration, and shell scripting Experience in working with container-based technologies (Docker, Kubernetes, and managed services like Azure AKS and AWS EKS) Experience deploying changes to large, distributed systems in both cloud and on-premises Excellent practical and theoretical knowledge of Continuous Integration / Continuous Delivery pipelines (CI/CD), sufficient to design, author, and maintain CI/CD codebases Experience designing and implementing observability components like alerts, dashboards, monitors, log-parsing pipelines, and automated remediation flows (experience with DataDog is a plus) Detail-oriented approach to writing crystal-clear technical documentation WHAT YOU'LL GET IN RETURN: Experience coordinating business-critical technical initiatives at a massive scale Hands on expertise supporting low-latency market data applications Work on cutting-edge platforms and initiatives that keep your skills at the forefront of the industry Exposure to a wide range of technologies, tools, and cloud providers The support of a dynamic and growing team interested always interested in sharpening our skillsWe recognize that to attract the best talent, we need to be flexible and we are open to discussing work arrangements with you.LSEG (London Stock Exchange Group) is more than a diversified global financial markets infrastructure and data business. We are dedicated, open-access partners with a dedication to excellence in delivering the services our customers expect from us. With extensive experience, deep knowledge and worldwide presence across financial markets, we enable businesses and economies around the world to fund innovation, manage risk and create jobs. It's how we've contributed to supporting the financial stability and growth of communities and economies globally for more than 300 years. Through a comprehensive suite of trusted financial market infrastructure services - and our open-access model - we provide the flexibility, stability and trust that enable our customers to pursue their ambitions with confidence and clarity.LSEG is headquartered in the United Kingdom, with significant operations in 65 countries across EMEA, North America, Latin America and Asia Pacific. We employ 25,000 people globally, more than half located in Asia Pacific. LSEG's ticker symbol is LSEG. OUR PEOPLE: People are at the heart of what we do and drive the success of our business. Our values of Integrity, Partnership, Excellence and Change shape how we think, how we do things and how we help our people fulfil their potential. We embrace diversity and actively seek to attract individuals with unique backgrounds and perspectives. We break down barriers and encourage teamwork, enabling innovation and rapid development of solutions that make a difference. Our workplace generates an enriching and rewarding experience for our people and customers alike. Our vision is to build an inclusive culture in which everyone feels encouraged to fulfil their potential.We know that real personal growth cannot be achieved by simply climbing a career ladder - which is why we encourage and enable a wealth of avenues and interesting opportunities for everyone to broaden and deepen their skills and expertise. As a global organisation spanning 65 countries and one rooted in a culture of growth, opportunity, diversity and innovation, LSEG is a place where everyone can grow, develop and fulfil your potential with meaningful careers.Join us and be part of a team that values innovation, quality, and continuous improvement. If you're ready to take your career to the next level and make a significant impact, we'd love to hear from you.LSEG is a leading global financial markets infrastructure and data provider. Our purpose is driving financial stability, empowering economies and enabling customers to create sustainable growth.Our purpose is the foundation on which our culture is built. Our values of Integrity, Partnership , Excellence and Change underpin our purpose and set the standard for everything we do, every day. They go to the heart of who we are and guide our decision making and everyday actions.Working with us means that you will be part of a dynamic organisation of 25,000 people across 65 countries. However, we will value your individuality and enable you to bring your true self to work so you can help enrich our diverse workforce.We are proud to be an equal opportunities employer. This means that we do not discriminate on the basis of anyone's race, religion, colour, national origin, gender, sexual orientation, gender identity, gender expression, age, marital status, veteran status, pregnancy or disability, or any other basis protected under applicable law. Conforming with applicable law, we can reasonably accommodate applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs.You will be part of a collaborative and creative culture where we encourage new ideas. We are committed to sustainability across our global business and we are proud to partner with our customers to help them meet their sustainability objectives. Our charity, the LSEG Foundation provides charitable grants to community groups that help people access economic opportunities and build a secure future with financial independence. Colleagues can get involved through fundraising and volunteering.LSEG offers a range of tailored benefits and support, including healthcare, retirement planning, paid volunteering days and wellbeing initiatives.Please
Are you an engineer who thrives on solving complex reliability challenges across cloud platforms? We're looking for a Site Reliability Engineer who can combine strong technical capability with a pragmatic approach to automation, monitoring, and service delivery. You'll help keep Tribal's education-driven SaaS products highly available, scalable, and performant. At Tribal Tribal is a leading EdTech business providing market-leading software solutions to the global education market. We research, develop, and deliver the products, services, and solutions that education institutions worldwide rely on to support their core mission: educating students, delivering exceptional learning experiences, and achieving successful outcomes. Our Platform Engineering function is at the heart of this, ensuring our systems are designed and maintained to the highest standards of reliability and security. As part of the SRE & Operations team, you'll play a key role in delivering Tribal's products through the public cloud as SaaS services across AWS and Azure. The Role As a Site Reliability Engineer, you'll design, build, and operate large-scale systems with an emphasis on reliability, efficiency, and automation. You'll work across deployment, monitoring, and incident response to ensure our platforms stay healthy and our customers experience uninterrupted service. You'll be involved in: Maintaining and improving production systems for availability, latency, and scalability Supporting application deployment and configuration to production environments Building or enhancing automation tools (Ansible, scripts, utilities) Implementing and managing observability tools such as DataDog or New Relic Analyzing logs and metrics to identify trends and improve reliability Supporting incident response and performing root-cause analysis Collaborating closely with engineering and customer teams to deliver proactive, preventative support Participating in on-call and out-of-hours rotations in line with Tribal's On-Call Policy This is a full-time, fully remote UK-based role, with occasional national travel for team collaboration or customer engagements. What you'll bring Strong experience with AWS (or Azure) environments Solid knowledge of Linux, Apache, and PHP in a production context Familiarity with automation/configuration tools such as Ansible Experience with monitoring and logging platforms (e.g. DataDog, New Relic, Azure Monitor) Good understanding of database fundamentals (SQL Server / Oracle) Hands-on troubleshooting and problem-solving skills Customer-facing experience with incident or service management tools (RemedyForce, ServiceNow) Strong written and verbal communication skills, able to translate technical details clearly Nice-to-have: Experience coding or scripting (Python, PowerShell, or Bash) Understanding of CI/CD pipelines (Azure DevOps or similar) ITIL Foundation or cloud certifications (AWS SysOps Administrator, AWS Solutions Architect) Note to applicants: We welcome applications from individuals who already have the right to work in the UK. As an equal opportunity employer, Tribal celebrate diversity and are committed to creating an inclusive environment for all employees. We make sure that our recruitment and selection processes never discriminate based upon any protected characteristics and actively welcome applications from all groups, not least those underrepresented in the tech sector. Note to all applicants - Tribal reserve the right to close an advertisement to applications ahead of the advertised closure date. For this reason, shortlisting may take place prior to the closing date on some occasions. With this in mind, please do not hesitate to apply early. Application Form Forename(1) Forename(2) Forename(3) Surname Known as or preferred name Phone Email Address Address Address Line 1 Address Line 2 City State/Province ZIP / Postal Country Upload your CV No File Chosen No File Chosen Do you have eligibility to work in the UK? Do you have eligibility to work in the UK? No Yes Minimum Salary Expectation Do you currently work for Tribal? Where did you find out about this opportunity? Where did you find out about this opportunity? Required field Linkedin Someone who works at Tribal Tribal Website University Job Board Other Join our Talent Pool! Join our Talent Pool! Yes, I opt into the talent pool and agree for you to contact me Join our Talent Pool! Join our Talent Pool! Yes, I opt into the talent pool and agree for you to contact me Please confirm if you wish to opt into our Talent pool and agree to receive communications about potential career opportunities at Tribal For details on how Tribal will use and retain your data click here Recruitment Agencies: Please review our recruitment agency statement here
Jan 01, 2026
Full time
Are you an engineer who thrives on solving complex reliability challenges across cloud platforms? We're looking for a Site Reliability Engineer who can combine strong technical capability with a pragmatic approach to automation, monitoring, and service delivery. You'll help keep Tribal's education-driven SaaS products highly available, scalable, and performant. At Tribal Tribal is a leading EdTech business providing market-leading software solutions to the global education market. We research, develop, and deliver the products, services, and solutions that education institutions worldwide rely on to support their core mission: educating students, delivering exceptional learning experiences, and achieving successful outcomes. Our Platform Engineering function is at the heart of this, ensuring our systems are designed and maintained to the highest standards of reliability and security. As part of the SRE & Operations team, you'll play a key role in delivering Tribal's products through the public cloud as SaaS services across AWS and Azure. The Role As a Site Reliability Engineer, you'll design, build, and operate large-scale systems with an emphasis on reliability, efficiency, and automation. You'll work across deployment, monitoring, and incident response to ensure our platforms stay healthy and our customers experience uninterrupted service. You'll be involved in: Maintaining and improving production systems for availability, latency, and scalability Supporting application deployment and configuration to production environments Building or enhancing automation tools (Ansible, scripts, utilities) Implementing and managing observability tools such as DataDog or New Relic Analyzing logs and metrics to identify trends and improve reliability Supporting incident response and performing root-cause analysis Collaborating closely with engineering and customer teams to deliver proactive, preventative support Participating in on-call and out-of-hours rotations in line with Tribal's On-Call Policy This is a full-time, fully remote UK-based role, with occasional national travel for team collaboration or customer engagements. What you'll bring Strong experience with AWS (or Azure) environments Solid knowledge of Linux, Apache, and PHP in a production context Familiarity with automation/configuration tools such as Ansible Experience with monitoring and logging platforms (e.g. DataDog, New Relic, Azure Monitor) Good understanding of database fundamentals (SQL Server / Oracle) Hands-on troubleshooting and problem-solving skills Customer-facing experience with incident or service management tools (RemedyForce, ServiceNow) Strong written and verbal communication skills, able to translate technical details clearly Nice-to-have: Experience coding or scripting (Python, PowerShell, or Bash) Understanding of CI/CD pipelines (Azure DevOps or similar) ITIL Foundation or cloud certifications (AWS SysOps Administrator, AWS Solutions Architect) Note to applicants: We welcome applications from individuals who already have the right to work in the UK. As an equal opportunity employer, Tribal celebrate diversity and are committed to creating an inclusive environment for all employees. We make sure that our recruitment and selection processes never discriminate based upon any protected characteristics and actively welcome applications from all groups, not least those underrepresented in the tech sector. Note to all applicants - Tribal reserve the right to close an advertisement to applications ahead of the advertised closure date. For this reason, shortlisting may take place prior to the closing date on some occasions. With this in mind, please do not hesitate to apply early. Application Form Forename(1) Forename(2) Forename(3) Surname Known as or preferred name Phone Email Address Address Address Line 1 Address Line 2 City State/Province ZIP / Postal Country Upload your CV No File Chosen No File Chosen Do you have eligibility to work in the UK? Do you have eligibility to work in the UK? No Yes Minimum Salary Expectation Do you currently work for Tribal? Where did you find out about this opportunity? Where did you find out about this opportunity? Required field Linkedin Someone who works at Tribal Tribal Website University Job Board Other Join our Talent Pool! Join our Talent Pool! Yes, I opt into the talent pool and agree for you to contact me Join our Talent Pool! Join our Talent Pool! Yes, I opt into the talent pool and agree for you to contact me Please confirm if you wish to opt into our Talent pool and agree to receive communications about potential career opportunities at Tribal For details on how Tribal will use and retain your data click here Recruitment Agencies: Please review our recruitment agency statement here
Once For All is a high-growth, cloud-based, SaaS subscription business. Our technology helps our customers to manage their supply chain governance, risk management and compliance. We work across public and private sector and have over 250k customers across the UK across 20 different sectors including construction, transport, retail, hospitality education, facility and property management, manufacturing, local and central government. Role Summary Join our Reliability and Platform group partnering with 10 Agile SCRUM teams to scale and harden a suite of microservices on Microsoft Azure. You will own production reliability for tier-1 services, set and track SLOs, automate operations, and lead incident response to keep our next-generation Supplier Risk Assessment platform fast, secure, and available. This role is fully remote role. Job Responsibilities Define SLOs, SLIs, and error budgets for critical services. Architect resilient multi-region and zone-aware workloads on Azure and AKS. Build infrastructure as code with Terraform or Bicep. Enforce policy as code. Design safe releases with progressive delivery, automated rollbacks, and feature flags. Lead on-call rotations, incident response, postmortems, and corrective actions. Implement end-to-end observability: metrics, logs, traces, dashboards, alerts. Plan capacity, tune performance, and optimize cost without impacting reliability. Secure the stack with Managed Identity, Key Vault, workload identity, and network segmentation. Establish backup, disaster recovery, and tested restore procedures with clear RPO and RTO. Mentor engineers and raise reliability standards across product teams Candidate Requirements 10+ years in SRE, platform, or production-facing engineering roles running large-scale systems. 7+ years hands on with Microsoft Azure: AKS, Front Door or Application Gateway, VNets, Private Link, Key Vault, Monitor, Log Analytics, Application Insights, Service Bus, Storage, SQL or Cosmos DB. 6+ years operating Kubernetes in production, including at least 3 years on AKS (network policies, PodDisruptionBudgets, HPA/VPA, node pools, upgrade playbooks). 5+ years infrastructure as code with Terraform or Bicep and Git based workflows. 5+ years designing observability and SLO based alerting using OpenTelemetry and Kusto queries. 4+ years running canary or blue green deployments in Azure DevOps or GitHub Actions. Proven incident command experience with measurable MTTR and MTTD improvements. Strong automation skills in Python or Go, plus Bash and PowerShell. Solid understanding of security hardening, container image scanning, SBOM, and least privilege. Experience with performance testing, p95 and p99 tuning, caching and connection pool strategies. Nice To Have Multi tenant SaaS and data sovereignty patterns. Service mesh, eBPF, or advanced traffic shaping. Compliance and audit trail design. FinOps practice with cost per request or per tenant KPIs. What We Offer Health and Wellbeing: Private Medical Insurance or wellness fund, 24/7 Employee Assistance Programme. Financial Benefits: Pension, Life Assurance (3x salary). Time Off: 25 days holiday + 8 bank holidays, holiday purchase scheme (+5 days), paid and unpaid volunteering days. Growth and Development: Ongoing CPD, team offsites, and company events. Everyday Perks: Home office budget, high spec laptop and peripherals. Work Setup: Fully remote within UK time zones, optional access to our Basingstoke office. Tech Stack You Will Use Azure, AKS, Terraform or Bicep, Azure DevOps or GitHub Actions, Docker, Helm, Service Bus, Storage, SQL Server, Cosmos DB, Key Vault, Azure Monitor, Log Analytics, Application Insights, Prometheus, Grafana, OpenTelemetry, Feature flagging tools. Interview Process Intro and role overview with Talent. Technical deep dive on Azure and AKS architecture. Practical exercise: propose SLOs and an alert plan for a sample service, plus a release safety plan. Culture and collaboration interview with Engineering.
Jan 01, 2026
Full time
Once For All is a high-growth, cloud-based, SaaS subscription business. Our technology helps our customers to manage their supply chain governance, risk management and compliance. We work across public and private sector and have over 250k customers across the UK across 20 different sectors including construction, transport, retail, hospitality education, facility and property management, manufacturing, local and central government. Role Summary Join our Reliability and Platform group partnering with 10 Agile SCRUM teams to scale and harden a suite of microservices on Microsoft Azure. You will own production reliability for tier-1 services, set and track SLOs, automate operations, and lead incident response to keep our next-generation Supplier Risk Assessment platform fast, secure, and available. This role is fully remote role. Job Responsibilities Define SLOs, SLIs, and error budgets for critical services. Architect resilient multi-region and zone-aware workloads on Azure and AKS. Build infrastructure as code with Terraform or Bicep. Enforce policy as code. Design safe releases with progressive delivery, automated rollbacks, and feature flags. Lead on-call rotations, incident response, postmortems, and corrective actions. Implement end-to-end observability: metrics, logs, traces, dashboards, alerts. Plan capacity, tune performance, and optimize cost without impacting reliability. Secure the stack with Managed Identity, Key Vault, workload identity, and network segmentation. Establish backup, disaster recovery, and tested restore procedures with clear RPO and RTO. Mentor engineers and raise reliability standards across product teams Candidate Requirements 10+ years in SRE, platform, or production-facing engineering roles running large-scale systems. 7+ years hands on with Microsoft Azure: AKS, Front Door or Application Gateway, VNets, Private Link, Key Vault, Monitor, Log Analytics, Application Insights, Service Bus, Storage, SQL or Cosmos DB. 6+ years operating Kubernetes in production, including at least 3 years on AKS (network policies, PodDisruptionBudgets, HPA/VPA, node pools, upgrade playbooks). 5+ years infrastructure as code with Terraform or Bicep and Git based workflows. 5+ years designing observability and SLO based alerting using OpenTelemetry and Kusto queries. 4+ years running canary or blue green deployments in Azure DevOps or GitHub Actions. Proven incident command experience with measurable MTTR and MTTD improvements. Strong automation skills in Python or Go, plus Bash and PowerShell. Solid understanding of security hardening, container image scanning, SBOM, and least privilege. Experience with performance testing, p95 and p99 tuning, caching and connection pool strategies. Nice To Have Multi tenant SaaS and data sovereignty patterns. Service mesh, eBPF, or advanced traffic shaping. Compliance and audit trail design. FinOps practice with cost per request or per tenant KPIs. What We Offer Health and Wellbeing: Private Medical Insurance or wellness fund, 24/7 Employee Assistance Programme. Financial Benefits: Pension, Life Assurance (3x salary). Time Off: 25 days holiday + 8 bank holidays, holiday purchase scheme (+5 days), paid and unpaid volunteering days. Growth and Development: Ongoing CPD, team offsites, and company events. Everyday Perks: Home office budget, high spec laptop and peripherals. Work Setup: Fully remote within UK time zones, optional access to our Basingstoke office. Tech Stack You Will Use Azure, AKS, Terraform or Bicep, Azure DevOps or GitHub Actions, Docker, Helm, Service Bus, Storage, SQL Server, Cosmos DB, Key Vault, Azure Monitor, Log Analytics, Application Insights, Prometheus, Grafana, OpenTelemetry, Feature flagging tools. Interview Process Intro and role overview with Talent. Technical deep dive on Azure and AKS architecture. Practical exercise: propose SLOs and an alert plan for a sample service, plus a release safety plan. Culture and collaboration interview with Engineering.
Role Summary At FINBOURNE, innovation and client focus are at the core of everything we do. Our industry-leading SaaS solutions, LUSID and LUMINESCE, empower clients with advanced IBOR and operational capabilities. We are seeking a passionate and experienced Senior DevOps Engineer with over five years of expertise to join our dynamic team. In this role, you will directly impact the software development lifecycle by designing, implementing, and optimising systems that enhance developer productivity and accelerate innovation. You'll also mentor junior engineers, influence architectural decisions, and collaborate across teams to establish and maintain robust DevOps practices. As a cloud-native organisation, we operate on Kubernetes in AWS and Azure, with applications written in C# .NET Core and tools developed in C#, Python, Go, and Rust. Key Responsibilities Collaborate with product development teams to design and implement solutions that improve the software development lifecycle. Partner with developers to understand their needs, advocate for developer-centric tools and practices, and ensure that they evolve to meet the changing needs of the organis Build and enhance tools, including CI/CD pipelines, to facilitate efficient code integration, testing, and deployment in a cloud or on-premises environment. Write readable, efficient code in languages such as Go, Python, Bash, C#, or similar, to automate software delivery processes. Create and manage monitoring and alerting systems to proactively identify issues in production and improve system observability. Participate in software architecture discussions, providing a DevOps perspective to ensure applications are designed for scalability, reliability, and maintainability. Monitor and troubleshoot systems across Linux environments, Kubernetes clusters, Istio service mesh, and application layers, ensuring high availability and performance. Continuously assess performance, scalability, and cost-effectiveness of the platform, suggesting improvements to reduce operational overhead while optimising resource utilis Mentor junior engineers and share knowledge through documentation, workshops, and regular team discussions. Skills and Experience We are looking for a skilled software engineer with experience in developing and running applications on Kubernetes and deep understanding of Linux, networking and container technology. You will have at least five years' experience working with large solutions. You will have: Proven experience as a DevOps engineer or in a similar software engineering role. Experience building, maintaining and releasing containerised software to production in a large organisation. Proficiency in programming languages such as Go, Python, or C#. Excellent knowledge of Linux systems and networking protocols (e.g., TCP/IP, HTTP/S, DNS, VPNs). Expertise with container and orchestration technologies, including Docker and Kubernetes. Hands-on experience with Helm for packaging, deploying, and managing Kubernetes applications. Experience with monitoring and logging solutions like Prometheus, Grafana, ELK Stack, or similar. Knowledge of security best practices in DevOps and cloud environments. Terraform, Ansible or Chef experience is preferred. Nice to haves: knowledge of Concourse, Nexus, SonarQube, various AWS services. Key Attributes Strong problem-solving skills with a focus on finding creative and efficient solutions. Excellent communication and collaboration skills with the ability to work effectively across teams. Ability to thrive in a fast-paced, dynamic environment and handle multiple competing priorities. Passion for continuous learning and improving processes. A team-player with a hands-on attitude and bias toward action. Just some of our benefits Competitive salary plus performance based bonus. Health & Wellbeing: A competitive health insurance policy that disregards previous medical history. This also includes dental, optical, mental health support and comprehensive cancer cover. Cycle to work scheme and Gym discounts: Buy a bike and cycling accessories out of your pre-tax salary and spread the cost over 12 months, as well huge discounts off Hussle, KOBOX and Nuffield Health gyms. Flexible and remote working: We have a mature attitude towards flexible and remote working. Whether you're a night owl, morning person, parent, carer or simply need flexibility to work a different pattern to the norm, we're committed to helping you be productive and work in a way that is best for you. Professional learning and development: External training and accreditations are supported, as well internal training and development programs. Maternity, paternity and adoption leave: Paid maternity, paternity and adoption leave, which includes 13 weeks full pay for maternity and adoption leave and 6 weeks full pay for paternity leave Holiday: 25 days holiday plus bank holidays About FINBOURNE We are a young, dynamic financial technology company aiming to re-engineer the world of investing to make it clearer, faster and more cost effective for everyone. At FINBOURNE, we offer a hugely supportive environment to build a career, with continuous learning and development opportunities. We have a collaborative culture of testing and exploring problems together to find the best evidence-based solutions. We respect your independent thought, your intellectual curiosity and your opinion. Our solution is open, API first and developer friendly - a true first for the asset management industry. You can see what our team is busy building on Github. For more information about us please visit our website.
Jan 01, 2026
Full time
Role Summary At FINBOURNE, innovation and client focus are at the core of everything we do. Our industry-leading SaaS solutions, LUSID and LUMINESCE, empower clients with advanced IBOR and operational capabilities. We are seeking a passionate and experienced Senior DevOps Engineer with over five years of expertise to join our dynamic team. In this role, you will directly impact the software development lifecycle by designing, implementing, and optimising systems that enhance developer productivity and accelerate innovation. You'll also mentor junior engineers, influence architectural decisions, and collaborate across teams to establish and maintain robust DevOps practices. As a cloud-native organisation, we operate on Kubernetes in AWS and Azure, with applications written in C# .NET Core and tools developed in C#, Python, Go, and Rust. Key Responsibilities Collaborate with product development teams to design and implement solutions that improve the software development lifecycle. Partner with developers to understand their needs, advocate for developer-centric tools and practices, and ensure that they evolve to meet the changing needs of the organis Build and enhance tools, including CI/CD pipelines, to facilitate efficient code integration, testing, and deployment in a cloud or on-premises environment. Write readable, efficient code in languages such as Go, Python, Bash, C#, or similar, to automate software delivery processes. Create and manage monitoring and alerting systems to proactively identify issues in production and improve system observability. Participate in software architecture discussions, providing a DevOps perspective to ensure applications are designed for scalability, reliability, and maintainability. Monitor and troubleshoot systems across Linux environments, Kubernetes clusters, Istio service mesh, and application layers, ensuring high availability and performance. Continuously assess performance, scalability, and cost-effectiveness of the platform, suggesting improvements to reduce operational overhead while optimising resource utilis Mentor junior engineers and share knowledge through documentation, workshops, and regular team discussions. Skills and Experience We are looking for a skilled software engineer with experience in developing and running applications on Kubernetes and deep understanding of Linux, networking and container technology. You will have at least five years' experience working with large solutions. You will have: Proven experience as a DevOps engineer or in a similar software engineering role. Experience building, maintaining and releasing containerised software to production in a large organisation. Proficiency in programming languages such as Go, Python, or C#. Excellent knowledge of Linux systems and networking protocols (e.g., TCP/IP, HTTP/S, DNS, VPNs). Expertise with container and orchestration technologies, including Docker and Kubernetes. Hands-on experience with Helm for packaging, deploying, and managing Kubernetes applications. Experience with monitoring and logging solutions like Prometheus, Grafana, ELK Stack, or similar. Knowledge of security best practices in DevOps and cloud environments. Terraform, Ansible or Chef experience is preferred. Nice to haves: knowledge of Concourse, Nexus, SonarQube, various AWS services. Key Attributes Strong problem-solving skills with a focus on finding creative and efficient solutions. Excellent communication and collaboration skills with the ability to work effectively across teams. Ability to thrive in a fast-paced, dynamic environment and handle multiple competing priorities. Passion for continuous learning and improving processes. A team-player with a hands-on attitude and bias toward action. Just some of our benefits Competitive salary plus performance based bonus. Health & Wellbeing: A competitive health insurance policy that disregards previous medical history. This also includes dental, optical, mental health support and comprehensive cancer cover. Cycle to work scheme and Gym discounts: Buy a bike and cycling accessories out of your pre-tax salary and spread the cost over 12 months, as well huge discounts off Hussle, KOBOX and Nuffield Health gyms. Flexible and remote working: We have a mature attitude towards flexible and remote working. Whether you're a night owl, morning person, parent, carer or simply need flexibility to work a different pattern to the norm, we're committed to helping you be productive and work in a way that is best for you. Professional learning and development: External training and accreditations are supported, as well internal training and development programs. Maternity, paternity and adoption leave: Paid maternity, paternity and adoption leave, which includes 13 weeks full pay for maternity and adoption leave and 6 weeks full pay for paternity leave Holiday: 25 days holiday plus bank holidays About FINBOURNE We are a young, dynamic financial technology company aiming to re-engineer the world of investing to make it clearer, faster and more cost effective for everyone. At FINBOURNE, we offer a hugely supportive environment to build a career, with continuous learning and development opportunities. We have a collaborative culture of testing and exploring problems together to find the best evidence-based solutions. We respect your independent thought, your intellectual curiosity and your opinion. Our solution is open, API first and developer friendly - a true first for the asset management industry. You can see what our team is busy building on Github. For more information about us please visit our website.
At Hometrack, our cloud platform is central to delivering industry-leading solutions. As a Platform Engineer, you will be responsible for the end-to-end lifecycle of our cloud infrastructure and data platforms, encompassing design, operation, infrastructure-as-code, and DevOps pipelines. You will report to the Head of Operational Engineering and collaborate closely with the wider Engineering team within a cross-functional structure. This role involves working with internal and external clients to deliver projects and business-as-usual (BAU) work with a focus on best practices, high standards, and meeting defined SLAs. You will also be a key driver in defining engineering disciplines, identifying opportunities for continuous improvement, and championing automation efforts across the platform. Key Responsibilities Collaboration: Partner with Software and Data Engineers to ensure all platform requirements for security, scalability, availability, monitoring, and support are met. Compliance: Help maintain platform monitoring to meet strict performance, capacity, availability, and security controls, specifically aligning with ISO-27001 standards. DevOps Enablement: Define and promote DevOps best practices for building, testing, and reliably releasing software components. Release Management: Assist in the coordination of releases, covering deployment, configuration, and successful rollout of software components and systems to various environments. Infrastructure-as-Code (IaC): Write and maintain robust IaC to facilitate the automated and consistent creation and management of environments, constantly seeking opportunities to simplify and enhance existing solutions. Support & Incident Response: Serve as part of the 3rd Line support team for the platform, including leading critical incident response and resolution efforts. Availability: Participate in an On-Call Rota and take part in out-of-hours releases as required. Essential Skills Strong experience building and supporting Microsoft Azure cloud-based architecture and infrastructure. Solid understanding of basic networking principles, including IPv4. Hands-on experience with DevOps, CI/CD, and automation tooling, such as Terraform, Azure DevOps, and GitHub. Proven track record in providing reliable support for cloud-based infrastructure. Experience in implementing and improving existing cloud solutions, aligning them with the Microsoft Azure Well-Architected Framework (covering security, capacity, performance, availability, and monitoring). Ability to support daily BAU services for internal and external clients, including SFTP, Virtual Desktops, BPM software, IAM, and other core services. Expertise in troubleshooting high-performance and business-critical solutions for both API and web-based applications. Strong communication and stakeholder management skills, with the ability to articulate complex technical topics clearly to non-technical audiences. Preferred Skills Relevant experience managing commercial infrastructure within the financial services sector. Previous experience working on a dedicated SRE, Platform, or DevOps team. A pragmatic approach to problem-solving, appreciating simplicity over complexity while knowing when to navigate between both. Capability to understand and accurately attribute cloud costs to specific teams and services. Hometrack delivers the market-leading valuation service to lenders and across the property technology and financial technology industries. Our primary commercial focus is on financial services, where we serve the mortgage lender segment, including 9 of the top 10 mortgage providers. All qualified applicants will receive consideration for employment without regard to race, colour, national origin, religion, sexual orientation, gender, gender identity, age, physical disability, or length of time spent unemployed. Benefits Everyday Flex - greater flexibility over where and when you work 25 days annual leave + extra days for years of service Day off for volunteering & Digital detox day Festive Closure - business closed for period between Christmas and New Year Cycle to work and electric car schemes Free Calm App membership Enhanced Parental leave Fertility Treatment Financial Support Group Income Protection and private medical insurance Gym on-site in London - or membership in regional offices 7.5% pension contribution by the company Discretionary annual bonus up to 10% of base salary Talent referral bonus up to £5K
Jan 01, 2026
Full time
At Hometrack, our cloud platform is central to delivering industry-leading solutions. As a Platform Engineer, you will be responsible for the end-to-end lifecycle of our cloud infrastructure and data platforms, encompassing design, operation, infrastructure-as-code, and DevOps pipelines. You will report to the Head of Operational Engineering and collaborate closely with the wider Engineering team within a cross-functional structure. This role involves working with internal and external clients to deliver projects and business-as-usual (BAU) work with a focus on best practices, high standards, and meeting defined SLAs. You will also be a key driver in defining engineering disciplines, identifying opportunities for continuous improvement, and championing automation efforts across the platform. Key Responsibilities Collaboration: Partner with Software and Data Engineers to ensure all platform requirements for security, scalability, availability, monitoring, and support are met. Compliance: Help maintain platform monitoring to meet strict performance, capacity, availability, and security controls, specifically aligning with ISO-27001 standards. DevOps Enablement: Define and promote DevOps best practices for building, testing, and reliably releasing software components. Release Management: Assist in the coordination of releases, covering deployment, configuration, and successful rollout of software components and systems to various environments. Infrastructure-as-Code (IaC): Write and maintain robust IaC to facilitate the automated and consistent creation and management of environments, constantly seeking opportunities to simplify and enhance existing solutions. Support & Incident Response: Serve as part of the 3rd Line support team for the platform, including leading critical incident response and resolution efforts. Availability: Participate in an On-Call Rota and take part in out-of-hours releases as required. Essential Skills Strong experience building and supporting Microsoft Azure cloud-based architecture and infrastructure. Solid understanding of basic networking principles, including IPv4. Hands-on experience with DevOps, CI/CD, and automation tooling, such as Terraform, Azure DevOps, and GitHub. Proven track record in providing reliable support for cloud-based infrastructure. Experience in implementing and improving existing cloud solutions, aligning them with the Microsoft Azure Well-Architected Framework (covering security, capacity, performance, availability, and monitoring). Ability to support daily BAU services for internal and external clients, including SFTP, Virtual Desktops, BPM software, IAM, and other core services. Expertise in troubleshooting high-performance and business-critical solutions for both API and web-based applications. Strong communication and stakeholder management skills, with the ability to articulate complex technical topics clearly to non-technical audiences. Preferred Skills Relevant experience managing commercial infrastructure within the financial services sector. Previous experience working on a dedicated SRE, Platform, or DevOps team. A pragmatic approach to problem-solving, appreciating simplicity over complexity while knowing when to navigate between both. Capability to understand and accurately attribute cloud costs to specific teams and services. Hometrack delivers the market-leading valuation service to lenders and across the property technology and financial technology industries. Our primary commercial focus is on financial services, where we serve the mortgage lender segment, including 9 of the top 10 mortgage providers. All qualified applicants will receive consideration for employment without regard to race, colour, national origin, religion, sexual orientation, gender, gender identity, age, physical disability, or length of time spent unemployed. Benefits Everyday Flex - greater flexibility over where and when you work 25 days annual leave + extra days for years of service Day off for volunteering & Digital detox day Festive Closure - business closed for period between Christmas and New Year Cycle to work and electric car schemes Free Calm App membership Enhanced Parental leave Fertility Treatment Financial Support Group Income Protection and private medical insurance Gym on-site in London - or membership in regional offices 7.5% pension contribution by the company Discretionary annual bonus up to 10% of base salary Talent referral bonus up to £5K