Real-Time ML Engineer: Large-Scale GPU Inference

  • Harrington Starr
  • Feb 23, 2026
Full time I.T. & Communications

Job Description

A leading tech recruitment firm is seeking a Machine Learning Engineer who will design and implement large-scale systems focused on training and real-time inference. The role involves collaboration with multidisciplinary teams to enhance models using GPU acceleration and distributed computing. Ideal candidates will possess strong expertise in machine learning, proficiency in programming (Python and C++), and experience in developing low-latency ML systems. Competitive remuneration and progressive opportunities are offered.