Senior CUDA Engineer – High-Performance Computing for Intelligent Systems
We’re working with a fast-growing, well-funded technology company developing advanced software that powers intelligent, real-time computing systems across multiple industries. Their solutions run on everything from high-performance workstation setups to compact embedded computing boards, with an unwavering focus on speed, reliability, and efficiency.
Following a major investment round, they’re entering an exciting scale-up phase and are seeking a Senior CUDA Engineer to take ownership of GPU-focused development. You’ll be at the heart of optimizing their most performance-critical code, ensuring their multi-platform systems run flawlessly.
Key Responsibilities:
-
Translate complex algorithms into efficient CUDA kernels, balancing throughput and latency.
-
Profile and optimize GPU workloads using industry-standard tools.
-
Debug functional and memory issues using advanced GPU debugging utilities.
-
Maintain and enhance a cross-platform, CMake-based CUDA build system.
-
Integrate and optimize third-party CUDA libraries for production use.
What You’ll Bring:
-
Strong expertise in CUDA programming, GPU memory hierarchies, and optimization strategies.
-
Proficiency with GPU profiling and debugging tools.
-
Solid C++ skills with experience in multi-platform builds.
-
Ability to make clear decisions around performance, architecture, and integration trade-offs.
-
Comfort working in a fast-paced, hands-on engineering environment.
Desirable Extras:
-
Experience with embedded GPU deployments and cross-compilation workflows.
-
Knowledge of CUDA libraries such as cuBLAS, Thrust, or CUTLASS.
Why This Role Stands Out:
-
Direct impact on the performance and reliability of advanced computing systems.
-
Autonomy to shape technical infrastructure in a high-calibre engineering team.
-
Join at a key inflection point with strong financial backing and a bold growth plan.
Darwin Recruitment is acting as an Employment Agency in relation to this vacancy.
George Wiles