Distributed AI Systems Engineer
Remote
USA
Permanent
Articificial Intelligence
Focus
- Build infrastructure for distributed model training
- Optimize compute scheduling across large GPU fleets
- Improve performance of LLM training pipelines
Tech
- PyTorch Distributed
- Ray
- CUDA
- HPC networking (InfiniBand / RDMA)
Darwin Recruitment is acting as an Employment Agency in relation to this vacancy.

Reece Waldon

Submit Your CV
Similar Jobs
1
Permanent
AI Silicon Verification EngineerTechnology
Articificial Intelligence
Focus: verification of AI accelerator designs simulation and validation of neural compute pipelines Tech: SystemVerilog UVM RTL Darwin Recruitment is acting as an Employment See more…
to $250,000/year
San Francisco
USA
1
Permanent
AI Datacenter ArchitectTechnology
Articificial Intelligence
Focus: design large-scale AI compute clusters optimize networking, storage and scheduling Tech: HPC networking Kubernetes GPU scheduling Darwin Recruitment is acting as an Employment See more…
to $250,000/year
San Francisco
USA
1
Permanent
AI Performance Optimization EngineerTechnology
Articificial Intelligence
Focus: protect AI accelerators against firmware and side-channel attacks secure boot and attestation for AI hardware Tech: firmware security hardware root of trust enclave See more…
to $250,000/year
San Francisco
USA