Distributed AI Systems Engineer

globe (3)

Remote

globe (3)

USA

hourglass (1)

Permanent

business-cards (1)

Articificial Intelligence

1

Focus

  • Build infrastructure for distributed model training
  • Optimize compute scheduling across large GPU fleets
  • Improve performance of LLM training pipelines

Tech

  • PyTorch Distributed
  • Ray
  • CUDA
  • HPC networking (InfiniBand / RDMA)

Darwin Recruitment is acting as an Employment Agency in relation to this vacancy.

Reece Waldon

Submit Your CV

This field is for validation purposes and should be left unchanged.
Name_1
Max. file size: 512 MB.

Similar Jobs

1

Permanent

AI Silicon Verification Engineer

Technology

Articificial Intelligence

Focus: verification of AI accelerator designs simulation and validation of neural compute pipelines Tech: SystemVerilog UVM RTL Darwin Recruitment is acting as an Employment See more…

to $250,000/year

San Francisco

USA

1

Permanent

AI Datacenter Architect

Technology

Articificial Intelligence

Focus: design large-scale AI compute clusters optimize networking, storage and scheduling Tech: HPC networking Kubernetes GPU scheduling Darwin Recruitment is acting as an Employment See more…

to $250,000/year

San Francisco

USA

1

Permanent

AI Performance Optimization Engineer

Technology

Articificial Intelligence

Focus: protect AI accelerators against firmware and side-channel attacks secure boot and attestation for AI hardware Tech: firmware security hardware root of trust enclave See more…

to $250,000/year

San Francisco

USA