← Back to Careers

Member of Technical Staff

Winter InternshipSan Francisco, CA

Winter internship opportunity to work on cutting-edge GPU optimization projects. You'll contribute to real production systems while learning from experienced engineers and researchers.

What You'll Do

  • Implement GPU optimization techniques for specific model architectures
  • Contribute to our profiling and benchmarking tools
  • Work on performance analysis and bottleneck identification
  • Assist with research experiments and data collection
  • Present findings to the technical team

What We Look For

Core Technical Expertise

  • GPU Fundamentals: Deep understanding of GPU architectures, CUDA programming, and parallel computing patterns.
  • Deep Learning Frameworks: Proficiency in PyTorch, TensorFlow, or JAX, particularly for GPU-accelerated workloads.
  • LLM/AI Knowledge: Strong grounding in large language models (training, fine-tuning, prompting, evaluation).
  • Systems Engineering: Proficiency in C++, Python, and possibly Rust/Go for building tooling around CUDA.

Ideal Background

  • Publications or open-source contributions in GPU computing or ML/AI for code are a plus.
  • Hands-on experience with large-scale experiments, benchmarking, and performance tuning.

Apply for this Position

PDF files only, max 5MB