← Back to Careers
Member of Technical Staff
Winter Internship•San Francisco, CA
Winter internship opportunity to work on cutting-edge GPU optimization projects. You'll contribute to real production systems while learning from experienced engineers and researchers.
What You'll Do
- • Implement GPU optimization techniques for specific model architectures
- • Contribute to our profiling and benchmarking tools
- • Work on performance analysis and bottleneck identification
- • Assist with research experiments and data collection
- • Present findings to the technical team
What We Look For
Core Technical Expertise
- • GPU Fundamentals: Deep understanding of GPU architectures, CUDA programming, and parallel computing patterns.
- • Deep Learning Frameworks: Proficiency in PyTorch, TensorFlow, or JAX, particularly for GPU-accelerated workloads.
- • LLM/AI Knowledge: Strong grounding in large language models (training, fine-tuning, prompting, evaluation).
- • Systems Engineering: Proficiency in C++, Python, and possibly Rust/Go for building tooling around CUDA.
Ideal Background
- • Publications or open-source contributions in GPU computing or ML/AI for code are a plus.
- • Hands-on experience with large-scale experiments, benchmarking, and performance tuning.