Member of Technical Staff

Winter Internship•San Francisco, CA

Winter internship opportunity to work on cutting-edge GPU optimization projects. You'll contribute to real production systems while learning from experienced engineers and researchers.

What You'll Do

• Implement GPU optimization techniques for specific model architectures
• Contribute to our profiling and benchmarking tools
• Work on performance analysis and bottleneck identification
• Assist with research experiments and data collection
• Present findings to the technical team

What We Look For

Core Technical Expertise

• GPU Fundamentals: Deep understanding of GPU architectures, CUDA programming, and parallel computing patterns.
• Deep Learning Frameworks: Proficiency in PyTorch, TensorFlow, or JAX, particularly for GPU-accelerated workloads.
• LLM/AI Knowledge: Strong grounding in large language models (training, fine-tuning, prompting, evaluation).
• Systems Engineering: Proficiency in C++, Python, and possibly Rust/Go for building tooling around CUDA.

Ideal Background

• Publications or open-source contributions in GPU computing or ML/AI for code are a plus.
• Hands-on experience with large-scale experiments, benchmarking, and performance tuning.

Member of Technical Staff

What You'll Do

What We Look For

Core Technical Expertise

Ideal Background

Apply for this Position