NVIDIA-posted 3 days ago
$184,000 - $287,500/Yr
Senior
Santa Clara, CA
Craft a resume that recruiters will want to see with Teal's resume Matching Mode

We are now looking for a Senior Software Engineer for Deep Learning Inference Workflows! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs? We are now welcoming exceptional software engineers to apply to Senior Engineering positions in the Deep Learning software team.

  • Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning inference.
  • Use C++ and Python to build graph parsers, optimizers, and tools for effective deployment of trained deep learning models.
  • Collaborate with teams of deep learning experts, GPU architects and DevOps engineers across diverse teams.
  • A Bachelor's, Master's, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field.
  • 6+ years of software development experience.
  • Strong experience with C++11/C++14/C++17.
  • Strong grasp of Machine Learning concepts, especially Natural Language Processing.
  • Excellent communication skills, and an aptitude for collaboration and teamwork.
  • Proficiency in Python.
  • Experience in software performance benchmarking, profiling, and optimizations.
  • Background in compiler development.
  • Experience in working with TensorRT, PyTorch, ONNX Runtime, JAX, TRT-LLM, vLLM, SGLang, or other ML frameworks.
  • Experience with HuggingFace Diffusers and Transformers libraries.
  • Equity and benefits.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service