LOG IN
SIGN UP
Tech Job Finder - Find Software, Technology Sales and Product Manager Jobs.
Sign In
OR continue with e-mail and password
E-mail address
Password
Don't have an account?
Reset password
Join Tech Job Finder
OR continue with e-mail and password
E-mail address
First name
Last name
Username
Password
Confirm Password
How did you hear about us?
By signing up, you agree to our Terms & Conditions and Privacy Policy.

Deep Learning Performance Software Engineer

at Nvidia

Back to all Data Science / AI / ML jobs
N
Industry not specified

Deep Learning Performance Software Engineer

at Nvidia

Mid LevelNo visa sponsorshipData Science/AI/ML

Posted 6 hours ago

No clicks

Compensation
Not specified

Currency: Not specified

City
Shanghai
Country
China

Join NVIDIA's team to develop GPU-accelerated deep learning software. You will create and optimize high-performance kernels using a tile-based GPU programming model, and contribute to end-to-end performance improvements. The role involves working with TileGym, Triton TileIR backend, and CUDA Tile, plus performance analysis, profiling, and code optimization. A strong background in C/C++, Python, MLIR, and GPU programming (CUDA/OpenCL) is required, with 3 years of relevant experience.

We are now looking for a Deep Learning Performance Software Engineer!

We are expanding our research and development for deep learning. We seek excellent Software Engineers and Senior Software Engineers to join our team. We specialize in developing GPU-accelerated Deep learning software. Researchers around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in numerous areas. Join the team that builds software to enable new solutions. Your ability to work in a fast-paced customer-oriented team is required and excellent communication skills are necessary.

What you’ll be doing:

  • Develop TileGym, Triton CUDA TileIR backend and CUDA Tile

  • Develop highly optimized deep learning kernels through tile-based GPU programming model

  • End-to-end performance optimization through tile-based GPU programming model

  • Do performance optimization, analysis, and tuning


What we need to see:

  • Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI)

  • SW Agile skills helpful

  • Excellent C/C++ programming and software design skills

  • Python experience a plus

  • MLIR experience a plus

  • AI agent experience a plus

  • Performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU

  • GPU programming experience (CUDA or OpenCL) desired

  • 3 years of relevant work experience


NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and talented people on the planet working for us. If you're creative and autonomous, we want to hear from you!

Deep Learning Performance Software Engineer

at Nvidia

Back to all Data Science / AI / ML jobs
N
Industry not specified

Deep Learning Performance Software Engineer

at Nvidia

Mid LevelNo visa sponsorshipData Science/AI/ML

Posted 6 hours ago

No clicks

Compensation
Not specified

Currency: Not specified

City
Shanghai
Country
China

Join NVIDIA's team to develop GPU-accelerated deep learning software. You will create and optimize high-performance kernels using a tile-based GPU programming model, and contribute to end-to-end performance improvements. The role involves working with TileGym, Triton TileIR backend, and CUDA Tile, plus performance analysis, profiling, and code optimization. A strong background in C/C++, Python, MLIR, and GPU programming (CUDA/OpenCL) is required, with 3 years of relevant experience.

We are now looking for a Deep Learning Performance Software Engineer!

We are expanding our research and development for deep learning. We seek excellent Software Engineers and Senior Software Engineers to join our team. We specialize in developing GPU-accelerated Deep learning software. Researchers around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in numerous areas. Join the team that builds software to enable new solutions. Your ability to work in a fast-paced customer-oriented team is required and excellent communication skills are necessary.

What you’ll be doing:

  • Develop TileGym, Triton CUDA TileIR backend and CUDA Tile

  • Develop highly optimized deep learning kernels through tile-based GPU programming model

  • End-to-end performance optimization through tile-based GPU programming model

  • Do performance optimization, analysis, and tuning


What we need to see:

  • Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI)

  • SW Agile skills helpful

  • Excellent C/C++ programming and software design skills

  • Python experience a plus

  • MLIR experience a plus

  • AI agent experience a plus

  • Performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU

  • GPU programming experience (CUDA or OpenCL) desired

  • 3 years of relevant work experience


NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and talented people on the planet working for us. If you're creative and autonomous, we want to hear from you!

SIMILAR OPPORTUNITIES

No similar jobs available at the moment.