GTC Japan 2017 Part 6: New NVIDIA TensorRT 3

Name: GTC Japan 2017 Part 6: New NVIDIA TensorRT 3
Uploaded: 2017-12-15T18:41:01.000Z
Duration: 4 min 19 s
Channel: NVIDIA
Description: - TensorRT optimizes computational graphs and generates Volta CUDA programs, providing amazing performance gains for deep neural networks. - Using TensorRT with TensorFlow, image recognition speed on V100 GPUs is 5,740 times faster than CPUs, while natural language translation is 100 times faster. -

December 15, 2017

NVIDIA

TL;DR

NVIDIA's TensorRT and Volta GPUs offer significant speed and efficiency improvements for deep neural networks, enabling faster image recognition and natural language translation.

Transcript

so we the tensor RT takes this computational graph and runs it through an optimizing compiler and outputs a Volta CUDA program this Volta could a program was written by software so software wrote software and this Volta could a program for a deep neural network could predict many things depends on what you taught it to do the performance that we've... Read More

Key Insights

🉐 TensorRT, combined with Volta GPUs, offers exceptional performance gains for deep neural networks in both image recognition and natural language translation.
⌛ The speed improvements achieved by TensorRT and Volta GPUs compared to CPUs are staggering, with gains of thousands of times in image recognition and hundreds of times in natural language translation.
👻 Volta GPUs drastically reduce the infrastructure required for image recognition, allowing for significant cost and energy savings.

Install to Summarize YouTube Videos and Get Transcripts

Explore YouTube Video Summarizer or Get YouTube Transcript Extractor

Questions & Answers

Q: How does TensorRT improve the performance of deep neural networks?

TensorRT takes a computational graph and compiles it into an optimized Volta CUDA program, resulting in significantly faster performance compared to traditional CPUs.

Q: What is the speed difference between running image recognition on CPUs versus V100 GPUs?

With TensorRT and V100 GPUs, image recognition speed is a remarkable 5,740 times faster than using CPUs, allowing for real-time processing of large volumes of images.

Q: How does TensorRT improve natural language translation speed?

TensorRT enables a 100-fold speed increase in natural language translation, allowing for faster translation from English to Chinese, German, French, and other languages.

Q: How does Volta GPUs revolutionize image recognition infrastructure?

Volta GPUs can achieve image recognition at a rate of 45,000 images per second, requiring only one server with 8 GPUs, which is a significant improvement in efficiency and cost compared to traditional CPU-based solutions.

Summary & Key Takeaways

TensorRT optimizes computational graphs and generates Volta CUDA programs, providing amazing performance gains for deep neural networks.
Using TensorRT with TensorFlow, image recognition speed on V100 GPUs is 5,740 times faster than CPUs, while natural language translation is 100 times faster.
Volta GPUs can achieve image recognition at a rate of 45,000 images per second, requiring only one server with 8 GPUs and consuming 3000 watts.

Read in Other Languages (beta)

English

Share This Summary 📚

Summarize YouTube Videos and Get Video Transcripts with 1-Click

Download browser extensions on:

Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator

Explore More Summaries from NVIDIA 📚

NVIDIA GeForce RTX Launch Event at Gamescom 2018

NVIDIA

Synthetic Brain Project | King’s College London on NVIDIA Cambridge-1

NVIDIA

An Introduction to NVIDIA Iray for 3ds Max Rendering Plug-In

NVIDIA

NVIDIA GTC DC Keynote - Day Two

NVIDIA

Track and Understand VR User Behavior with EASE

NVIDIA

Summarize YouTube Videos and Get Video Transcripts with 1-Click

Download browser extensions on:

Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator

Transcript

Key Insights

🉐 TensorRT, combined with Volta GPUs, offers exceptional performance gains for deep neural networks in both image recognition and natural language translation.

⌛ The speed improvements achieved by TensorRT and Volta GPUs compared to CPUs are staggering, with gains of thousands of times in image recognition and hundreds of times in natural language translation.

👻 Volta GPUs drastically reduce the infrastructure required for image recognition, allowing for significant cost and energy savings.

Questions & Answers

Q: How does TensorRT improve the performance of deep neural networks?

TensorRT takes a computational graph and compiles it into an optimized Volta CUDA program, resulting in significantly faster performance compared to traditional CPUs.

Q: What is the speed difference between running image recognition on CPUs versus V100 GPUs?

With TensorRT and V100 GPUs, image recognition speed is a remarkable 5,740 times faster than using CPUs, allowing for real-time processing of large volumes of images.

Q: How does TensorRT improve natural language translation speed?

TensorRT enables a 100-fold speed increase in natural language translation, allowing for faster translation from English to Chinese, German, French, and other languages.

Q: How does Volta GPUs revolutionize image recognition infrastructure?

Summary & Key Takeaways

TensorRT optimizes computational graphs and generates Volta CUDA programs, providing amazing performance gains for deep neural networks.

Using TensorRT with TensorFlow, image recognition speed on V100 GPUs is 5,740 times faster than CPUs, while natural language translation is 100 times faster.

Volta GPUs can achieve image recognition at a rate of 45,000 images per second, requiring only one server with 8 GPUs and consuming 3000 watts.