No
Yes
View More
View Less
Working...
Close
OK
Cancel
Confirm
System Message
Delete
Schedule
An unknown error has occurred and your request could not be completed. Please contact support.
Scheduled
Wait Listed
Personal Calendar
Speaking
Conference Event
Meeting
Interest
Schedule TBD
Conflict Found
This session is already scheduled at another time. Would you like to...
Loading...
Please enter a maximum of {0} characters.
Please enter a maximum of {0} words.
must be 50 characters or less.
must be 40 characters or less.
Session Summary
We were unable to load the map image.
This has not yet been assigned to a map.
Search Catalog
Reply
Replies ()
Search
New Post
Microblog
Microblog Thread
Post Reply
Post
Your session timed out.
This web page is not optimized for viewing on a mobile device. Visit this site in a desktop browser to access the full set of features.
2017 GTC San Jose

S7458 - Deploying Unique DL Networks as Micro-Services with TensorRT, user Extensible Layers, and GPU Rest Engine

Session Speakers
Session Description

Once you have trained your neural network to do some unique and interesting task, you might wonder how to make it available to colleagues, collaborators, or perhaps the world. One of the best ways to do that is to create a REST-based microservice. Then anyone with the URL can make a request and get an answer from your neural network. We'll show how three technologies come together to make that possible: 1. TensorRT provides low-latency, high-throughput inference; 2. Custom layer support in TensorRT allows you to express your unique deep learning secret sauce within TensorRT; 3. GPU Rest Engine gives you a fast and easy way to create a GPU-powered microservice. We'll show the steps necessary for you to start creating your own deep learning-powered microservices.


Additional Session Information
Intermediate
Talk
Deep Learning and AI Tools and Libraries
Automotive Cloud Services
25 minutes
Session Schedule