No
Yes
View More
View Less
Working...
Close
OK
Cancel
Confirm
System Message
Delete
Schedule
An unknown error has occurred and your request could not be completed. Please contact support.
Scheduled
Wait Listed
Personal Calendar
Speaking
Conference Event
Meeting
Interest
Schedule TBD
Conflict Found
This session is already scheduled at another time. Would you like to...
Loading...
Please enter a maximum of {0} characters.
Please enter a maximum of {0} words.
must be 50 characters or less.
must be 40 characters or less.
Session Summary
We were unable to load the map image.
This has not yet been assigned to a map.
Search Catalog
Reply
Replies ()
Search
New Post
Microblog
Microblog Thread
Post Reply
Post
Your session timed out.
This web page is not optimized for viewing on a mobile device. Visit this site in a desktop browser to access the full set of features.
2017 GTC San Jose

S7362 - Benchmarking the New Unified Memory of CUDA 8

Session Speakers
Session Description

We'll evaluate CUDA 8's new unified memory's impact to applications with benchmarks and share practices on how to tune or build high-performance apps. Since CUDA 6, unified memory has aimed at simplifying the programmability of heterogeneous memory management while maintaining good performance. However, practical limitations prevent applications from fully taking advantage of it. The CUDA 8 release highlights an updated unified memory that both simplifies programmability and improves performance, especially when married with the new Pascal GPU architecture. We'll evaluate the new system, benchmark its performance, and share our best practices in tuning code, which could be good reference for app developers. In addition, we'll explore options and solutions on moving/exchanging data efficiently between heterogeneous devices, such as NVMe/NVRAM in modern data center or cloud environments.


Additional Session Information
Intermediate
Talk
Data Center and Cloud Computing
Hardware / Semiconductor Software
25 minutes
Session Schedule