No
Yes
View More
View Less
Working...
Close
OK
Cancel
Confirm
System Message
Delete
Schedule
An unknown error has occurred and your request could not be completed. Please contact support.
Scheduled
Wait Listed
Personal Calendar
Speaking
Conference Event
Meeting
Interest
Schedule TBD
Conflict Found
This session is already scheduled at another time. Would you like to...
Loading...
Please enter a maximum of {0} characters.
Please enter a maximum of {0} words.
must be 50 characters or less.
must be 40 characters or less.
Session Summary
We were unable to load the map image.
This has not yet been assigned to a map.
Search Catalog
Reply
Replies ()
Search
New Post
Microblog
Microblog Thread
Post Reply
Post
Your session timed out.
This web page is not optimized for viewing on a mobile device. Visit this site in a desktop browser to access the full set of features.
2017 GTC San Jose

S7375 - Petabyte Data Pipelines: Massively Distributed SQL Data Warehouse on GPUs

Session Speakers
Session Description

With exponential data growth and the end of Moore's law, enabling data warehouses to scale is a huge challenge. Storing a petabyte in a data warehouse is incredibly costly, and often times non-performant. BlazingDB opens up a whole new level of speed with GPU power, while using data lake technologies to store massive data sets. We'll demonstrate how BlazingDB leverages GPUs for writing and reading, where compression and data skipping are key, and then for SQL analytics, where sorting, aggregations, and joining see huge performance bumps. This demo will be performed on a Microsoft Azure N Series GPU cluster for processing and Azure File Store for cold storage, showing a fully functional BlazingDB cloud deployment processing a massive data set.


Additional Session Information
All
Talk
AI Startup Accelerated Analytics Data Center and Cloud Computing
Higher Education / Research Software
25 minutes
Session Schedule