An Open Road to Swift DataFrame Scaling

Published: 09/15/2020  

Last Updated: 09/15/2020

[embed]https://w.soundcloud.com/player/?url=https%3A%2F%2Fapi.soundcloud.com%2Ftracks%2F893564005&auto_play=false&show_artwork=false&visual=false[/embed]


Subscribe Now

Stay in the know on all things CODE. Updates are delivered to your inbox.

Sign Up

 

Alex Baden
Technical director, OmniSci*

David Petersohn 
Machine learning engineer, Intel

Data scientists spend 60 percent of their time cleaning and preprocessing data, transforming this dirty data into crystallized insights. DataFrames, such as pandas, provide exceptional tooling to address data wrangling tasks, yet pandas themselves increasingly lack ease and speed as they scale. Alex Baden and Devin Petersohn explore the challenges and considerations of DataFrame scaling. They explore how the Intel® Distribution of Modin* and OmniSci solution, part of the Intel® oneAI AI Analytics Toolkit, offers an open road to quick, transparent scaling across heterogeneous architectures. They also explain how this solution’s integration with the rest of the Python* ecosystem enables data scientists to focus on extracting value from data rather than provisioning and orchestrating resources.

Download Audio  Download Transcript

    

Product and Performance Information

1

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.