As the generation of data becomes more prolific, the amount of time and computational resources necessary to perform analyses on these data increases. What is less well studied, however, is the data pre-processing steps that must be applied before any meaningful analysis can begin. This problem of taking data in some initial form and transforming it into a desired one is known as data integration, and it can be a substantial part of the overall computational load associated with data analytics. Join us for the November WashU Data Users Group meeting as Roger Chamberlain, professor of computer science & engineering, will discuss the execution of data integration via a range of compute engines and the various applications of data integration.
Free and open to all, pre-registration required.