IBM Unveils New Data Prep Tool Designed to Help Speed DataOps

Daniel G. Hernandez

Daniel G. Hernandez

ARMONK, N.Y., June 28, 2019 /PRNewswire/ -- IBM (NYSE: IBM) today announced a new data preparation solution designed to help clients improve their dataops processes to get their data ready for AI quickly and efficiently.

Data preparation is an integral step in building machine learning and predictive models, but it's also one of the most cumbersome and time-consuming, leading many data scientists to devote up to 80 percent of their time to it.1 And while the quality of the data remains a critical factor in producing accurate models – and more accurate insights – the time-intensive process can stall AI projects.

To ease this process, IBM introduced today InfoSphere Advanced Data Preparation, a new solution designed to help clients transform raw datasets by formatting, structuring and enriching the datasets for analytic processing and standard reporting. Jointly developed with data prep software provider, Trifacta, the new InfoSphere solution is engineered to work in conjunction with clients' existing data environments, including data lakes.

Among its many features, the new InfoSphere solution includes an intuitive dashboard for visualizing the data prep process, including the progress of tracking data quality and lineage (where the data originated, and where it's been). With the resulting cleaned datasets, clients can move them into the business analytics tool of their choice.

"The new InfoSphere solution adds to our growing stable of dataops services and capabilities that are designed to help organizations automate much of the cumbersome preparation work and get to the business of conducting data science and building AI models fast," said Daniel G. Hernandez [pictured], Vice President, IBM Data and AI. More...