Clean Data for AI & ML – Insights and Innovation Forum

Commit to clean data for more accurate predictions. No excuses!

Impactful use cases, demos, clean data with data preparation, automated machine learning, and analytics modernization roadmap

Join Capgemini, Waterline Data, Trifacta, and DataRobot on December 6th, 2018

“80% of data science is cleaning the data and 20% is complaining about cleaning the data”

— Kaggle founder and CEO Anthony Goldbloom

We all know it, but what do we do about it? Clean data is the foundation of any robust and accurate predictive analytics initiative. But data never arrives in its desired format for analysis; instead, it is often inconsistent, incomplete, and needs to be transformed and combined to meet specific standards. In addition, building predictive models is often a time-consuming and complex process. Committing to clean data for more accurate predictions means committing to data preparation and automated machine learning for business users. Waterline Data, Trifacta, and DataRobot can help.

Where: Capgemini, Accelerated Solutions Environment (79 5th Avenue, 3rd FL, New York, NY 10003)

Who: Capgemini, Waterline Data, Trifacta, and DataRobot Executives

Date: December 6th, 2018

Time: 3:00 p.m. ET – 6:30 p.m. ET

Food: Yes! Drinks and hors d’oeuvres

Who should attend?

  1. Digital Customer Experience Professionals
  2. Big Data and Analytics/BI Professionals
  3. Data Architects/Data Scientists/Machine Learning Experts
  4. Innovation Teams within Businesses
  5. Business or Technology Data-Driven Professionals

Meet the Experts:

Arindam Choudhury
Vice President – Global Big Data Practice Leader, Financial Services

Arindam is a Business Leader with extensive experience in the financial services industry. His current charter is to lead Capgemini’s Big Data Practice for Financial Services.
As a hands-on leader, he excels at conceptualizing, creating and delivering transformation solutions for his clients that leverage advances in Big Data & Analytics. He believes that Data and Analytics drives Digital Transformation and evangelize this belief to his clients, both internal and external.

Alex Gorelik
Founder, CTO
Waterline Data

Alex Gorelik is the founder and CTO of Waterline Data, his 3rd start-up. He is also a thought leader and a published author. Prior to Waterline Data, Alex served as General Manager of Informatica’s Data Quality Business Unit, driving Marketing, Product Management and R&D for an $80M business. Alex joined Informatica from IBM, where he was an IBM Distinguished Engineer for the Information Integration team. IBM acquired Alex’s second startup, Exeros, where he was founder, CTO and VP of Engineering.
Alex holds a B.S. in Computer Science from Columbia University School of Engineering and a M.S. in Computer Science from Stanford University.

Sean Kandel
Chief Technical Officer and Co-founder

Sean is Trifacta’s Chief Technical Officer and Co-founder. He completed his Ph.D. at Stanford University, where his research focused on user interfaces for database systems. At Stanford, Sean led development of new tools for data transformation and discovery, such as Data Wrangler. He previously worked as a data analyst at Citadel Investment Group.

H.P. Bunaes
Director, Banking Practice

H.P. Bunaes leads the banking practice at DataRobot, helping banks leverage AI and machine learning for predictive analytics and data mining. H.P. has 35 years of experience in banking, with broad banking domain knowledge and deep expertise in data and analytics. Prior to joining DataRobot, H.P. held a variety of leadership positions at SunTrust Banks, initially leading the design and development of the risk data and analytics platform used enterprise wide for risk management.


Overall agenda:



3:00 PM – 3:10 PM Welcome & Introduction
3:10 PM – 4:30 PM Why clean data is important to avoid “garbage in, garbage out”, and how to get to clean data fast and easy?

  • 3:10 PM – 3:30 PM: Capgemini’s Perspectives
  • 3:30 PM – 3:50 PM: Waterline’s Perspectives
  • 3:50 PM – 4:10 PM: Trifacta’s Perspectives
  • 4:10 PM – 4:30 PM: DataRobot’s Perspectives
4:30 PM – 5:30 PM Demo Breakouts (Participants can watch ongoing demos in separate rooms)

  • Room 1: Capgemini Demo
  • Room 2: Trifacta Demo
  • Room 3: DataRobot Demo
  • Room 4: Waterline Demo
5:30 PM – 6:00 PM Panel Discussion – Best Practices and the Future of Artificial Intelligence
6:00 PM – 6:30 PM Networking + Drinks

Reserve Your Spot – Register Now!

CapgeminiWater LineTrifactaDataRobot

Event details

December 6, 2018 3:00 pm
December 6, 2018 6:30 pm
Capgemini, Accelerated Solutions Environment (79 5th Avenue, 3rd FL, New York, NY 10003)