Head of "Big Data Management" (BigDaMa) Group, Technische Universität Berlin
Data scientists spend most of their time collecting, curating, and organizing data from heterogeneous and often dirty sources. In this process, datasets have to be cleaned from errors, equal entities from different data sources have to be matched, and data values have to be transformed into a common desired representation. In this talk, I will discuss the need for scalable techniques for data preparation and shed light on our vision for future data curation systems.
Ziawasch Abedjan is Juniorprofessor and the head of the "Big Data Management" (BigDaMa) Group at the TU Berlin in Germany and a Principal Investigator in the Berlin Big Data Center (BBDC). Prior to that, Ziawasch was a postdoctoral associate at MIT CSAIL where he worked on various data integration topics. He received his PhD from the Hasso Plattner Institute in Potsdam, Germany. His current research focuses on data integration, data mining, and data profiling.