Business Intelligence for the Real Time Enterprise

August 31, 2015 - Kohala Coast, Hawaii

Panel


Can big data platforms EVER deliver real-time analytics?



Abstract

Column stores and in-memory databases have finally realized truly real-time Business Intelligence (BI) queries over terabytes of data by exploiting the multiple cores and increasing size of main memories and caches available in modern processors. Now "big data" platforms promise analytics on petabytes of semi-structured as well as structured data through massive parallelism that can be added elastically. But these systems were originally intended for batch processing of large volumes of data. Can these systems ever catch up to databases and provide truly real-time querying? Is it just a matter of the more advanced maturity of database technology, or are there inherent limitations in the Big Data platforms? Is the database technology transferable to these new platforms? Come listen to the top leaders in both industries debate the future of real-time analytics on big data.


Panel Moderator

Guy Lohman (IBM Almaden)


Guy Lohman

Dr. Guy M. Lohman is a Distinguished Research Staff Member and Manager of Disruptive Information Management Architectures in the Advanced Information Management Department at IBM Research -- Almaden in San Jose, California, where he has worked for 33 years. He currently manages the Blink research project, which contributed BLU Acceleration to DB2 for Linux, UNIX, and Windows (LUW) 10.5, and the query engine of the IBM Smart Analytics Optimizer for DB2 for z/OS V1.1 and the Informix Warehouse Accelerator products (2008-2010). He was the architect of the Query Optimizer of DB2 LUW and was responsible for its development from 1992 to 1997 (versions 2 - 5), as well as the invention and prototyping of Visual Explain, efficient sampling, the Index Advisor, and optimization of XQuery queries in DB2. Dr. Lohman was elected to the IBM Academy of Technology in 2002, and named an IBM Master Inventor in 2011. He has been awarded 39 U.S. patents and is the author of over 77 technical papers in the refereed academic literature.


Panel Speakers

Andy Pavlo (CMU)


Andy Pavlo

Andy Pavlo is an Assistant Professor of Databaseology in the Computer Science Department at Carnegie Mellon University.

















Anastasia Ailamaki (EPFL)


Anastasia Ailamaki

Anastasia Ailamaki is a Professor of Computer and Communication Sciences at the Ecole Polytechnique Federale de Lausanne (EPFL) and the CEO and co-founder of RAW Labs, a startup company in Switzerland. Her research interests are in database systems and applications, and in particular (a) in strengthening the interaction between the database software and emerging hardware and I/O devices, and (b) in automating data management to support computationally- demanding and data-intensive scientific applications. She has received an ERC Consolidator Award (2013), a Finmeccanica endowed chair from the Computer Science Department at Carnegie Mellon (2007), a European Young Investigator Award from the European Science Foundation (2007), an Alfred P. Sloan Research Fellowship (2005), eight best-paper awards in database, storage, and computer architecture conferences (2001-2012), and an NSF CAREER award (2002). She holds a Ph.D. in Computer Science from the University of Wisconsin-Madison in 2000. She is a senior member of the IEEE and a member of the ACM, serves as the ACM SIGMOD vice chair, and has served as a CRA-W mentor.



Fatma Özcan (IBM Almaden)


Fatma Ozcan

Fatma Özcan is a Research Staff Member and a manager at IBM Almaden Research Center. Her current research focuses on platforms and infra-structure for large-scale data analysis. Dr Özcan got her PhD degree in computer science from University of Maryland, College Park. She has over 10 years of experience in semi-structured and structured data management, query processing and optimization, and has delivered core technologies into IBM DB2 and BigInsights products. She is the co-author of the book "Heterogeneous Agent Systems", and co-author of several conference papers and patents.







Marcel Kornacker (Cloudera)


Marcel Kornacker

Marcel Kornacker is the Chief Architect for database technology at Cloudera and creator of the Cloudera Impala project. Following his graduation in 2000 with a PhD in databases from UC Berkeley, he held engineering positions at several database-related start-up companies. Marcel joined Google in 2003 where he worked on several ads serving and storage infrastructure projects, then became tech lead for the distributed query engine component of Google's F1 project.









Michael Stonebraker (MIT)


Michael Stonebraker

Dr. Stonebraker has been a pioneer of data base research and technology for more than a quarter of a century. He was the main architect of the INGRES relational DBMS, and the object-relational DBMS, POSTGRES. These prototypes were developed at the University of California at Berkeley where Stonebraker was a Professor of Computer Science for twenty five years. More recently at M.I.T. he was a co-architect of the Aurora/Borealis stream processing engine, the C-Store column-oriented DBMS, the H- Store transaction processing engine, the SciDB array DBMS, and the Data Tamer data curation system. Presently he serves as Chief Technology Officer of Paradigm4 and Tamr, Inc.

Professor Stonebraker was awarded the ACM System Software Award in 1992 for his work on INGRES. Additionally, he was awarded the first annual SIGMOD Innovation award in 1994, and was elected to the National Academy of Engineering in 1997. He was awarded the IEEE John Von Neumann award in 2005, and is presently an Adjunct Professor of Computer Science at M.I.T, where he is co-director of the Intel Science and Technology Center focused on big data.



Michael J Franklin (UC Berkeley)


Michael J Franklin

Michael J Franklin is the Thomas M. Siebel Professor of Computer Science and Chair of the Computer Science Division in the EECS Department at the University of California, Berkeley. Prof. Franklin is also the Director of the Algorithms, Machines, and People Laboratory (AMPLab) at UC Berkeley, a leading academic Big Data analytics research center. AMPLab has produced industry-changing open source software including Apache Spark and BDAS, the Berkeley Data Analytics Stack. Prof. Franklin is a co-PI and Executive Committee member for the Berkeley Institute of Data Science, a campus-wide initiative to advance Data Science Environments. He was founder and CTO of Truviso, a data analytics company that was subsequently purchased by Cisco Systems. He currently serves on the Technical Advisory Boards of a number of data-driven technology companies, including Databricks, an AMPLab spinout. He is a Fellow of the ACM and a two-time winner of the ACM SIGMOD "Test of Time" award, and received the outstanding Advisor Award from the Computer Science Graduate Student Association at Berkeley. He received the Ph.D. in Computer Science from the University of Wisconsin in 1993, a Master of Software Engineering from the Wang Institute of Graduate Studies in 1986, and the B.S. in Computer and Information Science from the University of Massachusetts in 1983.