The need for unification between low latency and analytical infrastructure is clear. Data movement (ETL) is a productivity killer. Until now there was no good answer; only compromises. Either companies were forced to suffer the complexity of moving large volumes of data between their systems (in the typical ETL pipeline) or they had to make do with the limited real-time functionality within the Hadoop framework (primarily HBase) that provides some table abstractions but is complex to operate and optimized for scanning workloads more appropriate to analytical jobs rather than the random access performance critical for real-time applications. Given this choice, most opted for two systems and ETL.