Technology•July 22, 2013

Support CQL3 tables in Hadoop, Pig and Hive

Alex Liu

The first generation is based on the first generation of Hadoop Cassandra driver which uses the thrift column families. We need use the second generation of Hadoop Cassandra driver to improve the query on composite columns which CQL3 table use under the hood.

The second generation uses the second generation of Hadoop Cassandra driver to query on CQL3 tables. Basically It set the input and output CQL query and map the input and output value to Hive data type.

All metadata are retrieved from system tables of system.schema_columnfamilies and system.schems_columns.

All CQL3 tables have auto generated Hive tables using CqlStorageHandler which has the following parameters

The push down condition will be implemented the similar way as Pig partition filter push down. We will also expand the default mappings to include collections.

More Technology

View All

Technology • April 18, 2024

Knowledge Graphs for RAG without a GraphDB

Technology • April 17, 2024

How Winweb Built its AI Assistant with DataStax Astra DB and LangChain

Technology • April 16, 2024

Vercel + Astra DB: Get Data into Your GenAI Apps Fast

Technology • April 11, 2024

Simplifying Agent Development with Astra DB Connector for Vertex AI Search

One-stop Data API for Production GenAI

Astra DB gives JavaScript developers a complete data API and out-of-the-box integrations that make it easier to build production RAG apps with high relevancy and low latency.

Learn More

Get Started for Free

Support CQL3 tables in Hadoop, Pig and Hive

Alex Liu

Share

Share

The evolutions of Cassandra querying mechanism

Thrift API

CQL

CQL3

The first generation of Cassandra Hadoop driver

The second generation of Cassandra Hadoop driver

Input format

CQL3 pagination

Input parameters

Output format

Output parameters

The first generation of Pig Cassandra driver

CQL3 table support

The second generation of Pig Cassandra driver

CQL3 table support

Schema

Pig partition filter push down

The first generation of Hive Cassandra driver