DataStax Enterprise 2.0 Documentation

DataStax Enterprise Release Notes

DataStax Enterprise 2.0

  • Apache Cassandra 1.0.8
  • Apache Hadoop 1.0.0
  • Apache Hive 0.8.1
  • Apache Pig 0.8.3
  • Apache Sqoop 1.4.1
  • Apache Solr 4.0
  • DataStax OpsCenter 2.0

What's New

With version 2.0 of DataStax Enterprise, DataStax has extended its big data platform to include:

  • Integration with Apache Solr™ - Enables DataStax Enterprise to use full-text search; hit highlighting; multiple search attributes; search rich documents, such as PDF and Microsoft Word; and use geo-spatial search. By including Solr 4.0, near real-time indexing can be performed to manage real-time, analytic, and enterprise search features within a single integrated platform.
  • Hadoop Update - DataStax Enterprise 2.0 includes Hadoop 1.0.0.
  • Improved Hive Support - DataStax Enterprise is now enabled for any JDBC compliant user interface to connect to and work with Hive from within the server. This version of Hive includes support for binary data and support for wide rows (up to 2 billion columns).
  • Sqoop - Apache Sqoop supports migration of RDBMS data into the DataStax Enterprise server. You can now easily import data from RDBMS’s such as Oracle, Microsoft SQL Server, MySQL, Sybase, DB2, and others.
  • Elastic Workload Re-provisioning - Provides the ability to re-provision existing nodes to assume a different workload, such as changing a real-time node to an analytic node, and the ability to change the overall usage and capacity of a cluster.
  • Log4j - Apache log4j is a Java-based logging framework that provides runtime application feedback. It provides the ability to control the granularity of log statements using an external configuration file. Additionally, with the Cassandra Appender you can store the log4j messages in a column family where they're available for in-depth analysis using the Hadoop and Solr capabilities provided by DataStax Enterprise.

For more detailed information, see About DataStax Enterprise and What’s New in DataStax Enterprise 2.0? white paper.

Other issues

Do not use SolrJ (Java Interface for Solr) to add cores in DataStax Enterprise.

DataStax Enterprise 1.0

DataStax Enterprise 1.0.x is the first release of the DataStax commercial database platform. It is built on Apache Cassandra and designed for managing both real-time and analytic data workloads. Real-time data is managed with Cassandra and analytic operations are carried out via Apache Hadoop. DataStax Enterprise server is able to support both real-time and analytic workloads in the same cluster of machines with smart workload isolation transparently. This ensures that neither workload competes with the other for data or computing resources.

For component-specific information, refer to the components release notes and documentation.

DataStax Enterprise 1.0.2

  • Apache Cassandra 1.0.7 (updated from 1.0.5)
  • Apache Hadoop 0.20.204.1
  • Apache Hive 0.7.1
  • Apache Pig 0.8.3
  • DataStax OpsCenter 1.4

Changes in 1.0.2

Pig driver now support integer data types. This is in addition to current support for ASCII, UTF8, and long types. Exceptions are no longer thrown for bytes, UUID, and counters, but the data isn't returned correctly.

DataStax Enterprise 1.0.1

  • Apache Cassandra 1.0.5
  • Apache Hadoop 0.20.204.1
  • Apache Hive 0.7.1
  • Apache Pig 0.8.3
  • DataStax OpsCenter 1.4

Resolved Issues in 1.0.1

Issue Description
363 Create the CassandraFS (cfs) keyspace with replication strategy options that respect the currently configured snitch.
368 Fix Debian packages so they do not start the DSE service by default.