Kris Hahn

<p>DataStax has released Brisk 1.0 Beta 2! You can download Brisk from the&nbsp;DataStax web site.</p>

<h2>New Features in Brisk 1.0 Beta 2</h2>

<p>The following new features have been added in this release:</p>

<table border="1" cellpadding="5" cellspacing="0">
	<thead>
		<tr>
			<td valign="top" width="73">
			<h3>Feature</h3>
			</td>
			<td valign="top" width="370">
			<h3>Description</h3>
			</td>
		</tr>
	</thead>
	<tbody>
		<tr>
			<td valign="top" width="73">
			<p>BRISK-12</p>
			</td>
			<td valign="top" width="370">
			<p>Apache Pig Integration. See the&nbsp;DataStax Documentation&nbsp;for more information about using Pig in Brisk.</p>
			</td>
		</tr>
		<tr>
			<td valign="top" width="73">
			<p>BRISK-89</p>
			</td>
			<td valign="top" width="370">
			<p>Job Tracker Failover. See the&nbsp;DataStax Documentation&nbsp;for more information about using the new brisktool movejt command.</p>
			</td>
		</tr>
		<tr>
			<td valign="top" width="73">
			<p>BRISK-207</p>
			</td>
			<td valign="top" width="370">
			<p>New Snappy Compression Codec built on&nbsp;Google <a href="http://code.google.com/p/snappy">Snappy</a>&nbsp;is now used internally for automatic CassandraFS block compression.</p>
			</td>
		</tr>
		<tr>
			<td valign="top" width="73">
			<p><a href="https://datastax.jira.com/browse/BRISK-180">BRISK-180</a></p>
			</td>
			<td valign="top" width="370">
			<p>Automap Cassandra Column Families to Hive Tables in the Brisk Hive Metastore.</p>
			</td>
		</tr>
		<tr>
			<td valign="top" width="73">
			<p><a href="https://datastax.jira.com/browse/BRISK-152">BRISK-152</a></p>
			</td>
			<td valign="top" width="370">
			<p>Add a second HDFS layer in CassandraFS for long-term data storage. This is needed because the blocks column family in CFS requires frequent compactions - Hadoop uses it during MapReduce processing to store small files and temporary data. Compaction cleans this temporary data up after it is not needed anymore. Now there is the cfs:/// and cfs-archive:/// endpoints within CFS. The blocks column family in cfs-archive:/// has compaction disabled to improve performance for static data stored in CFS.</p>
			</td>
		</tr>
	</tbody>
</table>

<h2>Major Fixes in Brisk 1.0 Beta 2</h2>

<p>Brisk 1.0 Beta 2 also incudes the following major fixes. For details on all fixes in Beta 2, see the&nbsp;Brisk Jira Project Web site:</p>

<table border="1" cellpadding="5" cellspacing="0">
	<tbody>
		<tr>
			<td valign="top" width="109">
			<h3>Issue</h3>
			</td>
			<td valign="top" width="334">
			<h3>Description</h3>
			</td>
		</tr>
		<tr>
			<td valign="top" width="109">
			<p>BRISK-126</p>
			</td>
			<td valign="top" width="334">
			<p>Remove multiple slf4j warnings</p>
			</td>
		</tr>
		<tr>
			<td valign="top" width="109">
			<p>BRISK-203</p>
			</td>
			<td valign="top" width="334">
			<p>Use batchMutate instead of insert in HiveCassandraOutputFormat</p>
			</td>
		</tr>
		<tr>
			<td valign="top" width="109">
			<p>BRISK-219</p>
			</td>
			<td valign="top" width="334">
			<p>Cassandra super columns not mapping in Hive</p>
			</td>
		</tr>
		<tr>
			<td valign="top" width="109">
			<p>BRISK-220</p>
			</td>
			<td valign="top" width="334">
			<p>Improve performance of hadoop fs -ls</p>
			</td>
		</tr>
		<tr>
			<td valign="top" width="109">
			<p>CASSANDRA-2683</p>
			</td>
			<td valign="top" width="334">
			<p>Compaction issue causing secondary index corruption.</p>
			</td>
		</tr>
	</tbody>
</table>

<h2>Open Issues</h2>

<p>For a description of the open issues in Brisk, see the&nbsp;Brisk Jira Project Web site.</p>

<h2>About Brisk</h2>

<p>Brisk is an open-source Hadoop and Hive distribution developed by DataStax that utilizes Apache Cassandra for its core services and storage. Brisk provides Hadoop MapReduce capabilities using CassandraFS, an HDFS-compatible storage layer inside Cassandra. By replacing HDFS with CassandraFS, users are able to leverage their current MapReduce jobs on Cassandra’s peer-to-peer, fault-tolerant, and scalable architecture. Brisk is also able to support dual workloads, allowing you to use the same cluster of machines for both real-time applications and data analytics without having to move the data around between systems.</p>

<p>Brisk is available via Apache license v2.0, and contains the following components:</p>

<ul>
	<li>Apache Hadoop 0.20.203.0 + (<a href="https://issues.apache.org/jira/browse/HADOOP-7172">HADOOP-7172</a>,&nbsp;<a href="https://issues.apache.org/jira/browse/HADOOP-5759">HADOOP-5759</a>,&nbsp;<a href="https://issues.apache.org/jira/browse/HADOOP-7255">HADOOP-7255</a>)</li>
	<li>Cassandra 0.8.1</li>
	<li>Apache Hive 0.7</li>
	<li>Apache Pig 0.8.3</li>
</ul>


Brisk 1.0 Beta 2 Released

Kris Hahn

Share

Share

New Features in Brisk 1.0 Beta 2

Feature

Description

Major Fixes in Brisk 1.0 Beta 2

Issue

Description

Open Issues

About Brisk

More Technology

Knowledge Graphs for RAG without a GraphDB

How Winweb Built its AI Assistant with DataStax Astra DB and LangChain

Vercel + Astra DB: Get Data into Your GenAI Apps Fast

Simplifying Agent Development with Astra DB Connector for Vertex AI Search

One-stop Data API for Production GenAI