<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="bbPress/1.0.3" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<title>DataStax Support Forums &#187; Topic: ETL Tools to transfer data from Cassandra into other relational databases</title>
		<link>http://www.datastax.com/support-forums/topic/etl-tools-to-transfer-data-from-cassandra-into-other-relational-databases</link>
		<description>Software, Support, and Training for Apache Cassandra</description>
		<language>en-US</language>
		<pubDate>Wed, 19 Jun 2013 04:56:05 +0000</pubDate>
		<generator>http://bbpress.org/?v=1.0.3</generator>
		<textInput>
			<title><![CDATA[Search]]></title>
			<description><![CDATA[Search all topics from these forums.]]></description>
			<name>q</name>
			<link>http://www.datastax.com/support-forums/search.php</link>
		</textInput>
		<atom:link href="http://www.datastax.com/support-forums/rss/topic/etl-tools-to-transfer-data-from-cassandra-into-other-relational-databases" rel="self" type="application/rss+xml" />

		<item>
			<title>cko on "ETL Tools to transfer data from Cassandra into other relational databases"</title>
			<link>http://www.datastax.com/support-forums/topic/etl-tools-to-transfer-data-from-cassandra-into-other-relational-databases#post-8153</link>
			<pubDate>Thu, 27 Dec 2012 06:46:11 +0000</pubDate>
			<dc:creator>cko</dc:creator>
			<guid isPermaLink="false">8153@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;Hi Srini,&#60;/p&#62;
&#60;p&#62;Thanks for your response.&#60;/p&#62;
&#60;p&#62;A few months ago, we did try Pentaho briefly. The job caused an OutOfMemoryError in Cassandra. The Cassandra Pentaho plugin does not handle retrieving rows in batches. It was mentioned that the next version of the plugin will address that issue. We will look at it again when it is available.&#60;/p&#62;
&#60;p&#62;Thanks again.&#60;br /&#62;
Chin
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Srini on "ETL Tools to transfer data from Cassandra into other relational databases"</title>
			<link>http://www.datastax.com/support-forums/topic/etl-tools-to-transfer-data-from-cassandra-into-other-relational-databases#post-8015</link>
			<pubDate>Tue, 18 Dec 2012 18:09:37 +0000</pubDate>
			<dc:creator>Srini</dc:creator>
			<guid isPermaLink="false">8015@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;This should help you. It's just an overview. Drill down more as per your requirement.&#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;http://ovum.com/2012/03/08/pentaho-expands-big-data-coverage/&#34; rel=&#34;nofollow&#34;&#62;http://ovum.com/2012/03/08/pentaho-expands-big-data-coverage/&#60;/a&#62;&#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;http://wiki.pentaho.com/display/DATAMINING/Cassandra+Source+and+Sink+in+Weka&#34; rel=&#34;nofollow&#34;&#62;http://wiki.pentaho.com/display/DATAMINING/Cassandra+Source+and+Sink+in+Weka&#60;/a&#62;
&#60;/p&#62;</description>
		</item>
		<item>
			<title>cko on "ETL Tools to transfer data from Cassandra into other relational databases"</title>
			<link>http://www.datastax.com/support-forums/topic/etl-tools-to-transfer-data-from-cassandra-into-other-relational-databases#post-7972</link>
			<pubDate>Fri, 14 Dec 2012 03:13:03 +0000</pubDate>
			<dc:creator>cko</dc:creator>
			<guid isPermaLink="false">7972@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;We will use Cassandra as logging storage in one of our web application. The application only insert rows into Cassandra but never update or delete any rows. The CF is expected to grow by about 0.5 million rows per day.&#60;/p&#62;
&#60;p&#62;We need to transfer the data in Cassandra to another relational database daily. Due to the large size of the CF, instead of truncating the relational table and reloading all rows into it each time, we plan to run a job to select the &#34;delta&#34; rows since the last run and insert them into the relational database.&#60;/p&#62;
&#60;p&#62;We know we can use Java, Pig or Hive to extract the delta rows to a flat file and load the data into the target relational table. We are particularly interested in a process that can extract delta rows without scanning the entire CF.&#60;/p&#62;
&#60;p&#62;Has anyone used any other ETL tools to do this kind of delta extraction from Cassandra? We appreciate any comments and experience.&#60;/p&#62;
&#60;p&#62;Thanks,&#60;br /&#62;
Chin
&#60;/p&#62;</description>
		</item>

	</channel>
</rss>
