<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="bbPress/1.0.3" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<title>DataStax Support Forums &#187; Topic: Re-indexing partial fields in DSE 2.0</title>
		<link>http://www.datastax.com/support-forums/topic/re-indexing-partial-fields-in-dse-20</link>
		<description>Software, Support, and Training for Apache Cassandra</description>
		<language>en-US</language>
		<pubDate>Sun, 19 May 2013 14:36:36 +0000</pubDate>
		<generator>http://bbpress.org/?v=1.0.3</generator>
		<textInput>
			<title><![CDATA[Search]]></title>
			<description><![CDATA[Search all topics from these forums.]]></description>
			<name>q</name>
			<link>http://www.datastax.com/support-forums/search.php</link>
		</textInput>
		<atom:link href="http://www.datastax.com/support-forums/rss/topic/re-indexing-partial-fields-in-dse-20" rel="self" type="application/rss+xml" />

		<item>
			<title>Anonymous on "Re-indexing partial fields in DSE 2.0"</title>
			<link>http://www.datastax.com/support-forums/topic/re-indexing-partial-fields-in-dse-20#post-2039</link>
			<pubDate>Tue, 05 Jun 2012 21:18:13 +0000</pubDate>
			<dc:creator>Anonymous</dc:creator>
			<guid isPermaLink="false">2039@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;&#38;gt; a) How do we keep lucene indexes and CFs in sync? Will index write be attempted first and after that CFs get updated? If CF writes fail, do we rely on commitlog to replay the mutations?&#60;/p&#62;
&#60;p&#62;The index is in RAM until it is flushed, however the commit log guarantees that data is not lost.&#60;/p&#62;
&#60;p&#62;&#38;gt; b) Will read-repair work even for native solr queries also?. If we read/write at QUORUM, are we guaranteed to see the correct results returning from nodes?&#60;/p&#62;
&#60;p&#62;DSE supports writes at quorum.&#60;/p&#62;
&#60;p&#62;&#38;gt; c) Also we faced a power-cut scenario some time back and corrupted many of our indexes in hadoop. I guess enterprise search is also vulnerable to such issues. In those cases, will a simple re-build of indexes suffice to restore the data?&#60;/p&#62;
&#60;p&#62;Lucene is generally pretty good in regards to handling things like power outages.  With DSE the advantage over regular Solr is the index can be completely rebuilt from the data stored in Cassandra (which as a side notes is also compressed, something Solr / Lucene do not offer).
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Anonymous on "Re-indexing partial fields in DSE 2.0"</title>
			<link>http://www.datastax.com/support-forums/topic/re-indexing-partial-fields-in-dse-20#post-2003</link>
			<pubDate>Tue, 29 May 2012 05:27:20 +0000</pubDate>
			<dc:creator>Anonymous</dc:creator>
			<guid isPermaLink="false">2003@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;Thanks Jake.&#60;br /&#62;
Right now, we have built a proprietary distributed search over hadoop using native lucene and are looking for alternatives.&#60;/p&#62;
&#60;p&#62;I have a few more questions on this architecture&#60;/p&#62;
&#60;p&#62;a) How do we keep lucene indexes and CFs in sync? Will index write be attempted first and after that CFs get updated? If CF writes fail, do we rely on commitlog to replay the mutations?&#60;/p&#62;
&#60;p&#62;b) Will read-repair work even for native solr queries also?. If we read/write at QUORUM, are we guaranteed to see the correct results returning from nodes?&#60;/p&#62;
&#60;p&#62;c) Also we faced a power-cut scenario some time back and corrupted many of our indexes in hadoop. I guess enterprise search is also vulnerable to such issues. In those cases, will a simple re-build of indexes suffice to restore the data?&#60;/p&#62;
&#60;p&#62;Regards,&#60;br /&#62;
Ravi
&#60;/p&#62;</description>
		</item>
		<item>
			<title>tjake on "Re-indexing partial fields in DSE 2.0"</title>
			<link>http://www.datastax.com/support-forums/topic/re-indexing-partial-fields-in-dse-20#post-2002</link>
			<pubDate>Tue, 29 May 2012 02:39:08 +0000</pubDate>
			<dc:creator>tjake</dc:creator>
			<guid isPermaLink="false">2002@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;That is correct.  Since we hold all the fields in Cassandra we can rebuild the document internally. Please note, to get this behavior using the solr apis you must pass a extra http param. see ,&#60;/p&#62;
&#60;p&#62;&#60;a href=&#34;http://www.datastax.com/docs/datastax_enterprise2.0/search/dse_search_cluster#updating-individual-fields-in-a-solr-document&#34; rel=&#34;nofollow&#34;&#62;http://www.datastax.com/docs/datastax_enterprise2.0/search/dse_search_cluster#updating-individual-fields-in-a-solr-document&#60;/a&#62;
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Anonymous on "Re-indexing partial fields in DSE 2.0"</title>
			<link>http://www.datastax.com/support-forums/topic/re-indexing-partial-fields-in-dse-20#post-2001</link>
			<pubDate>Mon, 28 May 2012 14:43:40 +0000</pubDate>
			<dc:creator>Anonymous</dc:creator>
			<guid isPermaLink="false">2001@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;We are evaluating DSE 2.0 for it's ability to re-index fields individually. Our application has lot of update-able fields. We currently store all these fields in the index itself.&#60;/p&#62;
&#60;p&#62;I am curious as to how DSE 2.0 handles partial updates for fields? Even though it looks like the request updates a single field, under-the-hood will the entire document be read from cassandra and then delete/add to lucene performed?&#60;/p&#62;
&#60;p&#62;Is my understanding correct or is there some finer details I am missing&#60;/p&#62;
&#60;p&#62;Regards,&#60;br /&#62;
Ravi
&#60;/p&#62;</description>
		</item>

	</channel>
</rss>
