<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="bbPress/1.0.3" -->
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<title>DataStax Support Forums &#187; Topic: MapReduce Spill Failure</title>
		<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure</link>
		<description>Software, Support, and Training for Apache Cassandra</description>
		<language>en-US</language>
		<pubDate>Thu, 23 May 2013 21:38:33 +0000</pubDate>
		<generator>http://bbpress.org/?v=1.0.3</generator>
		<textInput>
			<title><![CDATA[Search]]></title>
			<description><![CDATA[Search all topics from these forums.]]></description>
			<name>q</name>
			<link>http://www.datastax.com/support-forums/search.php</link>
		</textInput>
		<atom:link href="http://www.datastax.com/support-forums/rss/topic/mapreduce-spill-failure" rel="self" type="application/rss+xml" />

		<item>
			<title>Anonymous on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure/page/2#post-1521</link>
			<pubDate>Tue, 03 Apr 2012 13:27:04 +0000</pubDate>
			<dc:creator>Anonymous</dc:creator>
			<guid isPermaLink="false">1521@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;Nevermind. I see my error: MyReducer should be parameterized as &#60;code&#62;&#38;lt;Text,IntWritable,Text,IntWritable&#38;gt;&#60;/code&#62; and then the reduce method should have the following signature:&#60;br /&#62;
&#60;code&#62;public void reduce(Text key, Iterable&#38;lt;IntWritable&#38;gt; counts, Context context)&#60;/code&#62;&#60;/p&#62;
&#60;p&#62;Everything is working now. Thanks again for all the help.
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Anonymous on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1518</link>
			<pubDate>Mon, 02 Apr 2012 20:19:46 +0000</pubDate>
			<dc:creator>Anonymous</dc:creator>
			<guid isPermaLink="false">1518@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;That solved my spill problem. One last question and I'll be all set:&#60;/p&#62;
&#60;p&#62;My mapper runs fine now and the job succeeds. The output file, however, looks like it didn't aggregate the results (all counts are 1 and the same key shows up multiple times in the output file). My reducer and job configuration look exactly like the WordCount example as far as I can tell (included below). Can you see anything I'm doing wrong?&#60;/p&#62;
&#60;pre&#62;&#60;code&#62;public static class MyReducer extends Reducer&#38;lt;Text, Iterable&#38;lt;IntWritable&#38;gt;, Text, IntWritable&#38;gt; {
        private IntWritable result = new IntWritable();

        public void reduce(Text key, Iterable&#38;lt;IntWritable&#38;gt; counts, Context context) throws Exception {
            int sum = 0;
            for (IntWritable count : counts) {
                sum += count.get();
            }
            result.set(sum);
            context.write(key, result);

        }
    }&#60;/code&#62;&#60;/pre&#62;
&#60;p&#62;And in the job configuration, I have:&#60;br /&#62;
&#60;pre&#62;&#60;code&#62;job.setCombinerClass(MyReducer.class);
 job.setReducerClass(MyReducer.class);&#60;/code&#62;&#60;/pre&#62;</description>
		</item>
		<item>
			<title>tjake on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1517</link>
			<pubDate>Mon, 02 Apr 2012 19:49:19 +0000</pubDate>
			<dc:creator>tjake</dc:creator>
			<guid isPermaLink="false">1517@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;col2.value().array() isn't safe (could generate a huge amount of garbage.  use ByteBufferUtil.string(col2.value()),  try that
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Anonymous on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1516</link>
			<pubDate>Mon, 02 Apr 2012 19:35:01 +0000</pubDate>
			<dc:creator>Anonymous</dc:creator>
			<guid isPermaLink="false">1516@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;Barely anything. Here's the map method in its entirety:&#60;/p&#62;
&#60;pre&#62;&#60;code&#62;public void map(ByteBuffer key, SortedMap&#38;lt;ByteBuffer, IColumn&#38;gt; columns, Context context) throws IOException, InterruptedException {
        IColumn col1 = (IColumn)columns.get(ByteBuffer.wrap(&#38;quot;column1&#38;quot;.getBytes(&#38;quot;UTF-8&#38;quot;)));
        IColumn col2 =(IColumn) columns.get(ByteBuffer.wrap(&#38;quot;column2&#38;quot;.getBytes(&#38;quot;UTF-8&#38;quot;)));
        if(col1 != null &#38;amp;&#38;amp; col1.value()!=null &#38;amp;&#38;amp; col2 != null &#38;amp;&#38;amp; col2.value()!=null ){
            long value = ByteBufferUtil.toLong(col1.value());
            if(value &#38;gt; 0 &#38;amp;&#38;amp; value &#38;lt;= THRESHOLD){
                Text outputKey =      null;
                try{
                    outputKey =   new Text(col2.value().array());
                }catch(Exception e){
                    System.out.println(&#38;quot;key is null&#38;quot;);
                }
                if(outputKey != null){
                    context.write(outputKey,new IntWritable(1));
                }
            }
        }

    }&#60;/code&#62;&#60;/pre&#62;</description>
		</item>
		<item>
			<title>tjake on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1515</link>
			<pubDate>Mon, 02 Apr 2012 19:24:16 +0000</pubDate>
			<dc:creator>tjake</dc:creator>
			<guid isPermaLink="false">1515@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;Yeah that's fine.  I have no idea why 10G of spill is generated from 600M of data.  What exactly is the mapper doing with the data?
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Anonymous on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1514</link>
			<pubDate>Mon, 02 Apr 2012 19:19:34 +0000</pubDate>
			<dc:creator>Anonymous</dc:creator>
			<guid isPermaLink="false">1514@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;As far as I can tell, I am setting that... in my Job configuration, I do the following:&#60;/p&#62;
&#60;pre&#62;&#60;code&#62;List&#38;lt;ByteBuffer&#38;gt; colList = new ArrayList&#38;lt;ByteBuffer&#38;gt;();
  colList.add(ByteBuffer.wrap(&#38;quot;column1&#38;quot;.getBytes(&#38;quot;UTF-8&#38;quot;)));
  colList.add(ByteBuffer.wrap(&#38;quot;column2&#38;quot;.getBytes(&#38;quot;UTF-8&#38;quot;)));
  SlicePredicate predicate = new SlicePredicate().setColumn_names(colList);
  //snip
  ConfigHelper.setInputSlicePredicate(job.getConfiguration(), predicate);&#60;/code&#62;&#60;/pre&#62;</description>
		</item>
		<item>
			<title>tjake on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1512</link>
			<pubDate>Mon, 02 Apr 2012 17:24:53 +0000</pubDate>
			<dc:creator>tjake</dc:creator>
			<guid isPermaLink="false">1512@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;Can you avoid range slicing? do you know the columns you need? if so you can set cassandra.slice.predicate.column_names
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Anonymous on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1508</link>
			<pubDate>Mon, 02 Apr 2012 16:59:22 +0000</pubDate>
			<dc:creator>Anonymous</dc:creator>
			<guid isPermaLink="false">1508@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;I thought I did, but upon re-working my Mapper, I'm still having issues. I don't understand why my mapper is spilling at all... my column family has a bunch of columns but in the slice predicate, I'm just looking at 2 of them. My mapper is quite simple: for each row, it looks at column1 and if the value is &#38;gt; Threshold, it writes the value of column2 and an IntWritable set to 1 to the context. &#60;/p&#62;
&#60;p&#62;My reducer just sums up the ints and outputs the total for each key.&#60;/p&#62;
&#60;p&#62;Even though my test column family only contains 300 MB of data, approximately 10 GB of spill files are produced. I can see in the spill files that it contains not only the value of column2, but column1 as well (which I never write to the context). I thought the spill files should only contain the output of the map operation that doesn't fit in memory any more?
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Stephanie on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1499</link>
			<pubDate>Mon, 02 Apr 2012 15:31:05 +0000</pubDate>
			<dc:creator>Stephanie</dc:creator>
			<guid isPermaLink="false">1499@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;Hi Yossarian, &#60;/p&#62;
&#60;p&#62;Were you able to get help with everything?
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Anonymous on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1490</link>
			<pubDate>Mon, 02 Apr 2012 13:46:57 +0000</pubDate>
			<dc:creator>Anonymous</dc:creator>
			<guid isPermaLink="false">1490@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;I didn't think it was too small (it had 6 GB free and my column family on which I was running the job was only 600 MB). When I changed the directory to an explicit dir on a filesystem with more space, it worked. Apparently, my mapper has a bug and it's generating way too much output. &#60;/p&#62;
&#60;p&#62;Thanks for the help.
&#60;/p&#62;</description>
		</item>
		<item>
			<title>tjake on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1487</link>
			<pubDate>Mon, 02 Apr 2012 13:29:06 +0000</pubDate>
			<dc:creator>tjake</dc:creator>
			<guid isPermaLink="false">1487@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;According to this your mapred.local.dir is ${hadoop.tmp.dir}/mapred/local&#60;/p&#62;
&#60;p&#62;Is your /tmp dir small?&#60;/p&#62;
&#60;p&#62;Can you try setting this to fixed directory in mapred-site.xml ?
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Anonymous on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1486</link>
			<pubDate>Mon, 02 Apr 2012 13:12:38 +0000</pubDate>
			<dc:creator>Anonymous</dc:creator>
			<guid isPermaLink="false">1486@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;Here's the job.xml:&#60;/p&#62;
&#60;pre&#62;&#60;code&#62;&#38;lt;?xml version=&#38;quot;1.0&#38;quot; encoding=&#38;quot;UTF-8&#38;quot; standalone=&#38;quot;no&#38;quot;?&#38;gt;&#38;lt;configuration&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.s3n.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.fs.s3native.NativeS3FileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.task.cache.levels&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;2&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.inode.mode&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;SYNC&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;hadoop.tmp.dir&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;/tmp/hadoop-${user.name}&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;hadoop.native.lib&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;map.sort.class&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.util.QuickSort&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;dse.cfs.repair.batchsize&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;300&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;ipc.client.idlethreshold&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;4000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;cassandra.thrift.address&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;nyarchapp01d&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.system.dir&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;${hadoop.tmp.dir}/mapred/system&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.tracker.persist.jobstatus.hours&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.skip.checksum.errors&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.default.name&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;cfs:///&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.cluster.reduce.memory.mb&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;-1&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.reducer.new-api&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.child.tmp&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;./tmp&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.har.impl.disable.cache&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;cassandra.partitioner.class&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.cassandra.dht.RandomPartitioner&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.skip.reduce.max.skip.groups&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.heartbeats.in.second&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;100&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.tasktracker.dns.nameserver&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;default&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.sort.factor&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;cassandra.input.columnfamily&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;MyEvent&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.task.timeout&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;600000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.max.tracker.failures&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;4&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;hadoop.rpc.socket.factory.class.default&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.net.StandardSocketFactory&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.tracker.jobhistory.lru.cache.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;5&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.hdfs.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.hdfs.DistributedFileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.queue.default.acl-administer-jobs&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;*&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.output.key.class&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.io.Text&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.skip.map.auto.incr.proc.count&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.job.complete.cancel.delegation.tokens&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.mapfile.bloom.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;1048576&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.reduce.shuffle.connect.timeout&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;180000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.jobtracker.blacklist.fault-timeout-window&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;180&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;tasktracker.http.threads&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;40&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.shuffle.merge.percent&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0.66&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.inputformat.class&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.cassandra.hadoop.ColumnFamilyInputFormat&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.ftp.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.fs.ftp.FTPFileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.output.compress&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.bytes.per.checksum&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;512&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.combine.recordsBeforeProgress&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.healthChecker.script.timeout&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;600000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;topology.node.switch.mapping.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.net.ScriptBasedMapping&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;cassandra.thrift.port&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;9160&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.reduce.slowstart.completed.maps&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0.05&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.reduce.max.attempts&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;4&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;cassandra.input.keyspace&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;ReportingKeyspace&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.ramfs.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.fs.InMemoryFileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.skip.map.max.skip.records&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.cluster.map.memory.mb&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;-1&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.local.block.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;67108864&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;hadoop.security.group.mapping&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.security.ShellBasedUnixGroupsMapping&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.tracker.persist.jobstatus.dir&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;/jobtracker/jobsInfo&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.jar&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;cfs:/tmp/hadoop-apparch/mapred/staging/apparch/.staging/job_201203301707_0004/job.jar&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.s3.buffer.dir&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;${hadoop.tmp.dir}/s3&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;job.end.retry.attempts&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;dse.consistencylevel.read&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;ONE&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.file.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.fs.LocalFileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.local.dir.minspacestart&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.output.compression.type&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;RECORD&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;cassandra.input.predicate&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0f00010b000000060000000c6f66666572696e67546b65790000000a646576696365436f64650000000d616666696c69617465436f64650000000b73657276696365546b65790000001373746172744576656e7454696d657374616d7000000015656e64456c6170736564506c61795365636f6e647300&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;dfs.permissions&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;topology.script.number.args&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;100&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.mapfile.bloom.error.rate&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0.005&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.cluster.max.reduce.memory.mb&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;-1&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.max.tracker.blacklists&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;4&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.task.profile.maps&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0-2&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.userlog.retain.hours&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;24&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.tracker.persist.jobstatus.active&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;hadoop.security.authorization&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;local.cache.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10737418240&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.min.split.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.map.tasks&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;3&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.cfs.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;com.datastax.bdp.hadoop.cfs.CassandraFileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.child.java.opts&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;-Xmx256m&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.job.counters.limit&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;120&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.output.value.class&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.io.SortedMapWritable&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;dse.autoconfigure.node&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.queue.name&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;default&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.tracker.retiredjobs.cache.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;1000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;ipc.server.listen.queue.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;128&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;job.end.retry.interval&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;30000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.inmem.merge.threshold&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;1000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.skip.attempts.to.start.skipping&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;2&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.tasktracker.outofband.heartbeat.damper&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;1000000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.checkpoint.dir&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;${hadoop.tmp.dir}/dfs/namesecondary&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.reduce.tasks&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;1&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.merge.recordsBeforeProgress&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.userlog.limit.kb&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.reduce.memory.mb&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;-1&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;webinterface.private.actions&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;hadoop.security.token.service.use_ip&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.sort.spill.percent&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0.80&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.shuffle.input.buffer.percent&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0.70&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.name&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;offering count&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.map.tasks.speculative.execution&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;hadoop.util.hash.type&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;murmur&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.map.max.attempts&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;4&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.job.acl-view-job&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt; &#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.tracker.handler.count&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.reduce.shuffle.read.timeout&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;180000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.tasktracker.expiry.interval&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;600000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.jobtracker.maxtasks.per.job&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;-1&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.jobtracker.job.history.block.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;3145728&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;keep.failed.task.files&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.task.profile.reduces&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0-2&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;ipc.client.tcpnodelay&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.output.compression.codec&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.io.compress.DefaultCodec&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.map.index.skip&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.working.dir&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;cfs:/user/apparch&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;ipc.server.tcpnodelay&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.jobtracker.blacklist.fault-bucket-width&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;15&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.used.genericoptionsparser&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.mapper.new-api&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.map.memory.mb&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;-1&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;hadoop.logfile.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10000000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.reduce.tasks.speculative.execution&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.job.dir&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;cfs:/tmp/hadoop-apparch/mapred/staging/apparch/.staging/job_201203301707_0004&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.tasktracker.outofband.heartbeat&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.reduce.input.limit&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;-1&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;hadoop.security.authentication&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;simple&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.checkpoint.period&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;3600&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.reuse.jvm.num.tasks&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;24&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.jobtracker.completeuserjobs.maximum&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;100&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;jobtracker.thrift.address&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0.0.0.0:9290&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.task.tracker.task-controller&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.mapred.DefaultTaskController&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.s3.maxRetries&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;4&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.cluster.max.map.memory.mb&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;-1&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.reduce.shuffle.maxfetchfailures&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.job.acl-modify-job&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt; &#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.local.dir&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;${hadoop.tmp.dir}/mapred/local&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.hftp.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.hdfs.HftpFileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.trash.interval&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.s3.sleepTimeSeconds&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.submit.replication&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.har.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.fs.HarFileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.map.output.compression.codec&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;com.datastax.bdp.hadoop.compression.SnappyCodec&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.tasktracker.dns.interface&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;default&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.tracker&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;${dse.job.tracker}&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;dse.consistencylevel.write&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;ONE&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.job.submithost&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;nyarchapp01d&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.seqfile.sorter.recordlimit&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;1000000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.line.input.format.linespermap&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;1&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.jobtracker.taskScheduler&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.mapred.JobQueueTaskScheduler&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.webhdfs.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.hdfs.web.WebHdfsFileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.local.dir.minspacekill&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.sort.record.percent&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0.05&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.reduce.class&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;com.myorg.hadoop.MyEventReducer&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.kfs.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.fs.kfs.KosmosFileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.temp.dir&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;${hadoop.tmp.dir}/mapred/temp&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.tasktracker.reduce.tasks.maximum&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;2&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.checkpoint.edits.dir&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;${fs.checkpoint.dir}&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.tasktracker.tasks.sleeptime-before-sigkill&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;5000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.reduce.input.buffer.percent&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0.0&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.tasktracker.indexcache.mb&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.jobtracker.plugins&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;com.datastax.bdp.hadoop.thriftfs.ThriftJobTrackerPlugin&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.job.split.metainfo.maxsize&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10000000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.skip.reduce.auto.incr.proc.count&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;hadoop.logfile.count&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.job.submithostaddress&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10.40.199.130&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.seqfile.compress.blocksize&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;1048576&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.s3.block.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;67108864&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.tasktracker.taskmemorymanager.monitoring-interval&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;5000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.queue.default.state&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;RUNNING&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.acls.enabled&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.jobtracker.staging.root.dir&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;${hadoop.tmp.dir}/mapred/staging&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.queue.names&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;default&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.hsftp.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.hdfs.HsftpFileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.task.tracker.http.address&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0.0.0.0:50060&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.combine.class&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;com.myorg.hadoop.MyEventReducer&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.cfs-archive.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;com.datastax.bdp.hadoop.cfs.CassandraFileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.reduce.parallel.copies&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;5&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.seqfile.lazydecompress&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.output.dir&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;/tmp/hadoopout&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.sort.mb&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;100&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;ipc.client.connection.maxidletime&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.compress.map.output&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;true&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;hadoop.security.uid.cache.secs&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;14400&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.task.tracker.report.address&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;127.0.0.1:0&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.healthChecker.interval&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;60000&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;ipc.client.kill.max&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;ipc.client.connect.max.retries&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;10&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapreduce.map.class&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;com.myorg.hadoop.MyEventMapper&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.s3.impl&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.fs.s3.S3FileSystem&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.user.jobconf.limit&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;5242880&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.job.tracker.http.address&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;0.0.0.0:50030&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.file.buffer.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;4096&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.jobtracker.restart.recover&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.serializations&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.io.serializer.WritableSerialization&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.task.profile&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;false&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;jobclient.output.filter&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;FAILED&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;mapred.tasktracker.map.tasks.maximum&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;2&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;io.compression.codecs&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;org.apache.hadoop.io.compress.DefaultCodec,org.apache.hadoop.io.compress.GzipCodec,org.apache.hadoop.io.compress.BZip2Codec&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.checkpoint.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;67108864&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;property&#38;gt;&#38;lt;name&#38;gt;fs.local.subblock.size&#38;lt;/name&#38;gt;&#38;lt;value&#38;gt;2097152&#38;lt;/value&#38;gt;&#38;lt;/property&#38;gt;
&#38;lt;/configuration&#38;gt;&#60;/code&#62;&#60;/pre&#62;</description>
		</item>
		<item>
			<title>tjake on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1481</link>
			<pubDate>Fri, 30 Mar 2012 21:17:25 +0000</pubDate>
			<dc:creator>tjake</dc:creator>
			<guid isPermaLink="false">1481@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;Can you post the job.xml file generated for your job?
&#60;/p&#62;</description>
		</item>
		<item>
			<title>Anonymous on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1480</link>
			<pubDate>Fri, 30 Mar 2012 21:10:47 +0000</pubDate>
			<dc:creator>Anonymous</dc:creator>
			<guid isPermaLink="false">1480@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;I set that property in both $DSE_HOME/resources/hadoop/conf/taskcontroller.cfg and added the xml property declaration to $DSE_HOME/resources/hadoop/conf/dse-core.xml but I still get the same error.
&#60;/p&#62;</description>
		</item>
		<item>
			<title>tjake on "MapReduce Spill Failure"</title>
			<link>http://www.datastax.com/support-forums/topic/mapreduce-spill-failure#post-1479</link>
			<pubDate>Fri, 30 Mar 2012 21:00:10 +0000</pubDate>
			<dc:creator>tjake</dc:creator>
			<guid isPermaLink="false">1479@http://www.datastax.com/support-forums/</guid>
			<description>&#60;p&#62;Sounds like you need to setup your mapred.local.dir on the nodes. &#60;/p&#62;
&#60;p&#62; &#38;lt;property&#38;gt;&#60;br /&#62;
    &#38;lt;name&#38;gt;mapred.local.dir&#38;lt;/name&#38;gt;&#60;br /&#62;
    &#38;lt;value&#38;gt;/d1/mapred/local,/d2/mapred/local,/d3/mapred/local,/d4/mapred/local&#38;lt;/value&#38;gt;&#60;br /&#62;
 &#38;lt;/property&#38;gt;&#60;/p&#62;
&#60;p&#62;This defaults to /tmp/hadoop-${user.name}
&#60;/p&#62;</description>
		</item>

	</channel>
</rss>
