Ok. We've set the JobTracker address based on the response from brisktool. When we execute now it fails with an EOFException (see detailed logging and stacktrace below). Any ideas?
From our application log:
06-07-2011 11:31:40 [http-8080-1] INFO controller.DataController - Computing targets graphs started at: Wed Jul 06 11:31:40 BST 2011
06-07-2011 11:31:40 [http-8080-1] INFO cassandra.AbstractStore - Truncating column family 'data_graphs'
06-07-2011 11:31:40 [http-8080-1] INFO client.CassandraClient - CassandraProxyClient(localhost, 9170, true, ROUND_ROBIN)
06-07-2011 11:31:40 [http-8080-1] INFO client.CassandraClient - Connected to cassandra at localhost:9170
06-07-2011 11:31:40 [http-8080-1] INFO hadoop.DependencyGraph - Configuring new DependencyGraph job against host localhost and port 9170
06-07-2011 11:31:40 [http-8080-1] INFO hadoop.DependencyGraph - Configuration Properties:
io.seqfile.compress.blocksize=1000000, fs.checkpoint.size=67108864, io.skip.checksum.errors=false, mapred.used.genericoptionsparser=true, fs.s3n.impl=org.apache.hadoop.fs.s3native.NativeS3FileSystem, fs.s3.maxRetries=4, webinterface.private.actions=false, fs.s3.impl=org.apache.hadoop.fs.s3.S3FileSystem, hadoop.native.lib=true, fs.checkpoint.edits.dir=${fs.checkpoint.dir}, ipc.server.listen.queue.size=128, fs.default.name=file:///, ipc.client.idlethreshold=4000, fs.hsftp.impl=org.apache.hadoop.hdfs.HsftpFileSystem, hadoop.tmp.dir=/tmp/hadoop-${user.name}, fs.checkpoint.dir=${hadoop.tmp.dir}/dfs/namesecondary, fs.s3.block.size=67108864, hadoop.security.authorization=false, io.serializations=org.apache.hadoop.io.serializer.WritableSerialization, hadoop.util.hash.type=murmur, io.seqfile.lazydecompress=true, io.file.buffer.size=4096, io.mapfile.bloom.size=1048576, fs.s3.buffer.dir=${hadoop.tmp.dir}/s3, hadoop.logfile.size=10000000, ipc.client.kill.max=10, io.compression.codecs=org.apache.hadoop.io.compress.DefaultCodec,org.apache.hadoop.io.compress.GzipCodec,org.apache.hadoop.io.compress.BZip2Codec, topology.script.number.args=100, fs.har.impl=org.apache.hadoop.fs.HarFileSystem, io.seqfile.sorter.recordlimit=1000000, fs.trash.interval=0, local.cache.size=10737418240, ipc.server.tcpnodelay=false, ipc.client.connect.max.retries=10, fs.ramfs.impl=org.apache.hadoop.fs.InMemoryFileSystem, hadoop.rpc.socket.factory.class.default=org.apache.hadoop.net.StandardSocketFactory, fs.kfs.impl=org.apache.hadoop.fs.kfs.KosmosFileSystem, fs.checkpoint.period=3600, topology.node.switch.mapping.impl=org.apache.hadoop.net.ScriptBasedMapping, hadoop.logfile.count=10, fs.ftp.impl=org.apache.hadoop.fs.ftp.FTPFileSystem, fs.file.impl=org.apache.hadoop.fs.LocalFileSystem, fs.hdfs.impl=org.apache.hadoop.hdfs.DistributedFileSystem, ipc.client.connection.maxidletime=10000, io.mapfile.bloom.error.rate=0.005, io.bytes.per.checksum=512, mapred.job.tracker=ondroid-si-1:8012, fs.har.impl.disable.cache=true, ipc.client.tcpnodelay=false, fs.hftp.impl=org.apache.hadoop.hdfs.HftpFileSystem, fs.s3.sleepTimeSeconds=10,
06-07-2011 11:31:41 [http-8080-1] ERROR hadoop.DependencyGraph - Caught exception waiting for job to complete:
java.io.IOException: Call to ondroid-si-1/10.20.5.191:8012 failed on local exception: java.io.EOFException
at org.apache.hadoop.ipc.Client.wrapException(Client.java:775)
at org.apache.hadoop.ipc.Client.call(Client.java:743)
at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220)
at org.apache.hadoop.mapred.$Proxy105.getProtocolVersion(Unknown Source)
at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:359)
at org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:429)
at org.apache.hadoop.mapred.JobClient.init(JobClient.java:423)
at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:410)
at org.apache.hadoop.mapreduce.Job.<init>(Job.java:50)
at org.apache.hadoop.mapreduce.Job.<init>(Job.java:54)
at ntoklo.matrix.impl.computation.hadoop.DependencyGraph.run(DependencyGraph.java:87)
at ntoklo.matrix.impl.controller.DataController.computeGraphs(DataController.java:206)
at ntoklo.matrix.impl.MatrixImpl.serviceComputeGraphs(MatrixImpl.java:89)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
at java.lang.reflect.Method.invoke(Unknown Source)
at com.sun.jersey.server.impl.model.method.dispatch.AbstractResourceMethodDispatchProvider$TypeOutInvoker._dispatch(AbstractResourceMethodDispatchProvider.java:167)
at com.sun.jersey.server.impl.model.method.dispatch.ResourceJavaMethodDispatcher.dispatch(ResourceJavaMethodDispatcher.java:70)
at com.sun.jersey.server.impl.uri.rules.HttpMethodRule.accept(HttpMethodRule.java:279)
at com.sun.jersey.server.impl.uri.rules.RightHandPathRule.accept(RightHandPathRule.java:136)
at com.sun.jersey.server.impl.uri.rules.ResourceClassRule.accept(ResourceClassRule.java:86)
at com.sun.jersey.server.impl.uri.rules.RightHandPathRule.accept(RightHandPathRule.java:136)
at com.sun.jersey.server.impl.uri.rules.RootResourceClassesRule.accept(RootResourceClassesRule.java:74)
at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1347)
at com.sun.jersey.server.impl.application.WebApplicationImpl._handleRequest(WebApplicationImpl.java:1279)
at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1229)
at com.sun.jersey.server.impl.application.WebApplicationImpl.handleRequest(WebApplicationImpl.java:1219)
at com.sun.jersey.spi.container.servlet.WebComponent.service(WebComponent.java:419)
at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:537)
at com.sun.jersey.spi.container.servlet.ServletContainer.service(ServletContainer.java:699)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:717)
at com.google.inject.servlet.ServletDefinition.doService(ServletDefinition.java:216)
at com.google.inject.servlet.ServletDefinition.service(ServletDefinition.java:141)
at com.google.inject.servlet.ManagedServletPipeline.service(ManagedServletPipeline.java:93)
at com.google.inject.servlet.FilterChainInvocation.doFilter(FilterChainInvocation.java:63)
at com.google.inject.servlet.ManagedFilterPipeline.dispatch(ManagedFilterPipeline.java:122)
at com.google.inject.servlet.GuiceFilter.doFilter(GuiceFilter.java:110)
at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235)
at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233)
at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191)
at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:298)
at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:859)
at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:588)
at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:489)
at java.lang.Thread.run(Unknown Source)
Caused by: java.io.EOFException
at java.io.DataInputStream.readInt(Unknown Source)
at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:501)
at org.apache.hadoop.ipc.Client$Connection.run(Client.java:446)
From the Cassandra/Brisk system.log:
WARN [pool-3-thread-1] 2011-07-06 11:34:10,110 Server.java (line 1110) Incorrect header or version mismatch from 10.20.5.191:57394 got version 3 expected version 4