I'm trying to setup and run simple pig scripts on the Cassandra data that I have. I referred to http://www.datastax.com/docs/datastax_enterprise2.1/analytics/about_pig#run-pig-mr this page; however, I'm getting a strange error that I couldn't find much documentation about.
grunt> test = LOAD 'cassandra://<one of my keyspaces>/<my column family>' USING CassandraStorage();
What I end up receiving is this:
2012-07-13 10:50:00,760 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 6007: Unable to check name cfs:/user/root
Details at logfile: /root/datastax/dse-2.1/pig_1342201428643.log
I also looked at the log file that it specified, and I receive this:
================================================================================
Pig Stack Trace
---------------
ERROR 6007: Unable to check name cfs:/user/root
java.lang.Error: org.apache.pig.backend.datastorage.DataStorageException: ERROR 6007: Unable to check name cfs:/user/root
at org.apache.pig.impl.logicalLayer.parser.QueryParser.LoadClause(QueryParser.java:1630)
at org.apache.pig.impl.logicalLayer.parser.QueryParser.BaseExpr(QueryParser.java:1366)
at org.apache.pig.impl.logicalLayer.parser.QueryParser.Expr(QueryParser.java:1013)
at org.apache.pig.impl.logicalLayer.parser.QueryParser.Parse(QueryParser.java:800)
at org.apache.pig.impl.logicalLayer.LogicalPlanBuilder.parse(LogicalPlanBuildg Stack Trace
---------------
ERROR 6007: Unable to check name cfs:/user/root
java.lang.Error: org.apache.pig.backend.datastorage.DataStorageException: ERROR 6007: Unable to check name cfs:/user/root
at org.apache.pig.impl.logicalLayer.parser.QueryParser.LoadClause(QueryParser.java:1630)
at org.apache.pig.impl.logicalLayer.parser.QueryParser.BaseExpr(QueryParser.java:1366)
at org.apache.pig.impl.logicalLayer.parser.QueryParser.Expr(QueryParser.java:1013)
at org.apache.pig.impl.logicalLayer.parser.QueryParser.Parse(QueryParser.java:800)
at org.apache.pig.impl.logicalLayer.LogicalPlanBuilder.parse(LogicalPlanBuilder.java:63)
at org.apache.pig.PigServer$Graph.parseQuery(PigServer.java:1612)
at org.apache.pig.PigServer$Graph.registerQuery(PigServer.java:1562)
at org.apache.pig.PigServer.registerQuery(PigServer.java:534)
at org.apache.pig.tools.grunt.GruntParser.processPig(GruntParser.java:871)
at org.apache.pig.Ser.java:63)
at org.apache.pig.PigServer$Graph.parseQuery(PigServer.java:1612)
at org.apache.pig.PigServer$Graph.registerQuery(PigServer.java:1562)
at org.apache.pig.PigServer.registerQuery(PigServer.java:534)
at org.apache.pig.tools.grunt.GruntParser.processPig(GruntParser.java:871)
at org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:388)
at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:168)
at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:144)
at org.apache.pig.tools.grunt.Grunt.run(Grunt.java:76)
at org.apache.pig.Main.run(Main.java:455)
at org.apache.pig.Main.main(Main.java:107)
Caused by: org.apache.pig.backend.datastorage.DataStorageException: ERROR 6007: Unable to check name cfs:/user/root
at org.apache.pig.backend.hadoop.datastorage.HDataStorage.isContainer(HDataStorage.java:210)
at org.apache.pig.backend.hadoop.datastorage.HDataStorage.asElement(HDataStorage.java:131)
at org.apache.pig.backend.hadoop.datastorage.HDataStorage.asElement(HDataStorage.java:141)
at org.apache.pig.impl.logicalLayer.parser.QueryParser.getCurrentDir(QueryParser.java:175)
at org.apache.pig.impl.logicalLayer.parser.QueryParser.LoadClause(QueryParser.java:1615)
... 14 more
Caused by: java.io.IOException: UnavailableException()
at com.datastax.bdp.hadoop.cfs.CassandraFileSystemThriftStore.readSlice(CassandraFileSystemThriftStore.java:688)
at com.datastax.bdp.hadoop.cfs.CassandraFileSystemThriftStore.readINodeColumns(CassandraFileSystemThriftStore.java:619)
at com.datastax.bdp.hadoop.cfs.CassandraFileSystemThriftStore.retrieveINode(CassandraFileSystemThriftStore.java:571)
at com.datastax.bdp.hadoop.cfs.CassandraFileSystemThriftStore.retrieveINode(CassandraFileSystemThriftStore.java:563)
at com.datastax.bdp.hadoop.cfs.CassandraFileSystem.getFileStatus(CassandraFileSystem.java:520)
at org.apache.hadoop.fs.FileSystem.exists(FileSystem.java:768)
at org.apache.pig.backend.hadoop.datastorage.HDataStorage.isContainer(HDataStorage.java:203)
... 18 more
Caused by: UnavailableException()
at org.apache.cassandra.thrift.Cassandra$get_slice_result.read(Cassandra.java:7764)
at org.apache.cassandra.thrift.Cassandra$Client.recv_get_slice(Cassandra.java:570)
at org.apache.cassandra.thrift.Cassandra$Client.get_slice(Cassandra.java:542)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source)
at java.lang.reflect.Method.invoke(Unknown Source)
at com.datastax.bdp.util.CassandraProxyClient.invoke(CassandraProxyClient.java:407)
at $Proxy1.get_slice(Unknown Source)
I checked that my column family and keyspace existed with the list command in this cli, so I'm not exactly sure what the issue is here.
If anyone could provide help, it'd be much appreciated.
