What is the preferred/best approach if I want to read huge volume of data from Cassandra for reporting purpose? Any difference in approach if the read is from one CF Vs multiple CFs. (my current need is from one CF)
Is it optimal to use, cassandra-jdbc and CQL with "select x,y,z…..where /some/ condition"? Or does this problem brings up the need of Hive or PIG?
To give a short description of my current environment, I have 2 DCs with 3 nodes on each DC. with nodes started with -s (for solar)