Im likely to migrate our data to Cassandra in the coming months. We're about to enter a phase of 10x growth followed by another 10x soon after and it's more than our current setup can handle.
One aspect of this is file storage. We collect sensor data from many sources and flat files typically get included. This data will be put into Lustre which leads me to my question: how viable would it be to use Lustre as the file store for Cassandra? IE build servers with lots of RAM but minimal disk space and use a DFS to store the DB logs and such.
It sounds like Cassandra uses disk for compaction, backup and logging. Would it suffer if the files were networked and not local? Im thinking multiple 1Gb bonded network - maybe 10Gb if necessary.