I've been following a discussion about Cassandra capacity planning on serverfault.com where it was suggested that nodes should have no more than .5 Tb of data. IE for 5 Tb of data one would need at least 10 nodes. I haven't been able to verify this against any documentation though. Has anyone seen recommendation(s) like this?
Note - this is exclusive of how much RAM each node has. My understanding on this is that 'more is better - but don't give Java too much'.