I have data in Cassandra in following structure:
[default@data] list values;
Using default limit of 100
-------------------
RowKey: 123:1333065600000
=> (column=16000, value=331, timestamp=1333724446041000)
=> (column=76000, value=314, timestamp=1333724446042000)
=> (column=136000, value=333, timestamp=1333724446043000)
RowKey: 123:1332979200000
=> (column=6000, value=300, timestamp=1333743661692000)
=> (column=66000, value=302, timestamp=1333743661692001)
=> (column=126000, value=303, timestamp=1333743661694000)
=> (column=186000, value=234, timestamp=1333743661695000)
=> (column=246000, value=445, timestamp=1333743661696000)
=> (column=306000, value=331, timestamp=1333743661696001)
=> (column=366000, value=455, timestamp=1333743661698000)
RowKey: 121:1334102400000
=> (column=68608000, value=12, timestamp=1334173122715000)
=> (column=68668000, value=12, timestamp=1334173122715001)
In other words - number of columns can differ row per row. Now I need to do aggregation using pig of all values. I cannot change the structure. Any idea how to do that?
