I am trying to do the following and am having trouble finding the correct syntax.
- I want to use the LOAD function to load a single key/value "row" into a pig object.
- The contents of that row is then flattened into a list of keys.
- I then want to use that list of keys for another load function to select the key/value pairs from another column family.
The only way I can get this to work is by using a generic load function then applying filters to get at the data I want. Then joining the two pig objects together to filter the second column family.
I want to avoid having to pull the entire column familys into pig, it is way too much data.
Any suggestions?
Thanks!
