I installed Brisk effortlessly on a single node of RHEL 5 Server. My version is beta2.
I got my hive prompt.
I created my table:
create database ayon_brisk_test;
create table ayon_brisk_test.actions_by_day (dait string,scanned_products_id bigint,action_type_id int,action_success int,actions_count bigint);
load data local inpath '/tmp/actions_by_day.txt' overwrite into table ayon_brisk_test.actions_by_day;
Sor far so great!
I fired a query:
select dait, count(*) from ayon_brisk_test.actions_by_day group by dait order by dait desc;
It dumps the whole table!
select dait, count(*) from ayon_brisk_test.actions_by_day group by dait;
It dumps the whole table
select dait, sum(actions_count) from ayon_brisk_test.actions_by_day where action_success=1 group by dait;
DUMPS THE WHOLE TABLE!
So the question is:
How do I make Brisk work?
At this point Brisk is doing a select * from <table> regardless of what query is given to it.
Here is the table desc:
hive> desc formatted actions_by_day;
OK
# col_name data_type comment
dait string None
scanned_products_id bigint None
action_type_id int None
action_success int None
actions_count bigint None
# Detailed Table Information
Database: ayon_brisk_test
Owner: ayon
CreateTime: Fri Sep 02 01:07:54 UTC 2011
LastAccessTime: UNKNOWN
Protect Mode: None
Retention: 0
Location: cfs://null/user/hive/warehouse/ayon_brisk_test.db/actions_by_day
Table Type: MANAGED_TABLE
Table Parameters:
transient_lastDdlTime 1314925949
# Storage Information
SerDe Library: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
InputFormat: org.apache.hadoop.mapred.TextInputFormat
OutputFormat: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
Compressed: No
Num Buckets: -1
Bucket Columns: []
Sort Columns: []
Storage Desc Params:
serialization.format 1
