I am trying to union a bunch of tables and some of these fields are of type array of bigint. When I try to cast these in Hive, it keeps giving me a "Expression not in GROUP BY key 'field_name'.
My code is as follows:
CREATE TABLE sample_table stored AS avro AS
SELECT cast(cast(rand()* 1000000000000000000 AS BIGINT) AS string) id,
'PERSON'AS person,
name AS name,
age AS age,
gender AS gender
FROM table_a
UNION ALL
SELECT cast(cast(rand()* 1000000000000000000 AS BIGINT) AS string) id,
'PERSON'AS person,
name AS name,
collect_list(
CAST(NULL AS BIGINT)
) AS age,
null AS gender
FROM table_b
The error being generated is as follows:
SQL Error [500051] [HY000]: [Cloudera]HiveJDBCDriver ERROR processing query/statement. Error Code: 10025, SQL state: TStatus(statusCode:ERROR_STATUS, infoMessages:[*org.apache.hive.service.cli.HiveSQLException:Error while compiling statement: FAILED: SemanticException [Error 10025]: Line 4:7 Expression not in GROUP BY key 'age':28:27, org.apache.hive.service.cli.operation.Operation:toSQLException:Operation.java:400, org.apache.hive.service.cli.operation.SQLOperation:prepare:SQLOperation.java:187
Expression not in GROUP BY key
) is based on? – Raymond Nijland