Hive ERROR: Out of memory due to hash maps used in map-side aggregation

最新推荐文章于 2022-03-04 13:03:39 发布

_大漠孤烟_

最新推荐文章于 2022-03-04 13:03:39 发布

阅读量1.6k

点赞数

文章标签： hive

本文链接：https://blog.csdn.net/lixucpf/article/details/20458617

版权

在执行H-SQL: select collect_set(messageDate)[0],count(*) from incidents_hive group by substr(messageDate,8,2);

时报以下错误：

URL:
http://RDCMaster.cluster:50030/taskdetails.jsp?jobid=job_201403041024_0002&tipid=task_201403041024_0002_m_000197

Possible error:
Out of memory due to hash maps used in map-side aggregation.

Solution:
Currently hive.map.aggr.hash.percentmemory is set to 0.5. Try setting it to a lower value. i.e 'set hive.map.aggr.hash.percentmemory = 0.25;'
-----
Diagnostic Messages for this Task:
java.lang.Throwable: Child Error
   at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:271)
Caused by: java.io.IOException: Task process exit with nonzero status of 65.
   at org.apache.hadoop.mapred.TaskRunner.run(TaskRunner.java:258)

FAILED: Execution Error, return code 2 from org.apache.hadoop.hive.ql.exec.MapRedTask
MapReduce Jobs Launched:
Job 0: Map: 433 Reduce: 20   Cumulative CPU: 12732.44 sec   HDFS Read: 67006 HDFS Write: 0 FAIL
Total MapReduce CPU Time Spent: 0 days 3 hours 32 minutes 12 seconds 440 msec