具体问题是,我想统计hive表里所有文本里高频的字,
把每row的句子split后flatmap成每row一个字,之后groupby,最后每个group里统计高频的字,会OOM
收起
https://guotong1988.blog.csdn.net/article/details/116168895
https://guotong1988.blog.csdn.net/article/details/116189487
报告相同问题?