weixin_45663641 2022-12-08 12:29 采纳率: 0%
浏览 22

CDH flink yarn 添加角色实例启动不了

CDH6.3.2 集成flink 1.14.6
flink yarn 一个角色实例能启动,添加多个实例启动不了,下面是报错信息,有人懂得吗


ERROR StatusLogger Reconfiguration failed: No configuration found for '45ee12a7' at 'null' in 'null'
11:44:09.657 [main] ERROR org.apache.flink.yarn.cli.FlinkYarnSessionCli - Error while running the Flink session.
org.apache.flink.client.deployment.ClusterDeploymentException: Couldn't deploy Yarn session cluster
    at org.apache.flink.yarn.YarnClusterDescriptor.deploySessionCluster(YarnClusterDescriptor.java:426) ~[flink-dist_2.11-1.14.6.jar:1.14.6]
    at org.apache.flink.yarn.cli.FlinkYarnSessionCli.run(FlinkYarnSessionCli.java:606) ~[flink-dist_2.11-1.14.6.jar:1.14.6]
    at org.apache.flink.yarn.cli.FlinkYarnSessionCli.lambda$main$4(FlinkYarnSessionCli.java:862) ~[flink-dist_2.11-1.14.6.jar:1.14.6]
    at java.security.AccessController.doPrivileged(Native Method) ~[?:1.8.0_112]
    at javax.security.auth.Subject.doAs(Subject.java:422) ~[?:1.8.0_112]
    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1836) ~[flink-shaded-hadoop-2-uber-2.8.3-10.0.jar:2.8.3-10.0]
    at org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41) ~[flink-dist_2.11-1.14.6.jar:1.14.6]
    at org.apache.flink.yarn.cli.FlinkYarnSessionCli.main(FlinkYarnSessionCli.java:862) [flink-dist_2.11-1.14.6.jar:1.14.6]
Caused by: org.apache.flink.yarn.YarnClusterDescriptor$YarnDeploymentException: The YARN application unexpectedly switched to state FAILED during deployment. 
Diagnostics from YARN: Application application_1669088721620_0013 failed 1 times in previous 10000 milliseconds (global limit =2; local limit is =1) due to AM Container for appattempt_1669088721620_0013_000001 exited with  exitCode: 1
Failing this attempt.Diagnostics: [2022-12-08 11:44:09.261]Exception from container-launch.
Container id: container_1669088721620_0013_01_000001
Exit code: 1

[2022-12-08 11:44:09.270]Container exited with a non-zero exit code 1. Error file: prelaunch.err.
Last 4096 bytes of prelaunch.err :

[2022-12-08 11:44:09.272]Container exited with a non-zero exit code 1. Error file: prelaunch.err.
Last 4096 bytes of prelaunch.err :

For more detailed output, check the application tracking page: http://bigdata01:8088/cluster/app/application_1669088721620_0013 Then click on links to logs of each attempt.
. Failing the application.
If log aggregation is enabled on your cluster, use this command to further investigate the issue:
yarn logs -applicationId application_1669088721620_0013
    at org.apache.flink.yarn.YarnClusterDescriptor.startAppMaster(YarnClusterDescriptor.java:1219) ~[flink-dist_2.11-1.14.6.jar:1.14.6]
    at org.apache.flink.yarn.YarnClusterDescriptor.deployInternal(YarnClusterDescriptor.java:607) ~[flink-dist_2.11-1.14.6.jar:1.14.6]
    at org.apache.flink.yarn.YarnClusterDescriptor.deploySessionCluster(YarnClusterDescriptor.java:419) ~[flink-dist_2.11-1.14.6.jar:1.14.6]
    ... 7 more

------------------------------------------------------------
 The program finished with the following exception:

org.apache.flink.client.deployment.ClusterDeploymentException: Couldn't deploy Yarn session cluster
    at org.apache.flink.yarn.YarnClusterDescriptor.deploySessionCluster(YarnClusterDescriptor.java:426)
    at org.apache.flink.yarn.cli.FlinkYarnSessionCli.run(FlinkYarnSessionCli.java:606)
    at org.apache.flink.yarn.cli.FlinkYarnSessionCli.lambda$main$4(FlinkYarnSessionCli.java:862)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Subject.java:422)
    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1836)
    at org.apache.flink.runtime.security.contexts.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
    at org.apache.flink.yarn.cli.FlinkYarnSessionCli.main(FlinkYarnSessionCli.java:862)
Caused by: org.apache.flink.yarn.YarnClusterDescriptor$YarnDeploymentException: The YARN application unexpectedly switched to state FAILED during deployment. 
Diagnostics from YARN: Application application_1669088721620_0013 failed 1 times in previous 10000 milliseconds (global limit =2; local limit is =1) due to AM Container for appattempt_1669088721620_0013_000001 exited with  exitCode: 1
Failing this attempt.Diagnostics: [2022-12-08 11:44:09.261]Exception from container-launch.
Container id: container_1669088721620_0013_01_000001
Exit code: 1

[2022-12-08 11:44:09.270]Container exited with a non-zero exit code 1. Error file: prelaunch.err.
Last 4096 bytes of prelaunch.err :

[2022-12-08 11:44:09.272]Container exited with a non-zero exit code 1. Error file: prelaunch.err.
Last 4096 bytes of prelaunch.err :

For more detailed output, check the application tracking page: http://bigdata01:8088/cluster/app/application_1669088721620_0013 Then click on links to logs of each attempt.
. Failing the application.
If log aggregation is enabled on your cluster, use this command to further investigate the issue:
yarn logs -applicationId application_1669088721620_0013
    at org.apache.flink.yarn.YarnClusterDescriptor.startAppMaster(YarnClusterDescriptor.java:1219)
    at org.apache.flink.yarn.YarnClusterDescriptor.deployInternal(YarnClusterDescriptor.java:607)
    at org.apache.flink.yarn.YarnClusterDescriptor.deploySessionCluster(YarnClusterDescriptor.java:419)
    ... 7 more

  • 写回答

1条回答 默认 最新

  • 此木Y 2022-12-13 16:59
    关注

    修改yarn-site.xml,增加如下配置

    <property>
        <name>yarn.resourcemanager.webapp.address.rm1</name>
        <value>hadoop102</value>
    </property>
    <property>
        <name>yarn.resourcemanager.scheduler.address.rm2</name>
        <value>hadoop103</value>
    </property>
    <property>
        <name>yarn.resourcemanager.webapp.address.rm2</name>
        <value>hadoop103</value>
    </property>
    

    重启yarn,hdfs再测试

    评论

报告相同问题?

问题事件

  • 创建了问题 12月8日

悬赏问题

  • ¥15 关于lua调用DLL的c/c++动态库(相关搜索:数据库)
  • ¥15 prokka运行时失败
  • ¥15 求一款免费的pdf编辑js,web端用的
  • ¥30 申威平台是否兼容使用cmake,需要cmake版本在3.15版本以上
  • ¥20 UE5.2插件Remote Control Web Interface安装失败问题
  • ¥15 c#调用c++方法提示错误 无法封送处理泛型类型
  • ¥15 求分析下图晶体与三极管组成的振荡电路
  • ¥100 多线程+连接池+代理 运行一段时间线程阻塞
  • ¥15 关于#单片机#的问题:求一个使用C语言将重力加速度gx,gy,gz积分获取到速度的代码(相关搜索:c语言)
  • ¥15 matlab导致电脑重启问题