今天 Ambari的 上的集群一个结点 lost heatbeat 解决过程:
- HDFS块丢失
- https://blog.csdn.net/mnasd/article/details/82143653
- https://www.aliyun.com/jiaocheng/804017.html
跑一个MRjob 报如下错误:
retry.RetryInvocationHandler: java.io.EOFException: End of File Exception between local host is: "chb0-179004/192.168.179.4"; destination host is: "chb0-179005":8050; : java.io.EOFException; For more details see: http://wiki.apache.org/hadoop/EOFException, while invoking ApplicationClientProtocolPBClientImpl.getApplicationReport over null. Retrying after sleeping for 15000ms.
终极解决方法: 重启了我的虚拟机 ,就ok,