最新文章专题视频专题问答1问答10问答100问答1000问答2000关键字专题1关键字专题50关键字专题500关键字专题1500TAG最新视频文章推荐1 推荐3 推荐5 推荐7 推荐9 推荐11 推荐13 推荐15 推荐17 推荐19 推荐21 推荐23 推荐25 推荐27 推荐29 推荐31 推荐33 推荐35 推荐37视频文章20视频文章30视频文章40视频文章50视频文章60 视频文章70视频文章80视频文章90视频文章100视频文章120视频文章140 视频2关键字专题关键字专题tag2tag3文章专题文章专题2文章索引1文章索引2文章索引3文章索引4文章索引5123456789101112131415文章专题3
当前位置: 首页 - 科技 - 知识百科 - 正文

hadoop的"mapred.ReduceTask:java.net.ConnectExceptio

来源:动视网 责编:小采 时间:2020-11-09 07:49:10
文档

hadoop的"mapred.ReduceTask:java.net.ConnectExceptio

hadoop的"mapred.ReduceTask:java.net.ConnectExceptio:集群某节点91有故障发生,出现 [plain] 2013-11-08 08:32:13,908 WARN org.apache.hadoop.mapred.ReduceTask: attempt_201311061017_18902_r_000000_0 copy failed: attempt_201311061017_18902_m_000
推荐度:
导读hadoop的"mapred.ReduceTask:java.net.ConnectExceptio:集群某节点91有故障发生,出现 [plain] 2013-11-08 08:32:13,908 WARN org.apache.hadoop.mapred.ReduceTask: attempt_201311061017_18902_r_000000_0 copy failed: attempt_201311061017_18902_m_000


集群某节点91有故障发生,出现 [plain] 2013-11-08 08:32:13,908 WARN org.apache.hadoop.mapred.ReduceTask: attempt_201311061017_18902_r_000000_0 copy failed: attempt_201311061017_18902_m_000003_0 from node-192 2013-11-08 08:32:13,921 WARN org.a

集群某节点91有故障发生,出现

[plain]

2013-11-08 08:32:13,908 WARN org.apache.hadoop.mapred.ReduceTask: attempt_201311061017_18902_r_000000_0 copy failed: attempt_201311061017_18902_m_000003_0 from node-192

2013-11-08 08:32:13,921 WARN org.apache.hadoop.mapred.ReduceTask: java.net.ConnectException: Connection timed out

at java.net.PlainSocketImpl.socketConnect(Native Method)

at java.net.AbstractPlainSocketImpl.doConnect(Unknown Source)

at java.net.AbstractPlainSocketImpl.connectToAddress(Unknown Source)

at java.net.AbstractPlainSocketImpl.connect(Unknown Source)

at java.net.SocksSocketImpl.connect(Unknown Source)

at java.net.Socket.connect(Unknown Source)

at sun.net.NetworkClient.doConnect(Unknown Source)

at sun.net.www.http.HttpClient.openServer(Unknown Source)

at sun.net.www.http.HttpClient.openServer(Unknown Source)

at sun.net.www.http.HttpClient.(Unknown Source)

at sun.net.www.http.HttpClient.New(Unknown Source)

at sun.net.www.http.HttpClient.New(Unknown Source)

at sun.net.www.protocol.http.HttpURLConnection.getNewHttpClient(Unknown Source)

at sun.net.www.protocol.http.HttpURLConnection.plainConnect(Unknown Source)

at sun.net.www.protocol.http.HttpURLConnection.connect(Unknown Source)

at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.getInputStream(ReduceTask.java:1631)

at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.setupSecureConnection(ReduceTask.java:1588)

at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.getMapOutput(ReduceTask.java:1488)

at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.copyOutput(ReduceTask.java:1399)

at org.apache.hadoop.mapred.ReduceTask$ReduceCopier$MapOutputCopier.run(ReduceTask.java:1331)

分析hadoop代码:

[java]

localFs = FileSystem.getLocal(fConf);

if (fConf.get("slave.host.name") != null) {

this.localHostname = fConf.get("slave.host.name");

}

if (localHostname == null) {

this.localHostname =

DNS.getDefaultHost

(fConf.get("mapred.tasktracker.dns.interface","default"),

fConf.get("mapred.tasktracker.dns.nameserver","default"));

}

在该节点ping 下这个hostname:

[plain]

ping node-191

PING node-128-191.localhost (220.250.64.228) 56(84) bytes of data.

64 bytes from 220.250.64.228: icmp_seq=1 ttl=247 time=14.8 ms

64 bytes from 220.250.64.228: icmp_seq=2 ttl=247 time=14.3 ms

64 bytes from 220.250.64.228: icmp_seq=3 ttl=247 time=14.4 ms

发现压根不是191的ip。

到该节点的hosts里查看,也没有配置191的hostname。

问题得解。

将191的hostname添加到集群所有节点的hosts上。重启tasktracker搞定。

文档

hadoop的"mapred.ReduceTask:java.net.ConnectExceptio

hadoop的"mapred.ReduceTask:java.net.ConnectExceptio:集群某节点91有故障发生,出现 [plain] 2013-11-08 08:32:13,908 WARN org.apache.hadoop.mapred.ReduceTask: attempt_201311061017_18902_r_000000_0 copy failed: attempt_201311061017_18902_m_000
推荐度:
标签: map quot "
  • 热门焦点

最新推荐

猜你喜欢

热门推荐

专题
Top