Netty中IOException:Connectionresetbypeer与java.。。。最近发现系统中出现了很多 IOException: Connection reset by peer 与 ClosedChannelException: null
深⼊看了看代码, 做了些测试, 发现 Connection reset 会在客户端不知道 channel 被关闭的情况下, 触发了 eventloop 的 ad() 操作抛出
⽽ ClosedChannelException ⼀般是由 Netty 主动抛出的, 在 AbstractChannel 以及 SSLHandler ⾥都可以看到 ClosedChannel 相关的代码
AbstractChannel
static final ClosedChannelException CLOSED_CHANNEL_EXCEPTION = new ClosedChannelException();
...
static {
CLOSED_CHANNEL_EXCEPTION.setStackTrace(EmptyArrays.EMPTY_STACK_TRACE);
NOT_YET_CONNECTED_EXCEPTION.setStackTrace(EmptyArrays.EMPTY_STACK_TRACE);
}
...
@Override
public void write(Object msg, ChannelPromise promise) {
ChannelOutboundBuffer outboundBuffer = this.outboundBuffer;
if (outboundBuffer == null) {
// If the outboundBuffer is null we know the channel was closed and so
// need to fail the future right away. If it is not null the handling of the rest
// will be done in flush0()
// See github/netty/netty/issues/2362
safeSetFailure(promise, CLOSED_CHANNEL_EXCEPTION);
/
/ release message now to prevent resource-leak
return;
}
outboundBuffer.addMessage(msg, promise);
}
在代码的许多部分, 都会有这个 ClosedChannelException, ⼤概的意思是说在 channel close 以后, 如果还调⽤了 write ⽅法, 则会将 write 的future 设置为 failure, 并将 cause 设置为 ClosedChannelException, 同样 SSLHandler 中也类似
-----------------
回到 Connection reset by peer, 要模拟这个情况⽐较简单, 就是在 server 端设置⼀个在 channelActive 的时候就 close channel 的 handler. ⽽在 client 端则写⼀个 Connect 成功后⽴即发送请求数据的 listener. 如下
client
public static void main(String[] args) throws IOException, InterruptedException {
Bootstrap b = new Bootstrap();
.channel(NioSocketChannel.class)
.handler(new ChannelInitializer<NioSocketChannel>() {
@Override
protected void initChannel(NioSocketChannel ch) throws Exception {
}
});
@Override
public void operationComplete(ChannelFuture future) throws Exception {
if (future.isSuccess()) {
future.channel().write(Unpooled.buffer().writeBytes("123".getBytes()));
future.channel().flush();
}
}
});
server
public class SimpleServer {
public static void main(String[] args) throws Exception {
EventLoopGroup bossGroup = new NioEventLoopGroup(1);
EventLoopGroup workerGroup = new NioEventLoopGroup();
ServerBootstrap b = new ServerBootstrap();
.channel(NioServerSocketChannel.class)
.option(ChannelOption.SO_REUSEADDR, true)
.childHandler(new ChannelInitializer<NioSocketChannel>() {
@Override
protected void initChannel(NioSocketChannel ch) throws Exception {
ch.pipeline().addLast(new SimpleServerHandler());
}
});
b.bind(8090).sync().channel().closeFuture().sync();
}
}
public class SimpleServerHandler extends ChannelInboundHandlerAdapter {
@Override
public void channelActive(ChannelHandlerContext ctx) throws Exception {
ctx.channel().close().sync();
}
@Override
public void channelRead(ChannelHandlerContext ctx, final Object msg) throws Exception {
System.out.println(123);
}
@Override
public void channelInactive(ChannelHandlerContext ctx) throws Exception {
System.out.println("inactive");
}
}
这种情况之所以能触发 connection reset by peer 异常, 是因为 connect 成功以后, client 段先会触发 connect 成功的 listener, 这个时候 server 段虽然断开了 channel, 也触发 channel 断开的事件 (它会触发⼀个客户端 read 事件, 但是这个 read 会返回 -1, -1 代表 channel 关闭, client 的 channelInactive 跟 channel  active 状态的改变都是在这时发⽣的), 但是这个事件是在 connect 成功的 listener 之后执⾏, 所以这个时候listener ⾥的 channel 并不知道⾃⼰已经断开, 它还是会继续进⾏ write 跟 flush 操作, 在调⽤ flush 后, eventloop 会进⼊ OP_READ 事件⾥,这时候 ad() 就会抛出 connection reset 异常. eventloop 代码如下
NioEventLoop
private static void processSelectedKey(SelectionKey k, AbstractNioChannel ch) {
final NioUnsafe unsafe = ch.unsafe();
if (!k.isValid()) {
// close the channel if the key is not valid anymore
unsafe.close(unsafe.voidPromise());
return;
}
try {
int readyOps = k.readyOps();
// Also check for readOps of 0 to workaround possible JDK bug which may otherwise lead
/
/ to a spin loop
if ((readyOps & (SelectionKey.OP_READ | SelectionKey.OP_ACCEPT)) != 0 || readyOps == 0) {
if (!ch.isOpen()) {
// Connection already closed - no need to handle write.
return;
}
}
if ((readyOps & SelectionKey.OP_WRITE) != 0) {
// Call forceFlush which will also take care of clear the OP_WRITE once there is nothing left to write
ch.unsafe().forceFlush();
}
if ((readyOps & SelectionKey.OP_CONNECT) != 0) {
// remove OP_CONNECT as otherwise Selector.select(..) will always return without blocking
// See github/netty/netty/issues/924
int ops = k.interestOps();
ops &= ~SelectionKey.OP_CONNECT;
k.interestOps(ops);
unsafe.finishConnect();
}
} catch (CancelledKeyException e) {
unsafe.close(unsafe.voidPromise());
}
}
这就是 connection reset by peer 产⽣的原因
------------------
再来看 ClosedChannelException 如何产⽣, 要复现他也很简单. ⾸先要明确, 并没有客户端主动关闭才会出现 ClosedChannelException 这么⼀说. 下⾯来看两种出现 ClosedChannelException 的客户端写法
client 1, 主动关闭 channel
public class SimpleClient {
private static final Logger logger = Logger(SimpleClient.class);
public static void main(String[] args) throws IOException, InterruptedException {
Bootstrap b = new Bootstrap();
.channel(NioSocketChannel.class)
.handler(new ChannelInitializer<NioSocketChannel>() {
@Override
protected void initChannel(NioSocketChannel ch) throws Exception {
}
});
@Override
public void operationComplete(ChannelFuture future) throws Exception {
if (future.isSuccess()) {
future.channel().close();
future.channel().write(Unpooled.buffer().writeBytes("123".getBytes())).addListener(new ChannelFutureListener() {
@Override
public void operationComplete(ChannelFuture future) throws Exception {
if (!future.isSuccess()) {
<("Error", future.cause());
}
}
});
future.channel().flush();
}
}
});
}
}
只要在 write 之前主动调⽤了 close, 那么 write 必然会知道 close 是 close 状态, 最后 write 就会失败, 并且 future ⾥的 cause 就是ClosedChannelException
--------------------
client 2. 由服务端造成的 ClosedChannelException
public class SimpleClient {
private static final Logger logger = Logger(SimpleClient.class);
public static void main(String[] args) throws IOException, InterruptedException {
Bootstrap b = new Bootstrap();
.channel(NioSocketChannel.class)
.handler(new ChannelInitializer<NioSocketChannel>() {
@Override
protected void initChannel(NioSocketChannel ch) throws Exception {
}
});
Channel channel = b.connect("localhost", 8090).sync().channel();
Thread.sleep(3000);
channel.writeAndFlush(Unpooled.buffer().writeBytes("123".getBytes())).addListener(new ChannelFutureListener() {
@Override
public void operationComplete(ChannelFuture future) throws Exception {
if (!future.isSuccess()) {
<("error", future.cause());
}
}peer
});
}
}
服务端
public class SimpleServer {
public static void main(String[] args) throws Exception {
EventLoopGroup bossGroup = new NioEventLoopGroup(1);
EventLoopGroup workerGroup = new NioEventLoopGroup();
ServerBootstrap b = new ServerBootstrap();
.channel(NioServerSocketChannel.class)
.option(ChannelOption.SO_REUSEADDR, true)
.childHandler(new ChannelInitializer<NioSocketChannel>() {
@Override
protected void initChannel(NioSocketChannel ch) throws Exception {
ch.pipeline().addLast(new SimpleServerHandler());
}
});
b.bind(8090).sync().channel().closeFuture().sync();
}
}
这种情况下,  服务端将 channel 关闭, 客户端先 sleep, 这期间 client 的 eventLoop 会处理客户端关闭的时间, 也就是 eventLoop 的processKey ⽅法会进⼊ OP_READ, 然后 read 出来⼀个 -1, 最后触发 client channelInactive 事件, 当 sleep 醒来以后, 客户端调⽤writeAndFlush, 这时候客户端 channel 的状态已经变为了 inactive, 所以 write 失败, cause 为 ClosedChannelException

版权声明:本站内容均来自互联网,仅供演示用,请勿用于商业和其他非法用途。如果侵犯了您的权益请与我们联系QQ:729038198,我们将在24小时内删除。