当我启动kafka时,它抱怨它无法连接到Zookeeper时迅速失败。我将Zookeeper作为独立的集群/集成运行。我很困惑,因为服务器之间没有防火墙(如zookeeper-shell.sh测试所证明)。
从/var/log/kafka/server.log
2016-02-24 16:07:12,101 INFO kafka.server.KafkaServer: [Kafka Server 1], Connecting to zookeeper on 10.7.20.100:2181,10.7.20.101:2181,10.7.20.102:2181 2016-02-24 16:07:20,291 FATAL kafka.server.KafkaServerStartable: Fatal error during KafkaServerStable startup. Prepare to shutdown org.I0Itec.zkclient.exception.ZkTimeoutException: Unable to connect to zookeeper server within timeout: 6000 at org.I0Itec.zkclient.ZkClient.connect(ZkClient.java:880) at org.I0Itec.zkclient.ZkClient.(ZkClient.java:98) at org.I0Itec.zkclient.ZkClient. (ZkClient.java:84) at kafka.server.KafkaServer.initZk(KafkaServer.scala:113) at kafka.server.KafkaServer.startup(KafkaServer.scala:69) at kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:34) at kafka.Kafka$.main(Kafka.scala:46) at kafka.Kafka.main(Kafka.scala) 2016-02-24 16:07:20,294 INFO kafka.server.KafkaServer: [Kafka Server 1], shutting down 2016-02-24 16:07:20,312 INFO kafka.server.KafkaServer: [Kafka Server 1], shut down completed 2016-02-24 16:07:20,317 INFO kafka.server.KafkaServer: [Kafka Server 1], shutting down
但是,从/ opt / kafka安装目录中,我可以使用集合连接字符串连接到zookeeper-因此,我真的怀疑它是网络还是防火墙。
[me@dckafka01 kafka]$ cd /opt/kafka [me@dckafka01 kafka]$ bin/zookeeper-shell.sh 10.7.20.100:2181,10.7.20.101:2181,10.7.20.102:2181 Connecting to 10.7.20.100:2181,10.7.20.101:2181,10.7.20.102:2181 Welcome to ZooKeeper! JLine support is disabled WATCHER::WatchedEvent state:SyncConnected type:None path:null get /blah null cZxid = 0x400000009 ctime = Tue Feb 16 09:00:28 EST 2016 mZxid = 0x400000009 mtime = Tue Feb 16 09:00:28 EST 2016 pZxid = 0x40000017e cversion = 2 dataVersion = 0 aclVersion = 0 ephemeralOwner = 0x0 dataLength = 0 numChildren = 2 ls /blah [applications, registry]
符合预期。有人愿意我调查吗?
好吧-更改超时很有帮助。现在我需要追踪网络延迟
cat config / server.properties
# coding: UTF-8 # This file created by Chef from template. Do not hand edit this file log.dirs=/var/kafka port=9092 num.partitions=4 default.replication.factor=3 log.flush.interval.messages=1 log.retention.minutes=43200 log.retention.check.interval.ms=3600000 num.replica.fetchers=4 replica.fetch.wait.max.ms=5000 replica.lag.max.messages=10000 auto.leader.rebalance.enable=true num.network.threads=8 advertised.host.name=10.7.20.71 zookeeper.connection.timeout.ms=16000 broker.id=1 zookeeper.connect=10.7.20.100:2181,10.7.20.101:2181,10.7.20.102:2181