我已经将Zeppelin 0.5.5二进制文件下载到了redhat 6.3实例中.我也成功地建立了火花1.5.2.现在,我想让zeppelin使用spark版本而不是zeppelin附带的预编译版本.
为此,我编辑了conf/so,它只包含这条未注释的行:
export SPARK_HOME=/home/eron/spark-1.5.2/
我还编辑了zeppelin GUI,现在我已经设置了Master:
master = spark://master:7077
然后我点击重启并尝试再次阅读教程:
val bankText = sc.textFile("/mnt/zeppelin-0.5.5-incubating-bin-all/bank.csv")
org.apache.thrift.transport.TIOStreamTransport.read(TIOStreamTransport.java:132)中的org.apache.thrift.transport.TTransportException位于org.apache.thrift.transport.TTransport.readAll(TTransport.java:86).位于org.apache.thrift.protocol.TBinaryProtocol.readI32(TBinaryProtocol.java:318)的apache.thrift.protocol.TBinaryProtocol.readAll(TBinaryProtocol.java:429)org.apache.thrift.protocol.TBinaryProtocol.readMessageBegin(TBinaryProtocol. Java的:219)在org.apache.thrift.TServiceClient.receiveBase(TServiceClient.java:69)在org.apache.zeppelin.interpreter.thrift.RemoteInterpreterService $ Client.recv_interpret(RemoteInterpreterService.java:220)在org.apache.zeppelin .interpreter.thrift.RemoteInterpreterService $ Client.interpret(RemoteInterpreterService.java:205)org.apache.zeppelin.interpreter.remote.RemoteInterpreter.interpret(RemoteInterpreter.java:211)org.apache.zeppelin.interpreter.LazyOpenInterpreter.interpret (LazyOpenInterpreter.java:93)org.apache.zeppelin.notebook.Paragraph.jobRun(Paragraph.java:207)org.apache.zeppelin.scheduler.Job.run(Job.java:170)at org.apache.zeppelin.scheduler.RemoteScheduler $ JobRunner.run(RemoteScheduler.java:304)at java .util.concurrent.Executors $ RunnableAdapter.call(Executors.java:471)在java.util.concurrent.FutureTask.run(FutureTask.java:262)在java.util.concurrent.ScheduledThreadPoolExecutor中$ $ ScheduledFutureTask.access 201(的ScheduledThreadPoolExecutor. Java的:178)在java.util.concurrent.ScheduledThreadPoolExecutor中$ ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:292)在java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)在java.util.concurrent.ThreadPoolExecutor中的$工人java.lang.Thread.run中的.run(ThreadPoolExecutor.java:615)(Thread.java:745)
还有,干得好
sc
收益率:
java.net.ConnectException:连接被拒绝在java.net.PlainSocketImpl.socketConnect(本机方法)在java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:339)在java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:200)在java.net.SocksSocketImpl.connect(SocksSocketImpl.java:392)的java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:182),位于org.apache的java.net.Socket.connect(Socket.java:579). thrift.transport.TSocket.open(TSocket.java:182)org.apache.zeppelin.interpreter.remote.ClientFactory.create(ClientFactory.java:51)at org.apache.zeppelin.interpreter.remote.ClientFactory.create( ClientFactory.java:37)org.apache.com上的org.apache.commons.pool2.BasePooledObjectFactory.makeObject(BasePooledObjectFactory.java:60)org.apache.com.and.Gempl(GenericObjectPool.java:86) org.apache.commons.pool2.impl.Gener上的.commons.pool2.impl.GenericObjectPool.borrowObject(GenericObjectPool.java:435)icObjectPool.borrowObject(GenericObjectPool.java:363)在org.apache.zeppelin.interpreter.remote.RemoteInterpreterProcess.getClient(RemoteInterpreterProcess.java:139)在org.apache.zeppelin.interpreter.remote.RemoteInterpreter.interpret(RemoteInterpreter.java: 192)在org.apache.zeppelin.interpreter.LazyOpenInterpreter.interpret(LazyOpenInterpreter.java:93)在org.apache.zeppelin.notebook.Paragraph.jobRun(Paragraph.java:207)在org.apache.zeppelin.scheduler.Job .run(Job.java:170)atg.apache.zeppelin.scheduler.RemoteScheduler $ JobRunner.run(RemoteScheduler.java:304)at java.util.concurrent.Executors $ RunnableAdapter.call(Executors.java:471)at at java.util.concurrent.FutureTask.run(FutureTask.java:262)java.util.concurrent.ScheduledThreadPoolExecutor $ ScheduledFutureTask.access $ 201(ScheduledThreadPoolExecutor.java:178)java.util.concurrent.ScheduledThreadPoolExecutor $ ScheduledFutureTask.run(ScheduledThreadPoolExecutor) .java:292)at java.util.concurrent.ThreadPoolExecutor.runWorker(Thread PoolExecutor.java:1145)java.util.concurrent.ThreadPoolExecutor $ Worker.run(ThreadPoolExecutor.java:615)at java.lang.Thread.run(Thread.java:745)花了0秒.
我做错了什么想法?
Zeppelin 0.5.5的版本不适用于任何高于1.5.0的Apache spark版本,因此您需要从github下载Zeppelin 0.6.0并使用命令"mvn clean package -Pspark-1.5 -DskipTests"Zeppelin 0.5重新编译. 5默认情况下构建Spark 1.4.