You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We find that Intel-MLlib will encounter error when using docker mode. The error is showed below:
22/07/12 04:44:01 INFO OneCCL: Initializing with IP_PORT: 10.10.1.144_3000
OneCCL (native): init
2022:07:12-04:44:01:(11155) ERROR: |ERROR| atl_ofi.cpp:637 send: fi_tsendmsg(prov_ep->tx, &msg, 0)
fails with ret: -2, strerror: No such file or directory
terminate called after throwing an instance of 'ccl::v1::exception'
what(): oneCCL: atl_ofi.cpp:637 send: EXCEPTION: OFI function error
.
Driver stacktrace:
at org.apache.spark.scheduler.DAGScheduler.failJobAndIndependentStages(DAGScheduler.scala:2454)
at org.apache.spark.scheduler.DAGScheduler.$anonfun$abortStage$2(DAGScheduler.scala:2403)
at org.apache.spark.scheduler.DAGScheduler.$anonfun$abortStage$2$adapted(DAGScheduler.scala:2402)
at scala.collection.mutable.ResizableArray.foreach(ResizableArray.scala:62)
at scala.collection.mutable.ResizableArray.foreach$(ResizableArray.scala:55)
at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:49)
at org.apache.spark.scheduler.DAGScheduler.abortStage(DAGScheduler.scala:2402)
at org.apache.spark.scheduler.DAGScheduler.$anonfun$handleTaskSetFailed$1(DAGScheduler.scala:1160)
at org.apache.spark.scheduler.DAGScheduler.$anonfun$handleTaskSetFailed$1$adapted(DAGScheduler.scala:1160)
at scala.Option.foreach(Option.scala:407)
at org.apache.spark.scheduler.DAGScheduler.handleTaskSetFailed(DAGScheduler.scala:1160)
at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.doOnReceive(DAGScheduler.scala:2642)
at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:2584)
at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:2573)
at org.apache.spark.util.EventLoop$$anon$1.run(EventLoop.scala:49)
at org.apache.spark.scheduler.DAGScheduler.runJob(DAGScheduler.scala:938)
at org.apache.spark.SparkContext.runJob(SparkContext.scala:2214)
at org.apache.spark.SparkContext.runJob(SparkContext.scala:2235)
at org.apache.spark.SparkContext.runJob(SparkContext.scala:2254)
at org.apache.spark.SparkContext.runJob(SparkContext.scala:2279)
at org.apache.spark.rdd.RDD.$anonfun$collect$1(RDD.scala:1030)
at org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope.scala:151)
at org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope.scala:112)
at org.apache.spark.rdd.RDD.withScope(RDD.scala:414)
at org.apache.spark.rdd.RDD.collect(RDD.scala:1029)
at com.intel.oap.mllib.stat.CorrelationDALImpl.computeCorrelationMatrix(CorrelationDALImpl.scala:40)
at org.apache.spark.ml.stat.spark321.Correlation.corr(Correlation.scala:86)
at org.apache.spark.ml.stat.Correlation$.corr(Correlation.scala:74)
at com.intel.hibench.sparkbench.ml.CorrelationExample$.run(CorrelationExample.scala:56)
at com.intel.hibench.sparkbench.ml.CorrelationExample$.main(CorrelationExample.scala:33)
at com.intel.hibench.sparkbench.ml.CorrelationExample.main(CorrelationExample.scala)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
The text was updated successfully, but these errors were encountered:
xwu99
changed the title
Intel-MLLib does not support docker mode.
[Docker] Intel-MLLib does not support docker mode.
Apr 14, 2023
We find that Intel-MLlib will encounter error when using docker mode. The error is showed below:
The text was updated successfully, but these errors were encountered: