Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Docker] Intel-MLLib does not support docker mode. #222

Open
haojinIntel opened this issue Jul 12, 2022 · 0 comments
Open

[Docker] Intel-MLLib does not support docker mode. #222

haojinIntel opened this issue Jul 12, 2022 · 0 comments

Comments

@haojinIntel
Copy link
Collaborator

We find that Intel-MLlib will encounter error when using docker mode. The error is showed below:

22/07/12 04:44:01 INFO OneCCL: Initializing with IP_PORT: 10.10.1.144_3000
OneCCL (native): init
2022:07:12-04:44:01:(11155) ERROR: |ERROR| atl_ofi.cpp:637 send: fi_tsendmsg(prov_ep->tx, &msg, 0)
 fails with ret: -2, strerror: No such file or directory
terminate called after throwing an instance of 'ccl::v1::exception'
  what():  oneCCL: atl_ofi.cpp:637 send: EXCEPTION: OFI function error


.
Driver stacktrace:
        at org.apache.spark.scheduler.DAGScheduler.failJobAndIndependentStages(DAGScheduler.scala:2454)
        at org.apache.spark.scheduler.DAGScheduler.$anonfun$abortStage$2(DAGScheduler.scala:2403)
        at org.apache.spark.scheduler.DAGScheduler.$anonfun$abortStage$2$adapted(DAGScheduler.scala:2402)
        at scala.collection.mutable.ResizableArray.foreach(ResizableArray.scala:62)
        at scala.collection.mutable.ResizableArray.foreach$(ResizableArray.scala:55)
        at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:49)
        at org.apache.spark.scheduler.DAGScheduler.abortStage(DAGScheduler.scala:2402)
        at org.apache.spark.scheduler.DAGScheduler.$anonfun$handleTaskSetFailed$1(DAGScheduler.scala:1160)
        at org.apache.spark.scheduler.DAGScheduler.$anonfun$handleTaskSetFailed$1$adapted(DAGScheduler.scala:1160)
        at scala.Option.foreach(Option.scala:407)
        at org.apache.spark.scheduler.DAGScheduler.handleTaskSetFailed(DAGScheduler.scala:1160)
        at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.doOnReceive(DAGScheduler.scala:2642)
        at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:2584)
        at org.apache.spark.scheduler.DAGSchedulerEventProcessLoop.onReceive(DAGScheduler.scala:2573)
        at org.apache.spark.util.EventLoop$$anon$1.run(EventLoop.scala:49)
        at org.apache.spark.scheduler.DAGScheduler.runJob(DAGScheduler.scala:938)
        at org.apache.spark.SparkContext.runJob(SparkContext.scala:2214)
        at org.apache.spark.SparkContext.runJob(SparkContext.scala:2235)
        at org.apache.spark.SparkContext.runJob(SparkContext.scala:2254)
        at org.apache.spark.SparkContext.runJob(SparkContext.scala:2279)
        at org.apache.spark.rdd.RDD.$anonfun$collect$1(RDD.scala:1030)
        at org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope.scala:151)
        at org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope.scala:112)
        at org.apache.spark.rdd.RDD.withScope(RDD.scala:414)
        at org.apache.spark.rdd.RDD.collect(RDD.scala:1029)
        at com.intel.oap.mllib.stat.CorrelationDALImpl.computeCorrelationMatrix(CorrelationDALImpl.scala:40)
        at org.apache.spark.ml.stat.spark321.Correlation.corr(Correlation.scala:86)
        at org.apache.spark.ml.stat.Correlation$.corr(Correlation.scala:74)
        at com.intel.hibench.sparkbench.ml.CorrelationExample$.run(CorrelationExample.scala:56)
        at com.intel.hibench.sparkbench.ml.CorrelationExample$.main(CorrelationExample.scala:33)
        at com.intel.hibench.sparkbench.ml.CorrelationExample.main(CorrelationExample.scala)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
@xwu99 xwu99 changed the title Intel-MLLib does not support docker mode. [Docker] Intel-MLLib does not support docker mode. Apr 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant