-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Azure Databricks java.lang.ClassNotFoundException: com.twosigma.flint.timeseries.TimeSeriesRDDBuilder #55
Comments
@josephd000 I think that is a good question for the Databricks folks. My understanding is there are some extra levels of indirection with Spark connection when working with a Databricks cluster and also some form of jar file loading logic built into Databricks runtime which is entirely proprietary, so you will need some additional steps to make it work on a Databricks cluster. Meanwhile if I do find something simple that make the Databricks use case work I'll let you know. |
@yitao-li , I went digging through the sparklyr.flint code and found the non-exported function,
I then created those "Libraries" on Databricks by passing in the "packages" and "repositories" where the Databricks Library GUI asks for "Coordinates" and "Repository", respectively. After installing these two "Libraries" on my cluster, I was able to successfully use |
@josephd000 Good to know! 👍 |
I have the same issue with Spark 3.1.1, Scala 2.12, Sparklyr 1.7.1 and Sparklyr.flint 0.2.1. I don't think I can install libraries on the cluster, I hope there will be some smooth solution soon. Thank you for the great looking package! |
@kehldaniel Did you also create a
or similar? |
Yes, (after trying hard with my own code that is running on my own laptop) I am running the exact same lines of code as in the original post by josephd000 and get the same error. |
Error
Expectation
That I can use basic sparklyr.flint functions on Azure Databricks without classpath errors by using
install.packages("sparklyr.flint")
.Details
I've created a "Library" with
flint-0.6.0
from Maven and installed it onto my cluster, detached and reattached my notebook, calledlibrary(sparklyr.flint)
beforespark_connect()
and it still can't find the library.Config
Reproducible code
The text was updated successfully, but these errors were encountered: