-
Notifications
You must be signed in to change notification settings - Fork 92
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
java.io.NotSerializableException: scala.xml.NodeSeq$$anon$1 #201
Comments
What version of scala-xml are you using? We had a release recently which contained a related fix on serialization, #154. |
@ashawley |
I don't know enough about using the Spark shell, but there's a good chance it may be using the scala-xml version that is brought in for the scala-compiler rather than the one you specified with --packages. Going down the rabbit hole, it appears this is the Spark shell script that finally runs Java: https://github.com/apache/spark/blob/73f28530/bin/spark-class java -cp "$LAUNCH_CLASSPATH" org.apache.spark.launcher.Main "$@" And LAUNCH_CLASSPATH="$SPARK_JARS_DIR/*" The jars directory is compiled in this shell script: https://github.com/apache/spark/blob/7013eea/dev/make-distribution.sh Those jars come from 'assembly/target` # Copy jars
cp "$SPARK_HOME"/assembly/target/scala*/jars/* "$DISTDIR/jars/" And there are multiple maven build files referencing scala-compiler: https://github.com/apache/spark/blob/73f2853/pom.xml |
Seems like Spark maintainers could either:
|
I am writing the map function in spark to parse xml within the log. But I got the NotSerializableException. I cannot figure it out the reason. The trace stack is followed. How to walk around it? Anyone has suggestion?
The way I am using is
The text was updated successfully, but these errors were encountered: