Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix pyspark test #1909

Merged
merged 13 commits into from
Mar 30, 2023
Merged

Conversation

loomlike
Copy link
Collaborator

Description

Env vars necessary to use PySpark doesn't set properly when using AzureML for testing.
This PR sets the env vars. Additionally, increase the stack size in driver and executors to avoid StackOverflowError.

Related Issues

#1898

Checklist:

  • I have followed the contribution guidelines and code style for this project.
  • I have added tests covering my contributions.

@loomlike
Copy link
Collaborator Author

@miguelgfierro This PR is merging into your branch. You'll need to revert the commented tests back from your branch before merging into Staging or Main.

Comment on lines +83 to +87

os.environ["PYSPARK_PYTHON"] = sys.executable
os.environ["PYSPARK_DRIVER_PYTHON"] = sys.executable
os.environ.pop("SPARK_HOME", None)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this is great, actually I haven't thought of this option.

Spark is like a kid that is going through

@miguelgfierro miguelgfierro merged commit 72a5100 into miguel/error_spark_benchmark Mar 30, 2023
@miguelgfierro miguelgfierro deleted the jumin/fix_pyspark_test branch March 30, 2023 10:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants