Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ignore hanging DistributedSuite #121

Merged
merged 1 commit into from
Mar 8, 2017
Merged

Conversation

ash211
Copy link

@ash211 ash211 commented Mar 7, 2017

Fixes #120

@ash211 ash211 changed the title Ingore hanging DistributedSuite Ignore hanging DistributedSuite Mar 7, 2017
@ash211
Copy link
Author

ash211 commented Mar 8, 2017

Passed!!!

@ash211 ash211 merged commit 7dc5bcd into master Mar 8, 2017
@ash211 ash211 deleted the ignore-hanging-distributedsuite branch March 8, 2017 03:30
dansanduleac pushed a commit that referenced this pull request Mar 9, 2017
mccheah pushed a commit that referenced this pull request Apr 27, 2017
mattsills pushed a commit to mattsills/spark that referenced this pull request Jul 17, 2020
### What changes were proposed in this pull request?

Updated kubernetes client.

### Why are the changes needed?

https://issues.apache.org/jira/browse/SPARK-27812
https://issues.apache.org/jira/browse/SPARK-27927

We need this fix fabric8io/kubernetes-client#1768 that was released on version 4.6 of the client. The root cause of the problem is better explained in apache#25785

### Does this PR introduce any user-facing change?

Nope, it should be transparent to users

### How was this patch tested?

This patch was tested manually using a simple pyspark job

```python
from pyspark.sql import SparkSession

if __name__ == '__main__':
    spark = SparkSession.builder.getOrCreate()
```

The expected behaviour of this "job" is that both python's and jvm's process exit automatically after the main runs. This is the case for spark versions <= 2.4. On version 2.4.3, the jvm process hangs because there's a non daemon thread running

```
"OkHttp WebSocket https://10.96.0.1/..." palantir#121 prio=5 os_prio=0 tid=0x00007fb27c005800 nid=0x24b waiting on condition [0x00007fb300847000]
"OkHttp WebSocket https://10.96.0.1/..." palantir#117 prio=5 os_prio=0 tid=0x00007fb28c004000 nid=0x247 waiting on condition [0x00007fb300e4b000]
```
This is caused by a bug on `kubernetes-client` library, which is fixed on the version that we are upgrading to.

When the mentioned job is run with this patch applied, the behaviour from spark <= 2.4.3 is restored and both processes terminate successfully

Closes apache#26093 from igorcalabria/k8s-client-update.

Authored-by: igor.calabria <[email protected]>
Signed-off-by: Dongjoon Hyun <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant