support Python 3.10 #3140

hirosassa · 2022-01-21T22:13:19Z

Description

This PR adds Python 3.10 CI workflow to support Python 3.10.

Motivation and Context

To follow latest Python version.

Have you tested this? If so, how?

Ran tox on local Mac and Docker (ubuntu 20.04)

dlstadther · 2022-03-12T04:21:10Z

@hirosassa what do we need to solve these 3.10 errors? There have been a couple 3.10 related PRs that i've merged in the past few days (or so) - a deprecation warning and the tenacity version upgrade. Are there additional changes that we need to make before updating this branch and retesting?

So sorry for my infrequent review and feedback here

hirosassa · 2022-03-12T12:01:45Z

@dlstadther Thanks for your comment! I would like to discuss about the CI failure.
The CI error on Python 3.10 occurred only on GitHub Actions (not reproduced on local Docker and Mac).

I found the problem comes from the code below:
https://github.com/spotify/luigi/blob/master/luigi/lock.py#L65-L66
actual fh contains nothing, it is expected that it contains current executed command like echo hello.

I tried adding retry using tenacity like here, and CI sometimes pass but it also fails "sometime".
https://github.com/hirosassa/luigi/pull/1/files#diff-e126af1be65ab218abe136e1248d7ec7fca5e00f0c3322d3c1eef50ff4389b20R82-R85

Do you have any ideas?

rjcortese · 2022-04-28T14:42:23Z

It could be possible that the GitHub Actions CI runners are using different configuration of procfs, such that the read of /proc/pid/cmdline seems empty (there is already a comment related to that in the code). Might be that reading empty file does not raise IOError. Do other python versions work on GitHub Actions CI runners?

Another reason /proc/pid/cmdline might be empty is if it is a zombie process: See the /proc/[pid]/cmdline section of https://man7.org/linux/man-pages/man5/proc.5.html.

Did Python 10 change anything about way open() or fh.read() works?

rschmidtner · 2022-11-23T12:34:11Z

Are there any plans to make luigi Python 3.10 compatible in the future?

hirosassa · 2022-11-23T20:38:20Z

@rschmidtner In my environment (production, too), Luigi is working well with Python 3.10.
But, in this PR, some tests are failed.

ravwojdyla · 2022-12-12T18:58:35Z

@hirosassa thanks for starting this PR! Regarding the cmdline issue from #3140 (comment), have you tried to add a short sleep between these two lines (e.g. time.sleep(0.42)):

luigi/test/lock_test.py

Lines 39 to 40 in 38b0c2b

    
           external_process = subprocess.Popen(command) 
        
           result = luigi.lock.getpcmd(external_process.pid)

EDIT: I'm not saying that is a fix, but it might confirm a problem.

hirosassa · 2022-12-12T21:45:19Z

@ravwojdyla Thanks for your comment! I added sleep at c5ff943 but it failed.

ravwojdyla · 2022-12-12T21:50:24Z

I added sleep at c5ff943 but it failed.

test_getpcmd now did NOT fail, test_get_info failed, which is the other test that failed in the past, probably the same reason. Could you please add the sleep in the other test as well, here:

luigi/test/lock_test.py

Lines 61 to 62 in 38b0c2b

    
           p = subprocess.Popen(["yes", u"à我ф"], stdout=subprocess.PIPE) 
        
           pid, cmd, pid_file = luigi.lock.get_info(self.pid_dir, p.pid)

hirosassa · 2022-12-12T22:19:50Z

@ravwojdyla Wow! Thank you. Finally all the tests are succeed.

ravwojdyla · 2022-12-12T22:33:27Z

Finally all the tests are succeed.

@hirosassa nice, good job! time.sleep(0.42) was somewhat arbitrary and very likely an overkill, some milliseconds would probably be sufficient. If you want to, in the comment you can mention that this sleep is "necessary" to make sure the test can read populated /proc/*/cmdline on linux.

EDIT: there might be a more idiomatic way to handle this in a test than sleep, maybe something could be synchronized etc, but I will leave that up the you/reviewers to decide.

lallea · 2022-12-12T23:07:55Z

The sleep is fragile. On a bad day, it won't be enough and a longer sleep slows down tests. I suggest moving the getpcmd call to a separate function and wrapping it with tenacity.retry with a timeout of tens of seconds.

hirosassa · 2022-12-12T23:11:37Z

@lallea @ravwojdyla Thanks for your comment. I agree with your opinion. I'll try it.

ravwojdyla · 2022-12-13T16:27:12Z

I suggest moving the getpcmd call to a separate function and wrapping it with tenacity.retry with a timeout of tens of seconds.

@lallea that sounds good. afaiu you are suggesting to retry until you can get the cmd line from /proc/*/cmdline, just want to point out that there can be multiple reasons why cmdline file exists BUT it's empty. At least two reasons come to my mind: it can be because the cmdline has not be written yet (might be buffered, in process etc), or the process is a zombie. If you add retry with "tens of seconds", in the case of the zombie process (which I do not know how often that will be the case), you might actually wait for those "tens of seconds", and that would be in the "prod code", not test. So there's a potential downside to that solution.

lallea · 2022-12-13T16:43:12Z

I suggest moving the getpcmd call to a separate function and wrapping it with tenacity.retry with a timeout of tens of seconds.

@lallea that sounds good. afaiu you are suggesting to retry until you can get the cmd line from /proc/*/cmdline, just want to point out that there can be multiple reasons why cmdline file exists BUT it's empty. At least two reasons come to my mind: it can be because the cmdline has not be written yet (might be buffered, in process etc), or the process is a zombie. If you add retry with "tens of seconds", in the case of the zombie process (which I do not know how often that will be the case), you might actually wait for those "tens of seconds", and that would be in the "prod code", not test. So there's a potential downside to that solution.

I'm suggesting breaking out the call to a new function in lock_test.py and put the retry there, not in production code. I agree that it would be appropriate in prod code.

hirosassa · 2022-12-25T01:14:36Z

@lallea @ravwojdyla I added retry on getpcmd and get_info in test code. Please check.

test/lock_test.py

hirosassa · 2022-12-28T07:13:26Z

@dlstadther Hi Dillon! Could you review this PR?

hirosassa · 2022-12-29T22:22:45Z

@dlstadther Thank you for your review and merge.
Could you release newer version on pypi? (there's many new feature released)

dlstadther · 2023-01-16T14:18:36Z

Apologies @hirosassa ; I have been a bit consumed as of late. @honnix , could Spotify devs prep and release a new Luigi version?

honnix · 2023-01-17T09:19:44Z

@dlstadther We will take care of it. Looking at #3220, it seems we should get that in before dropping a new release.

dlstadther · 2023-01-17T12:41:55Z

@dlstadther We will take care of it. Looking at #3220, it seems we should get that in before dropping a new release.

Thanks @honnix !

hirosassa requested review from dlstadther and a team as code owners January 21, 2022 22:13

support python 3.10

4103d10

hirosassa force-pushed the python-310 branch from 7141710 to 4103d10 Compare March 15, 2022 20:47

Merge branch 'master' into python-310

8238ba0

Merge branch 'master' into python-310

bffc202

Merge branch 'master' into python-310

6d94d4e

add sleep

c5ff943

hirosassa force-pushed the python-310 branch from 95b24fc to c5ff943 Compare December 12, 2022 21:37

add more sleep

4a14959

add comment

cc02b3c

hirosassa added 3 commits December 25, 2022 09:06

add retry on getpcmd and get_info

276211e

fix

c7c95ed

fix

add0107

ravwojdyla previously approved these changes Dec 27, 2022

View reviewed changes

test/lock_test.py Outdated Show resolved Hide resolved

remove unused

e8a957c

hirosassa dismissed ravwojdyla’s stale review via e8a957c December 27, 2022 21:16

ravwojdyla approved these changes Dec 28, 2022

View reviewed changes

dlstadther approved these changes Dec 28, 2022

View reviewed changes

dlstadther merged commit 3217e67 into spotify:master Dec 29, 2022

hirosassa deleted the python-310 branch December 29, 2022 01:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support Python 3.10 #3140

support Python 3.10 #3140

hirosassa commented Jan 21, 2022

dlstadther commented Mar 12, 2022

hirosassa commented Mar 12, 2022

rjcortese commented Apr 28, 2022

rschmidtner commented Nov 23, 2022

hirosassa commented Nov 23, 2022

ravwojdyla commented Dec 12, 2022 •

edited

Loading

hirosassa commented Dec 12, 2022

ravwojdyla commented Dec 12, 2022 •

edited

Loading

hirosassa commented Dec 12, 2022

ravwojdyla commented Dec 12, 2022 •

edited

Loading

lallea commented Dec 12, 2022

hirosassa commented Dec 12, 2022

ravwojdyla commented Dec 13, 2022

lallea commented Dec 13, 2022

hirosassa commented Dec 25, 2022

hirosassa commented Dec 28, 2022

hirosassa commented Dec 29, 2022 •

edited

Loading

dlstadther commented Jan 16, 2023

honnix commented Jan 17, 2023

dlstadther commented Jan 17, 2023

support Python 3.10 #3140

support Python 3.10 #3140

Conversation

hirosassa commented Jan 21, 2022

Description

Motivation and Context

Have you tested this? If so, how?

dlstadther commented Mar 12, 2022

hirosassa commented Mar 12, 2022

rjcortese commented Apr 28, 2022

rschmidtner commented Nov 23, 2022

hirosassa commented Nov 23, 2022

ravwojdyla commented Dec 12, 2022 • edited Loading

hirosassa commented Dec 12, 2022

ravwojdyla commented Dec 12, 2022 • edited Loading

hirosassa commented Dec 12, 2022

ravwojdyla commented Dec 12, 2022 • edited Loading

lallea commented Dec 12, 2022

hirosassa commented Dec 12, 2022

ravwojdyla commented Dec 13, 2022

lallea commented Dec 13, 2022

hirosassa commented Dec 25, 2022

hirosassa commented Dec 28, 2022

hirosassa commented Dec 29, 2022 • edited Loading

dlstadther commented Jan 16, 2023

honnix commented Jan 17, 2023

dlstadther commented Jan 17, 2023

ravwojdyla commented Dec 12, 2022 •

edited

Loading

ravwojdyla commented Dec 12, 2022 •

edited

Loading

ravwojdyla commented Dec 12, 2022 •

edited

Loading

hirosassa commented Dec 29, 2022 •

edited

Loading