ci: wart removal #8147

rb-determined-ai · 2023-10-13T20:19:19Z

We run some CLI list and describe commands after every single experiment, as a "to sanity check that basic CLI commands don't raise errors". That has made e2e test logs unreadable since the day it was added. For now, we can keep the tests, but don't dump stdout and stderr into the test logs. Some day, we should figure out what codepaths are being tested passively, and write proper tests for them.

Also, the test_task_logs has been failing intermittently with a timeout but no error message for weeks. Instead of using pytest.mark.timeout(), which can't be caught, implement our own timeout logic, and only dump stdout and stderr if the cli crashes.

netlify · 2023-10-13T20:19:25Z

✅ Deploy Preview for determined-ui ready!

Name	Link
🔨 Latest commit	`e8de877`
🔍 Latest deploy log	https://app.netlify.com/sites/determined-ui/deploys/653a95468cf9fa0008ecd30a
😎 Deploy Preview	https://deploy-preview-8147--determined-ui.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

tayritenour

Minor comment and possibly I'm wrong about it. Did we try forcing a test to fail and confirming that the logs come through?

tayritenour · 2023-10-18T17:36:07Z

e2e_tests/tests/cluster/test_logging.py

+        thread.join(timeout=5 * 60)
+        if thread.is_alive():
+            # The thread did not exit
+            raise ValueError("do_check_logs thread did not exit")


If it didn't exit, doesn't that mean that we timed out? Shouldn't we say that instead if that's the case?

There might be value to printing stdout in this case.

@MikhailKardash what do you mean? Isn't that what the except Exception clause does, is print the stdout/stderr from the task?

MikhailKardash · 2023-10-18T19:18:44Z

e2e_tests/tests/experiment/experiment.py

+    p = subprocess.Popen(cmd, stdout=subprocess.PIPE, stderr=subprocess.PIPE)
+    out, err = p.communicate()
+    ret = p.wait()
+    if ret:


Why not ret != 0 ?

wes-turner · 2023-10-25T21:10:12Z

e2e_tests/tests/cluster/test_logging.py

-        )
-    except socket.timeout:
-        raise TimeoutError(f"timed out waiting for {task_type} with id {task_id}")
+        result: Any = None


nit: since the type of result is Union[bool|Exception], could we make it more explicit and call it

exception: Optional[Exception] = None

A nit, but one I think makes the flow a tiny bit easier to read.

I think I was trying to provide positive confirmation that the thread actually set the value... but I don't check for it anywhere, and I only read it after joining the thread... So yeah, good idea, I like it that way.

wes-turner · 2023-10-25T22:00:08Z

e2e_tests/tests/cluster/test_logging.py

+
+        thread = threading.Thread(target=do_check_logs, daemon=True)
+        thread.start()
+        thread.join(timeout=5 * 60)


small nit that this stuffs configuration into the implementation.

Options I see, none of which I love (so indeed, maybe it's better as-is):

set a custom mark and in the test pull it out of request.node.keywords

abuse pytest.parameterize to set a fixture

honestly I'm inclined to leave it as-is. request.node.keywords is something I've never heard of, which makes me think it would be a lot less readable.

A strategy with 'abuse' in the bullet point sounds undesirable.

My take is that putting config into the implementation is the least evil.

rb-determined-ai · 2023-10-26T16:26:39Z

Minor comment and possibly I'm wrong about it. Did we try forcing a test to fail and confirming that the logs come through?

No, I hadn't, which was sloppy of me. Tests were in fact not coming through, because pytest.fail subclasses BaseException and not Exception. It's fixed now and tested now.

Thank you @tayritenour.

We run some CLI list and describe commands after every single experiment, as a "to sanity check that basic CLI commands don't raise errors". That has made e2e test logs unreadable since the day it was added. For now, we can keep the tests, but don't dump stdout and stderr into the test logs. Some day, we should figure out what codepaths are being tested passively, and write proper tests for them. Also, the test_task_logs has been failing intermittently with a timeout but no error message for weeks. Instead of using pytest.mark.timeout(), which can't be caught, implement our own timeout logic, and only dump stdout and stderr if the cli crashes.

amandavialva01 · 2023-10-26T19:13:31Z

e2e_tests/tests/experiment/experiment.py

amandavialva01

lgtm

rb-determined-ai requested review from a team as code owners October 13, 2023 20:19

rb-determined-ai requested review from amandavialva01 and caehd10 October 13, 2023 20:19

cla-bot bot added the cla-signed label Oct 13, 2023

rb-determined-ai force-pushed the rb/ci-warts branch from 85db2d7 to 9099f01 Compare October 13, 2023 20:27

tayritenour approved these changes Oct 18, 2023

View reviewed changes

MikhailKardash reviewed Oct 18, 2023

View reviewed changes

wes-turner reviewed Oct 25, 2023

View reviewed changes

rb-determined-ai force-pushed the rb/ci-warts branch from 9099f01 to df9f0d3 Compare October 26, 2023 16:25

rb-determined-ai force-pushed the rb/ci-warts branch from df9f0d3 to e8de877 Compare October 26, 2023 16:35

amandavialva01 reviewed Oct 26, 2023

View reviewed changes

e2e_tests/tests/experiment/experiment.py

Copy link

Contributor

amandavialva01 Oct 26, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm!

amandavialva01 approved these changes Oct 26, 2023

View reviewed changes

rb-determined-ai merged commit c09c529 into main Oct 26, 2023
71 of 81 checks passed

rb-determined-ai deleted the rb/ci-warts branch October 26, 2023 19:17

dannysauer added this to the 0.26.3 milestone Feb 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: wart removal #8147

ci: wart removal #8147

rb-determined-ai commented Oct 13, 2023

netlify bot commented Oct 13, 2023 •

edited

Loading

tayritenour left a comment

tayritenour Oct 18, 2023

MikhailKardash Oct 18, 2023

rb-determined-ai Oct 25, 2023

MikhailKardash Oct 18, 2023

wes-turner Oct 25, 2023

rb-determined-ai Oct 25, 2023

wes-turner Oct 25, 2023 •

edited

Loading

rb-determined-ai Oct 25, 2023

rb-determined-ai commented Oct 26, 2023

amandavialva01 Oct 26, 2023

amandavialva01 left a comment

ci: wart removal #8147

ci: wart removal #8147

Conversation

rb-determined-ai commented Oct 13, 2023

netlify bot commented Oct 13, 2023 • edited Loading

✅ Deploy Preview for determined-ui ready!

tayritenour left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wes-turner Oct 25, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rb-determined-ai commented Oct 26, 2023

Choose a reason for hiding this comment

amandavialva01 left a comment

Choose a reason for hiding this comment

netlify bot commented Oct 13, 2023 •

edited

Loading

wes-turner Oct 25, 2023 •

edited

Loading