[Datasets] Persist Datasets statistics to log file #30557

scottjlee · 2022-11-22T01:11:00Z

Why are these changes needed?

Currently, when we print Dataset stats after execution, there is no way to retrieve this information in case of job failure/crash. By persisting the logs to a separate file, we can access the stats which could be helpful for debugging. By default, this is configured to write to /logs/ray-data.log:

The new logger, DatasetLogger, is configured to always write logs to the ray-data.log file, and optionally also writes to stdout (this is enabled by default). The motivation behind this is so that users can easily use the specific log file to filter for Dataset logs, while still maintaining console logs for those who use them.

Related issue number

Closes #29575

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Scott Lee <[email protected]>

python/ray/data/_internal/dataset_logger.py

c21

Thanks @scottjlee, looks solid, have some comments.

python/ray/data/_internal/dataset_logger.py

python/ray/data/tests/test_dataset_logger.py

python/ray/data/_internal/dataset_logger.py

Signed-off-by: Scott Lee <[email protected]>

python/ray/data/_internal/dataset_logger.py

c21 · 2022-11-29T20:27:44Z

Hi @clarkzinzow and @jianoaix - could you help take a look? Thanks.

Signed-off-by: Scott Lee <[email protected]>

jianoaix · 2022-11-30T20:56:29Z

So currently, this log is written to the worker log which is persisted already as .out, and in case of job failure, we can access the stats from worker log, right?

Signed-off-by: Scott Lee <[email protected]>

python/ray/data/_internal/dataset_logger.py

jianoaix · 2022-12-06T03:29:49Z

python/ray/data/_internal/dataset_logger.py

+        """
+        # Logger used to logging to log file (in addition to the root logger,
+        # which logs to stdout as normal). We set `logger.propagate` to False
+        # to ensure the file logger only logs to the file, and not stdout, by default.


This documentation seems confusing to me since the class-level comment says it writes to the file in addition to stdout.

Reworded the comments and documentation to clarify here and in the class level comment, let me know if things are still confusing here. thanks!

Signed-off-by: Scott Lee <[email protected]>

c21 · 2022-12-06T20:52:32Z

The failed CI test looks irrelevant - https://flakey-tests.ray.io/ .

c21 · 2022-12-07T18:20:24Z

@clarkzinzow, @jianoaix any more comments? Thanks

Currently, when we print Dataset stats after execution, there is no way to retrieve this information in case of job failure/crash. By persisting the logs to a separate file, we can access the stats which could be helpful for debugging. By default, this is configured to write to /logs/ray-data.log. The new logger, DatasetLogger, is configured to always write logs to the ray-data.log file, and optionally also writes to stdout (this is enabled by default). The motivation behind this is so that users can easily use the specific log file to filter for Dataset logs, while still maintaining console logs for those who use them. Signed-off-by: Weichen Xu <[email protected]>

Currently, when we print Dataset stats after execution, there is no way to retrieve this information in case of job failure/crash. By persisting the logs to a separate file, we can access the stats which could be helpful for debugging. By default, this is configured to write to /logs/ray-data.log. The new logger, DatasetLogger, is configured to always write logs to the ray-data.log file, and optionally also writes to stdout (this is enabled by default). The motivation behind this is so that users can easily use the specific log file to filter for Dataset logs, while still maintaining console logs for those who use them. Signed-off-by: tmynn <[email protected]>

add datasetlogger class

b61bae0

Signed-off-by: Scott Lee <[email protected]>

scottjlee requested review from ericl, scv119, clarkzinzow, jjyao, jianoaix and c21 as code owners November 22, 2022 01:11

scottjlee added 4 commits November 22, 2022 14:38

add call path in logs output

e1264f8

Signed-off-by: Scott Lee <[email protected]>

clean up

9b4167a

Signed-off-by: Scott Lee <[email protected]>

trailing newline

f09435a

Signed-off-by: Scott Lee <[email protected]>

formatter

6e41b89

Signed-off-by: Scott Lee <[email protected]>

scottjlee commented Nov 23, 2022

View reviewed changes

python/ray/data/_internal/dataset_logger.py Outdated Show resolved Hide resolved

c21 reviewed Nov 23, 2022

View reviewed changes

scottjlee added 7 commits November 23, 2022 14:23

remove path override param and add comments

b1d7ef1

Signed-off-by: Scott Lee <[email protected]>

lint

22f15d1

Signed-off-by: Scott Lee <[email protected]>

update test to read directly from log file

247eb12

Signed-off-by: Scott Lee <[email protected]>

add secondary logger and override methods

4d241b8

Signed-off-by: Scott Lee <[email protected]>

override methods

f6ad787

Signed-off-by: Scott Lee <[email protected]>

Merge branch 'master' into stats-logfile

f23dfd7

update stats tests

48a9db2

Signed-off-by: Scott Lee <[email protected]>

c21 reviewed Nov 29, 2022

View reviewed changes

python/ray/data/_internal/dataset_logger.py Show resolved Hide resolved

c21 assigned clarkzinzow, jianoaix and c21 Nov 29, 2022

scottjlee added 4 commits November 29, 2022 12:50

Merge branch 'master' into stats-logfile

60d6527

add check on having valid handlers before logging main handler

15a5752

Signed-off-by: Scott Lee <[email protected]>

Merge branch 'master' into stats-logfile

499800b

update unit tests

500472f

Signed-off-by: Scott Lee <[email protected]>

scottjlee requested review from gjoliver, avnishn, ArturNiederfahrenhorst, smorad, kouroshHakha, fishbone, stephanie-wang, suquark, ijrsvt and alanwguo as code owners December 5, 2022 18:36

scottjlee force-pushed the stats-logfile branch from d8c09b2 to 7c4db74 Compare December 5, 2022 19:51

c21 approved these changes Dec 5, 2022

View reviewed changes

scottjlee added 3 commits December 5, 2022 12:54

Merge branch 'master' into stats-logfile

a9e292d

update unit tests

f2c5180

Signed-off-by: Scott Lee <[email protected]>

Merge branch 'master' into stats-logfile

6777b2c

Signed-off-by: Scott Lee <[email protected]>

jianoaix reviewed Dec 6, 2022

View reviewed changes

address comments

24ac4f1

Signed-off-by: Scott Lee <[email protected]>

c21 added the tests-ok The tagger certifies test failures are unrelated and assumes personal liability. label Dec 6, 2022

jianoaix approved these changes Dec 7, 2022

View reviewed changes

clarkzinzow merged commit a841c07 into ray-project:master Dec 7, 2022

scottjlee mentioned this pull request Dec 19, 2022

[Datasets] Postpone Dataset logger initialization until first logging method call #31190

Merged

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Datasets] Persist Datasets statistics to log file #30557

[Datasets] Persist Datasets statistics to log file #30557

scottjlee commented Nov 22, 2022 •

edited

Loading

c21 left a comment

c21 commented Nov 29, 2022

jianoaix commented Nov 30, 2022

jianoaix Dec 6, 2022

scottjlee Dec 6, 2022

c21 commented Dec 6, 2022

c21 commented Dec 7, 2022

[Datasets] Persist Datasets statistics to log file #30557

[Datasets] Persist Datasets statistics to log file #30557

Conversation

scottjlee commented Nov 22, 2022 • edited Loading

Why are these changes needed?

Related issue number

Checks

c21 left a comment

Choose a reason for hiding this comment

c21 commented Nov 29, 2022

jianoaix commented Nov 30, 2022

jianoaix Dec 6, 2022

Choose a reason for hiding this comment

scottjlee Dec 6, 2022

Choose a reason for hiding this comment

c21 commented Dec 6, 2022

c21 commented Dec 7, 2022

scottjlee commented Nov 22, 2022 •

edited

Loading