[AIR][Serve] Add windows check for pd.DataFrame comparison #26889

jiaodong · 2022-07-22T16:52:48Z

Why are these changes needed?

In previous implementation #26821 we have windows failure suggesting we behave differently on windows regarding datatype conversion.

In our https://sourcegraph.com/github.com/ray-project/ray/-/blob/python/ray/data/tests/test_dataset.py?L577 regarding use of TensorArray we seem to rely on pd'sassert_frame_equal rather than manually comparing frames.

This PR adds a quick conditional on windows only to ignore dtype for now.

  | unpacked_list = BatchingManager.split_dataframe(batched_df, 1)
  | assert len(unpacked_list) == 1
  | >       assert unpacked_list[0]["a"].equals(split_df["a"])
  | E       assert False
  | E        +  where False = <bound method NDFrame.equals of 0    1\n1    2\n2    3\n3    4\nName: a, dtype: int32>(0    1\n1    2\n2    3\n3    4\nName: a, dtype: int64)
  | E        +    where <bound method NDFrame.equals of 0    1\n1    2\n2    3\n3    4\nName: a, dtype: int32> = 0    1\n1    2\n2    3\n3    4\nName: a, dtype: int32.equals

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

richardliaw · 2022-07-22T16:59:31Z

python/ray/serve/tests/test_air_integrations.py

@@ -90,8 +91,13 @@ def test_dataframe_with_tensorarray(self):

        unpacked_list = BatchingManager.split_dataframe(batched_df, 1)
        assert len(unpacked_list) == 1
-        assert unpacked_list[0]["a"].equals(split_df["a"])
-        assert unpacked_list[0]["b"].equals(split_df["b"])
+        # On windows, conversion dtype is not preserved.


sad.... nice find!

cc: @clarkzinzow for awareness .. not sure if its related to TensorArray, but in the worst case this means AIR could mess up user's tensor dtype among full / half / mixed precision training =.=

…t#26889 n previous implementation ray-project#26821 we have windows failure suggesting we behave differently on windows regarding datatype conversion. In our https://sourcegraph.com/github.com/ray-project/ray/-/blob/python/ray/data/tests/test_dataset.py?L577 regarding use of TensorArray we seem to rely on pd'sassert_frame_equal rather than manually comparing frames. This PR adds a quick conditional on windows only to ignore dtype for now. Signed-off-by: Rohan138 <[email protected]>

…t#26889 n previous implementation ray-project#26821 we have windows failure suggesting we behave differently on windows regarding datatype conversion. In our https://sourcegraph.com/github.com/ray-project/ray/-/blob/python/ray/data/tests/test_dataset.py?L577 regarding use of TensorArray we seem to rely on pd'sassert_frame_equal rather than manually comparing frames. This PR adds a quick conditional on windows only to ignore dtype for now. Signed-off-by: Stefan van der Kleij <[email protected]>

add windows check

465aee6

jiaodong marked this pull request as ready for review July 22, 2022 16:54

jiaodong requested review from scv119, richardliaw, clarkzinzow and matthewdeng July 22, 2022 16:54

jiaodong assigned scv119, matthewdeng, richardliaw, clarkzinzow and simon-mo Jul 22, 2022

richardliaw reviewed Jul 22, 2022

View reviewed changes

scv119 approved these changes Jul 22, 2022

View reviewed changes

scv119 merged commit a03716e into ray-project:master Jul 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AIR][Serve] Add windows check for pd.DataFrame comparison #26889

[AIR][Serve] Add windows check for pd.DataFrame comparison #26889

jiaodong commented Jul 22, 2022

richardliaw Jul 22, 2022

jiaodong Jul 22, 2022

[AIR][Serve] Add windows check for pd.DataFrame comparison #26889

[AIR][Serve] Add windows check for pd.DataFrame comparison #26889

Conversation

jiaodong commented Jul 22, 2022

Why are these changes needed?

Related issue number

Checks

richardliaw Jul 22, 2022

Choose a reason for hiding this comment

jiaodong Jul 22, 2022

Choose a reason for hiding this comment