[FEA] Test estimators with hypothesis #4960

csadorf · 2022-10-28T16:51:54Z

Many of cuml's tests are aimed at comparing results between different implementations, e.g., the GPU and CPU implementation of the same estimator and against third-party implementations, notably scikit-learn. We expect estimators to overall behave very similarly and their results to be identical up to numerical precision.

Further, estimators are usually tested only against a specific combination of inputs and example datasets, an approach that likely fails to test rare edge cases and cannot provide confidence for the equivalence of a wide-range of inputs and datasets. Using hypothesis to test estimators and compare results has therefore two positive effects:

The API surface is tested significantly more often against edge cases and extreme values.
Any validation through the comparison of results is done on a more diverse set of input datasets.

A potential downside is an increase in test implementation complexity and test runtime. The former can be mitigated through a well-designed abstraction of hypothesis-strategies and may then actually lead to a reduction of complexity, the latter can be mitigated through limiting the number of hypothesis iterations and potentially only running hypothesis tests as part of the stress tests.

I suggest the following break-down for implementation:

#4960 (comment)

csadorf added feature request New feature or request ? - Needs Triage Need team to review and classify labels Oct 28, 2022

csadorf added the 2 - In Progress Currenty a work in progress label Oct 28, 2022

csadorf mentioned this issue Nov 1, 2022

[FEA] Expand hypothesis testing to all suitable tests for linear models. #4964

Open

csadorf mentioned this issue Dec 13, 2022

Expand hypothesis testing for linear models #5065

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Test estimators with hypothesis #4960

[FEA] Test estimators with hypothesis #4960

csadorf commented Oct 28, 2022 •

edited

Loading

[FEA] Test estimators with hypothesis #4960

[FEA] Test estimators with hypothesis #4960

Comments

csadorf commented Oct 28, 2022 • edited Loading

csadorf commented Oct 28, 2022 •

edited

Loading