Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Run ModelLauncher with a BenchmarkRunner and BenchmarkMetric #2681

Closed
wants to merge 5 commits into from

Conversation

esantorella
Copy link
Contributor

Summary: Similar to how Jennaton was migrated in D60996475, this change switches the ModelLauncher problem to use BenchmarkRunner and BenchmarkMetric.

Differential Revision: D61397457

@facebook-github-bot facebook-github-bot added the CLA Signed Do not delete this pull request or issue due to inactivity. label Aug 20, 2024
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D61397457

@codecov-commenter
Copy link

codecov-commenter commented Aug 20, 2024

Codecov Report

Attention: Patch coverage is 92.50936% with 20 lines in your changes missing coverage. Please review.

Project coverage is 95.26%. Comparing base (6e7e798) to head (f4bc194).

Files Patch % Lines
ax/benchmark/tests/stubs.py 0.00% 11 Missing ⚠️
ax/benchmark/runners/botorch_test.py 89.09% 6 Missing ⚠️
ax/benchmark/runners/base.py 88.23% 2 Missing ⚠️
...nchmark/tests/runners/test_botorch_test_problem.py 96.29% 1 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main    #2681      +/-   ##
==========================================
- Coverage   95.28%   95.26%   -0.02%     
==========================================
  Files         495      495              
  Lines       47800    47867      +67     
==========================================
+ Hits        45545    45600      +55     
- Misses       2255     2267      +12     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

esantorella added a commit to esantorella/Ax that referenced this pull request Aug 20, 2024
…k#2681)

Summary:
Pull Request resolved: facebook#2681

Similar to how Jennaton was migrated in D60996475, this change switches the ModelLauncher problem to use `BenchmarkRunner` and `BenchmarkMetric`.

Differential Revision: D61397457

Reviewed By: Balandat
esantorella added a commit to esantorella/Ax that referenced this pull request Aug 20, 2024
…k#2681)

Summary:
Pull Request resolved: facebook#2681

Similar to how Jennaton was migrated in D60996475, this change switches the ModelLauncher problem to use `BenchmarkRunner` and `BenchmarkMetric`.

Differential Revision: D61397457

Reviewed By: Balandat
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D61397457

esantorella added a commit to esantorella/Ax that referenced this pull request Aug 20, 2024
…k#2681)

Summary:
Pull Request resolved: facebook#2681

Similar to how Jennaton was migrated in D60996475, this change switches the ModelLauncher problem to use `BenchmarkRunner` and `BenchmarkMetric`.

Reviewed By: Balandat

Differential Revision: D61397457
esantorella and others added 5 commits August 21, 2024 07:14
…book#2674)

Summary:
Pull Request resolved: facebook#2674

Context:

This is an alternative to D61431979.

Note: There are benchmarks that do not use `BenchmarkRunner`, but I plan to have them all use `BenchmarkRunner` in the future.

`BenchmarkRunner` technically supports benchmarks without a ground truth, but that functionality is never used, and there aren't any Ax benchmarks that are noisy *and* don't have a ground truth. It is not conceptually clear how such a case should be benchmarked, so it is better to not over-engineer for that need, which may never arise. Instead, benchmarks that lack a ground truth but are deterministic can be treated as noiseless problems with a ground truth, and we can reap support for problems without a ground truth.

Also, `BenchmarkRunner` has some methods that must either be defined or not defined depending on whether there is a ground truth. They can't be abstract because they will not always be defined. With this change, we can make the ground-truth methods abstract and get rid of the rest.

This PR:
- Rewrites docstrings
- Removes method `get_Y_Ystd`
- Makes `get_Y_true` and other methods abstract
- Removes functionality for the case where `get_Y_true` raises a `NotImplementedError`

Reviewed By: ItsMrLin

Differential Revision: D61483962
Summary:
Pull Request resolved: facebook#2675

Context:

In a future refactor that will enable more flexible and powerful best-point functionality, every BenchmarkProblem's runner will be able to produce an "oracle" value (possibly the ground truth) for any arm, in-sample or not, with a function like `BenchmarkRunner.evaluate_oracle(arm=arm)`, with the problem handling computation and the runner formatting results.  However, the current `BenchmarkRunner` and `BenchmarkMetric` setup currently doesn't cover every benchmark. Consolidating on `BenchmarkRunner` and `BenchmarkMetric` will enable the refactor, make it easier to universalize functionality like handling of constraints, noise, and inference regret, and will also allow for deleting some LOC for more custom problems.

Current `BenchmarkRunner`s only handle problems that can consume tensor-valued arguments: BoTorch synthetic problems and surrogate problems. This isn't a good fit for problems like Jenatton that have a hierarchical search space and can have some parameters not passed. Because Ax always passes parameters and only sometimes represents them as tensors, a `TParameterization` is a more natural abstraction to handle parameters than a tensor.

This PR:
- Introduces `ParamBasedTestProblem`, which is like a BoTorch synthetic test problem but consumes a `TParameterization` rather than a tensor
- Added `ParamBasedProblemRunner`, which shares a base class `SyntheticProblemRunner` and most functionality with  `BotorchTestProblemRunner` (so it is a `BenchmarkRunner` and supports both observed and unboserved noise).

Differential Revision: D60996475
…#2676)

Summary:
Pull Request resolved: facebook#2676

This PR:
- Has Jenatton use `ParamBasedTestProblem` so that it can use `ParamBasedProblemRunner`, and also have it use `BenchmarkMetric`; get rid of specialized Jenatton runners and metrics. This enables Jenatton to handle noisy problems, whether noise levels are observed or not, like other benchmark problems, and will make it easy to add constraints or benefit from other new functionality.
- Does *not* clean up the now-unnecessary Jennaton metric file; that happens in the next diff.

Differential Revision: D61502458

Reviewed By: Balandat
Summary: I didn't do this in the previous diff so that it would be easier to review.

Differential Revision: D61431983
…k#2681)

Summary:
Pull Request resolved: facebook#2681

Similar to how Jennaton was migrated in D60996475, this change switches the ModelLauncher problem to use `BenchmarkRunner` and `BenchmarkMetric`.

Reviewed By: Balandat

Differential Revision: D61397457
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D61397457

esantorella added a commit to esantorella/Ax that referenced this pull request Aug 21, 2024
…k#2681)

Summary:
Pull Request resolved: facebook#2681

Similar to how Jennaton was migrated in D60996475, this change switches the ModelLauncher problem to use `BenchmarkRunner` and `BenchmarkMetric`.

Differential Revision: D61397457

Reviewed By: Balandat
esantorella added a commit to esantorella/Ax that referenced this pull request Aug 21, 2024
…k#2681)

Summary:
Pull Request resolved: facebook#2681

Similar to how Jennaton was migrated in D60996475, this change switches the ModelLauncher problem to use `BenchmarkRunner` and `BenchmarkMetric`.

Differential Revision: D61397457

Reviewed By: Balandat
@facebook-github-bot
Copy link
Contributor

This pull request has been merged in 9a83f87.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed Do not delete this pull request or issue due to inactivity. fb-exported Merged
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants