Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[RLlib] Update autoregressive actions example. #47829

Merged

Commits on Sep 27, 2024

  1. Defined a new autoregressive actions environment in which the agent c…

    …annot only watch the state but needs to also watch the first action. Furthermore, implemented the 'ValueFunctionAPI' in the 'AutoregressiveActionsRLM' and ran some tests.
    
    Signed-off-by: simonsays1980 <[email protected]>
    simonsays1980 committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    330594e View commit details
    Browse the repository at this point in the history
  2. Fixed bug in stop criterium.

    Signed-off-by: simonsays1980 <[email protected]>
    simonsays1980 committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    2b04f6c View commit details
    Browse the repository at this point in the history
  3. Added short comment as suggested by @sven1977.

    Signed-off-by: simonsays1980 <[email protected]>
    simonsays1980 committed Sep 27, 2024
    Configuration menu
    Copy the full SHA
    f217156 View commit details
    Browse the repository at this point in the history

Commits on Sep 28, 2024

  1. Fixed old stack autoregressive actions example.

    Signed-off-by: simonsays1980 <[email protected]>
    simonsays1980 committed Sep 28, 2024
    Configuration menu
    Copy the full SHA
    ff0c4a1 View commit details
    Browse the repository at this point in the history

Commits on Sep 30, 2024

  1. Adapted stop reward in old stack autoregressive actions example to ne…

    …w 'AutoregressiveActionsEnv'.
    
    Signed-off-by: simonsays1980 <[email protected]>
    simonsays1980 committed Sep 30, 2024
    Configuration menu
    Copy the full SHA
    738a6f1 View commit details
    Browse the repository at this point in the history
  2. Merged Master

    Signed-off-by: simonsays1980 <[email protected]>
    simonsays1980 committed Sep 30, 2024
    Configuration menu
    Copy the full SHA
    5e2f9bd View commit details
    Browse the repository at this point in the history
  3. CHanged stop rewards also in BUILD file.

    Signed-off-by: simonsays1980 <[email protected]>
    simonsays1980 committed Sep 30, 2024
    Configuration menu
    Copy the full SHA
    797c16b View commit details
    Browse the repository at this point in the history