Skip to content

The meaning of some settings in the SafeLOOP algorithm. #287

Answered by Gaiejj
TomatoZ2 asked this question in Q&A
Discussion options

You must be logged in to vote

Certainly! Here's the explanation for these parameters:

  1. elite_size: This signifies the number of elite individuals selected from sampled individuals. It is commonly used in action planning of a model-based RL. (More details can refer to Constrained Model-based Reinforcement Learning with Robust Cross-Entropy Method and The Cross-Entropy Method for Combinatorial and Continuous Optimization)

  2. use_decay: This parameter indicates whether to use the weight decay technique when updating the dynamic model.

  3. num_iterations: This is the number of iterations performed in planning.

  4. num_particles: During planning, this is the number of models used for evaluating an action trajectory.

  5. num_samples

Replies: 6 comments 6 replies

Comment options

You must be logged in to vote
1 reply
@TomatoZ2
Comment options

Answer selected by TomatoZ2
Comment options

You must be logged in to vote
2 replies
@TomatoZ2
Comment options

@Gaiejj
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
3 replies
@TomatoZ2
Comment options

@Gaiejj
Comment options

@hdadong
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants