Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/20240911 enable squashed gaussian on ppo #130

Merged

Conversation

ishihara-y
Copy link
Contributor

Some distribution (like SquashedGaussian) does not have an analytical form for the entropy. Thus PPO can not use such distribution in the training.
However, when entropy coefficient of PPO training is 0, ppo does not need to compute the entropy of the policy distribution.
This PR enables using SquashedGaussian and other distributions in PPO when the entropy coefficient is 0.
New implementation ignores the computation of entropy in policy training with 0 coefficient.

@ishihara-y ishihara-y force-pushed the feature/20240911-enable-squashed-gaussian-on-ppo branch from 92b0df8 to 03908bc Compare September 17, 2024 09:20
@ishihara-y ishihara-y self-assigned this Sep 17, 2024
@sbsekiguchi
Copy link
Contributor

LGTM.

@sbsekiguchi sbsekiguchi merged commit 15c2a84 into master Sep 18, 2024
11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants