Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to choose beta value proper way? #2

Open
JasOlean opened this issue Mar 20, 2019 · 1 comment
Open

How to choose beta value proper way? #2

JasOlean opened this issue Mar 20, 2019 · 1 comment

Comments

@JasOlean
Copy link

In your case, you use beta = 100. So, how to choose proper beta value (not constant)? And large or small beta value is good or not?

@alecGraves
Copy link
Owner

This paper has an excellent overview of what the beta parameter is doing: https://arxiv.org/abs/1804.03599

To summarize, larger beta will result in a more disentangled latent representation but lower-fidelity reconstructions. Smaller beta will not impose disentangling as much, allowing for higher-fidelity reconstructions. At beta = 1, the B-VAE is equivalent to a plain VAE, so it should is usually set to a value greater than one.

Determining the proper beta depends on the problem and your goals. You can try several values for beta with your data, and you can create a custom training regimen that changes beta over time. This implementation assumes a constant beta, but you can rebuild the model with a different beta during training.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants