"Just Relax It" is a cutting-edge Python library designed to streamline the optimization of discrete probability distributions in neural networks, offering a suite of advanced relaxation techniques compatible with PyTorch.
- Technical Meeting 1 - Presentation
- Technical Meeting 2 - Jupyter Notebook
- Technical Meeting 3 -ย Jupyter Notebook
- Blog Post
- Documentation
- Tests
For lots of mathematical problems we need an ability to sample discrete random variables. The problem is that due to continuous nature of deep learning optimization, the usage of truly discrete random variables is infeasible. Thus we use different relaxation methods. One of them, Concrete distribution or Gumbel-Softmax (this is one distribution proposed in parallel by two research groups) is implemented in different DL packages. In this project we implement different alternatives to it.
- Relaxed Bernoulli, also see ๐ paper
- Correlated relaxed Bernoulli, also see ๐ paper
- Gumbel-softmax TOP-K, also see ๐ paper
- Straight-Through Bernoulli, also see ๐ paper
- Stochastic Times Smooth, also see ๐ paper
- Invertible Gaussian with KL implemented, also see ๐ paper
- Hard Concrete, also see ๐ paper
- Logistic-Normal and Laplace-form approximation of Dirichlet, also see โน๏ธ wiki and ๐ป stackexchange
pip install relaxit
pip install git+https://github.com/intsystems/relaxit
git clone https://github.com/intsystems/relaxit
cd relaxit
pip install -e .
import torch
from relaxit.distributions import InvertibleGaussian
# initialize distribution parameters
loc = torch.zeros(3, 4, 5, requires_grad=True)
scale = torch.ones(3, 4, 5, requires_grad=True)
temperature = torch.tensor([1e-0])
# initialize distribution
distribution = InvertibleGaussian(loc, scale, temperature)
# sample with reparameterization
sample = distribution.rsample()
print('sample.shape:', sample.shape)
print('sample.requires_grad:', sample.requires_grad)
Laplace Bridge | REINFORCE in Acrobot environment | VAE with discrete latents |
---|---|---|
For demonstration purposes, we divide our algorithms in three1 different groups. Each group relates to the particular demo code:
- Laplace bridge between Dirichlet and LogisticNormal distributions
- REINFORCE
- Other relaxation methods
We describe our demo experiments here.
Some of the alternatives for GS were implemented in pyro, so we base our library on their codebase.
To make to library consistent, we integrate imports of distributions from pyro
and torch
into the library, so that all the categorical distributions can be imported from one entrypoint.
- Daniil Dorin (Basic code writing, Final demo, Algorithms)
- Igor Ignashin (Project wrapping, Documentation writing, Algorithms)
- Nikita Kiselev (Project planning, Blog post, Algorithms)
- Andrey Veprikov (Tests writing, Documentation writing, Algorithms)
- You are welcome to contribute to our project!
- About top-k GS
- VAE implementation with different latent distributions
- KL divergence between Dirichlet and Logistic-Normal implemented in R
- About score function (SF) and pathwise derivate (PD) estimators, VAE and REINFORCE
Footnotes
-
We also implement REINFORCE algorithm as a score function estimator alternative for our relaxation methods that are inherently pathwise derivative estimators. This one is implemented only for demo experiments and is not included into the source code of package. โฉ