TensorFlow implementation of variational inference for Hidden Markov Model(HMM) using the continuous relaxation of categorical latent variables described in both The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables by Chris J. Maddison, Andriy Mnih, Yee Whye Teh and Categorical Reparameterization with Gumbel-Softmax by Eric Jang, Shixiang Gu, Ben Poole.
- Gumbel distribution
- Concrete random variable / Gumbel-Softmax distribution
- definition (reparameterization)
- properties (categorical under zero temperature; close to uniform under high temperature; convexity under certain conditions)
- Section 3.3 & Appendix C of The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables
- the modified training objective / ELBO
- Concrete in log-space
- temperature (annealing)
- Section 2.2 & 3.2 of Categorical Reparameterization with Gumbel-Softmax
- Straight-Through Gumbel-Softmax (Figure 2)
- Score Function / REINFORCE & control variates
- Extension:
- gradient estimates
- HHMM
- Gaussian observations, K = 2, N = 500, fixed temperature & exponential decay