Skip to content

Commit

Permalink
Fix errors in AIR
Browse files Browse the repository at this point in the history
  • Loading branch information
aleju committed Apr 22, 2016
1 parent 41edb12 commit 04ebaaa
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions neural-nets/Attend_Infer_Repeat.md
Original file line number Diff line number Diff line change
Expand Up @@ -31,17 +31,17 @@

* (1) Introduction
* Assumption: Images are made up of distinct objects. These objects have visual and physical properties.
* They develop a framework for efficient inference in images (i.e. get from the image to a representation of the objects, i.e. inverse graphics).
* They developed a framework for efficient inference in images (i.e. get from the image to a latent representation of the objects, i.e. inverse graphics).
* Parts of the framework: High dimensional representations (e.g. object images), interpretable latent variables (e.g. for rotation) and generative processes (to combine object images with latent variables).
* Contributions:
* A scheme for efficient variational inference in latent spaces of variable dimensionality.
* Idea: Treat inference as an iterative process, implemented via an RNN that looks at one object at a time and learns an appropriate number of inference steps. (Attent-Infer-Repeat, AIR)
* Idea: Treat inference as an iterative process, implemented via an RNN that looks at one object at a time and learns an appropriate number of inference steps. (Attend-Infer-Repeat, AIR)
* End-to-end training via amortized variational inference (continuous variables: gradient descent, discrete variables: black-box optimization).
* AIR allows to train generative models that automatically learn to decompose scenes.
* AIR allows to recover objects and their attributes from rendered 3D scenes (inverse rendering).

* (2) Approach
* Just like in VAEs, the scene interpretation is viewed with a bayesdian approach.
* Just like in VAEs, the scene interpretation is treated with a bayesian approach.
* There are latent variables `z` and images `x`.
* Images are generated via a probability distribution `p(x|z)`.
* This can be reversed via bayes rule to `p(x|z) = p(x)p(z|x) / p(z)` which means that `p(x|z)p(z) / p(x) = p(z|x)`.
Expand Down

0 comments on commit 04ebaaa

Please sign in to comment.