Dealing With Multimodal Distributions
- Mixture of Gaussians
- Multiple Gaussians, with multiple means and variances (weight for each Guassian)
- Need quite a lot of distributions (exponential in number of parameters)
- Latent Variable Models
- Ways to solve:
- Conditional variational auto encoder
- Normalizing flow
- Stein variational gradient descent
- Autoregressive Discretization
- Discrete action space is easy, because we can use softmax
- However, discretizing a continuous action space is pretty difficult
- In autoregressive discretization, we first discretize the first dimension of the action
- Then, sample for the softmax, and have a value for the first dimension
- This feeds into another neural network, which discretizes the second dimension (and repeat)
uid: 202008311522 tags: #cs285