Created:
Modified:

reparameterization trick

This page is from my personal notes, and has not been specifically reviewed for public consumption. It might be incomplete, wrong, outdated, or stupid. Caveat lector.

reparameterization trick

Links to this note

sparse mixture of experts

deep deterministic policy gradient

maximum-entropy reinforcement learning

Meta