Computer vision
| Short | Full |
|---|
| CV | computer vision |
| FOV | field of view |
Dynamics
| Short | Full |
|---|
| CG | center of gravity |
Frames
| Short | Full |
|---|
| DCM | direction cosine matrix |
Probability
| Short | Full |
|---|
| RV | random variable |
| PDF | probability density function |
| KL | Kullback-Leibler |
| GP | Gaussian process |
Neural networks
| Short | Full |
|---|
| MTS | multivariate time series |
| LSTM | long short-term memory |
| NN | neural network |
| CNN | convolutional neural network |
| MLP | multilayer perceptron |
| ReLU | rectified linear unit |
| SGD | stochastic gradient descent |
| GRU | gated recurrent unit |
| RNN | recurrent neural network |
Reinforcement learning
| Short | Full |
|---|
| RL | reinforcement learning |
| RLHF | reinforcement learning from human feedback |
| DAgger | dataset aggregation |
| gSDE | generalized state-dependent exploration |
| AMP | adversarial motion priors |
| MDP | Markov decision process |
| SAC | Soft Actor-Critic |
| SARSA | State, Action, Reward, State, Action |
| TD | temporal difference |
| GLIE | greedy in the limit with infinite exploration |
| DQN | Deep Q-Network |
| DDQN | Double DQN |
| TRPO | Trust-Region Policy Optimization |
| PPO | Proximal Policy Optimization |
| DDPG | Deep Deterministic Policy Gradient |
| TD3 | Twin Delated Deep Deterministic Policy Gradient |