This video introduces TF agents, a reinforcement learning library for TensorFlow. It explains the key components of TF agents and how to use it to train a CartPole agent.
What does the Q in DQN stand for?
Which of the following is not an agent implemented in TF agents: DQN, PPO, or A2C?
What does Reverb stand for?