stable_learning_control.algos.tf2.policies.critics
Critic network structures.
Submodules
Classes
Soft Lyapunov critic Network. |
|
Soft Q critic network. |
Package Contents
- class stable_learning_control.algos.tf2.policies.critics.LCritic(obs_dim, act_dim, hidden_sizes, activation=nn.relu, name='lyapunov_critic', **kwargs)[source]
Bases:
tf.keras.Model
Soft Lyapunov critic Network.
- L
The layers of the network.
- Type:
Initialise the LCritic object.
- Parameters:
obs_dim (int) – Dimension of the observation space.
act_dim (int) – Dimension of the action space.
hidden_sizes (list) – Sizes of the hidden layers.
activation (
tf.keras.activations
, optional) – The activation function. Defaults totf.nn.relu
.name (str, optional) – The Lyapunov critic name. Defaults to
lyapunov_critic
.**kwargs – All kwargs to pass to the
tf.keras.Model
. Can be used to add additional inputs or outputs.
- L
- class stable_learning_control.algos.tf2.policies.critics.QCritic(obs_dim, act_dim, hidden_sizes, activation=nn.relu, output_activation=None, name='q_critic', **kwargs)[source]
Bases:
tf.keras.Model
Soft Q critic network.
- Q
The layers of the network.
- Type:
Initialise the QCritic object.
- Parameters:
obs_dim (int) – Dimension of the observation space.
act_dim (int) – Dimension of the action space.
hidden_sizes (list) – Sizes of the hidden layers.
activation (
tf.keras.activations
, optional) – The activation function. Defaults totf.nn.relu
.output_activation (
tf.keras.activations
, optional) – The activation function used for the output layers. Defaults toNone
which is equivalent to using the Identity activation function.name (str, optional) – The Lyapunov critic name. Defaults to
q_critic
.**kwargs – All kwargs to pass to the
tf.keras.Model
. Can be used to add additional inputs or outputs.
- Q