Module weight_ops

Module weight_ops 

Source
Expand description

Weight vector operations for meta-learning and target network updates.

Provides Reptile-style meta updates, Polyak (exponential moving average) updates for SAC/TD3 target networks, and weight vector averaging.

Structs§

PolyakUpdate
Polyak (EMA) target network update strategy.
ReptileUpdate
Reptile meta-learning update strategy.

Traits§

WeightUpdate
Trait for weight vector update strategies.

Functions§

average_weight_vectors
Average N weight vectors element-wise: result[i] = mean(vectors[j][i] for all j)
polyak_update
Exponential moving average (Polyak update): target[i] = tau * source[i] + (1 - tau) * target[i]
reptile_update
Reptile weight update: params += lr * (task_params - params)