YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

SAC Ant Agent

A SAC agent trained on the MuJoCo Ant-v4 environment.

Training Details

Algorithm: SAC
Timesteps: 2.4e6

- learning_rate=3e-4,
- buffer_size=1_000_000,      # 经验回放缓冲区大小. 这个参数PPO没有
- batch_size=256,             # 默认256
- tau=0.005,                  # 软更新系数
- gamma=0.99,                 # 折扣因子
- train_freq=1,               # 每步都训练，采集多少个环境步的数据后训练一次
- gradient_steps=1,           # 对replayBuffer中读取到的batch，进行多少次梯度下降更新

Usage

!pip install huggingface_sb3
# login to huggingFace
!huggingface-cli login

from stable_baselines3 import SAC
from huggingface_sb3 import load_from_hub

model_path = load_from_hub(   # This function only returns the path of the cached model
    repo_id="buffaX/sac-ant-v4", 
    filename="sac_ant.zip"
)

model = SAC.load(model_path)
print(model.actor)
print(model.critic)

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support