训练文本生成
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
hiyouga e3aaef7d4a fix layer norm name in PPO 2 years ago
..
utils fix layer norm name in PPO 2 years ago
__init__.py Initial commit 2 years ago
cli_demo.py alter rewards data type 2 years ago
export_model.py support BLOOM models 2 years ago
train_ppo.py alter rewards data type 2 years ago
train_pt.py alter rewards data type 2 years ago
train_rm.py alter rewards data type 2 years ago
train_sft.py alter rewards data type 2 years ago
web_demo.py alter rewards data type 2 years ago