[23/06/15] Now we support training the baichuan-7B model in this repo. Try `--model_name_or_path baichuan-inc/baichuan-7B` argument to use the baichuan-7B model.
[23/06/03] Now we support quantized training and inference (aka [QLoRA](https://github.com/artidoro/qlora)). Try `--quantization_bit 4/8` argument to work with quantized model. (experimental feature)
[23/06/03] Now we support quantized training and inference (aka [QLoRA](https://github.com/artidoro/qlora)). Try `--quantization_bit 4/8` argument to work with quantized model. (experimental feature)
[23/05/31] Now we support training the BLOOM & BLOOMZ models in this repo. Try `--model_name_or_path bigscience/bloomz-7b1-mt` argument to use the BLOOMZ model.
[23/05/31] Now we support training the BLOOM & BLOOMZ models in this repo. Try `--model_name_or_path bigscience/bloomz-7b1-mt` argument to use the BLOOMZ model.
We recommend using `--per_device_eval_batch_size=1` and `--max_target_length 128`in INT8 evaluation.
We recommend using `--per_device_eval_batch_size=1` and `--max_target_length 128`at 4/8-bit evaluation.
### CLI Demo
### CLI Demo
```bash
```bash
python src/cli_demo.py \
python src/cli_demo.py \
--model_name_or_path path_to_llama_model \
--model_name_or_path path_to_your_model \
--checkpoint_dir path_to_checkpoint
--checkpoint_dir path_to_checkpoint
```
```
### Web Demo
### Web Demo
```bash
```bash
python src/web_demo.py \
python src/web_demo.py \
--model_name_or_path path_to_llama_model \
--model_name_or_path path_to_your_model \
--checkpoint_dir path_to_checkpoint
--checkpoint_dir path_to_checkpoint
```
```
@ -236,7 +235,7 @@ python src/web_demo.py \
```bash
```bash
python src/export_model.py \
python src/export_model.py \
--model_name_or_path path_to_llama_model \
--model_name_or_path path_to_your_model \
--checkpoint_dir path_to_checkpoint \
--checkpoint_dir path_to_checkpoint \
--output_dir path_to_export
--output_dir path_to_export
```
```
@ -249,6 +248,8 @@ Please follow the [Model Card](https://github.com/facebookresearch/llama/blob/ma
Please follow the [RAIL License](https://huggingface.co/spaces/bigscience/license) to use the BLOOM & BLOOMZ models.
Please follow the [RAIL License](https://huggingface.co/spaces/bigscience/license) to use the BLOOM & BLOOMZ models.
Please follow the [baichuan-7B License](https://huggingface.co/baichuan-inc/baichuan-7B/resolve/main/baichuan-7B%20%E6%A8%A1%E5%9E%8B%E8%AE%B8%E5%8F%AF%E5%8D%8F%E8%AE%AE.pdf) to use the baichuan-7B model.