使用vllm部署
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

22 lines
519 B

## 安装环境
2 years ago
```bash
conda create -n vllm python=3.8
pip install -r requirements.txt
```
## 启动项目
2 years ago
```bash
conda activate vllm
bash run_api_nohup.sh
bash run_model.sh
```
## 测试
```bash
curl -H "Content-Type: application/json" -X POST -d '{"texts": "User: 你好\nAssistant:"}' http://192.168.31.149:12000/predict
curl -H "Content-Type: application/json" -X POST -d '{"id": "b412fc98-bdd7-11ee-8d23-d5e5c66dd02e"}' http://192.168.31.149:12000/search
```
返回"status_code"不出现 400 则调用成功