- An ASK cluster is created. For more information, see Create an ASK cluster
- The cluster runs as expected. You can log on to the Container Service for Kubernetes (ACK) console, navigate to the Clusters page, and then check whether the cluster is in the Running state.
FastChat is an open platform for training, serving, and evaluating large language model based chatbots.
- create deployment & service
# only for gpu
kubectl apply -f fastchat-gpu.yaml
- wait deployment ready
$ kubectl get po|grep fastchat
---
NAME READY STATUS RESTARTS AGE
fastchat-65f7cbfbc5-gb7wd 1/1 Running 0 30m
- Using FastChat
Run the following command to port-forward:
kubectl port-forward -n <namespace> service/fastchat-svc 7860:7860
And then open the console using the following URL:
http://localhost:7860
tag | Date | release |
---|---|---|
v1.1.0 | 2023-12 | model: fastchat-t5-3b-v1.0 |