This repository was archived by the owner on Aug 4, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 14
Running XGen 7B Chatbot in your cloud
Andrey Cheptsov edited this page Jul 5, 2023
·
3 revisions
XGen-7B is a new LLM by Salesforce, released under Apache 2.0.
This repository contains the source code of a chatbot application, along with the dstack configuration that allows you to run this application in your cloud with a single command, automatically provisioning cloud resources for you.
pip install "dstack[aws,gcp,azure,lambda]" -U
dstack startOnce the dstack server is up, create a project with your cloud credentials (AWS, GCP, or Azure). Then, copy its dstack config command and run it to configure the CLI.
git clone https://github.com/dstackai/dstack-examples
cd dstack-examples
dstack initCreate a .dstack/profiles.yml file that points to the created project and describes the resources.
Example:
profiles:
- name: gcp project: gcp resources: memory: 48GB gpu: memory: 24GB default: true
dstack run xgen-chatbot --reload
dstack will automatically forward the port to your local machine, providing secure and convenient access.
For more details on how dstack works, check out its documentation.