plamo-webui

WebUI container demo of Preferred Networks PLaMo-13B

プリファードネットワークさんの大規模言語モデル(LLM) PLaMo-13Bが公開されていたので簡単にgradioで動くコンテナを作ってみました。
I created container to run Preferred Networks' PLaMo-13B, a large-scale language model (LLM), with gradio. https://tech.preferred.jp/ja/blog/llm-plamo/

NGCのhttps://catalog.ngc.nvidia.com/orgs/nvidia/containers/pytorch を使っています。

動作確認はNVIDIA A100 80GB PCIeで行いました。50GBほどGPUメモリを必要とするようです。
The operation was checked with NVIDIA A100 80GB PCIe, it seems to need about 50GB GPU memory.

build

docker build . -t plamo-webui

run

docker run plamo-webui

or

# デーモン動作
docker run --rm -d --gpus all --ipc=host --ulimit memlock=-1 --ulimit stack=67108864 -p 7860:7860 plamo-webui

access http://localhost:7860/

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
launch.py		launch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

plamo-webui

build

run

example

About

Releases

Packages

Languages

License

JunzoKamahara/plamo-webui

Folders and files

Latest commit

History

Repository files navigation

plamo-webui

build

run

example

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages