[Samples] merge LLM samples to "text_generation" folder #1411

olpipi · 2024-12-19T11:17:59Z

No description provided.

samples/cpp/text_generation/CMakeLists.txt

samples/cpp/text_generation/README.md

samples/cpp/text_generation/CMakeLists.txt

samples/CMakeLists.txt

samples/cpp/text_generation/README.md

port: #28248 connected to: openvinotoolkit/openvino.genai#1411

connected to: openvinotoolkit/openvino.genai#1411 Co-authored-by: Andrzej Kopytko <[email protected]>

olpipi · 2025-01-03T17:25:41Z

If there is no more major comments, I will make the similar changes to python samples

Wovchena

You have a merge conflict

olpipi · 2025-01-09T12:27:39Z

@ilya-lavrenov re-review please. The PR cannot be merged without +1 from you

src/README.md

samples/cpp/text_generation/README.md

DimaPastushenkov · 2025-01-10T12:19:57Z

samples/python/text_generation/README.md

-This example showcases inference of text-generation Large Language Models (LLMs): `chatglm`, `LLaMA`, `Qwen` and other models with the same signature. The application doesn't have many configuration options to encourage the reader to explore and modify the source code. For example, change the device for inference to GPU. The sample fearures `openvino_genai.LLMPipeline` and configures it to run the simplest deterministic greedy sampling algorithm. There is also a Jupyter [notebook](https://github.com/openvinotoolkit/openvino_notebooks/tree/latest/notebooks/llm-chatbot) which provides an example of LLM-powered Chatbot in Python.
+These samples showcase the use of OpenVINO's inference capabilities for text generation tasks, including different decoding strategies such as beam search, multinomial sampling, and speculative decoding. Each sample has a specific focus and demonstrates a unique aspect of text generation.
+The applications don't have many configuration options to encourage the reader to explore and modify the source code. For example, change the device for inference to GPU.
+There are also Jupyter notebooks for some samples. You can find links to them in the appropriate sample descritions.


Typo: descritions -> descriptions

DimaPastushenkov · 2025-01-10T12:39:11Z

samples/python/text_generation/README.md


 ## Download and convert the model and tokenizers

 The `--upgrade-strategy eager` option is needed to ensure `optimum-intel` is upgraded to the latest version.

-Install [../../export-requirements.txt](../../export-requirements.txt) to convert a model.


I propose to have the clear message in the readme, that export-requirements.txt is needed for conversion/optimization and deployment-requirements.txt is needed to run the sample instead of installing both and mentioning, that export-requirements.txt isn't needed if model is already exported.
It will be easier for developer to understand, which dependencies are necessary for which stage (model preparation vs model deployment)
"Install ../../export-requirements.txt to convert a model" looks more appropriate in this case

DimaPastushenkov · 2025-01-10T12:51:02Z

samples/python/text_generation/README.md


 ```sh
-pip install --upgrade-strategy eager -r ../../export-requirements.txt
+pip install --upgrade-strategy eager -r ../../requirements.txt
 optimum-cli export openvino --trust-remote-code --model TinyLlama/TinyLlama-1.1B-Chat-v1.0 TinyLlama-1.1B-Chat-v1.0


What about also adding two other options to prepare the model here:

Download the converted model from HF
huggingface-cli download "OpenVINO/TinyLlama-1.1B-Chat-v1.0-int8-ov" --local-dir TinyLlama-1.1B-Chat-v1.0-int8-ov

Convert model and compress weights to int4 precision
optimum-cli export openvino --trust-remote-code --model TinyLlama/TinyLlama-1.1B-Chat-v1.0 --weight-format int4 TinyLlama-1.1B-Chat-v1.0-int4

It will help developer to see, that there are multiple options to prepare the model.
We can also emphasize, that "Download the converted model from HF" can be preferred option here for the sample (no need to spend time for conversion )

is huggingface-cli installed by export-requirements.txt as dependency?

No, it isn't. Please add huggingface_hub to export-requirements.txt

DimaPastushenkov · 2025-01-10T13:06:59Z

samples/python/text_generation/README.md


+## Sample Descriptions
+### Common information
+Follow [Get Started with Samples](https://docs.openvino.ai/2024/learn-openvino/openvino-samples/get-started-demos.html) to get common information about OpenVINO samples.

 Discrete GPUs (dGPUs) usually provide better performance compared to CPUs. It is recommended to run larger models on a dGPU with 32GB+ RAM. For example, the model meta-llama/Llama-2-13b-chat-hf can benefit from being run on a dGPU. Modify the source code to change the device for inference to the GPU.


We can recommend using GPU without mentioning discrete GPU, because iGPU works perfectly with LLMs. Recommendation can be done due to performance and not the memory.
Which dGPU with 32GB+ RAM is meant here exactly?
Intel ARC has up to 16Gb memory

DimaPastushenkov · 2025-01-13T08:50:18Z

samples/cpp/text_generation/README.md

+  ./beam_search_causal_lm <MODEL_DIR> "<PROMPT 1>" ["<PROMPT 2>" ...]
+  ```
+
+### 3. Chat Sample (`chat_sample`)


Please consider to put chat sample at the first place in the list as it is the most popular sample

DimaPastushenkov · 2025-01-13T09:02:29Z

samples/cpp/text_generation/README.md

-
+## Sample Descriptions
+### Common information
+Follow [Get Started with Samples](https://docs.openvino.ai/2024/learn-openvino/openvino-samples/get-started-demos.html) to get common information about OpenVINO samples.


Clear instructions how to build samples to be provided. For example, https://github.com/openvinotoolkit/openvino.genai/blob/master/src/docs/BUILD.md to be extended with sample build section and these instructions to be linked from this readme

### Details: - Update links to genai samples - related to openvinotoolkit/openvino.genai#1411 ### Tickets: - *ticket-id*

### Details: - Update links to genai samples to 2024.6 branch - related to openvinotoolkit/openvino.genai#1411 ### Tickets: - *ticket-id*

port: #28248 connected to: openvinotoolkit/openvino.genai#1411

olpipi · 2025-01-14T15:06:25Z

@DimaPastushenkov I addressed your comments in #1545. Please review

github-actions bot added category: GHA CI based on Github actions category: cmake / build Cmake scripts category: samples GenAI samples labels Dec 19, 2024

olpipi force-pushed the samples_movement branch 2 times, most recently from 9e7f861 to 3901fbb Compare December 19, 2024 11:22

Wovchena reviewed Dec 19, 2024

View reviewed changes

ilya-lavrenov assigned ilya-lavrenov, Wovchena and DimaPastushenkov Dec 20, 2024

ilya-lavrenov changed the title ~~Samples movement~~ [Samples] merge LLM samples to "text_generation" folder Dec 20, 2024

ilya-lavrenov requested changes Dec 20, 2024

View reviewed changes

This was referenced Jan 3, 2025

[DOCS] updating links to GenAI openvinotoolkit/openvino#28250

Merged

[DOCS] updating links to GenAI openvinotoolkit/openvino#28248

Merged

github-merge-queue bot pushed a commit to openvinotoolkit/openvino that referenced this pull request Jan 3, 2025

[DOCS] updating links to GenAI (#28250)

41fa4e6

port: #28248 connected to: openvinotoolkit/openvino.genai#1411

github-merge-queue bot pushed a commit to openvinotoolkit/openvino that referenced this pull request Jan 3, 2025

[DOCS] updating links to GenAI (#28248)

2bdaf5a

connected to: openvinotoolkit/openvino.genai#1411 Co-authored-by: Andrzej Kopytko <[email protected]>

olpipi force-pushed the samples_movement branch from b270500 to a48de38 Compare January 3, 2025 17:19

github-actions bot added the no-match-files label Jan 3, 2025

olpipi force-pushed the samples_movement branch from a48de38 to 9a9d41c Compare January 8, 2025 15:34

olpipi requested review from ilya-lavrenov and Wovchena January 8, 2025 15:36

olpipi force-pushed the samples_movement branch from 6db3e88 to 3b54139 Compare January 8, 2025 17:44

Wovchena approved these changes Jan 9, 2025

View reviewed changes

olpipi added 7 commits January 9, 2025 09:07

Move cpp text generation samples to one folder

b92d180

Update readme.md

39259f8

Apply comments

c1b313a

Refactor cmake

e3eda76

Apply comments

50c4f8f

fix

da1605a

Consolidate pyton samples

08f939b

fix

1457292

olpipi force-pushed the samples_movement branch from 3b54139 to 1457292 Compare January 9, 2025 09:09

ilya-lavrenov mentioned this pull request Jan 9, 2025

[JS] Add GenAI Node.js bindings #1193

Open

3 tasks

olpipi enabled auto-merge January 9, 2025 12:26

olpipi disabled auto-merge January 9, 2025 12:27

mlukasze requested a review from DimaPastushenkov January 10, 2025 06:03

ilya-lavrenov reviewed Jan 10, 2025

View reviewed changes

src/README.md Show resolved Hide resolved

samples/cpp/text_generation/README.md Outdated Show resolved Hide resolved

ilya-lavrenov added this to the 2025.0 milestone Jan 10, 2025

Apply comments

d8fe11b

olpipi requested a review from ilya-lavrenov January 10, 2025 14:41

This was referenced Jan 10, 2025

Update links to genai samples to 2024.6 branch openvinotoolkit/openvino#28383

Merged

Update links to genai samples openvinotoolkit/openvino#28384

Merged

ilya-lavrenov approved these changes Jan 13, 2025

View reviewed changes

olpipi added this pull request to the merge queue Jan 13, 2025

DimaPastushenkov reviewed Jan 13, 2025

View reviewed changes

Wovchena mentioned this pull request Jan 13, 2025

Automatically apply chat template in non-chat scenarios #1533

Open

Merged via the queue into openvinotoolkit:master with commit 8b62451 Jan 13, 2025
59 checks passed

github-merge-queue bot pushed a commit to openvinotoolkit/openvino that referenced this pull request Jan 13, 2025

Update links to genai samples (#28384)

91eeed5

### Details: - Update links to genai samples - related to openvinotoolkit/openvino.genai#1411 ### Tickets: - *ticket-id*

github-merge-queue bot pushed a commit to openvinotoolkit/openvino that referenced this pull request Jan 14, 2025

[DOCS] updating links to GenAI (#28250)

d11b79c

port: #28248 connected to: openvinotoolkit/openvino.genai#1411

olpipi mentioned this pull request Jan 14, 2025

Update samples readme #1545

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Samples] merge LLM samples to "text_generation" folder #1411

[Samples] merge LLM samples to "text_generation" folder #1411

olpipi commented Dec 19, 2024

olpipi commented Jan 3, 2025

Wovchena left a comment

olpipi commented Jan 9, 2025

DimaPastushenkov Jan 10, 2025

DimaPastushenkov Jan 10, 2025

DimaPastushenkov Jan 10, 2025

olpipi Jan 14, 2025

DimaPastushenkov Jan 14, 2025

DimaPastushenkov Jan 10, 2025

DimaPastushenkov Jan 13, 2025

DimaPastushenkov Jan 13, 2025

olpipi commented Jan 14, 2025

[Samples] merge LLM samples to "text_generation" folder #1411

[Samples] merge LLM samples to "text_generation" folder #1411

Conversation

olpipi commented Dec 19, 2024

olpipi commented Jan 3, 2025

Wovchena left a comment

Choose a reason for hiding this comment

olpipi commented Jan 9, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

olpipi commented Jan 14, 2025