Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Potential Performance Degradation with Large Document Sets #2122

Open
9 tasks done
T-Ezzoury opened this issue Nov 11, 2024 · 0 comments
Open
9 tasks done

[BUG] Potential Performance Degradation with Large Document Sets #2122

T-Ezzoury opened this issue Nov 11, 2024 · 0 comments
Labels
bug Something isn't working

Comments

@T-Ezzoury
Copy link

Pre-check

  • I have searched the existing issues and none cover this bug.

Description

When working with large datasets, there could be noticeable delays or performance drops during document ingestion and context retrieval

Steps to Reproduce

1- Prepare a dataset containing a large number of documents.
2- Ingest the dataset using the tool’s document ingestion process.
3- Observe the performance during the ingestion and context retrieval phases.

Expected Behavior

The system should efficiently handle large document sets with minimal delays.

Actual Behavior

Noticeable delays and performance degradation are observed when processing a large number of documents.

Environment

Windows 11, NVIDIA® H100, H200

Additional Information

No response

Version

No response

Setup Checklist

  • Confirm that you have followed the installation instructions in the project’s documentation.
  • Check that you are using the latest version of the project.
  • Verify disk space availability for model storage and data processing.
  • Ensure that you have the necessary permissions to run the project.

NVIDIA GPU Setup Checklist

  • Check that the all CUDA dependencies are installed and are compatible with your GPU (refer to CUDA's documentation)
  • Ensure an NVIDIA GPU is installed and recognized by the system (run nvidia-smi to verify).
  • Ensure proper permissions are set for accessing GPU resources.
  • Docker users - Verify that the NVIDIA Container Toolkit is configured correctly (e.g. run sudo docker run --rm --gpus all nvidia/cuda:11.0.3-base-ubuntu20.04 nvidia-smi)
@T-Ezzoury T-Ezzoury added the bug Something isn't working label Nov 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant