Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable specifying the max_optimization_threads on post_upload() method for better indexing times on VMs with large CPU count #163

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

filipecosta90
Copy link
Contributor

@filipecosta90 filipecosta90 commented Jun 13, 2024

This pull request introduces enhancements to the Qdrant client configuration on issues observed on large vector count ( > 40M ). The key changes include:

  1. Max Optimization Threads:
    • Added the ability to specify the max_optimization_threads parameter within the post_upload() method, enabling better control over resource utilization during optimization tasks. By looking at the actual usage of qdrant cloud instances during ingestion and indexing i could see that we were not using the entire deployment VCPUs. Adding the ability control the ammount of optimization threads should give us better usage. By default it will follow qdrant's Optimizer config base model ("If null - have no limit and choose dynamically to saturate CPU")

confirmation that indeed this change is effective, using a 1M vector sample dataset :

"None" - meaning no limit and choose dynamically to saturate CPU

1000000it [00:26, 38223.74it/s]
Upload time: 26.269824364921078
Total import time: 36.38025348598603

QDRANT_MAX_OPTIMIZATION_THREADS=1 - The current value of master

1000000it [00:26, 37504.10it/s]
Upload time: 26.770209033973515
Total import time: 76.93960217398126

  1. Exponential Backoff:

@filipecosta90 filipecosta90 changed the title Enable specifying the max_optimization_threads on post_upload() for q… Enable specifying the max_optimization_threads on post_upload() method for better indexing times on VMs with large CPU count Jun 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
1 participant