-
Notifications
You must be signed in to change notification settings - Fork 211
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deletion of large number of graphs #1338
Comments
How many triples are in these 30K graphs ? Are you deleting the graphs with transaction log mode to set to Are you ultimately seeking to delete and reload these 30K graphs, as you indicate above |
That was an example, the number and size of the graphs can vary, but I would say that it is possible to have in total up to billions of triples. Currently, we are completely disabling it with Yes, the goal is that there might be some graphs that needs to be removed and then reloaded into the database. The type of files loaded might be turtle, trig, or nquads. Thank you. |
As detailed in the log_enable documentation:
So, The key is to not be in |
In my Virtuoso database instance I have around 30k graphs that I want to delete. I've search for an efficient solution but didn't find any so far. Trying to delete so many graphs leads to big performance problems. It would take less time to restart the database from scratch and re-ingest all the 30k graphs.
I tried with different logics like:
In my opinion, the main issue is that when some graphs are removed from the
rdf_quad
then a re-indexing is triggered and this takes the majority of the computational time. I tried to remove the indexes associated with the tablerdf_quad
but this was creating inconsistencies.The best approach would be to first remove all the graphs from the
rdf_quad
table (even in batches it should work), then at the end of the removal we trigger the re-indexing. By doing so we can reduce the overhead be the re-indexing and only execute it the least amount of times possible. It's fine if the database is not accessible during the deletion.Do you have any suggestion on how this could be achieved? Or other any recommendations on how to remove a large number of graphs from the database? Thank you.
The text was updated successfully, but these errors were encountered: