Worst-Case-Optimal Similarity Joins on Graph Databases
dc.catalogador | gjm | |
dc.contributor.author | Arroyuelo Billiardi, Diego Gastón | |
dc.contributor.author | Bustos, Benjamin | |
dc.contributor.author | Gómez-Brandón, Adrián | |
dc.contributor.author | Hogan, Aidan | |
dc.contributor.author | Navarro, Gonzalo | |
dc.contributor.author | Reutter de la Maza, Juan | |
dc.date.accessioned | 2024-05-23T17:13:44Z | |
dc.date.available | 2024-05-23T17:13:44Z | |
dc.date.issued | 2024 | |
dc.description.abstract | We extend the concept of worst-case optimal equijoins in graph databases to the case where some nodes are required to be within the k-nearest neighbors (kNN) of others under some similarity function. We model the problem by superimposing the database graph with the kNN graph and show that a variant of Leapfrog TrieJoin (LTJ) implemented over a compact data structure called the Ring can be seamlessly extended to integrate similarity clauses with the equijoins in the LTJ query process, retaining worst-case optimality in many relevant cases. Our experiments on a benchmark that combines Wikidata and IMGpedia show that our enhanced LTJ algorithm outperforms by a considerable margin a baseline that first applies classic LTJ and then completes the query by applying the similarity predicates. The difference is more pronounced on queries where the similarity clauses are more densely connected to the query, becoming of an order of magnitude in some cases. | |
dc.fechaingreso.objetodigital | 2024-08-30 | |
dc.format.extent | 26 páginas | |
dc.fuente.origen | ORCID | |
dc.identifier.doi | 10.1145/3639294 | |
dc.identifier.uri | https://doi.org/10.1145/3639294 | |
dc.identifier.uri | https://dl.acm.org/doi/10.1145/3639294 | |
dc.identifier.uri | https://repositorio.uc.cl/handle/11534/85770 | |
dc.information.autoruc | Escuela de Ingeniería; Arroyuelo Billiardi, Diego Gastón; S/I; 1321287 | |
dc.information.autoruc | Escuela de Ingeniería; Reutter de la Maza, Juan; 0000-0002-2186-0312; 126898 | |
dc.issue.numero | 1 | |
dc.language.iso | en | |
dc.nota.acceso | Contenido parcial | |
dc.revista | Proceedings of the ACM on Management of Data | |
dc.rights | acceso restringido | |
dc.subject | Worst-case optimal joins | |
dc.subject | Leapfrog Triejoin | |
dc.subject | Graph patterns | |
dc.subject | Graph databases | |
dc.subject | Graph indexing | |
dc.subject | Similarity joins | |
dc.subject | Nearest-neighbor graphs | |
dc.subject.ddc | 600 | |
dc.subject.dewey | Tecnología | es_ES |
dc.title | Worst-Case-Optimal Similarity Joins on Graph Databases | |
dc.type | artículo | |
dc.volumen | 2 | |
sipa.codpersvinculados | 1321287 | |
sipa.codpersvinculados | 126898 | |
sipa.trazabilidad | ORCID;2024-05-20 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Worst-Case-Optimal Similarity Joins on Graph Databases.pdf
- Size:
- 28.56 KB
- Format:
- Adobe Portable Document Format
- Description: