Query Completion for Small-Scale Distributed Databases in PostgreSQL and MongoDB

Relational/SQL and document/JSON data stores are competing but complementary technologies in OLAP (On-Line Analytical Processing) systems. Whereas traditional approaches for performance comparison use query execution time, this paper compares two distributed setups deployed on PostgreSQL/Citus and M...

Full description

Saved in:
Bibliographic Details
Main Authors: Marin FOTACHE, Catalina BADEA, Marius-Iulian CLUCI, Codrin-Stefan ESANU
Format: Article
Language:English
Published: Bucharest University of Economic Studies 2025-01-01
Series:Database Systems Journal
Subjects:
Online Access:https://www.dbjournal.ro/archive/36/36_2.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Relational/SQL and document/JSON data stores are competing but complementary technologies in OLAP (On-Line Analytical Processing) systems. Whereas traditional approaches for performance comparison use query execution time, this paper compares two distributed setups deployed on PostgreSQL/Citus and MongoDB by focusing solely on query completion within a 10-minute timeout. The TPC-H benchmark was converted into a denormalized JSON schema in MongoDB. An initial set of 296 SQL queries was executed in PostgreSQL/Citus and then mapped to MongoDB’s Aggregation Framework. Completion success was collected across six scenarios defined by two data sizes (0.01 GB, 0.1 GB) and three node counts (3, 6, 9). Relationships between completion rates and query parameters were assessed using statistical tests and machine learning techniques.
ISSN:2069-3230