جدول المحتويات: :: Library Catalog

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Alexandre Nakao França, Thiago
التنسيق:	Recurso digital
اللغة:
منشور في:	Zenodo 2025
الوصول للمادة أونلاين:	https://doi.org/10.5281/zenodo.17089173
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

جدول المحتويات:

<h2>Database Creation</h2> In this work, two datasets were created: a real dataset and a synthetic dataset. <ul> <li> The real dataset was obtained through the collection of stock information for 7,984 distinct products across 177 different online stores (suppliers) in a marketplace. The data was collected between October 8 and October 11, 2023. </li> <li> The synthetic dataset includes all records from the real dataset, plus an expansion that ensures every supplier has all products from the base in stock. It was created to analyze the performance of solution approaches in a context with a significantly larger search space. </li> </ul> <h3>Real Dataset</h3> The real dataset contains stock records from 177 suppliers, 7,984 distinct products, and a total of 635,723 stock records. To preserve the privacy of the suppliers, all identifiers related to stores and products were anonymized. Each record contains: <ul> <li> Supplier name (anonymized) </li> <li> Product name (anonymized) </li> <li> Price </li> <li> Quantity in stock </li> <li> Date of last collection </li> </ul> Suppliers in this dataset vary significantly in terms of stock capacity. For example, only 4 suppliers have more than 7,000 distinct products, while 50 suppliers have more than 5,000 products, and 20 suppliers have fewer than 1,000 products in stock. <h3>Synthetic Dataset</h3> The synthetic dataset is an expansion of the real dataset, ensuring that every supplier provides all 7,984 products. To build it, the real dataset was first replicated. Then, for each supplier–product pair that was originally missing, the following process was applied: <ul> <li> Price generation: a non-negative random sample was drawn from a normal distribution defined by the mean and standard deviation of the prices of the same product across suppliers in the real dataset. </li> <li> Quantity generation: a similar process was applied, using stock quantities instead of prices. </li> </ul> As a result, each of the 177 suppliers has all 7,984 products in stock, totaling 1,413,168 records. This synthetic dataset enables experiments in scenarios where the availability of products is maximized, creating a larger and more challenging search space.

مواد مشابهة