Saved in:
| Main Authors: | Li, Chen, Zhu, Ye, Cao, Yang, Zhang, Jinli, Annisa, Annisa, Cheng, Debo, Morimoto, Yasuhiko |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.03254 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Distributed Record Linkage in Healthcare Data with Apache Spark
by: Heydari, Mohammad, et al.
Published: (2024)
by: Heydari, Mohammad, et al.
Published: (2024)
Performance comparison of Dask and Apache Spark on HPC systems for Neuroimaging
by: Dugré, Mathieu, et al.
Published: (2024)
by: Dugré, Mathieu, et al.
Published: (2024)
Large-Scale Network Embedding in Apache Spark
by: Lin, Wenqing
Published: (2021)
by: Lin, Wenqing
Published: (2021)
Comparative analysis of large data processing in Apache Spark using Java, Python and Scala
by: Borodii, Ivan, et al.
Published: (2025)
by: Borodii, Ivan, et al.
Published: (2025)
Demystifying Object-based Big Data Storage Systems
by: Mondal, Anindita Sarkar, et al.
Published: (2024)
by: Mondal, Anindita Sarkar, et al.
Published: (2024)
Towards Polyglot Data Processing in Social Networks using the Hadoop-Spark ecosystem
by: Seabra, Antony, et al.
Published: (2025)
by: Seabra, Antony, et al.
Published: (2025)
Distributed Indexing Schemes for k-Dominant Skyline Analytics on Uncertain Edge-IoT Data
by: Lai, Chuan-Chi, et al.
Published: (2023)
by: Lai, Chuan-Chi, et al.
Published: (2023)
A GPU-accelerated Molecular Docking Workflow with Kubernetes and Apache Airflow
by: Medeiros, Daniel, et al.
Published: (2024)
by: Medeiros, Daniel, et al.
Published: (2024)
BigSUMO: A Scalable Framework for Big Data Traffic Analytics and Parallel Simulation
by: Sengupta, Rahul, et al.
Published: (2026)
by: Sengupta, Rahul, et al.
Published: (2026)
Analysis of Server Throughput For Managed Big Data Analytics Frameworks
by: Anagnostakis, Emmanouil, et al.
Published: (2025)
by: Anagnostakis, Emmanouil, et al.
Published: (2025)
Distributed Continuous Range-Skyline Query Monitoring over the Internet of Mobile Things
by: Lai, Chuan-Chi, et al.
Published: (2019)
by: Lai, Chuan-Chi, et al.
Published: (2019)
A Taxonomy of Schedulers -- Operating Systems, Clusters and Big Data Frameworks
by: Sliwko, Leszek
Published: (2025)
by: Sliwko, Leszek
Published: (2025)
Advancing Polyglot Big Data Processing using the Hadoop ecosystem
by: Seabra, Antony, et al.
Published: (2025)
by: Seabra, Antony, et al.
Published: (2025)
Prink: $k_s$-Anonymization for Streaming Data in Apache Flink
by: Groneberg, Philip, et al.
Published: (2025)
by: Groneberg, Philip, et al.
Published: (2025)
Big Data Architecture for Large Organizations
by: Ismail, Fathima Nuzla, et al.
Published: (2025)
by: Ismail, Fathima Nuzla, et al.
Published: (2025)
Trustworthy Scheduling for Big Data Applications
by: Tomaras, Dimitrios, et al.
Published: (2026)
by: Tomaras, Dimitrios, et al.
Published: (2026)
StreamShield: A Production-Proven Resiliency Solution for Apache Flink at ByteDance
by: Fang, Yong, et al.
Published: (2026)
by: Fang, Yong, et al.
Published: (2026)
A Spark Optimizer for Adaptive, Fine-Grained Parameter Tuning
by: Lyu, Chenghao, et al.
Published: (2024)
by: Lyu, Chenghao, et al.
Published: (2024)
Zero-Execution Retrieval-Augmented Configuration Tuning of Spark Applications
by: Suri, Raunaq, et al.
Published: (2025)
by: Suri, Raunaq, et al.
Published: (2025)
A TEE-based Approach for Preserving Data Secrecy in Process Mining with Decentralized Sources
by: Basile, Davide, et al.
Published: (2026)
by: Basile, Davide, et al.
Published: (2026)
On Efficiently Partitioning a Topic in Apache Kafka
by: Raptis, Theofanis P., et al.
Published: (2022)
by: Raptis, Theofanis P., et al.
Published: (2022)
An Integrated (Crop Model, Cloud and Big Data Analytic) Framework to support Agriculture Activity Monitoring System
by: Akhter, Shamim, et al.
Published: (2024)
by: Akhter, Shamim, et al.
Published: (2024)
OCEP: An Ontology-Based Complex Event Processing Framework for Healthcare Decision Support in Big Data Analytics
by: Chandra, Ritesh, et al.
Published: (2025)
by: Chandra, Ritesh, et al.
Published: (2025)
CONFINE: Preserving Data Secrecy in Decentralized Process Mining
by: Goretti, Valerio, et al.
Published: (2024)
by: Goretti, Valerio, et al.
Published: (2024)
Big Data-Driven Fraud Detection Using Machine Learning and Real-Time Stream Processing
by: Liu, Chen, et al.
Published: (2025)
by: Liu, Chen, et al.
Published: (2025)
Cyberattack Data Analysis in IoT Environments using Big Data
by: Patidar, Neelam, et al.
Published: (2024)
by: Patidar, Neelam, et al.
Published: (2024)
AMECOS: A Modular Event-based Framework for Concurrent Object Specification
by: Albouy, Timothé, et al.
Published: (2024)
by: Albouy, Timothé, et al.
Published: (2024)
DOLMA: A Data Object Level Memory Disaggregation Framework for HPC Applications
by: Zheng, Haoyu, et al.
Published: (2025)
by: Zheng, Haoyu, et al.
Published: (2025)
An OPC UA-based industrial Big Data architecture
by: Hirsch, Eduard, et al.
Published: (2023)
by: Hirsch, Eduard, et al.
Published: (2023)
bigMICE: Multiple Imputation of Big Data
by: Morvan, Hugo, et al.
Published: (2026)
by: Morvan, Hugo, et al.
Published: (2026)
Enhancing ASIC Technology Mapping via Parallel Supergate Computing
by: Cai, Ye, et al.
Published: (2024)
by: Cai, Ye, et al.
Published: (2024)
Edge-assisted Parallel Uncertain Skyline Processing for Low-latency IoE Analysis
by: Lai, Chuan-Chi, et al.
Published: (2025)
by: Lai, Chuan-Chi, et al.
Published: (2025)
Flora: Efficient Cloud Resource Selection for Big Data Processing via Job Classification
by: Will, Jonathan, et al.
Published: (2025)
by: Will, Jonathan, et al.
Published: (2025)
A Review of Ontology-Driven Big Data Analytics in Healthcare: Challenges, Tools, and Applications
by: Chandra, Ritesh, et al.
Published: (2025)
by: Chandra, Ritesh, et al.
Published: (2025)
Humas: A Heterogeneity- and Upgrade-aware Microservice Auto-scaling Framework in Large-scale Data Centers
by: Hua, Qin, et al.
Published: (2024)
by: Hua, Qin, et al.
Published: (2024)
BlazingAML: High-Throughput Anti-Money Laundering (AML) via Multi-Stage Graph Mining
by: Ye, Haojie, et al.
Published: (2026)
by: Ye, Haojie, et al.
Published: (2026)
Towards Serverless Processing of Spatiotemporal Big Data Queries
by: Baumann, Diana, et al.
Published: (2025)
by: Baumann, Diana, et al.
Published: (2025)
A unified framework to improve the interoperability between HPC and Big Data languages and programming models
by: Piñeiro, César, et al.
Published: (2021)
by: Piñeiro, César, et al.
Published: (2021)
Unveiling Crowdfunding Futures: Analyzing Campaign Outcomes through Distributed Models and Big Data Perspectives
by: Pipitò, Giuseppe, et al.
Published: (2024)
by: Pipitò, Giuseppe, et al.
Published: (2024)
EdgeMiner: Distributed Process Mining at the Data Sources
by: Andersen, Julia, et al.
Published: (2024)
by: Andersen, Julia, et al.
Published: (2024)
Similar Items
-
Distributed Record Linkage in Healthcare Data with Apache Spark
by: Heydari, Mohammad, et al.
Published: (2024) -
Performance comparison of Dask and Apache Spark on HPC systems for Neuroimaging
by: Dugré, Mathieu, et al.
Published: (2024) -
Large-Scale Network Embedding in Apache Spark
by: Lin, Wenqing
Published: (2021) -
Comparative analysis of large data processing in Apache Spark using Java, Python and Scala
by: Borodii, Ivan, et al.
Published: (2025) -
Demystifying Object-based Big Data Storage Systems
by: Mondal, Anindita Sarkar, et al.
Published: (2024)