NFDI4DS | UHH-SEMS - Publication Details

Minguk Choi

ORCID: 0009-0003-4030-9344

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5051718290

Research Areas

Data Management and Algorithms
Algorithms and Data Compression
Explainable Artificial Intelligence (XAI)
Distributed systems and fault tolerance
Advanced Database Systems and Queries
Data Analysis with R
Data Stream Mining Techniques
Machine Learning and Data Classification
Data Quality and Management
Data Mining Algorithms and Applications

Dankook University
2023-2025

Reproducibility Report for ACM SIGMOD 2024 Paper: 'Can Learned Indexes be Built Efficiently? A Deep Dive into Sampling Trade-offs'

OPENALEX - Publications

Yihao Liu Yuanhui Luo Minguk Choi

In general, most of the main experiments in paper ''Can Learned Indexes be Built Efficiently? A Deep Dive into Sampling Trade-offs'' can successfully reproduced.

10.1145/3687998.3717055 article EN cc-by 2025-03-21

Reproducibility Report for ACM SIGMOD 2024 Paper: 'Optimizing Distributed Protocols with Query Rewrites'

OPENALEX - Publications

Atanu Saha Minguk Choi Qiaolin Yu David Chu

The authors provided detailed descriptions for setting up the re- quired system and performing experiments using individual scripts provided. While required considerable effort time, instructions facilitated evaluating results generating graphs that reproduced same patterns claimed by in paper Optimizing Distributed Protocols with Query Rewrites

10.1145/3687998.3717041 article EN cc-by 2025-03-21

An Empirical Study of Segmented Linear Regression Search in LevelDB

OPENALEX - Publications

Agung Rahmat Ramadhan Minguk Choi Yoojin Chung Jongmoo Choi

The purpose of this paper is proposing a novel search mechanism, called SLR (Segmented Linear Regression) search, based on the concept learned index. It motivated by our observation that lot big data, collected and used previous studies, have linearity property, meaning keys their stored locations show strong linear correlation. This leads us to design where we apply segmentation into well-known machine learning algorithm, regression, for identifying location from given key. We devise two...

10.3390/electronics12041018 article EN Electronics 2023-02-17

Can Learned Indexes be Built Efficiently? A Deep Dive into Sampling Trade-offs

OPENALEX - Publications

Minguk Choi Seehwan Yoo Jongmoo Choi

By embedding the distribution of keys in indexing structure, learned indexes can minimize index size and maximize lookup performance. Yet, one problems present is long index-building time. The conventional requires a complete traversal entire dataset, which makes it less practical than traditional index. This paper challenges efficiency build time to make practical. Our approach for time-efficient employ sampled learning. In this paper, we two error-bounded sampling schemes: Sample EB-PLA,...

10.1145/3654919 article EN Proceedings of the ACM on Management of Data 2024-05-29

Coming Soon ...