NFDI4DS | UHH-SEMS - Publication Details

Efficient and effective Querying by Image Content

OPENALEX - Publications

Christos Faloutsos Ronald Barber Myron Flickner John W. Hafner W. Niblack and 2 more

10.1007/bf00962238 article EN Journal of Intelligent Information Systems 1994-07-01

DB2 with BLU acceleration

OPENALEX - Publications

Vijayshankar Raman Gopi Attaluri Ronald Barber Naresh Chainani David Kalmuk and 13 more

DB2 with BLU Acceleration deeply integrates innovative new techniques for defining and processing column-organized tables that speed read-mostly Business Intelligence queries by 10 to 50 times improve compression 3 times, compared traditional row-organized tables, without the complexity of indexes or materialized views on those tables. But is much more than just a column store. Exploiting frequency-based dictionary main-memory query technology from Blink project at IBM Research - Almaden,...

10.14778/2536222.2536233 article EN Proceedings of the VLDB Endowment 2013-08-01

Memory-efficient hash joins

OPENALEX - Publications

Ronald Barber Guy M. Lohman Ippokratis Pandis Venkat Raman Richard Sidle and 4 more

We present new hash tables for joins, and a join based on them, that consumes far less memory is usually faster than recently published in-memory joins. Our not restricted to outer fit wholly in memory. Key this concise table (CHT), linear probing has 100% fill factor, uses sparse bitmap with embedded population counts almost entirely avoid collisions. This also serves as Bloom filter use multi-table study the random access characteristics of renew case non-partitioned introduce variant...

10.14778/2735496.2735499 article EN Proceedings of the VLDB Endowment 2014-12-01

Designing Succinct Secondary Indexing Mechanism by Exploiting Column Correlations

OPENALEX - Publications

Yingjun Wu Jia Yu Yuanyuan Tian Richard Sidle Ronald Barber

Database administrators construct secondary indexes on data tables to accelerate query processing in relational database management systems (RDBMSs). These are built top of the most frequently queried columns according statistics. Unfortunately, maintaining multiple same can be extremely space consuming, causing significant performance degradation due potential exhaustion memory space. In this paper, we demonstrate that there exist many opportunities exploit column correlations for...

10.1145/3299869.3319861 article EN Proceedings of the 2022 International Conference on Management of Data 2019-06-18

In-memory BLU acceleration in IBM's DB2 and dashDB: Optimized for modern workloads and hardware architectures

OPENALEX - Publications

Ronald Barber Guy M. Lohman Vijayshankar Raman Richard Sidle Sam Lightstone and 1 more

Although the DRAM for main memories of systems continues to grow exponentially according Moore's Law and become less expensive, we argue that memory hierarchies will always exist many reasons, both economic practical, in particular due concurrent users competing working perform joins grouping. We present in-memory BLU Acceleration used IBM's DB2 Linux, UNIX, Windows, now also dashDB cloud offering, which was designed implemented from ground up exploit but is not limited what fits does...

10.1109/icde.2015.7113372 article EN 2015-04-01

Wildfire

OPENALEX - Publications

Ronald Barber Matt Huras Guy M. Lohman C. Mohan Rene Mueller and 8 more

We demonstrate Hybrid Transactional and Analytics Processing (HTAP) on the Spark platform by Wildfire prototype, which can ingest up to ~6 million inserts per second node simultaneously perform complex SQL analytics queries. Here, a simplified mobile application uses recommend advertising customers based upon their distance from stores interest in products sold these stores, while continuously graphing results as those move respond ads with purchases.

10.1145/2882903.2899406 article EN Proceedings of the 2022 International Conference on Management of Data 2016-06-16

Joins on encoded and partitioned data

OPENALEX - Publications

Jae-Gil Lee Gopi Attaluri Ronald Barber Naresh Chainani Oliver Draese and 14 more

Compression has historically been used to reduce the cost of storage, I/Os from that and buffer pool utilization, at expense CPU required decompress data every time it is queried. However, significant additional efficiencies can be achieved by deferring decompression as late in query processing possible performing operations directly on still-compressed data. In this paper, we investigate benefits challenges joins compressed (or encoded) We demonstrate benefit independently optimizing...

10.14778/2733004.2733008 article EN Proceedings of the VLDB Endowment 2014-08-01

Query by image content using multiple objects and multiple features: user interface issues

OPENALEX - Publications

D. Lee Ronald Barber W. Niblack Myron Flickner John W. Hafner and 1 more

On-line collections of images are growing larger and more common, tools needed to efficiently manage, organize, navigate through them. The authors have developed a prototype system called QBIC which allows complex multi-object multi-feature queries large image databases. based on content-the colors, textures, shapes, positions the objects/regions they contain. computes numeric features represent properties uses similarity measures these for retrieval. focus paper is its user interface...

10.1109/icip.1994.413534 article EN 2002-12-17

Indexing for complex queries on a query-by-content image database

OPENALEX - Publications

D. Lee Ronald Barber W. Niblack Myron Flickner John W. Hafner and 1 more

We describe how the QBIC (Query By Image Content) system handles "multi-*" queries-queries on large image collections involving multifeatures of each as a whole and multiple objects within image. The queries are based properties content-such colors, textures, shapes, edges. computes set features to above properties, uses distance-like measures provide similarity retrieval, has graphical interface that enable users pose visually. In this paper, we present indexing algorithms allow these run...

10.1109/icpr.1994.576246 article EN 2002-12-17

Db2 event store

OPENALEX - Publications

Christian Garcia-Arellano Hamdi Roumani Richard Sidle Josh Tiefenbach Kostas Rakopoulos and 14 more

The requirements of Internet Things (IoT) workloads are unique in the database space. While significant effort has been spent over last decade rearchitecting OLTP and Analytics for public cloud, little done to rearchitect IoT cloud. In this paper we present IBM Db2 Event Store ™ , a cloud-native system designed specifically workloads, which require extremely high-speed ingest, efficient open data storage, near real-time analytics. Additionally, by leveraging SQL compiler, optimizer runtime,...

10.14778/3415478.3415552 article EN Proceedings of the VLDB Endowment 2020-08-01

Efficient query by image content for very large image databases

OPENALEX - Publications

Ronald Barber W. Equitz W. Flickner W. Niblack Dragutin Petković and 1 more

The QBIC (query by image content) project in the IBM Almaden Research Center San Jose, CA, is conducting a theoretical, experimental, and prototyping study of problem querying large still databases efficiently based on content. Since difficult, aim to discover general principles, but at same time identify target application(s) for which concrete pilot systems will be prototyped. A number algorithms have been developed that allow user search color, texture, shape. can focused either objects...

10.1109/cmpcon.1993.289627 article EN 2002-12-30

Ultimedia Manager: Query By Image Content and its applications

OPENALEX - Publications

Ronald Barber Myron Flickner John W. Hafner D. Lee W. Niblack and 23 more

IBM Almaden Research Center's project on Query By Image Content (QBIC) is studying means to retrieve images from large image databases using contents such as color, texture, shape and layout. In this paper, we describe the beta version of PC-based Ultimedia Manager product, which based QBIC technology. We outline product philosophy give a demonstration current version. The expected be announced soon, together with an OEM offering search query engine.< <ETX...

10.1109/cmpcon.1994.282889 article EN 2002-12-17

WiSer: A Highly Available HTAP DBMS for IoT Applications

OPENALEX - Publications

Ronald Barber Christian Garcia-Arellano Ronen Grosman Guy M. Lohman C. Mohan and 8 more

In a classic transactional distributed database management system (DBMS), write transactions invariably synchronize with coordinator before final commitment. While enforcing serializability, this model has long been criticized for not satisfying the applications' availability requirements. When entering era of Internet Things (IoT), problem become more severe, as an increasing number applications call capability hybrid and analytical processing (HTAP), where aggregation constraints need to...

10.1109/bigdata47090.2019.9006519 article EN 2021 IEEE International Conference on Big Data (Big Data) 2019-12-01

Native Cloud Object Storage in Db2 Warehouse: Implementing a Fast and Cost-Efficient Cloud Storage Architecture

OPENALEX - Publications

David Kalmuk Christian Garcia-Arellano Ronald Barber Richard Sidle Kostas Rakopoulos and 17 more

Database systems built on traditional storage subsystems typically store their data in small blocks referred to as pages (commonly sized a multiple of 4KB for historical reasons). These subsystems, example network attached block storage, were designed efficient random-access I/O patterns at the level, and size is usually configurable by application based its needs. For large scale analytic databases cloud environments, these are not cost effective when compared object database that exploit...

10.1145/3626246.3653393 article EN 2024-05-23

HERMIT in action

OPENALEX - Publications

Yingjun Wu Jia Yu Yuanyuan Tian Richard Sidle Ronald Barber

Database administrators construct secondary indexes on data tables to accelerate query processing in relational database management systems (RDBMSs). These are built top of the most frequently queried columns according statistics. Unfortunately, maintaining multiple same can be extremely space consuming, causing significant performance degradation due potential exhaustion memory space. However, we find that there indeed exist many opportunities save storage by exploiting column correlations....

10.14778/3352063.3352090 article EN Proceedings of the VLDB Endowment 2019-08-01

Designing Succinct Secondary Indexing Mechanism by Exploiting Column Correlations (Extended Version)

OPENALEX - Publications

Yingjun Wu Jia Yu Yuanyuan Tian Richard Sidle Ronald Barber

Database administrators construct secondary indexes on data tables to accelerate query processing in relational database management systems (RDBMSs). These are built top of the most frequently queried columns according statistics. Unfortunately, maintaining multiple same can be extremely space consuming, causing significant performance degradation due potential exhaustion memory space. In this paper, we demonstrate that there exist many opportunities exploit column correlations for...

10.48550/arxiv.1903.11203 preprint EN other-oa arXiv (Cornell University) 2019-01-01

WiSer: A Highly Available HTAP DBMS for IoT Applications

OPENALEX - Publications

Ronald Barber Christian Garcia-Arellano Ronen Grosman Guy M. Lohman C. Mohan and 8 more

In a classic transactional distributed database management system (DBMS), write transactions invariably synchronize with coordinator before final commitment. While enforcing serializability, this model has long been criticized for not satisfying the applications' availability requirements. When entering era of Internet Things (IoT), problem become more severe, as an increasing number applications call capability hybrid and analytical processing (HTAP), where aggregation constraints need to...

10.48550/arxiv.1908.01908 preprint EN other-oa arXiv (Cornell University) 2019-01-01

System for the Evaluation and Classification of Imperfections in Auto Bodywork

OPENALEX - Publications

Ronald Barber

Quality aspects are more important every day, since they have a major impact on the final product. The automobile industry is not unaware of this fact, being an issue car’s sheet quality analysis. Nowadays, most systems for analysis implemented outside production line and performed manually. In work, autonomous system proposed in order to enable automatic quantification classification imperfections produced sheets composing auto bodywork due squeezing process. consists motorized capture...

10.4172/2167-7670.1000126 article EN cc-by Advances in Automobile Engineering 2015-01-01

Go, server, go!

OPENALEX - Publications

Ronald Barber Guy M. Lohman Rene Mueller Ippokratis Pandis Venkat Raman and 1 more

In data centers today, servers are stationary and flows on a hierarchical network of switches routers. But such static server arrangements require very scalable networks, many applications bottlenecked by bandwidth. addition, density is kept low to enable maintenance upgrades, as well increase air flow. this paper, we propose design in which move physically, communicate via point-to-point connections (instead switches). We argue that allows transfer bandwidth scale linearly with the number...

10.1145/2523616.2523634 article EN 2013-10-01