NFDI4DS | UHH-SEMS - Publication Details

The pochoir stencil compiler

OPENALEX - Publications

Yuan Tang Rezaul Chowdhury Bradley C. Kuszmaul Chi-Keung Luk Charles E. Leiserson

A stencil computation repeatedly updates each point of a d-dimensional grid as function itself and its near neighbors. Parallel cache-efficient algorithms based on "trapezoidal decompositions" are known, but most programmers find them difficult to write. The Pochoir compiler allows programmer write simple specification in domain-specific language embedded C++ which the then translates into high-performing Cilk code that employs an efficient parallel cache-oblivious algorithm. supports...

10.1145/1989493.1989508 article EN 2011-06-04

Airborne LiDAR Assisted Obstacle Recognition and Intrusion Detection Towards Unmanned Aerial Vehicle: Architecture, Modeling and Evaluation

OPENALEX - Publications

Yiming Miao Yuan Tang Bander Alzahrani Ahmed Barnawi Tarik Alafif and 1 more

With the rapid development of wireless communication and flight control technologies, unmanned aerial vehicles (UAVs) have been widely used in multiple application scenarios. A typical scenario is massive crowd management multi-millions annual Hajj Pilgrimage to Mecca where UAVs are utilized conduct monitoring by carrying sensory devices. The safe a UAV crucial for ensuring successful execution missions. aim overcome disadvantage caused ground station intrusion detection, combination...

10.1109/tits.2020.3023189 article EN IEEE Transactions on Intelligent Transportation Systems 2020-10-01

A joint global and local path planning optimization for UAV task scheduling towards crowd air monitoring

OPENALEX - Publications

Yuan Tang Yiming Miao Ahmed Barnawi Bander Alzahrani Reem Alotaibi and 1 more

10.1016/j.comnet.2021.107913 article EN Computer Networks 2021-02-09

Cache-oblivious wavefront: improving parallelism of recursive dynamic programming algorithms without losing cache-efficiency

OPENALEX - Publications

Yuan Tang Ronghui You Haibin Kan Jesmin Jahan Tithi Pramod Ganapathi and 1 more

State-of-the-art cache-oblivious parallel algorithms for dynamic programming (DP) problems usually guarantee asymptotically optimal cache performance without any tuning of parameters, but they often fail to exploit the theoretically best parallelism at same time. While these achieve cache-optimality through use a recursive divide-and-conquer (DAC) strategy, scheduling tasks granularity task dependency introduces artificial dependencies in addition those arising from defining recurrence...

10.1145/2688500.2688514 article EN 2015-01-24

Intelligent Fabric Enabled 6G Semantic Communication System for In-Cabin Scenarios

OPENALEX - Publications

Yuan Tang Ning Zhou Yu Qiao Di Wu Chong Hou and 2 more

With the large-scale commercialization of 5G, global industry has started exploration next generation mobile communication technology (6G). From Internet, to IoT, and then smart connection everything, 6G will transform from 5G's service objects people things intelligent networking agent that supports human–machine–object. networks should have characteristics ubiquitous intelligence perception, which poses challenges for network construction. Therefore, we propose a Semantic Communication...

10.1109/tits.2022.3174704 article EN IEEE Transactions on Intelligent Transportation Systems 2022-06-03

Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model

OPENALEX - Publications

Xu Han Yuan Tang Zhaoxuan Wang Xianzhi Li

10.1145/3664647.3681173 article EN 2024-10-26

AUTOGEN

OPENALEX - Publications

Rezaul Chowdhury Pramod Ganapathi Jesmin Jahan Tithi Charles Bachmeier Bradley C. Kuszmaul and 3 more

We present AUTOGEN---an algorithm that for a wide class of dynamic programming (DP) problems automatically discovers highly efficient cache-oblivious parallel recursive divide-and-conquer algorithms from inefficient iterative descriptions DP recurrences. AUTOGEN analyzes the set table locations accessed by when run on small size, and identifies access pattern corresponding provably correct solving recurrence. use to autodiscover several well-known problems. Our experimental results show...

10.1145/2851141.2851167 article EN 2016-02-22

Extending the Nested Parallel Model to the Nested Dataflow Model with Provably Efficient Schedulers

OPENALEX - Publications

David Dinh Harsha Vardhan Simhadri Yuan Tang

The nested parallel (a.k.a. fork-join) model is widely used for writing programs. However, the two composition constructs, i.e. "||" (parallel) and ";" (serial), that comprise nested-parallel are insufficient in expressing "partial dependencies" a program. We propose new dataflow construct "↝" to express partial dependencies algorithms processor- cache-oblivious way, thus extending Nested Parallel (NP) Dataflow (ND) model. redesign several divide-and-conquer ranging from dense linear algebra...

10.1145/2935764.2935797 article EN 2016-07-08

Provably Efficient Scheduling of Cache-oblivious Wavefront Algorithms

OPENALEX - Publications

Rezaul Chowdhury Pramod Ganapathi Yuan Tang Jesmin Jahan Tithi

Iterative wavefront algorithms for evaluating dynamic programming recurrences exploit optimal parallelism but show poor cache performance. Tiled-iterative achieve complexity and high are cache-aware hence not portable cache-adaptive. On the other hand, standard cache-oblivious recursive divide-and-conquer have serial often low due to artificial dependencies among subtasks. Recently, we introduced (COW) algorithms, which do any dependencies, they too complicated develop, analyze, implement,...

10.1145/3087556.3087586 article EN 2017-07-20

Semantic-Driven Efficient Service Network Towards Smart Healthcare System in Intelligent Fabric

OPENALEX - Publications

Wenjing Xiao Yuan Tang Jia Liu Di Wu Bander Alzahrani and 2 more

The advantages of intelligent fabrics enable health service systems more efficient and safe to perceive transmit, where massive sensitive data related behavior are collected through the multi-source flexible sensor network. However, due its complex network structure, face challenge low-power transmission processing in fabric space. Recently, Deep learning-powered semantic communication (DeepSC) has emerged as a promising paradigm. To tackle above problem, we introduce concept cognition into...

10.1109/tnse.2022.3202901 article EN IEEE Transactions on Network Science and Engineering 2022-09-05

Point-LGMask: Local and Global Contexts Embedding for Point Cloud Pre-Training With Multi-Ratio Masking

OPENALEX - Publications

Yuan Tang Xianzhi Li Jinfeng Xu Yu Qiao Long Hu and 2 more

Self-supervised learning has achieved great success in both natural language processing and 2D vision, where masked modeling is a quite popular pre-training scheme. However, extending masking to 3D point cloud understanding that combines local global features poses new challenge. In our work, we present Point-LGMask, novel method embed contexts with multi-ratio masking, which effective for self-supervised feature of clouds but unfortunately ignored by existing works. Specifically, avoid...

10.1109/tmm.2023.3282568 article EN IEEE Transactions on Multimedia 2023-06-08

Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model

OPENALEX - Publications

Xu Han Yuan Tang Zhaoxuan Wang Xianzhi Li

Existing Transformer-based models for point cloud analysis suffer from quadratic complexity, leading to compromised resolution and information loss. In contrast, the newly proposed Mamba model, based on state space (SSM), outperforms Transformer in multiple areas with only linear complexity. However, straightforward adoption of does not achieve satisfactory performance tasks. this work, we present Mamba3D, a model tailored learning enhance local feature extraction, achieving superior...

10.48550/arxiv.2404.14966 preprint EN arXiv (Cornell University) 2024-04-23

Cross-Domain Feature Semantic Calibration for Zero-Shot Sketch-Based Image Retrieval

OPENALEX - Publications

Xuewan He Jielei Wang Qianxin Xia Guoming Lu Yuan Tang and 1 more

10.1109/icme57554.2024.10687519 article EN 2022 IEEE International Conference on Multimedia and Expo (ICME) 2024-07-15

Autogen

OPENALEX - Publications

Rezaul Chowdhury Pramod Ganapathi Stephen L. Tschudi Jesmin Jahan Tithi Charles Bachmeier and 4 more

We present A utogen —an algorithm that for a wide class of dynamic programming (DP) problems automatically discovers highly efficient cache-oblivious parallel recursive divide-and-conquer algorithms from inefficient iterative descriptions DP recurrences. analyzes the set table locations accessed by when run on small size and identifies access pattern corresponding provably correct solving recurrence. use to autodiscover several well-known problems. Our experimental results show...

10.1145/3125632 article EN ACM Transactions on Parallel Computing 2017-03-31

Improving Parallelism of Recursive Stencil Computations without Sacrificing Cache Performance

OPENALEX - Publications

Yuan Tang Ronghui You Haibin Kan Jesmin Jahan Tithi Pramod Ganapathi and 1 more

The state-of-the-art "trapezoidal decomposition algorithm" for stencil computations on modern multicore machines use recursive divide-and-conquer (DAC) to achieve asymptotically optimal cache complexity cache-obliviously. But the same DAC approach restricts parallelism by introducing artificial dependencies among subtasks in addition those arising from defining equations. As a result, trapezoidal algorithm has suboptimal parallelism.

10.1145/2686745.2686752 article EN 2014-10-20

Cache-oblivious wavefront: improving parallelism of recursive dynamic programming algorithms without losing cache-efficiency

OPENALEX - Publications

Yuan Tang Ronghui You Haibin Kan Jesmin Jahan Tithi Pramod Ganapathi and 1 more

State-of-the-art cache-oblivious parallel algorithms for dynamic programming (DP) problems usually guarantee asymptotically optimal cache performance without any tuning of parameters, but they often fail to exploit the theoretically best parallelism at same time. While these achieve cache-optimality through use a recursive divide-and-conquer (DAC) strategy, scheduling tasks granularity task dependency introduces artificial dependencies in addition those arising from defining recurrence...

10.1145/2858788.2688514 article EN ACM SIGPLAN Notices 2015-01-24

Weight Balancing on Boundaries and Skeletons

OPENALEX - Publications

Luis Barba Otfried Cheong Jean-Lou De Carufel Michael Gene Dobbins Rudolf Fleischer and 8 more

Given a polygonal region containing target point (which we assume is the origin), it not hard to see that there are two points on perimeter antipodal, i.e., whose midpoint origin. We prove three generalizations of this fact. (1) For any polygon (or bounded closed with connected boundary) origin, possible place given set weights boundary so their barycenter (center mass) coincides provided largest weight does exceed sum other weights. (2) On 3-dimensional polyhedron exist form an equilateral...

10.1145/2582112.2582142 article EN 2014-05-02

AUTOGEN

OPENALEX - Publications

Rezaul Chowdhury Pramod Ganapathi Jesmin Jahan Tithi Charles Bachmeier Bradley C. Kuszmaul and 3 more

We present AUTOGEN---an algorithm that for a wide class of dynamic programming (DP) problems automatically discovers highly efficient cache-oblivious parallel recursive divide-and-conquer algorithms from inefficient iterative descriptions DP recurrences. AUTOGEN analyzes the set table locations accessed by when run on small size, and identifies access pattern corresponding provably correct solving recurrence. use to autodiscover several well-known problems. Our experimental results show...

10.1145/3016078.2851167 article EN ACM SIGPLAN Notices 2016-02-27

Active Perception in Non-Visual Recognition Environments by Stretchable Tentacle Sensor Arrays

OPENALEX - Publications

Yamei Yue Yuan Tang Qi Wang Wenjing Xiao Jia Liu and 4 more

Smoke fog or other light-interference environments have intrinsic obstruction for visual recognition techniques to explore objects and surroundings. Alternatively, tactile perceptions, rather than observations, are commonly used by burrowing deep-sea animals communicate with environments. Bio-inspired this natural wisdom, here, we demonstrate stretchable tentacle sensor arrays, which can recognize surrounding located in non-visual conditions such as smoke dark environment. Each is composed...

10.1021/acsami.2c04717 article EN ACS Applied Materials & Interfaces 2022-06-06

PDF: A Probability-Driven Framework for Open World 3D Point Cloud Semantic Segmentation

OPENALEX - Publications

Jinfeng Xu Siyuan Yang Xianzhi Li Yuan Tang Yixue Hao and 2 more

Existing point cloud semantic segmentation networks cannot identify unknown classes and update their knowledge, due to a closed-set static perspective of the real world, which would induce intelligent agent make bad decisions. To address this problem, we propose Probability-Driven Framework (PDF) for open world that includes (i) lightweight U-decoder branch by estimating uncertainties, (ii) flexible pseudo-labeling scheme supply geometry features along with probability distribution...

10.48550/arxiv.2404.00979 preprint EN arXiv (Cornell University) 2024-04-01

Networking strategy of small hydropower micro network under weak communication conditions

OPENALEX - Publications

Zifan Zhang Zhanhong Liang Yuan Tang Zhifeng Chen Na Shen and 1 more

<title>Abstract</title> A microgrid with small hydropower as the main power source can absorb or transmit to grid during grid-connected operation. When is operating off-grid due sudden reasons such line faults, it necessary remove sources loads maintain frequency stability of microgrid. Due difficulty achieving high-quality communication in mountainous areas where concentrated, a common solution use high-frequency generation tripping low-frequency load shedding without communication. This...

10.21203/rs.3.rs-5244412/v1 preprint EN cc-by Research Square (Research Square) 2024-10-18

MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors

OPENALEX - Publications

Yuan Tang Xu Han Xianzhi Li Yu Qiao Yixue Hao and 2 more

10.1145/3664647.3681257 article EN 2024-10-26

MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors

OPENALEX - Publications

Yuan Tang Xu Han Xianzhi Li Qiao Yu Yixue Hao and 2 more

Large 2D vision-language models (2D-LLMs) have gained significant attention by bridging Language Models (LLMs) with images using a simple projector. Inspired their success, large 3D point cloud-language (3D-LLMs) also integrate clouds into LLMs. However, directly aligning LLM requires expensive training costs, typically in hundreds of GPU-hours on A100, which hinders the development 3D-LLMs. In this paper, we introduce MiniGPT-3D, an efficient and powerful 3D-LLM that achieves multiple SOTA...

10.48550/arxiv.2405.01413 preprint EN arXiv (Cornell University) 2024-05-02

JIMR: Joint Semantic and Geometry Learning for Point Scene Instance Mesh Reconstruction

OPENALEX - Publications

Yu Qiao Xianzhi Li Yuan Tang Jinfeng Xu Long Hu and 2 more

Point scene instance mesh reconstruction is a challenging task since it requires both scene-level segmentation and instance-level from partial observations simultaneously. Previous works either adopt detection backbone or one, then directly employ network to produce complete meshes incomplete point clouds. To further boost the quality with local details global smoothness, in this work, we propose JIMR, joint framework two cascaded stages for semantic geometry understanding. In first stage,...

10.1109/tvcg.2024.3398737 article EN IEEE Transactions on Visualization and Computer Graphics 2024-01-01