NFDI4DS | UHH-SEMS - Publication Details

Learning Structured Sparsity in Deep Neural Networks

OPENALEX - Publications

Wei Wen Chunpeng Wu Yandan Wang Yiran Chen Hai Li

High demand for computation resources severely hinders deployment of large-scale Deep Neural Networks (DNN) in resource constrained devices. In this work, we propose a Structured Sparsity Learning (SSL) method to regularize the structures (i.e., filters, channels, filter shapes, and layer depth) DNNs. SSL can: (1) learn compact structure from bigger DNN reduce cost; (2) obtain hardware-friendly structured sparsity efficiently accelerate DNNs evaluation. Experimental results show that...

10.48550/arxiv.1608.03665 preprint EN other-oa arXiv (Cornell University) 2016-01-01

TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning

OPENALEX - Publications

Wei Wen Cong Xu Feng Yan Chunpeng Wu Yandan Wang and 2 more

High network communication cost for synchronizing gradients and parameters is the well-known bottleneck of distributed training. In this work, we propose TernGrad that uses ternary to accelerate deep learning in data parallelism. Our approach requires only three numerical levels {-1,0,1}, which can aggressively reduce time. We mathematically prove convergence under assumption a bound on gradients. Guided by bound, layer-wise ternarizing gradient clipping improve its convergence. experiments...

10.48550/arxiv.1705.07878 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Visual saliency detection by spatially weighted dissimilarity

OPENALEX - Publications

Lijuan Duan Chunpeng Wu Jun Miao Laiyun Qing Yu Fu

In this paper, a new visual saliency detection method is proposed based on the spatially weighted dissimilarity. We measured by integrating three elements as follows: dissimilarities between image patches, which were evaluated in reduced dimensional space, spatial distance patches and central bias. The inversely corresponding distance. A weighting mechanism, indicating bias for human fixations to center of image, was employed. principal component analysis (PCA) dimension reducing used our...

10.1109/cvpr.2011.5995676 article EN 2011-06-01

Coordinating Filters for Faster Deep Neural Networks

OPENALEX - Publications

Wei Wen Cong Xu Chunpeng Wu Yandan Wang Yiran Chen and 1 more

Very large-scale Deep Neural Networks (DNNs) have achieved remarkable successes in a large variety of computer vision tasks. However, the high computation intensity DNNs makes it challenging to deploy these models on resource-limited systems. Some studies used low-rank approaches that approximate filters by basis accelerate testing. Those works directly decomposed pre-trained Low-Rank Approximations (LRA). How train toward lower-rank space for more efficient DNNs, however, remains as an open...

10.1109/iccv.2017.78 article EN 2017-10-01

Handwritten Character Recognition by Alternately Trained Relaxation Convolutional Neural Network

OPENALEX - Publications

Chunpeng Wu Wei Fan Yuan He Jun Sun Satoshi Naoi

Deep learning methods have recently achieved impressive performance in the area of visual recognition and speech recognition. In this paper, we propose a hand- writing method based on relaxation convolutional neural network (R-CNN) alternately trained (ATR-CNN). Previous regularize CNN at full-connected layer or spatial-pooling layer, however, focus layer. The convolution adopted our R-CNN, unlike traditional does not require neurons within feature map to share same kernel, endowing with...

10.1109/icfhr.2014.56 article EN 2014-09-01

FPGA Acceleration of Recurrent Neural Network Based Language Model

OPENALEX - Publications

Sicheng Li Chunpeng Wu Hai Li Boxun Li Yu Wang and 1 more

Recurrent neural network (RNN) based language model (RNNLM) is a biologically inspired for natural processing. It records the historical information through additional recurrent connections and therefore very effective in capturing semantics of sentences. However, use RNNLM has been greatly hindered high computation cost training. This work presents an FPGA implementation framework training acceleration. At architectural level, we improve parallelism RNN scheme reduce computing resource...

10.1109/fccm.2015.50 article EN 2015-05-01

Supra‐Photothermal CO2 Methanation over Greenhouse‐Like Plasmonic Superstructures of Ultrasmall Cobalt Nanoparticles

OPENALEX - Publications

Mujin Cai Chaoran Li Xingda An Biqing Zhong Yuxuan Zhou and 12 more

Improving the solar-to-thermal energy conversion efficiency of photothermal nanomaterials at no expense other physicochemical properties, e.g., catalytic reactivity metal nanoparticles, is highly desired for diverse applications but remains a big challenge. Herein, synergistic strategy developed enhanced by greenhouse-like plasmonic superstructure 4 nm cobalt nanoparticles while maintaining their intrinsic reactivity. The silica shell plays key role in retaining superstructures efficient use...

10.1002/adma.202308859 article EN Advanced Materials 2023-11-06

MeDNN: A distributed mobile system with enhanced partition and deployment for large-scale DNNs

OPENALEX - Publications

Jiachen Mao Zhongda Yang Wei Wen Chunpeng Wu Linghao Song and 4 more

Deep Neural Networks (DNNs) are pervasively used in a significant number of applications and platforms. To enhance the execution efficiency large-scale DNNs, previous attempts focus mainly on client-server paradigms, relying powerful external infrastructure, or model compression, with complicated pre-processing phases. Though effective, these methods overlook optimization DNNs distributed mobile devices. In this work, we design implement MeDNN, local computing system enhanced partitioning...

10.1109/iccad.2017.8203852 article EN 2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) 2017-11-01

Niche Applications of MXene Materials in Photothermal Catalysis

OPENALEX - Publications

Zhiyi Wu Jiahui Shen Chaoran Li Chengcheng Zhang Chunpeng Wu and 3 more

MXene materials have found emerging applications as catalysts for chemical reactions due to their intriguing physical and applications. In particular, broad light response strong photothermal conversion capabilities are likely render MXenes promising candidates catalysis, which is drawing increasing attention in both academic research industrial satisfy all three criteria of a desirable catalyst: absorption, effective heat management, versatile surface reactivity. However, specific...

10.3390/chemistry5010036 article EN cc-by Chemistry 2023-03-06

A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation

OPENALEX - Publications

Chunpeng Wu Wei Wen Tariq Afzal Yongmei Zhang Yiran Chen and 1 more

Recently, DNN model compression based on network architecture design, e.g., SqueezeNet, attracted a lot attention. No accuracy drop image classification is observed these extremely compact networks, compared to well-known models. An emerging question, however, whether techniques hurt DNNs learning ability other than classifying images single dataset. Our preliminary experiment shows that methods could degrade domain adaptation (DA) ability, though the performance preserved. Therefore, we...

10.1109/cvpr.2017.88 article EN 2017-07-01

Phosphorization-Induced “Fence Effect” on the Active Hydrogen Species Migration Enables Tunable CO2 Hydrogenation Selectivity

OPENALEX - Publications

Chunpeng Wu Jiahui Shen Xingda An Zhiyi Wu Shuairen Qian and 10 more

Incorporating phosphorus (P) into the active metals of a catalyst is an effective strategy to enhance catalytic performance. However, mechanisms underlying influence introduced species on performance remain largely unknown. Herein, we observe pronounced shift in product selectivity CO2 hydrogenation from CH4 CO upon introducing P Ru/SiO2 catalysts. This alteration attributed role as "fence" hindering migration H species. The adsorbed CO, key intermediate for methanation, preferentially...

10.1021/acscatal.4c00742 article EN ACS Catalysis 2024-05-17

Neuromorphic computing's yesterday, today, and tomorrow – an evolutional view

OPENALEX - Publications

Yiran Chen Hai Li Chunpeng Wu Chang Song Sicheng Li and 4 more

10.1016/j.vlsi.2017.11.001 article EN Integration 2017-11-29

MeDNN: a distributed mobile system with enhanced partition and deployment for large-scale DNNs

OPENALEX - Publications

Jiachen Mao Zhongda Yang Wei Wen Chunpeng Wu Linghao Song and 4 more

Deep Neural Networks (DNNs) are pervasively used in a significant number of applications and platforms. To enhance the execution efficiency large-scale DNNs, previous attempts focus mainly on client-server paradigms, relying powerful external infrastructure, or model compression, with complicated pre-processing phases. Though effective, these methods overlook optimization DNNs distributed mobile devices. In this work, we design implement MeDNN, local computing system enhanced partitioning...

10.5555/3199700.3199800 article EN International Conference on Computer Aided Design 2017-11-13

SmoothOut: Smoothing Out Sharp Minima to Improve Generalization in Deep Learning

OPENALEX - Publications

Wei Wen Yandan Wang Feng Yan Cong Xu Chunpeng Wu and 2 more

In Deep Learning, Stochastic Gradient Descent (SGD) is usually selected as a training method because of its efficiency; however, recently, problem in SGD gains research interest: sharp minima Neural Networks (DNNs) have poor generalization; especially, large-batch tends to converge minima. It becomes an open question whether escaping can improve the generalization. To answer this question, we propose SmoothOut framework smooth out DNNs and thereby nutshell, perturbs multiple copies DNN by...

10.48550/arxiv.1805.07898 preprint EN other-oa arXiv (Cornell University) 2018-01-01

A new learning method for inference accuracy, core occupation, and performance co-optimization on TrueNorth chip

OPENALEX - Publications

Wei Wen Chunpeng Wu Yandan Wang Kent W. Nixon Qing Wu and 3 more

IBM TrueNorth chip uses digital spikes to perform neuromorphic computing and achieves ultrahigh execution parallelism power efficiency. However, in chip, low quantization resolution of the synaptic weights significantly limits inference (e.g., classification) accuracy deployed neural network model. Existing workaround, i.e., averaging results over multiple copies instantiated spatial temporal domains, rapidly exhausts hardware resources slows down computation. In this work, we propose a...

10.1145/2897937.2897968 preprint EN 2016-05-25

MAT: A Multi-strength Adversarial Training Method to Mitigate Adversarial Attacks

OPENALEX - Publications

Chang Song Hsin-Pai Cheng Huanrui Yang Sicheng Li Chunpeng Wu and 3 more

Some recent work revealed that deep neural networks (DNNs) are vulnerable to so-called adversarial attacks where input examples intentionally perturbed fool DNNs. In this work, we revisit the DNN training process includes into dataset so as improve DNN's resilience attacks, namely, training. Our experiments show different strengths, i.e., perturbation levels of examples, have working ranges resist attacks. Based on observation, propose a multi-strength method (MAT) combines with strengths...

10.1109/isvlsi.2018.00092 preprint EN 2018-07-01

Understanding the design of IBM neurosynaptic system and its tradeoffs: A user perspective

OPENALEX - Publications

Hsin-Pai Cheng Wei Wen Chunpeng Wu Sicheng Li Hai Li and 1 more

As a large-scale commercial spiking-based neuromorphic computing platform, IBM TrueNorth processor received tremendous attentions in society. However, one of the known issues design is limited precision synaptic weights. The current workaround running multiple neural network copies which average value each weight close to that original network. We theoretically analyze impacts low data chip on inference accuracy, core occupation, and performance, present probability-biased learning method...

10.23919/date.2017.7926972 article EN Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015 2017-03-01

ApesNet: a pixel‐wise efficient segmentation network for embedded devices

OPENALEX - Publications

Chunpeng Wu Hsin‐Pai Cheng Sicheng Li Hai Li Yiran Chen

Road scene understanding and semantic segmentation is an on-going issue for computer vision. A precise can help a machine learning model understand the real world more accurately. In addition, well-designed efficient be used on source limited devices. The authors aim to implement high-level, in embedded device with finite power resources. Toward this goal, propose ApesNet, pixel-wise network which understands road scenes near real-time has achieved promising accuracy. key findings authors'...

10.1049/iet-cps.2016.0027 article EN cc-by IET Cyber-Physical Systems Theory & Applications 2016-11-12

Enhanced photochemical effects of plasmonic cluster catalysts through aggregated nanostructures

OPENALEX - Publications

Xu Hu Zhijie Zhu Yuxuan Zhou Shuang Liu Chunpeng Wu and 11 more

Here we present an effective strategy to achieve strongly enhanced catalytic activity of platinum–copper bimetallic clusters through augmented plasmonic photochemical effects aggregated nanostructure.

10.1039/d4gc00560k article EN Green Chemistry 2024-01-01

Hierarchical Carbon Nanocages as Superior Supports for Photothermal CO2 Catalysis

OPENALEX - Publications

Zhijie Chen Xudong Dong Zi‐Xuan Sun Xingda An Chaoran Li and 13 more

The exploitation of hierarchical carbon nanocages with superior light-to-heat conversion efficiency, together their distinct structural, morphological, and electronic properties, in photothermal applications could provide effective solutions to long-standing challenges diverse areas. Here, we demonstrate the discovery pristine nitrogen-doped as supports for highly loaded, small-sized Ru particles toward enhanced CO2 catalysis. A record CO production rate 3.1 mol·gRu-1·h-1 above 90%...

10.1021/acsnano.4c04691 article EN ACS Nano 2024-07-17

Coordinating Filters for Faster Deep Neural Networks

OPENALEX - Publications

Wei Wen Cong Xu Chunpeng Wu Yandan Wang Yiran Chen and 1 more

Very large-scale Deep Neural Networks (DNNs) have achieved remarkable successes in a large variety of computer vision tasks. However, the high computation intensity DNNs makes it challenging to deploy these models on resource-limited systems. Some studies used low-rank approaches that approximate filters by basis accelerate testing. Those works directly decomposed pre-trained Low-Rank Approximations (LRA). How train toward lower-rank space for more efficient DNNs, however, remains as an open...

10.48550/arxiv.1703.09746 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Detection of Front-View Vehicle with Occlusions Using AdaBoost

OPENALEX - Publications

Chunpeng Wu Lijuan Duan Jun Miao Faming Fang Xuebin Wang

In this paper, we propose a vehicle detection method based on AdaBoost. We focus the of front-view car and bus with occlusions highway. Samples different occlusion situations are selected into training set. By using basic rotated Haar-like features extracted from samples in set, train an AdaBoost-based cascade detector. The performance tests static images short time videos show that (1) our approach detects cars more effectively than buses (2) real-time video proceeds at 30 frames per second.

10.1109/iciecs.2009.5365582 article EN International Conference on Information Engineering and Computer Science 2009-12-01