NFDI4DS | UHH-SEMS - Publication Details

Regularizing Activation Distribution for Training Binarized Deep Networks

OPENALEX - Publications

Ruizhou Ding Ting-Wu Chin Zeye Liu Diana Marculescu

Binarized Neural Networks (BNNs) can significantly reduce the inference latency and energy consumption in resource-constrained devices due to their pure-logical computation fewer memory accesses. However, training BNNs is difficult since activation flow encounters degeneration, saturation, gradient mismatch problems. Prior work alleviates these issues by increasing bits adding floating-point scaling factors, thereby sacrificing BNN's efficiency. In this paper, we propose use distribution...

10.1109/cvpr.2019.01167 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Quantized deep neural networks for energy efficient hardware-based inference

OPENALEX - Publications

Ruizhou Ding Zeye Liu R.D. Blanton Diana Marculescu

Deep Neural Networks (DNNs) have been adopted in many systems because of their higher classification accuracy, with custom hardware implementations great candidates for high-speed, accurate inference. While progress achieving large scale, highly DNNs has made, significant energy and area are required due to massive memory accesses computations. Such demands pose a challenge any DNN implementation, yet it is more natural handle platform. To alleviate the increased demand storage energy,...

10.1109/aspdac.2018.8297274 article EN 2018 23rd Asia and South Pacific Design Automation Conference (ASP-DAC) 2018-01-01

LightNN

OPENALEX - Publications

Ruizhou Ding Zeye Liu Rongye Shi Diana Marculescu R.D. Blanton

Application-specific integrated circuit (ASIC) implementations for Deep Neural Networks (DNNs) have been adopted in many systems because of their higher classification speed. However, although they may be characterized by better accuracy, larger DNNs require significant energy and area, thereby limiting wide adoption. The consumption is driven both memory accesses computation. Binarized (BNNs), as a trade-off between accuracy consumption, can achieve great reduction, good large due to its...

10.1145/3060403.3060465 article EN Proceedings of the Great Lakes Symposium on VLSI 2022 2017-05-10

Quantized deep neural networks for energy efficient hardware-based inference

OPENALEX - Publications

Ruizhou Ding Zeye Liu R.D. Blanton Diana Marculescu

Deep Neural Networks (DNNs) have been adopted in many systems because of their higher classification accuracy, with custom hardware implementations great candidates for high-speed, accurate inference. While progress achieving large scale, highly DNNs has made, significant energy and area are required due to massive memory accesses computations. Such demands pose a challenge any DNN implementation, yet it is more natural handle platform. To alleviate the increased demand storage energy,...

10.5555/3201607.3201609 article EN Asia and South Pacific Design Automation Conference 2018-01-22

Characterization of Locked Sequential Circuits via ATPG

OPENALEX - Publications

Danielle Duvalsaint Zeye Liu Ananya Ravikumar R.D. Blanton

Hardware security-related threats such as the insertion of malicious circuits, overproduction, and reverse engineering are increasing concern in IC industry. To mitigate these threats, various design-for-trust techniques have been developed, including sequential logic locking. Sequential locking protects a non-scanned design by employing key-controlled entrance FSM, transitions, or combination both techniques. Current methods for characterizing (attacking) security sequentially locked...

10.1109/itc-asia.2019.00030 article EN 2019-09-01

FLightNNs

OPENALEX - Publications

Ruizhou Ding Zeye Liu Ting-Wu Chin Diana Marculescu R.D. Blanton

To improve the throughput and energy efficiency of Deep Neural Networks (DNNs) on customized hardware, lightweight neural networks constrain weights DNNs to be a limited combination (denoted as k ϵ {1, 2}) powers 2. In such networks, multiply-accumulate operation can replaced with single shift operation, or two shifts an add operation. provide even more design flexibility, for each convolutional filter optimally chosen instead being fixed every filter. this paper, we formulate selection...

10.1145/3316781.3317828 article EN 2019-05-23

Lightening the Load with Highly Accurate Storage- and Energy-Efficient LightNNs

OPENALEX - Publications

Ruizhou Ding Zeye Liu R.D. Blanton Diana Marculescu

Hardware implementations of deep neural networks (DNNs) have been adopted in many systems because their higher classification speed. However, while they may be characterized by better accuracy, larger DNNs require significant energy and area, thereby limiting wide adoption. The consumption is driven both memory accesses computation. Binarized (BNNs), as a tradeoff between accuracy consumption, can achieve great reduction good for large due to regularization effect. BNNs show poor when...

10.1145/3270689 article EN ACM Transactions on Reconfigurable Technology and Systems 2018-09-30

Achieving 100% Cell-Aware Coverage by Design

OPENALEX - Publications

Zeye Liu Ben Niewenhuis Soumya Mittal R. D. Blanton

A comprehensive investigation of new integrated circuit design and fabrication technologies is crucial for yielding reliable parts. Prior work proposed a novel logic characterization vehicle called the Carnegie Mellon Logic Characterization Vehicle (CM-LCV), an implementation flow that ensures test chip to be product-like with near optimal testability diagnosability. This describes enhanced methodology CM-LCV not only guarantees 100% intra-cell defect all standard cells but also reflects...

10.3850/9783981537079_0329 article EN 2016-01-01

Test chip design for optimal cell-aware diagnosability

OPENALEX - Publications

Soumya Mittal Zeye Liu Ben Niewenhuis R.D. Blanton

Rapid yield learning in a new manufacturing process via test chips is greatly enhanced with "Design for Diagnosis" methodology. Prior work on logic-based chip design demonstrated an implementation flow that ensures 100% intra-cell fault coverage using minimal set. However, testability alone does not guarantee good diagnosability. Since diagnosis inherently function of design, it crucial the defect-level resolution and accuracy. This describes methodology Carnegie-Mellon Logic...

10.1109/test.2016.7805850 article EN 2016-11-01

CompactNet: High Accuracy Deep Neural Network Optimized for On-Chip Implementation

OPENALEX - Publications

Abhinav Goel Zeye Liu R.D. Blanton

Recent research has focused on Deep Neural Networks (DNNs) implemented directly in hardware. However, larger DNNs require significant energy and area, thereby limiting their wide adoption. We propose a novel DNN quantization technique corresponding hardware solution, CompactNet that optimizes the use of resources even further, through dynamic allocation memory for each parameter. Experimental results MNIST CIFAR-10 datasets, show reduces requirement by over 80%, 12-fold, area 7-fold, when...

10.1109/bigdata.2018.8622329 article EN 2021 IEEE International Conference on Big Data (Big Data) 2018-12-01

LightNN: Filling the Gap between Conventional Deep Neural Networks and Binarized Networks

OPENALEX - Publications

Ruizhou Ding Zeye Liu Rongye Shi Diana Marculescu R.D. Blanton

Application-specific integrated circuit (ASIC) implementations for Deep Neural Networks (DNNs) have been adopted in many systems because of their higher classification speed. However, although they may be characterized by better accuracy, larger DNNs require significant energy and area, thereby limiting wide adoption. The consumption is driven both memory accesses computation. Binarized (BNNs), as a trade-off between accuracy consumption, can achieve great reduction, good large due to its...

10.48550/arxiv.1802.02178 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Logic characterization vehicle design reflection via layout rewiring

OPENALEX - Publications

Phillip Fynan Zeye Liu Benjamin Niewenhuis Soumya Mittal Marcin Strajwas and 1 more

Continued scaling of semiconductor fabrication processes has made achieving yield targets increasingly difficult. The design and various types test vehicles is one approach for enabling fast learning. Recent work introduced the Carnegie Mellon logic characterization vehicle (CM-LCV). CM-LCV methodology uses regularity existing testability theory to produce logic-based designs that are both highly testable diagnosable. For be effective learning, it must reflect characteristics actual product...

10.1109/test.2016.7805849 article EN 2016-11-01

Front-end layout reflection for test chip design

OPENALEX - Publications

Zeye Liu Phillip Fynan R.D. Blanton

Fast yield ramping in a new technology to meet aggressive time-to-market deadlines requires comprehensive design and fabrication methodology for silicon test structures that systematically explores validates the technology. Prior work proposed novel logic characterization vehicle (LCV), along with an implementation flow produces chip ensures demographics resemble real products, near-optimal testability diagnosability. This describes efficiently incorporates FEOL layout properties into easily...

10.1109/test.2017.8242041 article EN 2017-10-01

Improving Test Chip Design Efficiency via Machine Learning

OPENALEX - Publications

Zeye Liu Qicheng Huang Chenlei Fang R. D. Blanton

Competitive position in the semiconductor field depends on yield which is becoming more challenging to achieve high levels due increasing complexity associated with design and fabrication of leading-edge integrated circuits (ICs). Consequently, test chips, especially full-flow logic are increasingly employed investigate complex interaction between layout features process before during product ramp. However, designing a quality chip can be time-consuming huge space. This work describes...

10.1109/itc44170.2019.9000131 article EN 2019-11-01

Back-End Layout Reflection for Test Chip Design

OPENALEX - Publications

Zeye Liu R.D. Blanton

At advanced technology nodes, complex interactions between layout features and the process can lead to manufacturability issues that reduce yield. Due huge number of geometries inherent random logic, logic-only test chips are increasingly employed during yield ramp. This work describes a design methodology incorporates into an optimally testable full-flow logic chip. Experiments comparing properties among test-chip, benchmark, actual product designs demonstrate efficacy methodology....

10.1109/iccd.2018.00074 article EN 2022 IEEE 40th International Conference on Computer Design (ICCD) 2018-10-01

FLightNNs: Lightweight Quantized Deep Neural Networks for Fast and Accurate Inference

OPENALEX - Publications

Ruizhou Ding Zeye Liu Ting-Wu Chin Diana Marculescu Raúl Cortés D. and 1 more

To improve the throughput and energy efficiency of Deep Neural Networks (DNNs) on customized hardware, lightweight neural networks constrain weights DNNs to be a limited combination (denoted as $k\in\{1,2\}$) powers 2. In such networks, multiply-accumulate operation can replaced with single shift operation, or two shifts an add operation. provide even more design flexibility, $k$ for each convolutional filter optimally chosen instead being fixed every filter. this paper, we formulate...

10.48550/arxiv.1904.02835 preprint EN other-oa arXiv (Cornell University) 2019-01-01

A Low Power Test Pattern Generator for BIST

OPENALEX - Publications

Shaochong Lei Feng Liang Zeye Liu Xiaoying Wang Zhen Wang

To tackle the increasing testing power during built-in self-test (BIST) operations, this paper proposes a new test pattern generator (TPG). With proposed reconfigurable LFSR, Johnson counter, decompressor and XOR gate network, introduced TPG can produce single input change (SIC) sequences with few repeated vectors. The SIC minimize switching activities of circuit under (CUT). Simulation results on ISCAS benchmarks demonstrate that method effectively save power, does not impose high impact...

10.1587/transele.e93.c.696 article EN IEICE Transactions on Electronics 2010-01-01

A unified solution to reduce test power and test volume for Test-per-scan schemes

OPENALEX - Publications

Shaochong Lei Zhen Wang Zeye Liu Feng Liang

This paper proposes a unified solution to reduce test power and volume for test-per-scan schemes. With the self-testing using MISR Parallel SRSG (STUMPS) architecture developed reconfigurable Johnson counter, proposed pattern generator (TPG) applies two transition sequences all scan chains, primary inputs of circuit under (CUT) keep unchanged at most times. Therefore, switching activities both in combinational block chains can be reduced simultaneously. If generated vectors that do not...

10.1587/elex.7.1364 article EN IEICE Electronics Express 2010-01-01

Path Delay Test of the Carnegie Mellon Logic Characterization Vehicle

OPENALEX - Publications

Ben Niewenhuis Balaji Ravikumar Zeye Liu R.D. Blanton

Previous work on the Carnegie Mellon Logic Characterization Vehicle (CM-LCV) has achieved optimal testability for static fault models. This explores enhancements to CM- LCV that make delay faults optimally testable, with specific focus path model. Results from a design experiment indicate modified CM-LCV can achieve up 100% robust coverage, significant improvement estimated 55.22% coverage reference benchmark design.

10.1109/vts.2019.8758651 article EN 2019-04-01

High Defect-Density Yield Learning using Three-Dimensional Logic Test Chips

OPENALEX - Publications

Zeye Liu R.D. Blanton

Test vehicles of various types that aim to identify yield detractors are essential for maturing a new semiconductor process before high volume production. Due large number unpredictable geometries created by place-and-route, test focus on random logic the utmost importance. Prior work utilizes two-dimensional regular array blocks has demonstrated significant superiority over conventional approaches. In this work, third dimension is added ensure efficient diagnosis multiple defects frequently...

10.1109/itc44778.2020.9325244 article EN 2020-11-01

A low cost test pattern generator for test-per-clock BIST scheme

OPENALEX - Publications

Shaochong Lei Zhen Wang Zeye Liu Feng Liang

Test power and test overhead are crucial to VLSI SOC testing. This paper proposes a low cost pattern generator (TPG) for test-per-clock built-in self-test (BIST) scheme. The proposed method utilizes two-dimensional TPG bit-XOR array reduce area overhead, generates single input change (SIC) sequences transitions of the circuit under (CUT). Simulation results on ISCAS benchmarks demonstrate that can achieve high fault coverage effectively power.

10.1587/elex.7.672 article EN IEICE Electronics Express 2010-01-01

IPSA: Integer Programming via Sparse Approximation for Efficient Test-Chip Design

OPENALEX - Publications

Qicheng Huang Chenlei Fang Zeye Liu Ruizhou Ding R. D. Blanton

Logic test chips are a key component of the yield learning process, which aim to investigate characteristics actual products that will be fabricated at high volume. Mathematically, design logic chip with such an objective may involve solving constrained under-determined equation for integer vector solution, is unfortunately, NP-hard. Existing methods not applicable due lack accuracy or computational complexity. We propose method called IPSA (Integer Programming via Sparse Approximation)...

10.1109/iccd46524.2019.00011 article EN 2022 IEEE 40th International Conference on Computer Design (ICCD) 2019-11-01