NFDI4DS | UHH-SEMS - Publication Details

Jie Gu

ORCID: 0000-0003-2912-7294

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5005991726

Research Areas

Low-power high-performance VLSI design
Atomic and Molecular Physics
Advanced Memory and Neural Computing
Advanced Chemical Physics Studies
Parallel Computing and Optimization Techniques
Analog and Mixed-Signal Circuit Design
Neuroscience and Neural Engineering
Atmospheric Ozone and Climate
Semiconductor materials and devices
CCD and CMOS Imaging Sensors
Advancements in Semiconductor Devices and Circuit Design
Advancements in PLL and VCO Technologies
VLSI and Analog Circuit Testing
Ferroelectric and Negative Capacitance Devices
X-ray Spectroscopy and Fluorescence Analysis
Embedded Systems Design Techniques
Advanced Measurement and Metrology Techniques
VLSI and FPGA Design Techniques
Spectroscopy and Laser Applications
Atomic and Subatomic Physics Research
Electromagnetic Compatibility and Noise Suppression
Cold Atom Physics and Bose-Einstein Condensates
Physical Unclonable Functions (PUFs) and Hardware Security
Photochemistry and Electron Transfer Studies
Interconnection Networks and Systems

Northwestern University
2016-2025

Northwest Normal University
2023

General Motors (United States)
2015-2019

General Motors (Poland)
2016

The University of Tokyo
2008-2015

Tokyo Medical and Dental University
2015

Shihezi University
2015

MaxLinear (United States)
2010-2014

Texas Instruments (United States)
2009-2011

University of Minnesota
2005-2009

15.3 A 65nm 3T Dynamic Analog RAM-Based Computing-in-Memory Macro and CNN Accelerator with Retention Enhancement, Adaptive Analog Sparsity and 44TOPS/W System Energy Efficiency

OPENALEX - Publications

Zhengyu Chen Xi Chen Jie Gu

Computing-In-Memory (CIM) techniques which incorporate analog computing inside memory macros have shown significant advantages in efficiency for deep learning applications. While earlier CIM were limited by lower bit precision, e.g. binary weights [1], recent works 4-to-8b precision the weights/inputs and up to 20b output values [2, 3]. Sparsity application features also been exploited at system level further improve computation [4, 5]. To enable higher bit-wise operations commonly utilized...

10.1109/isscc42613.2021.9366045 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2021-02-13

Charge transfer in collisions of O+ with H and H+ with O

OPENALEX - Publications

P. C. Stancil D. R. Schultz M. Kimura Jie Gu Gerhard Hirsch and 1 more

Cross sections and rate coefficients for total fine-structure resolved charge transfer in collisions of O+ with H H+ O are presented collision energies between 0.1 meV/u 10 MeV/u temperatures 107 K. The results obtained utilizing new quantal semiclassical molecular-orbital close-coupling, classical trajectory Monte Carlo, continuum distorted wave calculations conjunction previous experimental theoretical data. Applications to various astrophysical atmospheric environments discussed.

10.1051/aas:1999419 article EN Astronomy and Astrophysics Supplement Series 1999-12-01

A 65nm Systolic Neural CPU Processor for Combined Deep Learning and General-Purpose Computing with 95% PE Utilization, High Data Locality and Enhanced End-to-End Performance

OPENALEX - Publications

Yuhao Ju Jie Gu

Despite recent progress on building highly efficient deep neural network (DNN) accelerators, few works have targeted improving the end-to-end performance of deep-learning tasks, where inter-layer pre/post-processing, data alignment and movement across memory processing units often dominate execution time. An improvement to computation requires cohesive cooperation between accelerator CPU with flow management. Figure 15.2.1 shows most commonly used heterogeneous architecture, containing a...

10.1109/isscc42614.2022.9731757 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2022-02-20

A 28 nm 0.6 V Low Power DSP for Mobile Applications

OPENALEX - Publications

Nathan Ickes Gordon Gammie Mahmut E. Sinangil Rahul Rithe Jie Gu and 10 more

Processors for next generation mobile devices will need to operate across a wide supply voltage range in order support both high performance and power efficiency modes of operation. However, the effects local transistor threshold ( <i xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">V</i> <sub xmlns:xlink="http://www.w3.org/1999/xlink">T</sub> ) variation, already significant issue today's advanced process technologies, further exacerbated at low...

10.1109/jssc.2011.2169689 article EN IEEE Journal of Solid-State Circuits 2011-11-17

Cyclic locking and memristor-based obfuscation against CycSAT and inside foundry attacks

OPENALEX - Publications

Amin Rezaei Yuanqi Shen Shuyu Kong Jie Gu Hai Zhou

The high cost of IC design has made chip protection one the first priorities semiconductor industry. Although there is a common impression that combinational circuits must be designed without any cycles, with cycles can as well. Such cyclic used to reliably lock ICs. Moreover, since memristor compatible CMOS structure, it possible efficiently obfuscate using polymorphic memristor-CMOS gates. In this case, layouts different functionalities look exactly identical, making impossible even for an...

10.23919/date.2018.8341984 article EN Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015 2018-03-01

Visualizing Thermally Activated Memristive Switching in Percolating Networks of Solution‐Processed 2D Semiconductors

OPENALEX - Publications

Vinod K. Sangwan Sonal V. Rangnekar Joohoon Kang Jianan Shen Hong‐Sub Lee and 8 more

Abstract Memristive systems present a low‐power alternative to silicon‐based electronics for neuromorphic and in‐memory computation. 2D materials have been increasingly explored memristive applications due their novel biomimetic functions, ultrathin geometry ultimate scaling limits, potential fabricating large‐area, flexible, printed devices. While the switching mechanism in memristors based on single nanosheets is similar conventional oxide memristors, nanosheet composite films complicated...

10.1002/adfm.202107385 article EN publisher-specific-oa Advanced Functional Materials 2021-09-23

A 65-nm Humanoid Robot System-on-Chip Using Time-Domain 3-D Footstep Planning and Mixed-Signal ZMP Gait Scheduler With Inverse Kinematics

OPENALEX - Publications

Qiankai Cao Juin Chuen Oh Jie Gu

10.1109/jssc.2025.3541484 article EN IEEE Journal of Solid-State Circuits 2025-01-01

Humanoid Robot Control: A Mixed-Signal Footstep Planning SoC with ZMP Gait Scheduler and Neural Inverse Kinematics

OPENALEX - Publications

Qiankai Cao Y. J. Li Juin Chuen Oh Jie Gu

10.1145/3658617.3698482 article EN Proceedings of the 28th Asia and South Pacific Design Automation Conference 2025-01-20

Headset-Integrated Brain-Machine Interface for Mind Imagery and Control in VR/MR Applications

OPENALEX - Publications

Zhiwei Zhong Yijie Wei Lance Christopher Go Yipeng Xiong Yifan Li Jie Gu

10.1145/3658617.3698713 article EN Proceedings of the 28th Asia and South Pacific Design Automation Conference 2025-01-20

Charging Process and Coulomb-Force-Directed Printing of Nanoparticles with Sub-100-nm Lateral Resolution

OPENALEX - Publications

Chad R. Barry Jie Gu Heiko O. Jacobs

This article reports on a new charging process and Coulomb-force-directed assembly of nanoparticles onto charged surface areas with sub-100-nm resolution. The is accomplished using flexible nanostructured thin silicon electrode. Electrical nanocontacts have been created as small 50 nm by placing the electrode an electret surface. used to inject charge into sized areas. Nanoparticles were assembled patterns, lateral resolution 60 has observed for first time. A comparison nanoparticle patterns...

10.1021/nl0511972 article EN Nano Letters 2005-08-27

CNC machine tool work offset error compensation method

OPENALEX - Publications

Jie Gu John S. Agapiou Sheri Kurgin

10.1016/j.jmsy.2015.04.001 article EN Journal of Manufacturing Systems 2015-10-01

High-Throughput Dynamic Time Warping Accelerator for Time-Series Classification With Pipelined Mixed-Signal Time-Domain Computing

OPENALEX - Publications

Zhengyu Chen Jie Gu

Time-series classification (TSC) is a challenging problem in machine learning and significant efforts have been made to improve its speed computation efficiency. Among various approaches, dynamic time warping (DTW) algorithm one of the most prevalent methods for TSC due succinctness generality. To throughput operation, this work presents mixed-signal DTW accelerator utilizing time-domain (TD) computing where signals are encoded processed using pulses. A pipelined operation enabled by...

10.1109/jssc.2020.3021066 article EN publisher-specific-oa IEEE Journal of Solid-State Circuits 2020-09-17

A theoretical study of the absorption spectrum of singlet CH 2

OPENALEX - Publications

Jie Gu Gerhard Hirsch Robert J. Buenker Martin Brumm Gerald Osmann and 2 more

10.1016/s0022-2860(99)00256-2 article EN Journal of Molecular Structure 2000-02-01

A High-Speed Variation-Tolerant Interconnect Technique for Sub-Threshold Circuits Using Capacitive Boosting

OPENALEX - Publications

Jonggab Kil Jie Gu C.H. Kim

This paper describes an interconnect technique for subthreshold circuits to improve global wire delay and reduce the variation due process-voltage-temperature (PVT) fluctuations. By internally boosting gate voltage of driver transistors, operating region is shifted from super-threshold enhancing performance improving tolerance PVT variations. Simulations a clock distribution network using proposed shows 66%-76% reduction in 3sigma skew value 84%-88% tree compared conventional drivers. A...

10.1109/tvlsi.2007.915455 article EN IEEE Transactions on Very Large Scale Integration (VLSI) Systems 2008-03-27

On-Chip Supply Noise Regulation Using a Low-Power Digital Switched Decoupling Capacitor Circuit

OPENALEX - Publications

Jie Gu Hanyong Eom Chris H. Kim

On-chip resonant supply noise in the mid-frequency range (i.e., 50-300 MHz) has been identified as dominant component modern microprocessors. To overcome limited efficiency of conventional decoupling capacitors reducing noise, this paper proposes a low-power digital switched capacitor circuit. By adaptively switching connectivity decaps according to measured amount charge provided by is dramatically boosted leading an increased damping on-chip network. Analysis on transfer during events...

10.1109/jssc.2009.2020454 article EN IEEE Journal of Solid-State Circuits 2009-05-27

Design and Implementation of Active Decoupling Capacitor Circuits for Power Supply Regulation in Digital ICs

OPENALEX - Publications

Jie Gu Ramesh Harjani C.H. Kim

Control of on-chip power supply noise has become a major challenge for continuous scaling CMOS technology. Conventional passive decoupling capacitors (decaps) exhibit significant area and leakage penalties. To improve the efficiency regulation, this paper proposes distributed active decap circuit use in digital integrated circuits (ICs). The proposed design uses an operational amplifier to boost performance conventional decaps. Simulations proved its enhanced effect comparison with also...

10.1109/tvlsi.2008.2004543 article EN IEEE Transactions on Very Large Scale Integration (VLSI) Systems 2009-01-16

Error compensation and accuracy improvements in 5-axis machine tools using the global offset method

OPENALEX - Publications

Jie Gu John S. Agapiou Sheri Kurgin

10.1016/j.jmsy.2017.04.015 article EN Journal of Manufacturing Systems 2017-05-19

Hybrid Memristor-CMOS Obfuscation Against Untrusted Foundries

OPENALEX - Publications

Amin Rezaei Jie Gu Hi Zhou

The high cost of IC design has made chip protection one the first priorities semiconductor industry. In addition, with growing number untrusted foundries, possibility inside foundry attack is escalating. However, by taking advantage polymorphic gates, layouts circuits different functionalities look exactly identical, making it impossible even for an attacker to distinguish defined functionality looking at its layout. Moreover, since memristor compatible CMOS structure, possible efficiently...

10.1109/isvlsi.2019.00102 article EN 2019-07-01

Charge Transfer in Collisions of C+with H and H+with C

OPENALEX - Publications

P. C. Stancil C. C. Havener Predrag Krstić D. R. Schultz M. Kimura and 4 more

Charge transfer rate coefficients for collisions of C+ with H and H+ C are presented temperatures from 30,000 to 107 K 10 K, respectively. The were calculated recommended cross sections deduced in a recent theoretical experimental investigation that took into account previous measurements. Nonadiabatic radial coupling is the dominant mechanism both reactions above ~50,000 but lower reaction proceeds primarily by radiative charge transfer. Implications, due magnitude coefficients, various...

10.1086/305937 article EN The Astrophysical Journal 1998-08-01

Width Quantization Aware FinFET Circuit Design

OPENALEX - Publications

Jie Gu John Keane Sachin S. Sapatnekar Chris Kim

This paper presents a statistical leakage estimation method for FinFET devices considering the unique width quantization property. Monte Carlo simulations show that conventional approach underestimates average current of by as much 43% while proposed gives precise with an error less than 5%. Design example on dynamic logic circuits shows effectiveness

10.1109/cicc.2006.320916 article EN 2006-09-01

A 28nm 0.6V low-power DSP for mobile applications

OPENALEX - Publications

Gordon Gammie Nathan Ickes Mahmut E. Sinangil Rahul Rithe Jie Gu and 10 more

A multimedia applications processor is fabricated using a 28nm low-power process technology for ultra-low-power applications. Based on 4-issue, 32 register version of the TMS320C64X+ VLIW DSP, this System Chip (SoC) includes 32kB L1 and 128kB L2 caches, I2S, SPI, UART, MultiMediaCard, external memory interfaces (Fig. 7.5.1). The design incorporates over 600k instances custom low-voltage logic cells 43 (1.6 Mb) 6T SRAM. Utilizing ultra-low-voltage (ULV) optimized standard-cell libraries SRAM...

10.1109/isscc.2011.5746251 article EN 2011-02-01

A Sparse Convolution Neural Network Accelerator for 3D/4D Point-Cloud Image Recognition on Low Power Mobile Device with Hopping-Index Rule Book for Efficient Coordinate Management

OPENALEX - Publications

Qiankai Cao Jie Gu

This work presents the first 3D/4D sparse CNN (SCNN) accelerator for point cloud image recognition on low power devices. A special hopping-index rule book method and efficient data search technique were developed to mitigate overhead of coordinate management SCNN. 65nm test chip images was demonstrated with 7.09–13.6 TOPS/W efficiency state-of-the-art frame rate.

10.1109/vlsitechnologyandcir46769.2022.9830178 article EN 2022 IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits) 2022-06-12

A Systolic Neural CPU Processor Combining Deep Learning and General-Purpose Computing With Enhanced Data Locality and End-to-End Performance

OPENALEX - Publications

Yuhao Ju Jie Gu

While neural network (NN) accelerators are being significantly developed in recent years, CPU is still essential for data management and pre-/post-processing of a commonly used heterogeneous architecture, which usually contains an NN accelerator processor core with transfer performed by direct memory access (DMA) engine. This work presents special processor, referred to as systolic (SNCPU), unified architecture combining deep learning general-purpose computing fifth-generation reduced...

10.1109/jssc.2022.3214170 article EN IEEE Journal of Solid-State Circuits 2022-11-04

33.2 A Sub-1μJ/class Headset-Integrated Mind Imagery and Control SoC for VR/MR Applications with Teacher-Student CNN and General-Purpose Instruction Set Architecture

OPENALEX - Publications

Zhiwei Zhong Yijie Wei Lance Christopher Go Jie Gu

Virtual Reality (VR) and Mixed (MR) systems, e.g., Meta Quest Apple Vision Pro, have recently gained significant interest in consumer electronics, creating a new wave of developments metaverse for gaming, social networking, workforce assistance, online shopping, etc. Strong technological innovations AI computing multi-modular human activity tracking control produced immersive virtual realistic user experiences. However, most existing VR headsets only rely on traditional joysticks or...

10.1109/isscc49657.2024.10454317 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2024-02-18

20.4 A 28nm Physics Computing Unit Supporting Emerging Physics-Informed Neural Network and Finite Element Method for Real-Time Scientific Computing on Edge Devices

OPENALEX - Publications

Yuhao Ju Ganqi Xu Jie Gu

The demand for real-time computing on edge devices from emerging applications, e.g. AI, has exploded in recent years. Lately, physics-based scientific also drawn significant interests driven by the growth of e.g., VR, IoT, robotics, etc. Fig. 20.4.1 shows examples computation including structural deformation photorealistic VR/MR, robot dynamic control, temperature monitoring additive manufacturing, and leak-gas tracking. Unfortunately, hardware support numerical is relatively poor, hindering...

10.1109/isscc49657.2024.10454502 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2024-02-18

Coming Soon ...