NFDI4DS | UHH-SEMS - Publication Details

Xin Si

ORCID: 0000-0002-4993-0087

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5000787204

Research Areas

Advanced Memory and Neural Computing
Ferroelectric and Negative Capacitance Devices
Semiconductor materials and devices
Advanced Neural Network Applications
Parallel Computing and Optimization Techniques
CCD and CMOS Imaging Sensors
Advancements in Semiconductor Devices and Circuit Design
Low-power high-performance VLSI design
Neural Networks and Reservoir Computing
Photonic and Optical Devices
Neuroscience and Neural Engineering
Analytic Number Theory Research
Tensor decomposition and applications
Advanced Mathematical Identities
Real-Time Systems Scheduling
Analog and Mixed-Signal Circuit Design
Power Systems and Technologies
Electronic and Structural Properties of Oxides
Advanced Computational Techniques and Applications
Proteins in Food Systems
Advanced Combinatorial Mathematics
Advancements in PLL and VCO Technologies
Machine Learning in Materials Science
Radio Frequency Integrated Circuit Design
Transition Metal Oxide Nanomaterials

Southeast University
2021-2025

The Synergetic Innovation Center for Advanced Materials
2024

Ningxia University
2022

University of Electronic Science and Technology of China
2019-2022

National Tsing Hua University
2018-2022

24.5 A Twin-8T SRAM Computation-In-Memory Macro for Multiple-Bit CNN-Based Machine Learning

OPENALEX - Publications

Xin Si Jiajing Chen Yung-Ning Tu Wei-Hsing Huang Jinghong Wang and 11 more

Computation-in-memory (CIM) is a promising avenue to improve the energy efficiency of multiply-and-accumulate (MAC) operations in AI chips. Multi-bit CNNs are required for high-inference accuracy many applications [1–5]. There challenges and tradeoffs SRAM-based CIM: (1) between signal margin, cell stability area overhead; (2) high-weighted bit process variation dominates end-result error rate; (3) trade-off input bandwidth, speed area. Previous SRAM CIM macros were limited binary MAC fully...

10.1109/isscc.2019.8662392 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2019-02-01

A 65nm 4Kb algorithm-dependent computing-in-memory SRAM unit-macro with 2.3ns and 55.8TOPS/W fully parallel product-sum operation for binary DNN edge processors

OPENALEX - Publications

Win-San Khwa Jiajing Chen Jiafang Li Xin Si En-Yu Yang and 6 more

For deep-neural-network (DNN) processors [1-4], the product-sum (PS) operation predominates computational workload for both convolution (CNVL) and fully-connect (FCNL) neural-network (NN) layers. This hinders adoption of DNN to on edge artificial-intelligence (AI) devices, which require low-power, low-cost fast inference. Binary DNNs [5-6] are used reduce computation hardware costs AI devices; however, a memory bottleneck still remains. In Fig. 31.5.1 conventional PE arrays exploit...

10.1109/isscc.2018.8310401 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2018-02-01

15.5 A 28nm 64Kb 6T SRAM Computing-in-Memory Macro with 8b MAC Operation for AI Edge Chips

OPENALEX - Publications

Xin Si Yung-Ning Tu Wei-Hsing Huang Jian-Wei Su Pei-Jung Lu and 22 more

Advanced AI edge chips require multibit input (IN), weight (W), and output (OUT) for CNN multiply-and-accumulate (MAC) operations to achieve an inference accuracy that is sufficient practical applications. Computing-in-memory (CIM) attractive approach improve the energy efficiency <tex xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">$(\mathrm{EF}_{\mathrm{MAC}}]$</tex> of MAC under a memory-wall constraint. Previous SRAM-CIM macros demonstrated...

10.1109/isscc19947.2020.9062995 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2020-02-01

A Twin-8T SRAM Computation-in-Memory Unit-Macro for Multibit CNN-Based AI Edge Processors

OPENALEX - Publications

Xin Si Rui Liu Shimeng Yu Ren-Shuo Liu Chih-Cheng Hsieh and 11 more

Computation-in-memory (CIM) is a promising candidate to improve the energy efficiency of multiply-and-accumulate (MAC) operations artificial intelligence (AI) chips. This work presents an static random access memory (SRAM) CIM unit-macro using: 1) compact-rule compatible twin-8T (T8T) cells for weighted MAC reduce area overhead and vulnerability process variation; 2) even–odd dual-channel (EODC) input mapping scheme extend bandwidth; 3) two's complement weight (C2WM) enable using positive...

10.1109/jssc.2019.2952773 article EN IEEE Journal of Solid-State Circuits 2019-11-27

15.2 A 28nm 64Kb Inference-Training Two-Way Transpose Multibit 6T SRAM Compute-in-Memory Macro for AI Edge Chips

OPENALEX - Publications

Jian-Wei Su Xin Si Yen-Chi Chou Ting-Wei Chang Wei-Hsing Huang and 18 more

Many Al edge devices require local intelligence to achieve fast computing time (t <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">AC</inf> ), high energy efficiency (EF), and privacy. The transfer-learning approach is a popular solution for chips, wherein data used re-train the in cloud fine-tune (re-train) few of neural layers devices. This enables dynamic incorporation from in-situ environments or private information. Computing-in-memory (CIM)...

10.1109/isscc19947.2020.9062949 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2020-02-01

16.3 A 28nm 384kb 6T-SRAM Computation-in-Memory Macro with 8b Precision for AI Edge Chips

OPENALEX - Publications

Jian-Wei Su Yen-Chi Chou Ruhui Liu Ta-Wei Liu Pei-Jung Lu and 16 more

Recent SRAM-based computation-in-memory (CIM) macros enable mid-to-high precision multiply-and-accumulate (MAC) operations with improved energy efficiency using ultra-small/small capacity (0.4-8KB) memory devices. However, advanced CIM-based edge-AI chips favor multiple mid/large SRAM-CIM macros: high input (IN) and weight (W) to reduce the frequency of data reloads from external DRAM, avoid need for additional SRAM buffers or ultra-large on-chip buffers. enlarging throughput increases delay...

10.1109/isscc42613.2021.9365984 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2021-02-13

A computing-in-memory macro based on three-dimensional resistive random-access memory

OPENALEX - Publications

Qiang Huo Yang Yi-ming Yiming Wang Dengyun Lei Xiangqu Fu and 14 more

Abstract Non-volatile computing-in-memory macros that are based on two-dimensional arrays of memristors use in the development artificial intelligence edge devices. Scaling such systems to three-dimensional could provide higher parallelism, capacity and density for necessary vector–matrix multiplication operations. However, scaling three dimensions is challenging due manufacturing device variability issues. Here we report a two-kilobit non-volatile macro vertical resistive random-access...

10.1038/s41928-022-00795-x article EN cc-by Nature Electronics 2022-07-26

A Dual-Split 6T SRAM-Based Computing-in-Memory Unit-Macro With Fully Parallel Product-Sum Operation for Binarized DNN Edge Processors

OPENALEX - Publications

Xin Si Meng‐Fan Chang Win-San Khwa Jiajing Chen Jiafang Li and 5 more

Computing-in-memory (CIM) is a promising approach to reduce the latency and improve energy efficiency of deep neural network (DNN) artificial intelligence (AI) edge processors. However, SRAM-based CIM (SRAM-CIM) faces practical challenges in terms area overhead, performance, efficiency, yield against variations data patterns transistor performance. This paper employed circuit-system co-design methodology develop SRAM-CIM unit-macro for binary-based fully connected (FCNN) layer DNN AI The...

10.1109/tcsi.2019.2928043 article EN IEEE Transactions on Circuits and Systems I Regular Papers 2019-08-06

14.3 A 65nm Computing-in-Memory-Based CNN Processor with 2.9-to-35.8TOPS/W System Energy Efficiency Using Dynamic-Sparsity Performance-Scaling Architecture and Energy-Efficient Inter/Intra-Macro Data Reuse

OPENALEX - Publications

Jinshan Yue Zhe Yuan Xiaoyu Feng Yifan He Zhixiao Zhang and 6 more

Computing-in-Memory (CIM) is a promising solution for energy-efficient neural network (NN) processors. Previous CIM chips [1], [4] mainly focus on the memory macro itself, lacking insight overall system integration. Recently, CIM-based processor [5] speech recognition demonstrated energy efficiency. No prior work systematically explores sparsity optimization processor. Directly mapping sparse NN models onto regular macros ineffective, since data usually randomly distributed and cannot be...

10.1109/isscc19947.2020.9062958 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2020-02-01

A Local Computing Cell and 6T SRAM-Based Computing-in-Memory Macro With 8-b MAC Operation for Edge AI Chips

OPENALEX - Publications

Xin Si Yung-Ning Tu Wei-Hsing Huang Jian-Wei Su Pei-Jung Lu and 16 more

This article presents a computing-in-memory (CIM) structure aimed at improving the energy efficiency of edge devices running multi-bit multiply-and-accumulate (MAC) operations. The proposed scheme includes 6T SRAM-based CIM (SRAM-CIM) macro capable of: 1) weight-bitwise MAC (WbwMAC) operations to expand sensing margin and improve readout accuracy for high-precision operations; 2) compact local computing cell perform multiplication with suppressed sensitivity process variation; 3) an...

10.1109/jssc.2021.3073254 article EN IEEE Journal of Solid-State Circuits 2021-04-27

A 4-Kb 1-to-8-bit Configurable 6T SRAM-Based Computation-in-Memory Unit-Macro for CNN-Based AI Edge Processors

OPENALEX - Publications

Yen-Cheng Chiu Zhixiao Zhang Jiajing Chen Xin Si Ruhui Liu and 13 more

Previous SRAM-based computing-in-memory (SRAM-CIM) macros suffer small read margins for high-precision operations, large cell array area overhead, and limited compatibility with many input weight configurations. This work presents a 1-to-8-bit configurable SRAM CIM unit-macro using: 1) hybrid structure combining 6T-SRAM based in-memory binary product-sum (PS) operations digital near-memory-computing multibit PS accumulation to increase accuracy reduce overhead; 2) column-based...

10.1109/jssc.2020.3005754 article EN IEEE Journal of Solid-State Circuits 2020-07-14

A 28nm 64-kb 31.6-TFLOPS/W Digital-Domain Floating-Point-Computing-Unit and Double-Bit 6T-SRAM Computing-in-Memory Macro for Floating-Point CNNs

OPENALEX - Publications

An Guo Xin Si Xi Chen Fangyuan Dong Xingyu Pu and 14 more

SRAM-based computing-in-memory (SRAM-CIM) has been intensively studied and developed to improve the energy area efficiency of AI devices. SRAM-CIMs have effectively implemented high integer (INT) precision multiply-and-accumulate (MAC) operations inference accuracy various image classification tasks [1]–[3],[5],[6]. To realize more complex tasks, such as detection segmentation, support on-chip training for better accuracy, floating-point MAC (FP-MAC) with high-energy are required. However,...

10.1109/isscc42615.2023.10067260 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2023-02-19

Parallelizing SRAM arrays with customized bit-cell for binary neural networks

OPENALEX - Publications

Rui Liu Xiaochen Peng Xiaoyu Sun Win-San Khwa Xin Si and 4 more

Recent advances in deep neural networks (DNNs) have shown Binary Neural Networks (BNNs) are able to provide a reasonable accuracy on various image datasets with significant reduction computation and memory cost. In this paper, we explore two BNNs: hybrid BNN (HBNN) XNOR-BNN, where the weights binarized +1/-1 while neuron activations 1/0 respectively. Two SRAM bit cell designs proposed, namely, 6T for HBNN customized 8T XNOR-BNN. our design, high-precision multiply-and-accumulate (MAC) is...

10.1145/3195970.3196089 article EN 2018-06-19

A Half-Select Disturb-Free 11T SRAM Cell With Built-In Write/Read-Assist Scheme for Ultralow-Voltage Operations

OPENALEX - Publications

Yajuan He Jiubai Zhang Xiaoqing Wu Xin Si Shaowei Zhen and 1 more

This paper presents a half-select disturb-free 11T static random access memory (SRAM) cell for ultralow-voltage operations. The proposed SRAM is well suited bit-interleaving architecture, which helps to improve the soft-error immunity with error correction coding. read noise margin (RSNM) and write (WM) are significantly improved due its built-in write/read-assist scheme. experimental results in 40-nm standard CMOS technology indicate that at 0.5-V supply voltage, RSNM of <inline-formula...

10.1109/tvlsi.2019.2919104 article EN IEEE Transactions on Very Large Scale Integration (VLSI) Systems 2019-06-17

STICKER-IM: A 65 nm Computing-in-Memory NN Processor Using Block-Wise Sparsity Optimization and Inter/Intra-Macro Data Reuse

OPENALEX - Publications

Jinshan Yue Yongpan Liu Zhe Yuan Xiaoyu Feng Yifan He and 10 more

Computing-in-memory (CIM) is a promising architecture for energy-efficient neural network (NN) processors. Several CIM macros have demonstrated high energy efficiency, while CIM-based system-on-a-chip not well explored. This work presents NN processor, named STICKER-IM, which implemented with sophisticated system integration. Three key innovations are proposed. First, CIM-friendly block-wise sparsity (BWS) designed, enabling both activation-sparsity-aware acceleration and...

10.1109/jssc.2022.3148273 article EN IEEE Journal of Solid-State Circuits 2022-02-16

A 28nm Horizontal-Weight-Shift and Vertical-feature-Shift-Based Separate-WL 6T-SRAM Computation-in-Memory Unit-Macro for Edge Depthwise Neural-Networks

OPENALEX - Publications

Bo Wang Chen Xue Zhongyuan Feng Zhaoyang Zhang Han Liu and 11 more

SRAM-based computation-in-memory (CIM) has shown great potential in improving the energy efficiency of edge-AI devices. Most CIM work [3–4] is targeted at MAC operations with a higher input (IN), weight (W) and output (OUT) precision, which suitable for standard-convolution layers fully-connected layers. Edge-AI neural networks tradeoff inference accuracy network parameters. Depthwise (DW) convolution support essential many light-CNN models, such as MobileNet-V2. However, when applying...

10.1109/isscc42615.2023.10067526 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2023-02-19

Two-Way Transpose Multibit 6T SRAM Computing-in-Memory Macro for Inference-Training AI Edge Chips

OPENALEX - Publications

Jian-Wei Su Xin Si Yen-Chi Chou Ting-Wei Chang Wei-Hsing Huang and 20 more

Computing-in-memory (CIM) based on SRAM is a promising approach to achieving energy-efficient multiply-and-accumulate (MAC) operations in artificial intelligence (AI) edge devices; however, existing SRAM-CIM chips support only DNN inference. The flow of training data requires that CIM arrays perform convolutional computation using transposed weight matrices. This article presents two-way transpose (TWT) multiply cell with high resistance process variation and novel read scheme uses...

10.1109/jssc.2021.3108344 article EN IEEE Journal of Solid-State Circuits 2021-09-10

15.4 A 5.99-to-691.1TOPS/W Tensor-Train In-Memory-Computing Processor Using Bit-Level-Sparsity-Based Optimization and Variable-Precision Quantization

OPENALEX - Publications

Ruiqi Guo Zhiheng Yue Xin Si Te Hu Hao Li and 7 more

Computing-in-memory (CIM) improves energy efficiency by enabling parallel multiply-and-accumulate (MAC) operations and reducing memory accesses [1-4]. However, today's typical neural networks (NNs) usually exceed on-chip capacity. Thus, a CIM-based processor may encounter bottleneck [5]. Tensor-train (TT) is tensor decomposition method, which decomposes d-dimensional to d 4D tensor-cores (TCs: G <sub xmlns:mml="http://www.w3.org/1998/Math/MathML"...

10.1109/isscc42613.2021.9365989 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2021-02-13

Efficient and Robust Nonvolatile Computing-In-Memory Based on Voltage Division in 2T2R RRAM With Input-Dependent Sensing Control

OPENALEX - Publications

Linfang Wang Wang Ye Chunmeng Dou Xin Si Xiaoxin Xu and 7 more

Resistive memory (RRAM) provides an ideal platform to develop embedded non-volatile computing-in-memory (nvCIM). However, it faces several critical challenges ranging from device non-idealities, large DC currents, and small signal margins. To address these issues, we propose voltage-division (VD) based computing approach its circuit implementation in two-transistor-two-resistor (2T2R) RRAM cell arrays, which can realize energy-efficient, sign-aware, robust deep neural network (DNN)...

10.1109/tcsii.2021.3067385 article EN IEEE Transactions on Circuits & Systems II Express Briefs 2021-03-19

A 8-b-Precision 6T SRAM Computing-in-Memory Macro Using Segmented-Bitline Charge-Sharing Scheme for AI Edge Chips

OPENALEX - Publications

Jian-Wei Su Yen-Chi Chou Ruhui Liu Ta-Wei Liu Pei-Jung Lu and 20 more

Advances in static random access memory (SRAM)-CIM devices are meant to increase capacity while improving energy efficiency (EF) and reducing computing latency ( <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$T_{\mathrm {AC}}$ </tex-math></inline-formula> ). This work presents a novel SRAM-CIM structure using: 1) segmented-bitline charge-sharing (SBCS) scheme for multiply-and-accumulate (MAC) operations...

10.1109/jssc.2022.3199077 article EN IEEE Journal of Solid-State Circuits 2022-09-21

TRIFP-DCIM: A Toggle-Rate-Immune Floating-point Digital Compute-in-Memory Design with Adaptive-Asymmetric Compute-Tree

OPENALEX - Publications

Xing Wang Tianhui Jiao S. Li Yuchen Ma Zhican Zhang and 3 more

10.1145/3658617.3697577 article EN Proceedings of the 28th Asia and South Pacific Design Automation Conference 2025-01-20

14.3 A 28nm 17.83-to-62.84TFLOPS/W Broadcast-Alignment Floating-Point CIM Macro with Non-Two's-Complement MAC for CNNs and Transformers

OPENALEX - Publications

Xing Wang T. Y. Jiao Yi Yang Shaochen Li Dongqi Li and 13 more

10.1109/isscc49661.2025.10904738 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2025-02-16

14.6 A 28nm 64kb Bit-Rotated Hybrid-CIM Macro with an Embedded Sign-Bit-Processing Array and a Multi-Bit-Fusion Dual-Granularity Cooperative Quantizer

OPENALEX - Publications

Xi Chen Shaochen Li Zhican Zhang Wentao Zheng Tan Xiao and 16 more

10.1109/isscc49661.2025.10904646 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2025-02-16

14.7 NeuroPilot: A 28nm, 69.4fJ/node and 0.22ns/node, 32×32 Mimetic-Path-Searching CIM-Macro with Dynamic-Logic Pilot PE and Dual-Direction Searching

OPENALEX - Publications

An Guo Jingmin Zhang Xingyu Pu Yi Yang Defa Wu and 17 more

10.1109/isscc49661.2025.10904805 article EN 2022 IEEE International Solid- State Circuits Conference (ISSCC) 2025-02-16

Preparation and quality assessment of processed cream cheese by high hydrostatic pressure combined thermal processing and spore-induced germination

OPENALEX - Publications

Bo Song Panpan Zhu Yumeng Zhang Ning Ju Xin Si and 3 more

10.1016/j.jfoodeng.2022.111319 article EN Journal of Food Engineering 2022-10-13

Coming Soon ...