Songming Yu

ORCID: 0000-0002-6191-2682
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Memory and Neural Computing
  • Advanced Neural Network Applications
  • Numerical methods in engineering
  • Parallel Computing and Optimization Techniques
  • Composite Material Mechanics
  • Ferroelectric and Negative Capacitance Devices
  • Machine Learning and ELM
  • Ultrasonics and Acoustic Wave Propagation
  • Adhesion, Friction, and Surface Interactions
  • Fluid dynamics and aerodynamics studies
  • Human Pose and Action Recognition
  • Advanced Data Storage Technologies
  • Mechanical Behavior of Composites
  • Advancements in Semiconductor Devices and Circuit Design
  • Stochastic Gradient Optimization Techniques
  • Video Surveillance and Tracking Methods
  • Advancements in Photolithography Techniques
  • Fluid Dynamics and Mixing
  • Aerodynamics and Fluid Dynamics Research
  • Surface Modification and Superhydrophobicity
  • Diagnosis and treatment of tuberculosis
  • Fault Detection and Control Systems
  • Adversarial Robustness in Machine Learning
  • Rheology and Fluid Dynamics Studies
  • Fatigue and fracture mechanics

Tsinghua University
1996-2024

10.1016/j.ijsolstr.2012.03.017 article EN publisher-specific-oa International Journal of Solids and Structures 2012-03-24

10.1016/s0167-8442(96)00026-2 article EN Theoretical and Applied Fracture Mechanics 1996-11-01

10.1016/s0167-8442(96)00027-4 article EN Theoretical and Applied Fracture Mechanics 1996-11-01

Recently CNN-based methods have made remarkable progress in broad fields. Both network pruning algorithms and hardware accelerators been introduced to accelerate CNN. However, existing not fully studied the pattern method, current index storage scheme of sparse CNN is efficient. Furthermore, performance suffers from no-load PEs on networks. This work proposes a software-hardware co-design address these problems. The software includes an ADMM-based method which compresses patterns convolution...

10.1109/dac18072.2020.9218630 article EN 2020-07-01

This work presents an energy-efficient CIM SoC with heterogeneous CPU, CIM, SIMD, DMA and COMM cores. The main contributions include: 1) A producer-consumer instruction dependency controller (PCIDC) shared multi-port SRAM to reduce SoC-level data transfer. 2) An inner-pipelined read-free digital macro achieve higher frequency. 3) parallel-to-serial (PTS) sparse architecture utilizing the low activity of partial-sum accumulation. demonstrates first digital-CIM SoC. fabricated 55nm chip...

10.23919/vlsitechnologyandcir57934.2023.10185315 article EN 2022 IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits) 2023-06-11

Deploying neural network (NN) models on Internet-of-Things (IoT) devices is important to enable artificial intelligence (AI) the edge realizing AI-of-Things (AIoT). However, high energy consumption and bandwidth requirement of NN restricts AI applications battery-limited equipments. Compute-In-Memory (CIM), featured with efficiency, provides new opportunities for IoT deployment NN. design CIM-based full system still at early stage, lacking system-level demonstration vertical optimization...

10.1109/tcsii.2023.3249245 article EN IEEE Transactions on Circuits & Systems II Express Briefs 2023-02-27

In recent years, convolutional neural networks (CNNs) have achieved significant advancements in various fields. However, the computation and storage overheads of CNNs are overwhelming for Internet-of-Things devices. Both network pruning algorithms hardware accelerators been introduced to empower CNN inference at edge. Network reduce size computational cost by regularizing unimportant weights zeros. existing works lack intrakernel structured types tradeoff between sparsity efficiency, index...

10.1109/tcad.2022.3140730 article EN IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2022-01-06

Resistive random access memory (RRAM) is an emerging device for processing-in-memory (PIM) architecture to accelerate convolutional neural network (CNN). However, due the highly coupled crossbar structure in RRAM array, it difficult exploit CNN sparsity feature improve performance RRAM-based accelerator. To optimize weight mapping of sparse array and area energy efficiency, we propose a novel scheme corresponding accelerator based on pattern pruning operation unit(OU) mechanism. Experimental...

10.1109/nvmsa53655.2021.9628683 article EN 2021-08-18

Resistive Random Access Memory (RRAM) is an emerging device for processing-in-memory (PIM) architecture to accelerate convolutional neural network (CNN). However, due the highly coupled crossbar structure in RRAM array, it difficult exploit sparsity of RRAM-based CNN accelerator. To optimize weight mapping sparse array and achieve high area energy efficiency, we propose a novel scheme corresponding accelerator based on pattern pruning Operation Unit(OU) mechanism. Experimental results show...

10.48550/arxiv.2010.06156 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Compute-In-Memory (CIM), characterized by efficient matrix-vector multiplication, has been recognized as a promising candidate technology for edge AI computing. However, applying CIM in extreme scenarios, where power delivery is limited and unstable, still faces challenges. The relatively high memory write energy, compared with computing, prevents its further gains on ultra-lower-power devices. frequent backup/restore intermittent together the nonvolatile (NVM) even higher program escalates...

10.1109/tcsii.2023.3289493 article EN IEEE Transactions on Circuits & Systems II Express Briefs 2023-06-26

Bionic non-smooth structures on the bodies of revolution had characteristic drag reduction. In this paper, oil flow visualization test were made in low speed wind tunnel between two bionic surface models and smooth model. The results show that pattern is significantly difference according to configuration structures, obviously effect friction However, total coefficient BNNS model reduced. It was found BNSS can decrease pressure obviously. mechanism through sacrifice small viscous force,...

10.3968/j.ans.1715787020100302.028 article EN Advances in natural science/Advances in natural sciences 2010-09-03

Abstract Background Differentiating between ulcerative colitis (UC), Crohn’s disease (CD) and intestinal tuberculosis (ITB) is challenging under endoscopy. We aimed to realise automatic differential diagnosis among these diseases through machine learning algorithms. Methods A total of 6399 consecutive patients (5128 UC, 875 CD 396 ITB) who had taken colonoscopy examinations in Peking Union Medical College Hospital from January 2008 November 2018 was enrolled. The input the description...

10.1093/ecco-jcc/jjz203.342 article EN Journal of Crohn s and Colitis 2020-01-01
Coming Soon ...