NFDI4DS | UHH-SEMS - Publication Details

Guido Masera

ORCID: 0000-0003-2238-9443

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5074150684

Research Areas

Advanced Wireless Communication Techniques
Error Correcting Code Techniques
Advanced Data Compression Techniques
Low-power high-performance VLSI design
Digital Filter Design and Implementation
Embedded Systems Design Techniques
Advanced Memory and Neural Computing
Coding theory and cryptography
Video Coding and Compression Technologies
Parallel Computing and Optimization Techniques
Interconnection Networks and Systems
Image and Signal Denoising Methods
Wireless Communication Networks Research
Cooperative Communication and Network Coding
Semiconductor materials and devices
VLSI and FPGA Design Techniques
VLSI and Analog Circuit Testing
PAPR reduction in OFDM
Advanced Vision and Imaging
Advanced Neural Network Applications
Advancements in Semiconductor Devices and Circuit Design
Ferroelectric and Negative Capacitance Devices
Cryptographic Implementations and Security
Electromagnetic Compatibility and Noise Suppression
Adversarial Robustness in Machine Learning

Polytechnic University of Turin
2016-2025

ORCID
2021

Canadian Standards Association
2019

Institute of Electrical and Electronics Engineers
2018-2019

Turin Polytechnic University
2019

Huawei Technologies (France)
2018

Politecnico di Milano
2012

Consorzio Nazionale Interuniversitario per le Telecomunicazioni
2012

RWTH Aachen University
2006

Bilkent University
2006

An Updated Survey of Efficient Hardware Architectures for Accelerating Deep Convolutional Neural Networks

OPENALEX - Publications

Maurizio Capra Beatrice Bussolino Alberto Marchisio Muhammad Shafique Guido Masera and 1 more

Deep Neural Networks (DNNs) are nowadays a common practice in most of the Artificial Intelligence (AI) applications. Their ability to go beyond human precision has made these networks milestone history AI. However, while on one hand they present cutting edge performance, other require enormous computing power. For this reason, numerous optimization techniques at hardware and software level, specialized architectures, have been developed process models with high performance power/energy...

10.3390/fi12070113 article EN cc-by Future Internet 2020-07-07

Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead

OPENALEX - Publications

Maurizio Capra Beatrice Bussolino Alberto Marchisio Guido Masera Maurizio Martina and 1 more

Currently, Machine Learning (ML) is becoming ubiquitous in everyday life. Deep (DL) already present many applications ranging from computer vision for medicine to autonomous driving of modern cars as well other sectors security, healthcare, and finance. However, achieve impressive performance, these algorithms employ very deep networks, requiring a significant computational power, both during the training inference time. A single DL model may require billions multiply-and-accumulated...

10.1109/access.2020.3039858 article EN cc-by IEEE Access 2020-01-01

Edge Computing: A Survey On the Hardware Requirements in the Internet of Things World

OPENALEX - Publications

Maurizio Capra Riccardo Peloso Guido Masera Massimo Ruo Roch Maurizio Martina

In today’s world, ruled by a great amount of data and mobile devices, cloud-based systems are spreading all over. Such phenomenon increases the number connected broadcast bandwidth, information exchange. These fine-grained interconnected systems, which enable Internet connectivity for an extremely large facilities (far beyond current devices) go name Things (IoT). this scenario, devices have operating time is proportional to battery capacity, operations performed per cycle exchanged data....

10.3390/fi11040100 article EN cc-by Future Internet 2019-04-23

VLSI architectures for turbo codes

OPENALEX - Publications

Guido Masera Gianluca Piccinini Massimo Ruo Roch Maurizio Zamboni

A great interest has been gained in recent years by a new error-correcting code technique, known as "turbo coding", which proven to offer performance closer the Shannon's limit than traditional concatenated codes. In this paper, several very large scale integration (VLSI) architectures suitable for turbo decoder implementation are proposed and compared terms of complexity performance; impact on VLSI system parameters like state number, number iterations, rate evaluated different solutions....

10.1109/92.784098 article EN IEEE Transactions on Very Large Scale Integration (VLSI) Systems 1999-09-01

Implementation of a Flexible LDPC Decoder

OPENALEX - Publications

Guido Masera Federico Quaglio F. Vacca

Low-density parity-check codes (LDPC) are among the most powerful error correcting tools today available. For this reason they became very popular in several applications such as digital satellite broadcasting system (DVB-S2), wireless local area network (IEEE 802.11n) and metropolitan (802.16e). Whereas code-specific decoders have been proposed literature, implementation of a high performance yet flexible LDPC decoder still is challenging topic. This work presents novel formulation decoding...

10.1109/tcsii.2007.894409 article EN IEEE Transactions on Circuits and Systems II Analog and Digital Signal Processing 2007-06-01

CarSNN: An Efficient Spiking Neural Network for Event-Based Autonomous Cars on the Loihi Neuromorphic Research Processor

OPENALEX - Publications

Alberto Viale Alberto Marchisio Maurizio Martina Guido Masera Muhammad Shafique

Autonomous Driving (AD) related features provide new forms of mobility that are also beneficial for other kind intelligent and autonomous systems like robots, smart transportation, industries. For these applications, the decisions need to be made fast in real-time. Moreover, quest electric mobility, this task must follow low power policy, without affecting much autonomy mean transport or robot. These two challenges can tackled using emerging Spiking Neural Networks (SNNs). When deployed on a...

10.1109/ijcnn52387.2021.9533738 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2021-07-18

VLSI Implementation of a Multi-Mode Turbo/LDPC Decoder Architecture

OPENALEX - Publications

Carlo Condo Maurizio Martina Guido Masera

Flexible and reconfigurable architectures have gained wide popularity in the communications field. In particular, for physical layer are an attractive solution not only to switch among different coding modes but also achieve interoperability. This work concentrates on design of a architecture both turbo LDPC codes decoding. The novel contributions this paper are: i) tackling reconfiguration issue introducing formal systematic treatment that, best our knowledge, was previously addressed ii)...

10.1109/tcsi.2012.2221216 article EN IEEE Transactions on Circuits and Systems I Regular Papers 2012-11-30

Multiplierless, Folded 9/7– 5/3 Wavelet VLSI Architecture

OPENALEX - Publications

Maurizio Martina Guido Masera

This brief proposes a multiplierless VLSI architecture for the famous 9/7 wavelet filters. The novelty of this is possibility to compute 5/3 results into data path with reduced number adders compared other solutions. has been characterized in terms performance through simulations JPEG2000 environment and Implementation on 0.13-mum standard cell technology shows that proposed architectures requires amount logic excellent performance.

10.1109/tcsii.2007.900354 article EN IEEE Transactions on Circuits and Systems II Analog and Digital Signal Processing 2007-09-01

Quantum Dot Cellular Automata Check Node Implementation for LDPC Decoders

OPENALEX - Publications

Muhammad Awais Marco Vacca Mariagrazia Graziano Massimo Ruo Roch Guido Masera

The quantum dot Cellular Automata (QCA) is an emerging nanotechnology that has gained significant research interest in recent years. Extremely small feature sizes, ultralow power consumption, and high clock frequency make QCA a potentially attractive solution for implementing computing architectures at the nanoscale. To be considered as suitable CMOS substitute, technology must able to implement complex real-time applications with affordable complexity. Low density parity check (LDPC)...

10.1109/tnano.2013.2251422 article EN IEEE Transactions on Nanotechnology 2013-05-01

High Speed Architectures for Finding the First two Maximum/Minimum Values

OPENALEX - Publications

Luca Amarù Maurizio Martina Guido Masera

High speed architectures for finding the first two maximum/minimum values are of paramount importance in several applications, including iterative (e.g., turbo and low-density-parity-check) decoders. In this brief, stemming from a previous work, based on radix-2 solutions, we propose higher mixed radix implementations that improve architecture latency. Post place route results 180-nm CMOS standard cell technology show proposed achieve lower latency than solutions with moderate area increase.

10.1109/tvlsi.2011.2174166 article EN IEEE Transactions on Very Large Scale Integration (VLSI) Systems 2011-11-28

Adaptive Approximated DCT Architectures for HEVC

OPENALEX - Publications

Maurizio Masera Maurizio Martina Guido Masera

This paper proposes a flexible and efficient implementation of the 2D N-point discrete cosine transform (DCT) for High Efficiency Video Coding (HEVC) standard. The DCT is implemented through Walsh-Hadamard (WHT) followed by Givens rotations. scheme exploited to derive an adaptive algorithm, which allows computing four different approximations ranging from complete WHT, selectively skipping some shows statistical analysis usage derives precomputation mechanism adaptively skip Each...

10.1109/tcsvt.2016.2595320 article EN IEEE Transactions on Circuits and Systems for Video Technology 2016-01-01

On optimal and near-optimal turbo decoding using generalized max operator

OPENALEX - Publications

Stylianos Papaharalabos P. Takis Mathiopoulos Guido Masera Maurizio Martina

Motivated by a recently published robust geometric programming approximation, generalized approach for approximating efficiently the max* operator is presented. Using this approach, approximated means of generic and yet very simple max operator, instead using additional correction term as previous approximation methods require. Following that, several turbo decoding algorithms are obtained with optimal near-optimal bit error rate (BER) performance depending on single parameter, namely number...

10.1109/lcomm.2009.090537 article EN IEEE Communications Letters 2009-07-01

SwiftTron: An Efficient Hardware Accelerator for Quantized Transformers

OPENALEX - Publications

Alberto Marchisio Davide Dura Maurizio Capra Maurizio Martina Guido Masera and 1 more

Transformers' compute- intensive operations pose enormous challenges for their deployment in resource- constrained EdgeAI / tiny ML devices. As an established neural network compression technique, quantization reduces the hardware computational and memory resources. In particular, fixed-point is desirable to ease computations using lightweight blocks, like adders multipliers, of underlying hardware. However, deploying fully-quantized Transformers on existing general-purpose hardware, generic...

10.1109/ijcnn54540.2023.10191521 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2023-06-18

Wireless sensor networks

OPENALEX - Publications

Federico Quaglio Maurzio Martina F. Vacca Guido Masera A. Molino and 2 more

Wireless Sensor Networks are an emerging phenomenon in the research community. The design and development of network architectures nodes implementation fostering many activities. Due to their wide application fields pervasive employment possibilities, investigation novel classes wireless sensor is great concern. In this paper we presented a Power-Scalable Motion Estimation IP suitable for video-surveillance over Networks. proposed architecture can achieve low dynamic power, good video...

10.1145/611851.611878 article EN 2003-01-01

Low-Complexity, Efficient 9/7 Wavelet Filters VLSI Implementation

OPENALEX - Publications

Maurizio Martina Guido Masera

This brief proposes a novel low-complexity, efficient 9/7 wavelet filters VLSI architecture for image compression applications. The performance of hardware implementation the filter bank depends on accuracy coefficients representation. aim this work is to show that great complexity reduction with excellent can be achieved going through derivation taps values

10.1109/tcsii.2006.883092 article EN IEEE Transactions on Circuits and Systems II Analog and Digital Signal Processing 2006-11-01

Motion estimation and CABAC VLSI co-processors for real-time high-quality H.264/AVC video coding

OPENALEX - Publications

Sergio Saponara Maurizio Martina Michele Casula Luca Fanucci Guido Masera

10.1016/j.micpro.2010.06.003 article EN Microprocessors and Microsystems 2010-06-17

A High-Entropy True Random Number Generator with Keccak Conditioning for FPGA

OPENALEX - Publications

Valeria Piscopo Alessandra Dolmeta Mattia Mirigaldi Maurizio Martina Guido Masera

Any cryptographic system strongly relies on randomness to ensure robust encryption and masking methods. True Random Number Generators play a fundamental role in this context. The National Institute of Standards Technology (NIST) the Bundesamt für Sicherheit der Informationstechnik (BSI) provide guidelines for designing reliable entropy sources fuel Bit Generators. This work presents highly parameterized, open-source implementation TRNG based ring oscillators, complemented by an optimized...

10.3390/s25061678 article EN cc-by Sensors 2025-03-08

Scalable and RISC-V Programmable Near-Memory Computing Architectures for Edge Nodes

OPENALEX - Publications

Michele Caon Clément Choné Pasquale Davide Schiavone Alexandre Levisse Guido Masera and 2 more

10.1109/tetc.2025.3555869 article EN IEEE Transactions on Emerging Topics in Computing 2025-01-01

TYRCA: A RISC-V Tightly-Coupled Accelerator for Code-Based Cryptography

OPENALEX - Publications

Alessandra Dolmeta Stefano Di Matteo Emanuele Valea Mikael Carmona Antoine Loiseau and 2 more

10.23919/date64628.2025.10993202 article EN 2025-03-31

Architectural strategies for low-power VLSI turbo decoders

OPENALEX - Publications

Guido Masera Marco Mazza Gianluca Piccinini F. Viglione Maurizio Zamboni

The use of "turbo codes" has been proposed for several applications, including the development wireless systems, where highly reliable transmission is required at very low signal-to-noise ratios (SNR). problem extracting best coding gains from these kind codes deeply investigated in last years. Also hardware implementation turbo a challenging topic, mainly due to iterative nature decoding process, which demands an operating frequency much higher than data rate; case design constraints became...

10.1109/tvlsi.2002.1043330 article EN IEEE Transactions on Very Large Scale Integration (VLSI) Systems 2002-06-01

Turbo NOC: A Framework for the Design of Network-on-Chip-Based Turbo Decoder Architectures

OPENALEX - Publications

Maurizio Martina Guido Masera

This paper proposes a general framework for the design and simulation of network-on-chip-based turbo decoder architectures. Several parameters in space are investigated, namely, network topology, parallelism degree, rate at which messages sent by processing nodes over network, routing strategy. The main results this analysis as follows: 1) most suited topologies to achieve high throughput with limited complexity overhead generalized de Bruijn Kautz 2) depending on requirements, different...

10.1109/tcsi.2010.2046257 article EN IEEE Transactions on Circuits and Systems I Regular Papers 2010-05-10

Comparative Study of Keccak SHA-3 Implementations

OPENALEX - Publications

Alessandra Dolmeta Maurizio Martina Guido Masera

This paper conducts an extensive comparative study of state-of-the-art solutions for implementing the SHA-3 hash function. SHA-3, a pivotal component in modern cryptography, has spawned numerous implementations across diverse platforms and technologies. research aims to provide valuable insights into selecting optimizing Keccak implementations. Our encompasses in-depth analysis hardware, software, software–hardware (hybrid) solutions. We assess strengths, weaknesses, performance metrics each...

10.3390/cryptography7040060 article EN cc-by Cryptography 2023-11-20

Coming Soon ...