Wei-Yu Chen

ORCID: 0000-0001-8922-3988
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Parallel Computing and Optimization Techniques
  • Advanced Data Storage Technologies
  • Online Learning and Analytics
  • Embedded Systems Design Techniques
  • Distributed and Parallel Computing Systems
  • Intelligent Tutoring Systems and Adaptive Learning
  • Digital Rights Management and Security
  • Copyright and Intellectual Property
  • Interconnection Networks and Systems
  • Education and Learning Interventions
  • Online and Blended Learning
  • Mobile Learning in Education
  • Algorithms and Data Compression
  • VLSI and Analog Circuit Testing
  • Data Stream Mining Techniques
  • Integrated Circuits and Semiconductor Failure Analysis
  • Gaze Tracking and Assistive Technology
  • Distributed systems and fault tolerance
  • Radiation Effects in Electronics
  • Digital and Traditional Archives Management
  • Multimodal Machine Learning Applications
  • Educational Games and Gamification
  • Education, Safety, and Science Studies
  • Cognitive Functions and Memory
  • Cloud Computing and Resource Management

Shenzhen University
2024

Kyung Hee University
2023

Intel (United States)
2018-2021

Beijing Wuzi University
2015

National Center for High-Performance Computing
2011

National Chin-Yi University of Technology
2009

Chinese Culture University
2009

University of California, Berkeley
2003-2007

Lawrence Berkeley National Laboratory
2007

Partitioned Global Address Space (PGAS) languages combine the programming convenience of shared memory with locality and performance control message passing. One such language, Unified Parallel C (UPC) is an extension ISO defined by a consortium that boasts multiple proprietary open source compilers. Another PGAS Titanium, dialect JavaTM designed for high scientific computation. In this paper we describe some highlights two related projects, Titanium project centered at U.C. Berkeley UPC...

10.1145/1278177.1278183 article EN 2007-07-27

Transactional memory offers significant advantages for concurrency control compared to locks. This paper presents the design and implementation of transactional constructs in an unmanaged language. Unmanaged languages pose a unique set challenges - example, lack type safety, use function pointers, aliasing local variables, others. describes novel compiler runtime mechanisms that address these optimize performance transactions environment. We have implemented production-quality C...

10.1109/cgo.2007.4 article EN 2007-03-01

Unified Parallel C (UPC) is a parallel language that uses Single Program Multiple Data (SPMD) model of parallelism within global address space. The space used to simplify programming, especially on applications with irregular data structures lead fine-grained sharing between threads. Recent results have shown the performance UPC using commercial compiler comparable MPI [7]. In this paper we describe portable open source for UPC. Our goal achieve similar while enabling easy porting and...

10.1145/782814.782825 article EN 2003-06-23

We study learning outcome prediction for online courses. Whereas prior work has focused on semester-long courses with frequent student assessments, we focus short-courses that have single outcomes assigned by instructors at the end. The lack of performance data and generally small enrollments makes behavior learners, captured as they interact course content one another in Social Learning Networks (SLN), essential prediction. Our method defines several (machine) features based processing...

10.1109/tlt.2018.2793193 article EN IEEE Transactions on Learning Technologies 2018-01-15

Register allocation is a well-studied problem, but surprisingly little work has been published on assigning registers for GPU architectures. In this paper we present the register allocator in production compiler Intel HD and Iris Graphics. GPUs feature large byte-addressable file organized into banks, an expressive instruction set that supports variable SIMD-sizes divergent control flow, high spill overhead due to relatively long memory latencies. These distinctive characteristics impose...

10.1145/3168806 article EN 2018-02-24

Cloud computing can reduce mainframe management costs, so more and users choose to build their own cloud hosting environment. In computing, all the commands through network connection, therefore, information security is particularly important. this paper, we will explore types of intrusion detection systems, integration these types, provided an effective output reports, system administrators understand attacks damage quickly. With popularity log files are also increasing rapidly, effect...

10.1109/ccis.2011.6045076 article EN 2011-09-01

The Cray X1 was recently introduced as the first in a new line of parallel systems to combine high-bandwidth vector processing with an MPP system architecture. Alongside capabilities such automatic fine-grained data parallelism through use instructions, offers hardware support for transparent global-address space (GAS), which makes it interesting target GAS languages. In this paper, we describe our experience developing portable, open-source and high performance compiler Unified Parallel C...

10.1145/1006209.1006236 article EN 2004-06-26

In this paper we motivate the explicit validation of hold-time violations in silicon and propose a method for doing so. New failure model test pattern generation methodologies are defined.We outline conditions under which these tests can be applied reliably. We present results applying patterns on microprocessor discuss implications intermittent failures relevance hazards during timing analysis.

10.1145/1146909.1146994 article EN 2006-01-01

With increasing general purpose programming capability, GPUs have become the mainstay for a wide variety of compute intensive tasks from cloud to edge computing. Because its availability on nearly every desktop and mobile processor that Intel ships, integrated GPU offers plethora opportunities researchers application developers make significant real-world impact. In this paper we present Graphics Compiler (IGC), LLVM-based production compiler HD Iris graphics. IGC supports all major graphics...

10.1109/cgo.2019.8661189 article EN 2019-02-01

We study learning outcome prediction for online courses. Whereas prior work has focused on semester-long courses with frequent student assessments, we focus short-courses that have single outcomes assigned by instructors at the end. The lack of performance data makes behavior learners, captured as they interact course content and one another in Social Learning Networks (SLN), essential prediction. Our method defines several (machine) features based behaviors collected modes (human) a course,...

10.1109/infocom.2017.8057105 article EN IEEE INFOCOM 2022 - IEEE Conference on Computer Communications 2017-05-01

With increasing general purpose programming capability, GPUs have become the mainstay for a wide variety of compute intensive tasks from cloud to edge computing. Because its availability on nearly every desktop and mobile processor that Intel ships, integrated GPU offers plethora opportunities researchers application developers make significant real-world impact. In this paper we present Graphics Compiler (IGC), LLVM-based production compiler HD Iris graphics. IGC supports all major graphics...

10.5555/3314872.3314902 article EN Symposium on Code Generation and Optimization 2019-02-16

In this paper we motivate the explicit validation of hold-time violations in silicon and propose a method for doing so. New failure model test pattern generation methodologies are defined. We outline conditions under which these tests can be applied reliably. present results applying patterns on microprocessor discuss implications intermittent failures relevance hazards during timing analysis

10.1109/dac.2006.229279 article EN Proceedings - ACM IEEE Design Automation Conference 2006-01-01

The SIMT execution model is commonly used for general GPU development. CUDA and OpenCL developers write scalar code that implicitly parallelized by compiler hardware. On Intel GPUs, however, this abstraction has profound performance implications as the underlying ISA SIMD important hardware capabilities cannot be fully utilized. To close gap we introduce C- For- Metal (CM), an explicit programming framework designed to deliver close-to-the-metal on GPUs. CM language its vector/matrix types...

10.1109/cgo51591.2021.9370324 article EN 2021-02-27

Register allocation is a well-studied problem, but surprisingly little work has been published on assigning registers for GPU architectures. In this paper we present the register allocator in production compiler Intel HD and Iris Graphics. GPUs feature large byte-addressable file organized into banks, an expressive instruction set that supports variable SIMD-sizes divergent control flow, high spill overhead due to relatively long memory latencies. These distinctive characteristics impose...

10.1145/3179541.3168806 article EN 2018-01-01

Due to the advantages of acoustic sensing such as device ubiquity, hands-free interaction and privacy security, acoustic-based gesture input techniques have gained extensive at-tention many excellent works been proposed. However, these following shortcomings: high cost system construction, non-continuous input, degraded performance in cross-user scenarios. To overcome above shortcomings, we propose UltraWrite, a continuous that needs rather low construction cost, supports achieves...

10.1109/percom59722.2024.10494485 article EN 2024-03-11

The increasing popularity of e-learning has created demand for improving online education through techniques such as predictive analytics and content recommendations. In this paper, we study learner outcome predictions, i.e., predictions how they will perform at the end a course. We propose novel Two Branch Decision Network performance prediction that incorporates two important factors: learners progress course progresses combine clickstream features which log every action takes while...

10.48550/arxiv.2001.08328 preprint EN other-oa arXiv (Cornell University) 2020-01-01

The Puppet King website includes a large number of digital video and audio materials Taiwan's traditional puppet shows. These include pictures, videos,music right management system, which protects those from been infringed. This system can effectively protect contents using the Role-based Access Control mechanism, controls users' requests for accessing data.

10.1109/ncm.2009.398 article EN 2009-01-01

3G mobile devices will make the digital content becomes more compact and portable. The can be carried out at anytime, anywhere. In this study, we build system for portable platform of 3G, which readily achieved. contents read on a walk, students use video calls online discussed with their teachers anytime.

10.1145/1655925.1655999 article EN 2009-11-24

Continuing education is indispensable in the national scientific quality improvement, and socio-economic development.One of main functions local college continuing to cultivate practical compound skilled personnel for economic development. BOTTLENECKS IN THE DEVELOPMENT OF LOCAL COLLEGE CONTINUING EDUCATION The recognition not highContinuing gets sustained attention in."The outline National Medium long-term reform development (2010-2020)" puts forward that "We need promote different levels...

10.2991/ssemse-15.2015.147 article EN cc-by-nc 2015-01-01

MOOC is an open online education model, with free registration and massive learning resources sharing, which essentially different from the traditional network teaching.In continuing education, participant groups are more complex diverse, for they vary not only in age, occupation, but also ability, time, demand other aspects.As a new form of remarkable flexibility effectiveness, can better meet individual diversified bring development opportunities to strategic transformation our country as well.

10.2991/ssemse-15.2015.275 article EN cc-by-nc 2015-01-01
Coming Soon ...