- Parallel Computing and Optimization Techniques
- Advanced Data Storage Technologies
- Online Learning and Analytics
- Embedded Systems Design Techniques
- Distributed and Parallel Computing Systems
- Intelligent Tutoring Systems and Adaptive Learning
- Digital Rights Management and Security
- Copyright and Intellectual Property
- Interconnection Networks and Systems
- Education and Learning Interventions
- Online and Blended Learning
- Mobile Learning in Education
- Algorithms and Data Compression
- VLSI and Analog Circuit Testing
- Data Stream Mining Techniques
- Integrated Circuits and Semiconductor Failure Analysis
- Gaze Tracking and Assistive Technology
- Distributed systems and fault tolerance
- Radiation Effects in Electronics
- Digital and Traditional Archives Management
- Multimodal Machine Learning Applications
- Educational Games and Gamification
- Education, Safety, and Science Studies
- Cognitive Functions and Memory
- Cloud Computing and Resource Management
Shenzhen University
2024
Kyung Hee University
2023
Intel (United States)
2018-2021
Beijing Wuzi University
2015
National Center for High-Performance Computing
2011
National Chin-Yi University of Technology
2009
Chinese Culture University
2009
University of California, Berkeley
2003-2007
Lawrence Berkeley National Laboratory
2007
Partitioned Global Address Space (PGAS) languages combine the programming convenience of shared memory with locality and performance control message passing. One such language, Unified Parallel C (UPC) is an extension ISO defined by a consortium that boasts multiple proprietary open source compilers. Another PGAS Titanium, dialect JavaTM designed for high scientific computation. In this paper we describe some highlights two related projects, Titanium project centered at U.C. Berkeley UPC...
Transactional memory offers significant advantages for concurrency control compared to locks. This paper presents the design and implementation of transactional constructs in an unmanaged language. Unmanaged languages pose a unique set challenges - example, lack type safety, use function pointers, aliasing local variables, others. describes novel compiler runtime mechanisms that address these optimize performance transactions environment. We have implemented production-quality C...
Unified Parallel C (UPC) is a parallel language that uses Single Program Multiple Data (SPMD) model of parallelism within global address space. The space used to simplify programming, especially on applications with irregular data structures lead fine-grained sharing between threads. Recent results have shown the performance UPC using commercial compiler comparable MPI [7]. In this paper we describe portable open source for UPC. Our goal achieve similar while enabling easy porting and...
We study learning outcome prediction for online courses. Whereas prior work has focused on semester-long courses with frequent student assessments, we focus short-courses that have single outcomes assigned by instructors at the end. The lack of performance data and generally small enrollments makes behavior learners, captured as they interact course content one another in Social Learning Networks (SLN), essential prediction. Our method defines several (machine) features based processing...
Register allocation is a well-studied problem, but surprisingly little work has been published on assigning registers for GPU architectures. In this paper we present the register allocator in production compiler Intel HD and Iris Graphics. GPUs feature large byte-addressable file organized into banks, an expressive instruction set that supports variable SIMD-sizes divergent control flow, high spill overhead due to relatively long memory latencies. These distinctive characteristics impose...
Cloud computing can reduce mainframe management costs, so more and users choose to build their own cloud hosting environment. In computing, all the commands through network connection, therefore, information security is particularly important. this paper, we will explore types of intrusion detection systems, integration these types, provided an effective output reports, system administrators understand attacks damage quickly. With popularity log files are also increasing rapidly, effect...
The Cray X1 was recently introduced as the first in a new line of parallel systems to combine high-bandwidth vector processing with an MPP system architecture. Alongside capabilities such automatic fine-grained data parallelism through use instructions, offers hardware support for transparent global-address space (GAS), which makes it interesting target GAS languages. In this paper, we describe our experience developing portable, open-source and high performance compiler Unified Parallel C...
In this paper we motivate the explicit validation of hold-time violations in silicon and propose a method for doing so. New failure model test pattern generation methodologies are defined.We outline conditions under which these tests can be applied reliably. We present results applying patterns on microprocessor discuss implications intermittent failures relevance hazards during timing analysis.
With increasing general purpose programming capability, GPUs have become the mainstay for a wide variety of compute intensive tasks from cloud to edge computing. Because its availability on nearly every desktop and mobile processor that Intel ships, integrated GPU offers plethora opportunities researchers application developers make significant real-world impact. In this paper we present Graphics Compiler (IGC), LLVM-based production compiler HD Iris graphics. IGC supports all major graphics...
We study learning outcome prediction for online courses. Whereas prior work has focused on semester-long courses with frequent student assessments, we focus short-courses that have single outcomes assigned by instructors at the end. The lack of performance data makes behavior learners, captured as they interact course content and one another in Social Learning Networks (SLN), essential prediction. Our method defines several (machine) features based behaviors collected modes (human) a course,...
With increasing general purpose programming capability, GPUs have become the mainstay for a wide variety of compute intensive tasks from cloud to edge computing. Because its availability on nearly every desktop and mobile processor that Intel ships, integrated GPU offers plethora opportunities researchers application developers make significant real-world impact. In this paper we present Graphics Compiler (IGC), LLVM-based production compiler HD Iris graphics. IGC supports all major graphics...
In this paper we motivate the explicit validation of hold-time violations in silicon and propose a method for doing so. New failure model test pattern generation methodologies are defined. We outline conditions under which these tests can be applied reliably. present results applying patterns on microprocessor discuss implications intermittent failures relevance hazards during timing analysis
The SIMT execution model is commonly used for general GPU development. CUDA and OpenCL developers write scalar code that implicitly parallelized by compiler hardware. On Intel GPUs, however, this abstraction has profound performance implications as the underlying ISA SIMD important hardware capabilities cannot be fully utilized. To close gap we introduce C- For- Metal (CM), an explicit programming framework designed to deliver close-to-the-metal on GPUs. CM language its vector/matrix types...
Register allocation is a well-studied problem, but surprisingly little work has been published on assigning registers for GPU architectures. In this paper we present the register allocator in production compiler Intel HD and Iris Graphics. GPUs feature large byte-addressable file organized into banks, an expressive instruction set that supports variable SIMD-sizes divergent control flow, high spill overhead due to relatively long memory latencies. These distinctive characteristics impose...
Due to the advantages of acoustic sensing such as device ubiquity, hands-free interaction and privacy security, acoustic-based gesture input techniques have gained extensive at-tention many excellent works been proposed. However, these following shortcomings: high cost system construction, non-continuous input, degraded performance in cross-user scenarios. To overcome above shortcomings, we propose UltraWrite, a continuous that needs rather low construction cost, supports achieves...
The increasing popularity of e-learning has created demand for improving online education through techniques such as predictive analytics and content recommendations. In this paper, we study learner outcome predictions, i.e., predictions how they will perform at the end a course. We propose novel Two Branch Decision Network performance prediction that incorporates two important factors: learners progress course progresses combine clickstream features which log every action takes while...
The Puppet King website includes a large number of digital video and audio materials Taiwan's traditional puppet shows. These include pictures, videos,music right management system, which protects those from been infringed. This system can effectively protect contents using the Role-based Access Control mechanism, controls users' requests for accessing data.
3G mobile devices will make the digital content becomes more compact and portable. The can be carried out at anytime, anywhere. In this study, we build system for portable platform of 3G, which readily achieved. contents read on a walk, students use video calls online discussed with their teachers anytime.
Continuing education is indispensable in the national scientific quality improvement, and socio-economic development.One of main functions local college continuing to cultivate practical compound skilled personnel for economic development. BOTTLENECKS IN THE DEVELOPMENT OF LOCAL COLLEGE CONTINUING EDUCATION The recognition not highContinuing gets sustained attention in."The outline National Medium long-term reform development (2010-2020)" puts forward that "We need promote different levels...
MOOC is an open online education model, with free registration and massive learning resources sharing, which essentially different from the traditional network teaching.In continuing education, participant groups are more complex diverse, for they vary not only in age, occupation, but also ability, time, demand other aspects.As a new form of remarkable flexibility effectiveness, can better meet individual diversified bring development opportunities to strategic transformation our country as well.