NFDI4DS | UHH-SEMS - Publication Details

A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT

OPENALEX - Publications

Jules White Quchen Fu Sam Hays Michael Sandborn Carlos Olea and 4 more

Prompt engineering is an increasingly important skill set needed to converse effectively with large language models (LLMs), such as ChatGPT. Prompts are instructions given LLM enforce rules, automate processes, and ensure specific qualities (and quantities) of generated output. also a form programming that can customize the outputs interactions LLM. This paper describes catalog prompt techniques presented in pattern have been applied solve common problems when conversing LLMs. patterns...

10.48550/arxiv.2302.11382 preprint EN other-oa arXiv (Cornell University) 2023-01-01

ChatGPT Prompt Patterns for Improving Code Quality, Refactoring, Requirements Elicitation, and Software Design

OPENALEX - Publications

Jules White Sam Hays Quchen Fu Jesse Spencer-Smith Douglas C. Schmidt

This paper presents prompt design techniques for software engineering, in the form of patterns, to solve common problems when using large language models (LLMs), such as ChatGPT automate engineering activities, ensuring code is decoupled from third-party libraries and simulating a web application API before it implemented. provides two contributions research on LLMs engineering. First, catalog patterns that classifies according types they solve. Second, explores several have been applied...

10.48550/arxiv.2303.07839 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Towards a Catalog of Prompt Patterns to Enhance the Discipline of Prompt Engineering

OPENALEX - Publications

Douglas C. Schmidt Jesse Spencer-Smith Quchen Fu Jules White

The rapid advent of Large Language Models (LLMs), such as ChatGPT and Claude, is revolutionizing various fields, from education healthcare to the engineering reliable software systems. These LLMs operate through "prompts," which are natural language inputs that users employ query leverage models' capabilities. Given novelty LLMs, understanding how effectively use prompts remains largely anecdotal, based on isolated cases. This fragmented approach limits reliability utility especially when...

10.1145/3672359.3672364 article EN ACM SIGAda Ada Letters 2024-06-06

FastAudio: A Learnable Audio Front-End For Spoof Speech Detection

OPENALEX - Publications

Quchen Fu Zhongwei Teng Jules White Maria Powell Douglas C. Schmidt

Spoof speech can be used to try and fool speaker verification systems that determine the identity of based on voice characteristics. This paper compares popular learnable front-ends this task. We categorize by defining two generic architectures then analyze filtering stages both types in terms learning constraints. pro-pose replacing fixed filterbanks with a layer better adapt anti-spoofing tasks. The proposed FastAudio front-end is tested back-ends measure performance Logical Access track...

10.1109/icassp43922.2022.9746722 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

Sketch2Vis: Generating Data Visualizations from Hand-drawn Sketches with Deep Learning

OPENALEX - Publications

Zhongwei Teng Quchen Fu Jules White Douglas C. Schmidt

Data visualization has become a vital tool to help people understand the driving forces behind real-world phenomena. Although learning curve of tools have been reduced, domain experts still often require significant amounts training use them effectively. To reduce this even further, paper proposes Sketch2Vis, novel solution using deep techniques and generate source code for multi-platform data visualizations automatically from hand-drawn sketches provided by experts, which is similar how an...

10.1109/icmla52953.2021.00141 article EN 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA) 2021-12-01

SA-SASV: An End-to-End Spoof-Aggregated Spoofing-Aware Speaker Verification System

OPENALEX - Publications

Zhongwei Teng Quchen Fu Jules White Maria Powell Douglas C. Schmidt

Research in the past several years has boosted performance of automatic speaker verification systems and countermeasure to deliver low Equal Error Rates (EERs) on each system. However, research joint optimization both is still limited. The Spoofing-Aware Speaker Verification (SASV) 2022 challenge was proposed encourage development integrated SASV with new metrics evaluate model performance. This paper proposes an ensemble-free end-to-end solution, known as Spoof-Aggregated-SASV (SA-SASV)...

10.21437/interspeech.2022-11029 article EN Interspeech 2022 2022-09-16

A multi-stage transfer learning strategy for diagnosing a class of rare laryngeal movement disorders

OPENALEX - Publications

Yu Yao Maria Powell Jules White Jian Feng Quchen Fu and 2 more

10.1016/j.compbiomed.2023.107534 article EN Computers in Biology and Medicine 2023-09-29

NeurIPS 2020 NLC2CMD Competition: Translating Natural Language to Bash Commands

OPENALEX - Publications

Mayank Agarwal Tathagata Chakraborti Quchen Fu David Gros Xi Victoria Lin and 4 more

The NLC2CMD Competition hosted at NeurIPS 2020 aimed to bring the power of natural language processing command line. Participants were tasked with building models that can transform descriptions line tasks in English their Bash syntax. This is a report on competition details task, metrics, data, attempted solutions, and lessons learned.

10.48550/arxiv.2103.02523 preprint EN other-oa arXiv (Cornell University) 2021-01-01

NL2CMD: An Updated Workflow for Natural Language to Bash Commands Translation

OPENALEX - Publications

Quchen Fu Zhongwei Teng Marco Georgaklis Jules White Douglas C. Schmidt

Translating natural language into Bash Commands is an emerging research field that has gained attention in recent years. Most efforts have focused on producing more accurate translation models. To the best of our knowledge, only two datasets are available, with one based other. Both involve scraping through known data sources (through platforms like stack overflow, crowdsourcing, etc.) and hiring experts to validate correct either English text or Commands. This paper provides contributions...

10.48550/arxiv.2302.07845 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Complementing Handcrafted Features with Raw Waveform Using a Light-weight Auxiliary Model

OPENALEX - Publications

Zhongwei Teng Quchen Fu Jules White Maria Powell Douglas C. Schmidt

An emerging trend in audio processing is capturing low-level speech representations from raw waveforms. These have shown promising results on a variety of tasks, such as recognition and separation. Compared to handcrafted features, learning features via backpropagation provides the model greater flexibility how it represents data for different tasks theoretically. However, empirical study shows that, some voice spoof detection, are more competitive than learned features. Instead evaluating...

10.48550/arxiv.2109.02773 preprint EN other-oa arXiv (Cornell University) 2021-01-01

A Transformer-based Approach for Translating Natural Language to Bash Commands

OPENALEX - Publications

Quchen Fu Zhongwei Teng Jules White Douglas C. Schmidt

This paper explores the translation of natural language into Bash Commands, which developers commonly use to accomplish command-line tasks in a terminal. In our approach terminal takes command as sentence plain English and translates it corresponding string Commands. The analyzes performance several architectures on this problem using data from NLC2CMD competition at NeurIPS 2020 conference. presented is best performing architecture date improves current state-of-the-art accuracy task 13.8% 53.2%.

10.1109/icmla52953.2021.00202 article EN 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA) 2021-12-01

FastAudio: A Learnable Audio Front-End for Spoof Speech Detection

OPENALEX - Publications

Quchen Fu Zhongwei Teng Jules White Maria Powell Douglas C. Schmidt

Voice assistants, such as smart speakers, have exploded in popularity. It is currently estimated that the speaker adoption rate has exceeded 35% US adult population. Manufacturers integrated identification technology, which attempts to determine identity of person speaking, provide personalized services different members same family. Speaker can also play an important role controlling how used. For example, it not critical correctly identify user when playing music. However, reading user's...

10.48550/arxiv.2109.02774 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Performance Analysis of Line Laser Sensor for Aeroengine Blade Inspection

OPENALEX - Publications

Ruirui Song Li Ran Quchen Fu Cheng Jiang

Turbine blade is one of the most important structural components aeroengine, and it may be broken in process using. So meaningful to inspect blades accurately by optical method. In this paper, a laser scanning sensor designed for inspecting aeronautic casting turbine blade. First, some useful equations parameters performance are obtained through establishment line projection scheme. Second, which affect analyzed. Third, requirements that must obeyed given when designed. Furthermore,...

10.1145/3080845.3080878 article EN 2017-04-07

ARawNet: A Lightweight Solution for Leveraging Raw Waveforms in Spoof Speech Detection

OPENALEX - Publications

Zhongwei Teng Quchen Fu Jules White Maria Powell Douglas C. Schmidt

An emerging trend in audio processing is capturing low-level speech representations from raw waveforms. These have shown promising results on a variety of tasks, such as recognition and separation. Compared to handcrafted features, learning features via backpropagation can potentially provide the model greater flexibility how it represents data for different tasks. However, empirical studies show that, some spoof detection, still currently outperform learned features. Instead evaluating...

10.1109/icpr56361.2022.9956138 article EN 2022 26th International Conference on Pattern Recognition (ICPR) 2022-08-21

NL2CMD: An Updated Workflow for Natural Language to Bash Commands Translation

OPENALEX - Publications

Quchen Fu Zhongwei Teng Marco Georgaklis Jules White Douglas C. Schmidt

Translating natural language into Bash Commands is an emerging research field that has gained attention in recent years. Most efforts have focused on producing more accurate translation models. To the best of our knowledge, only two datasets are available, with one based other. Both involve scraping through known data sources (through platforms like stack overflow, crowdsourcing, etc.) and hiring experts to validate correct either English text or Commands. This paper provides contributions...

10.13052/jmltapissn.2023.002 preprint EN arXiv (Cornell University) 2023-02-15

Real-Time Speech Interruption Analysis: from Cloud to Client Deployment

OPENALEX - Publications

Quchen Fu Szu‐Wei Fu Yaran Fan Yu Wu Zhuo Chen and 2 more

Meetings are an essential form of communication for all types organizations, and remote collaboration systems have been much more widely used since the COVID-19 pandemic. One major issue with meetings is that it challenging participants to interrupt speak. We recently developed first speech interruption analysis model WavLM_SI, which detects failed interruptions, shows very promising performance, being deployed in cloud. To deliver this feature a cost-efficient environment-friendly way, we...

10.1109/icassp49357.2023.10095639 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Deep Learning Models on CPUs: A Methodology for Efficient Training

OPENALEX - Publications

Quchen Fu Ramesh Chukka Keith Achorn Thomas Atta-fosu Deepak R. Canchi and 3 more

GPUs have been favored for training deep learning models due to their highly parallelized architecture. As a result, most studies on optimization focus GPUs. There is often trade-off, however, between cost and efficiency when deciding how choose the proper hardware training. In particular, CPU servers can be beneficial if CPUs was more efficient, as they incur fewer update costs better utilizing existing infrastructure. This paper makes several contributions research using CPUs. First, it...

10.48550/arxiv.2206.10034 preprint EN other-oa arXiv (Cornell University) 2022-01-01

SA-SASV: An End-to-End Spoof-Aggregated Spoofing-Aware Speaker Verification System

OPENALEX - Publications

Zhongwei Teng Quchen Fu Jules White Maria Powell Douglas C. Schmidt

Research in the past several years has boosted performance of automatic speaker verification systems and countermeasure to deliver low Equal Error Rates (EERs) on each system. However, research joint optimization both is still limited. The Spoofing-Aware Speaker Verification (SASV) 2022 challenge was proposed encourage development integrated SASV with new metrics evaluate model performance. This paper proposes an ensemble-free end-to-end solution, known as Spoof-Aggregated-SASV (SA-SASV)...

10.48550/arxiv.2203.06517 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Real-time Speech Interruption Analysis: From Cloud to Client Deployment

OPENALEX - Publications

Quchen Fu Szu‐Wei Fu Yaran Fan Yu Wu Zhuo Chen and 2 more

Meetings are an essential form of communication for all types organizations, and remote collaboration systems have been much more widely used since the COVID-19 pandemic. One major issue with meetings is that it challenging participants to interrupt speak. We recently developed first speech interruption analysis model, which detects failed interruptions, shows very promising performance, being deployed in cloud. To deliver this feature a cost-efficient environment-friendly way, we reduced...

10.48550/arxiv.2210.13334 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Using LSTM Networks and Future Gradient Values to Forecast Heart Rate in Biking

OPENALEX - Publications

Henry Gilbert Jules White Quchen Fu Douglas C. Schmidt

10.5220/0011541800003321 article EN 2022-01-01

Deep Learning Models on CPUs: A Methodology for Efficient Training

OPENALEX - Publications

Quchen Fu Ramesh Chukka Keith Achorn Thomas Atta-fosu Deepak R. Canchi and 3 more

GPUs have been favored for training deep learning models due to their highly parallelized architecture. As a result, most studies on optimization focus GPUs. There is often trade-off, however, between cost and efficiency when deciding how choose the proper hardware training. In particular, CPU servers can be beneficial if CPUs was more efficient, as they incur fewer update costs better utilizing existing infrastructure. This paper makes several contributions research using CPUs. First, it...

10.13052/jmltapissn.2022.003 preprint EN arXiv (Cornell University) 2022-06-20