NFDI4DS | UHH-SEMS - Publication Details

Aryo Pradipta Gema

ORCID: 0009-0007-1163-3531

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5080213009

Research Areas

Topic Modeling
Natural Language Processing Techniques
Computational Drug Discovery Methods
Biomedical Text Mining and Ontologies
Machine Learning in Healthcare
Sentiment Analysis and Opinion Mining
Pharmacovigilance and Adverse Drug Reactions
Semantic Web and Ontologies
Bioinformatics and Genomic Networks
Software Engineering Research
Gene expression and cancer classification
Advanced Text Analysis Techniques
Molecular Biology Techniques and Applications
Multi-Agent Systems and Negotiation
Medical Coding and Health Information
Mental Health via Writing
Evolutionary Psychology and Human Behavior
Intelligent Tutoring Systems and Adaptive Learning
Chaos-based Image/Signal Encryption
Machine Learning and Data Classification
Advanced Steganography and Watermarking Techniques
Mathematics Education and Pedagogy
English Language Learning and Teaching
Neuroscience and Music Perception
Multimodal Machine Learning Applications

University of Edinburgh
2023-2025

Epigénétique et Destin Cellulaire
2022-2023

Medigene (Germany)
2022

Binus University
2017-2021

pyComBat, a Python tool for batch effects correction in high-throughput molecular data using empirical Bayes methods

OPENALEX - Publications

Abdelkader Behdenna Maximilien Colange Julien Haziza Aryo Pradipta Gema Guillaume Appé and 2 more

Abstract Background Variability in datasets is not only the product of biological processes: they are also technical biases. ComBat and ComBat-Seq among most widely used tools for correcting those biases, called batch effects, in, respectively, microarray RNA-Seq expression data. Results In this note, we present a new Python implementation ComBat-Seq. While mathematical framework strictly same, show here that our implementations: (i) have similar results terms effects correction; (ii) as...

10.1186/s12859-023-05578-5 article EN cc-by BMC Bioinformatics 2023-12-07

Parameter-Efficient Fine-Tuning of LLaMA for the Clinical Domain

OPENALEX - Publications

Aryo Pradipta Gema Pasquale Minervini Luke Daines Tom Hope Beatrice Alex

10.18653/v1/2024.clinicalnlp-1.9 article EN 2024-01-01

Can GPT-3.5 generate and code discharge summaries?

OPENALEX - Publications

Matúš Falis Aryo Pradipta Gema Hang Dong Luke Daines Siddharth Basetti and 4 more

Abstract Objectives The aim of this study was to investigate GPT-3.5 in generating and coding medical documents with International Classification Diseases (ICD)-10 codes for data augmentation on low-resource labels. Materials Methods Employing we generated coded 9606 discharge summaries based lists ICD-10 code descriptions patients infrequent (or generation) within the MIMIC-IV dataset. Combined baseline training set, formed an augmented set. Neural models were trained evaluated test We...

10.1093/jamia/ocae132 article EN cc-by Journal of the American Medical Informatics Association 2024-09-13

Fostering effective hybrid human-LLM reasoning and decision making

OPENALEX - Publications

Andrea Passerini Aryo Pradipta Gema Pasquale Minervini Burcu Sayin Katya Tentori

The impressive performance of modern Large Language Models (LLMs) across a wide range tasks, along with their often non-trivial errors, has garnered unprecedented attention regarding the potential AI and its impact on everyday life. While considerable effort been continues to be dedicated overcoming limitations current models, potentials risks human-LLM collaboration remain largely underexplored. In this perspective, we argue that enhancing focus interaction should primary target for future...

10.3389/frai.2024.1464690 article EN cc-by Frontiers in Artificial Intelligence 2025-01-08

Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs

OPENALEX - Publications

Rohit Saxena Aryo Pradipta Gema Pasquale Minervini

Understanding time from visual representations is a fundamental cognitive skill, yet it remains challenge for multimodal large language models (MLLMs). In this work, we investigate the capabilities of MLLMs in interpreting and date through analogue clocks yearly calendars. To facilitate this, curated structured dataset comprising two subsets: 1) $\textit{ClockQA}$, which comprises various types clock styles$-$standard, black-dial, no-second-hand, Roman numeral, arrow-hand clocks$-$paired...

10.48550/arxiv.2502.05092 preprint EN arXiv (Cornell University) 2025-02-07

Self-Training Large Language Models for Tool-Use Without Demonstrations

OPENALEX - Publications

Nuan Luo Aryo Pradipta Gema Xuanli He Emile van Krieken Pietro Lesci and 1 more

Large language models (LLMs) remain prone to factual inaccuracies and computational errors, including hallucinations mistakes in mathematical reasoning. Recent work augmented LLMs with tools mitigate these shortcomings, but often requires curated gold tool-use demonstrations. In this paper, we investigate whether can learn use without First, analyse zero-shot prompting strategies guide tool utilisation. Second, propose a self-training method synthesise traces using the LLM itself. We compare...

10.48550/arxiv.2502.05867 preprint EN arXiv (Cornell University) 2025-02-09

pyComBat, a Python tool for batch effects correction in high-throughput molecular data using empirical Bayes methods

OPENALEX - Publications

Abdelkader Behdenna Maximilien Colange Julien Haziza Aryo Pradipta Gema Guillaume Appé and 2 more

Abstract Background Variability in datasets is not only the product of biological processes: they are also technical biases. ComBat and ComBat-Seq among most widely used tools for correcting those biases, called batch effects, in, respectively, microarray RNA-Seq expression data. Results In this note, we present a new Python implementation ComBat-Seq. While mathematical framework strictly same, show here that our implementations: ( i ) have similar results terms effects correction; ii as...

10.1101/2020.03.17.995431 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2020-03-18

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models

OPENALEX - Publications

Giwon Hong Aryo Pradipta Gema Rohit Saxena Xiaotang Du Ping Nie and 5 more

Large Language Models (LLMs) have transformed the Natural Processing (NLP) landscape with their remarkable ability to understand and generate human-like text. However, these models are prone ``hallucinations'' -- outputs that do not align factual reality or input context. This paper introduces Hallucinations Leaderboard, an open initiative quantitatively measure compare tendency of each model produce hallucinations. The leaderboard uses a comprehensive set benchmarks focusing on different...

10.48550/arxiv.2404.05904 preprint EN arXiv (Cornell University) 2024-04-08

DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations

OPENALEX - Publications

Aryo Pradipta Gema Jin Chen Ahmed Abdulaal Tom Diethe Philip Teare and 3 more

Large Language Models (LLMs) often hallucinate, producing unfaithful or factually incorrect outputs by misrepresenting the provided context incorrectly recalling internal knowledge. Recent studies have identified specific attention heads within Transformer architecture, known as retrieval heads, responsible for extracting relevant contextual information. We hypothesise that masking these can induce hallucinations and contrasting of base LLM masked reduce hallucinations. To this end, we...

10.48550/arxiv.2410.18860 preprint EN arXiv (Cornell University) 2024-10-24

Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

OPENALEX - Publications

Yu Zhao Alessio Devoto Giwon Hong Xiaotang Du Aryo Pradipta Gema and 4 more

Large language models (LLMs) can store a significant amount of factual knowledge in their parameters. However, parametric may conflict with the information provided context -- this phenomenon, known as \emph{context-memory conflicts}, lead to undesirable model behaviour, such reliance on outdated or incorrect information. Analysing internal activations LLMs, we find that they internally register signals at mid-layers. Such allow us detect whether occurs and use \emph{inference-time}...

10.48550/arxiv.2410.15999 preprint EN arXiv (Cornell University) 2024-10-21

Parameter-Efficient Fine-Tuning of LLaMA for the Clinical Domain

OPENALEX - Publications

Aryo Pradipta Gema Luke Daines Pasquale Minervini Beatrice Alex

Adapting pretrained language models to novel domains, such as clinical applications, traditionally involves retraining their entire set of parameters. However, this approach is increasingly proven be impractical owing the substantial computational requirements associated with training large models. To address issue, Parameter-Efficient Fine-Tuning (PEFT) techniques offer a viable solution by selectively fine-tuning small subset additional parameters, significantly reducing for domain...

10.48550/arxiv.2307.03042 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Edinburgh Clinical NLP at SemEval-2024 Task 2: Fine-tune your model unless you have access to GPT-4

OPENALEX - Publications

Aryo Pradipta Gema Giwon Hong Pasquale Minervini Luke Daines Beatrice Alex

10.18653/v1/2024.semeval-1.265 article EN Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022) 2024-01-01

Argument annotation and analysis using deep learning with attention mechanism in Bahasa Indonesia

OPENALEX - Publications

Derwin Suhartono Aryo Pradipta Gema Suhendro Winton Theodorus David Mohamad Ivan Fanany and 1 more

Abstract Argumentation mining is a research field which focuses on sentences in type of argumentation. Argumentative are often used daily communication and have important role each decision or conclusion making process. The objective to do observation deep learning utilization combined with attention mechanism for argument annotation analysis. Argument component classification from certain discourse several classes. Classes include major claim, premise non-argumentative. analysis points...

10.1186/s40537-020-00364-z article EN cc-by Journal Of Big Data 2020-10-19

It Takes Two To Tango: Modification of Siamese Long Short Term Memory Network with Attention Mechanism in Recognizing Argumentative Relations in Persuasive Essay

OPENALEX - Publications

Aryo Pradipta Gema Suhendro Winton Theodorus David Derwin Suhartono Muhsin Shodiq and 1 more

We propose a novel approach in dataset of argumentation relations. This task is intended to analyze the presence support relation between two sentences. To be able identify relations sentences or arguments, one obliged understand nuance brought by both Our models are modification siamese network architectures, which we replace feature extractor into Long Short Term Memory and implement cosine distance as energy function. take pair their input try whether there those not.The primary...

10.1016/j.procs.2017.10.036 article EN Procedia Computer Science 2017-01-01

Leveraging BERT with Extractive Summarization for Depression Detection on Social Media

OPENALEX - Publications

David William Said Achmad Derwin Suhartono Aryo Pradipta Gema

In our current time, the well-being of a person is not only determined by physical health, but also their mental health. A lot focus and effort have been spent into raising awareness this issue. One such comes from field computer science utilizing data social media to provide additional information in detecting these disorders. research, authors proposed Bidirectional Encoder Representations Transformers (BERT) with extractive summarization preprocess obtained popular platform as Reddit...

10.1109/isitia56226.2022.9855370 article EN 2022-07-20

Facial Attractiveness Classification using Deep Learning

OPENALEX - Publications

Ricky Anderson Aryo Pradipta Gema Suharjito Suharjito Sani Muhamad Isa

Facial attractiveness classification application has many various usabilities, including photo editing, beautification, grading and dataset labeling. While face seems to be related personal preference, building a robust classifier is not impossible. There are several studies that have developed system of facial using convolutional neural network provide satisfactory results. The use Image-net pre-trained been largely used by face-related research, yet none them attractiveness. This study...

10.1109/inapr.2018.8627004 article EN 2018-09-01

BERT and ULMFiT Ensemble for Personality Prediction from Indonesian Social Media Text

OPENALEX - Publications

Noptovius Halimawan Derwin Suhartono Aryo Pradipta Gema Rezki Yunanda

Predicting personality is a growing topic in the field of natural language processing. The study prediction has been proven to benefit development recommender systems and automated assessments by previous studies. Additionally, widespread usage social media Indonesia such as Twitter served potential source data for developing models. Existing models explored implementation both traditional machine learning deep models, with latter perform better more data. Despite so, there not much...

10.1109/isitdi55734.2022.9944476 article EN 2022-07-27

Can GPT-3.5 Generate and Code Discharge Summaries?

OPENALEX - Publications

Matúš Falis Aryo Pradipta Gema Hang Dong Luke Daines Siddharth Basetti and 4 more

Objective: To investigate GPT-3.5 in generating and coding medical documents with ICD-10 codes for data augmentation on low-resources labels. Materials Methods: Employing we generated coded 9,606 discharge summaries based lists of code descriptions patients infrequent (generation) within the MIMIC-IV dataset. Combined baseline training set, this formed an augmented set. Neural models were trained evaluated a test We report micro- macro-F1 scores full codeset, generation codes, their...

10.48550/arxiv.2401.13512 preprint EN cc-by arXiv (Cornell University) 2024-01-01

CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning

OPENALEX - Publications

Joshua Ong Jun Leang Aryo Pradipta Gema Shay B. Cohen

Mathematical reasoning remains a significant challenge for large language models (LLMs), despite progress in prompting techniques such as Chain-of-Thought (CoT). We present Chain of Mathematically Annotated Thought (CoMAT), which enhances through two stages: Symbolic Conversion (converting natural queries into symbolic form) and Reasoning Execution (deriving answers from representations). CoMAT operates entirely with single LLM without external solvers. Across four LLMs, outperforms...

10.48550/arxiv.2410.10336 preprint EN arXiv (Cornell University) 2024-10-14

Learning Binding Affinities via Fine-tuning of Protein and Ligand Language Models

OPENALEX - Publications

Rohan Gorantla Aryo Pradipta Gema Ian Xi Yang Álvaro Serrano-Morrás Benjamin S. Suutari and 2 more

Abstract Accurate in-silico prediction of protein-ligand binding affinity is essential for efficient hit identification in large molecular libraries. Commonly used structure-based methods such as giga-docking often fail to rank compounds effectively, and free energy-based approaches, while accurate, are too computationally intensive large-scale screening. Existing deep learning models struggle generalize new targets or drugs, current evaluation do not reflect real-world performance...

10.1101/2024.11.01.621495 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2024-11-01

Automatic Piano Sheet Music Transcription with Machine Learning

OPENALEX - Publications

Fernandes Saputra Un Greffin Namyu Vincent Derwin Suhartono Aryo Pradipta Gema

Automatic Music Transcription (AMT) is becoming more and popular throughout the day, it has piqued interest of many in addition to academic research. A successful AMT system would be able bridge multiple ranges interactions between people music, including music education. The goal this research transcribe an audio input notation. Research methods were conducted by training neural networks architectures different kinds cases. evaluation used two approaches, those objective subjective...

10.3844/jcssp.2021.178.187 article EN cc-by Journal of Computer Science 2021-03-01

Edinburgh Clinical NLP at SemEval-2024 Task 2: Fine-tune your model unless you have access to GPT-4

OPENALEX - Publications

Aryo Pradipta Gema Giwon Hong Pasquale Minervini Luke Daines Beatrice Alex

The NLI4CT task assesses Natural Language Inference systems in predicting whether hypotheses entail or contradict evidence from Clinical Trial Reports. In this study, we evaluate various Large Models (LLMs) with multiple strategies, including Chain-of-Thought, In-Context Learning, and Parameter-Efficient Fine-Tuning (PEFT). We propose a PEFT method to improve the consistency of LLMs by merging adapters that were fine-tuned separately using triplet language modelling objectives. found two...

10.48550/arxiv.2404.00484 preprint EN arXiv (Cornell University) 2024-03-30

Coming Soon ...