NFDI4DS | UHH-SEMS - Publication Details

Bruno Martins

ORCID: 0000-0002-3856-2936

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5055101594

Research Areas

Natural Language Processing Techniques
Geographic Information Systems Studies
Topic Modeling
Data Management and Algorithms
Semantic Web and Ontologies
Web Data Mining and Analysis
Multimodal Machine Learning Applications
Advanced Text Analysis Techniques
Advanced Image and Video Retrieval Techniques
Human Mobility and Location-Based Analysis
Sentiment Analysis and Opinion Mining
Data-Driven Disease Surveillance
Information Retrieval and Search Behavior
Biomedical Text Mining and Ontologies
Data Quality and Management
Expert finding and Q&A systems
Data Visualization and Analytics
Domain Adaptation and Few-Shot Learning
Automated Road and Building Extraction
Motivation and Self-Concept in Sports
Recommender Systems and Techniques
Mobile Crowdsensing and Crowdsourcing
Text and Document Classification Technologies
Speech and dialogue systems
Context-Aware Activity Recognition Systems

University of Lisbon
2016-2025

Instituto de Engenharia de Sistemas e Computadores Investigação e Desenvolvimento
2016-2025

Instituto Superior Técnico
2012-2024

Artificial Intelligence in Medicine (Canada)
2024

Instituto Politécnico de Lisboa
2007-2023

Institute for Systems Engineering and Computers
2015-2023

Universidade Estadual Paulista (Unesp)
2022-2023

University of Copenhagen
2021-2023

Universidade Federal do Pará
2023

Universitat de les Illes Balears
2020-2022

Predicting future locations with hidden Markov models

OPENALEX - Publications

Wesley Mathew Ruben Raposo Bruno Martins

The analysis of human location histories is currently getting an increasing attention, due to the widespread usage geopositioning technologies such as GPS, and also online location-based services that allow users share this information. Tasks prediction movement can be addressed through these data, in turn offering support for more advanced applications, adaptive mobile with proactive context-based functions. This paper presents hybrid method predicting mobility on basis Hidden Markov Models...

10.1145/2370216.2370421 article EN 2012-09-05

Smallcap: Lightweight Image Captioning Prompted with Retrieval Augmentation

OPENALEX - Publications

Rita Ramos Bruno Martins Desmond Elliott Yova Kementchedjhieva

Recent advances in image captioning have focused on scaling the data and model size, substantially increasing cost of pretraining finetuning. As an alternative to large models, we present Smallcap, which generates a caption conditioned input related captions retrieved from datastore. Our is lightweight fast train, as only learned parameters are newly introduced cross-attention layers between pre-trained CLIP encoder GPT-2 decoder. Smallcap can transfer new domains without additional...

10.1109/cvpr52729.2023.00278 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Automated Geocoding of Textual Documents: A Survey of Current Approaches

OPENALEX - Publications

F. Galvão de Melo Bruno Martins

Abstract This survey article describes previous research addressing text‐based document geocoding, i.e. the task of predicting geospatial coordinates latitude and longitude, that best correspond to an entire document, based on its textual contents. We describe (1) early geocoding systems use heuristics over place names mentioned in text (e.g. cities states), (2) probabilistic language modeling approaches, where generative models are built for different regions world (usually considering a...

10.1111/tgis.12212 article EN Transactions in GIS 2016-06-17

Large Language Models for Captioning and Retrieving Remote Sensing Images

OPENALEX - Publications

João Daniel Silva João Avelar Magalhães Devis Tuia Bruno Martins

Image captioning and cross-modal retrieval are examples of tasks that involve the joint analysis visual linguistic information. In connection to remote sensing imagery, these can help non-expert users in extracting relevant Earth observation information for a variety applications. Still, despite some previous efforts, development application vision language models domain have been hindered by relatively small size available datasets used studies. this work, we propose RS-CapRet, Vision...

10.48550/arxiv.2402.06475 preprint EN arXiv (Cornell University) 2024-02-09

Adding geographic scopes to web resources

OPENALEX - Publications

Mário J. Silva Bruno Martins Marcírio Silveira Chaves Ana Paula Afonso Nuno Cardoso

10.1016/j.compenvurbsys.2005.08.003 article EN Computers Environment and Urban Systems 2006-05-09

Toponym matching through deep neural networks

OPENALEX - Publications

Rui Santos Patricia Murrieta‐Flores Pável Calado Bruno Martins

Toponym matching, i.e. pairing strings that represent the same real-world location, is a fundamental problemfor several practical applications. The current state-of-the-art relies on string similarity metrics, either specifically developed for matching place names or integrated within methods combine multiple metrics. However, these all rely common sub-strings in order to establish similarity, and they do not effectively capture character replacements involved toponym changes due...

10.1080/13658816.2017.1390119 article EN International Journal of Geographical Information Science 2017-10-31

Deep neural models for ICD-10 coding of death certificates and autopsy reports in free-text

OPENALEX - Publications

Francisco J. Duarte Bruno Martins Cátia Sousa Pinto Mário J. Silva

10.1016/j.jbi.2018.02.011 article EN publisher-specific-oa Journal of Biomedical Informatics 2018-02-26

Symbolic and subsymbolic GeoAI: Geospatial knowledge graphs and spatially explicit machine learning

OPENALEX - Publications

Gengchen Mai Yingjie Hu Song Gao Ling Cai Bruno Martins and 3 more

10.1111/tgis.13012 article EN Transactions in GIS 2022-12-01

Retrieval-augmented Image Captioning

OPENALEX - Publications

Rita Ramos Desmond Elliott Bruno Martins

Inspired by retrieval-augmented language generation and pretrained Vision Language (V&L) encoders, we present a new approach to image captioning that generates sentences given the input set of captions retrieved from datastore, as opposed alone. The encoder in our model jointly processes using V&L BERT, while decoder attends multimodal representations, benefiting extra textual evidence captions. Experimental results on COCO dataset show can be effectively formulated this perspective. Our...

10.18653/v1/2023.eacl-main.266 article EN cc-by 2023-01-01

Language identification in web pages

OPENALEX - Publications

Bruno Martins Mário J. Silva

This paper discusses the problem of automatically identifying language a given Web document. Previous experiments in guessing focused on analyzing "coherent" text sentences, whereas this work was validated texts from Web, often presenting harder problems. Our "guessing" software uses well-known n-gram based algorithm, complemented with heuristics and new similarity measure. Both fast robust, has been use for past two years, as part crawler search engine. Experiments show that it achieves...

10.1145/1066677.1066852 article EN 2005-03-13

Semi-Supervised Bootstrapping of Relationship Extractors with Distributional Semantics

OPENALEX - Publications

David S. Batista Bruno Martins Mário J. Silva

Semi-supervised bootstrapping techniques for relationship extraction from text iteratively expand a set of initial seed relationships while limiting the semantic drift.We research using word embeddings to find similar relationships.Experimental results show that relying on achieves better performance task extracting four types collection newswire documents when compared with baseline TF-IDF relationships.

10.18653/v1/d15-1056 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2015-01-01

Using machine learning methods for disambiguating place references in textual documents

OPENALEX - Publications

João Santos Ivo Anastácio Bruno Martins

10.1007/s10708-014-9553-y article EN GeoJournal 2014-05-06

Ensemble Named Entity Recognition (NER): Evaluating NER Tools in the Identification of Place Names in Historical Corpora

OPENALEX - Publications

Miguel Won Patricia Murrieta‐Flores Bruno Martins

The field of Spatial Humanities has advanced substantially in the past years. identification and extraction toponyms spatial information mentioned historical text collections allowed its use innovative ways, making possible application analysis mapping these places with Geographic Information Systems. For instance, automated place name is nowadays Named Entity Recognition (NER) systems. Statistical NER methods based on supervised learning, particular, are highly successful modern datasets....

10.3389/fdigh.2018.00002 article EN Frontiers in Digital Humanities 2018-03-09

Assessing flood severity from crowdsourced social media photos with deep neural networks

OPENALEX - Publications

Jorge Pereira João Monteiro Joel Silva Jacinto Estima Bruno Martins

10.1007/s11042-020-09196-8 article EN Multimedia Tools and Applications 2020-07-12

Multimorbidity in Heart Failure Patients: Application of Machine Learning Algorithms to Predict Imminent Health Outcomes

OPENALEX - Publications

Jorge Cerejo Rui Baeta Simão Gonçalves Bernardo Neves Pedro Sarmento and 5 more

10.5220/0013381800003911 article EN Proceedings of the 15th International Joint Conference on Biomedical Engineering Systems and Technologies 2025-01-01

Indexing and ranking in Geo-IR systems

OPENALEX - Publications

Bruno Martins Mário J. Silva Leonardo Andrade

This paper addresses document indexing and retrieval using geographical location. It discusses possible structures result ranking algorithms, surveying known approaches showing how they can be combined to build an effective Geo-IR system.

10.1145/1096985.1096993 article EN 2005-11-04

Resolving user identities over social networks through supervised learning and rich similarity features

OPENALEX - Publications

André Nunes Pável Calado Bruno Martins

This paper describes an approach for resolving user identifiers in the context of social networks, using techniques from area duplicate record detection [1]. We reduce identity resolution problem into a binary classification task, where goal is to classify pairs as either belonging same person or not. The are represented feature vectors that combine multiple sources similarity (e.g. between profile information, descriptions people's interests, and friend lists). report on thorough evaluation...

10.1145/2245276.2245413 article EN 2012-03-26

Learning to rank academic experts in the DBLP dataset

OPENALEX - Publications

Catarina Moreira Pável Calado Bruno Martins

Expert finding is an information retrieval task that concerned with the search for most knowledgeable people respect to a specific topic, and based on documents describe people's activities. The involves taking user query as input returning list of who are sorted by their level expertise query. Despite recent interest in area, current state-of-the-art techniques lack principled approaches optimally combining different sources evidence. This article proposes two frameworks multiple estimators...

10.1111/exsy.12062 article EN Expert Systems 2013-11-28

Learning to combine multiple string similarity metrics for effective toponym matching

OPENALEX - Publications

Rui Santos Patricia Murrieta‐Flores Bruno Martins

Several tasks related to geographical information retrieval and the sciences involve toponym matching, that is, problem of matching place names share a common referent. In this article, we present results wide-ranging evaluation on performance different string similarity metrics over task. We also report experiments involving usage supervised machine learning for combining multiple metrics, which has natural advantage avoiding manual tuning thresholds. Experiments with very large dataset...

10.1080/17538947.2017.1371253 article EN International Journal of Digital Earth 2017-09-06

INESC-ID: A Regression Model for Large Scale Twitter Sentiment Lexicon Induction

OPENALEX - Publications

Silvio Amir Ramón Fernández Astudillo Ling Wang Bruno Martins Mário J. Silva and 1 more

We present the approach followed by INESC-ID in SemEval 2015 Twitter Sentiment Analysis challenge, subtask E. The goal was to determine strength of association terms with positive sentiment.Using two labeled lexicons, we trained a regression model predict sentiment polarity and intensity words phrases.Terms were represented as word embeddings induced an unsupervised fashion from corpus tweets.Our system attained top ranking submission, attesting general adequacy proposed approach.

10.18653/v1/s15-2102 article EN cc-by Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022) 2015-01-01

Using Neural Encoder-Decoder Models With Continuous Outputs for Remote Sensing Image Captioning

OPENALEX - Publications

Rita Parada Ramos Bruno Martins

Remote sensing image captioning involves generating a concise textual description for an input aerial image. The task has received significant attention, and several recent proposals are based on neural encoder-decoder models. Most previous methods trained to generate discrete outputs corresponding word tokens that match the reference sentences word-by-word, thereby optimizing generation locally at token-level instead of globally sentence-level. This paper explores alternative method...

10.1109/access.2022.3151874 article EN cc-by IEEE Access 2022-01-01

A metadata geoparsing system for place name recognition and resolution in metadata records

OPENALEX - Publications

Nuno Freire José Borbinha Pável Calado Bruno Martins

This paper describes an approach for performing recognition and resolution of place names mentioned over the descriptive metadata records typical digital libraries. Our exploits evidence provided by existing structured attributes within to support name resolution, in order achieve better results than just using lexical from textual values these attributes. In records, is very often insufficient this task, since short sentences simple expressions are predominant. implementation uses a...

10.1145/1998076.1998140 article EN 2011-06-13

Coming Soon ...