- Semantic Web and Ontologies
- Web Data Mining and Analysis
- Natural Language Processing Techniques
- Topic Modeling
- Data Quality and Management
- Service-Oriented Architecture and Web Services
- Advanced Data Storage Technologies
- Advanced Database Systems and Queries
- Biomedical Text Mining and Ontologies
- Text and Document Classification Technologies
- Caching and Content Delivery
- Business Process Modeling and Analysis
- Advanced Text Analysis Techniques
- Manufacturing Process and Optimization
- Cloud Computing and Resource Management
- Cloud Data Security Solutions
- Web visibility and informetrics
- Blind Source Separation Techniques
- Data Management and Algorithms
- Speech and Audio Processing
- AI-based Problem Solving and Planning
- Data Stream Mining Techniques
- Face and Expression Recognition
- Scientific Computing and Data Management
- Geographic Information Systems Studies
Center for Research and Advanced Studies of the National Polytechnic Institute
2015-2024
Centro de Investigación en Materiales Avanzados
2020-2021
Instituto Politécnico Nacional
2011-2019
Tecnológico Nacional de México
2011-2014
Information Technology Laboratory
2008-2013
National University of Tres de Febrero
2009
Universitat Politècnica de Catalunya
2007
Software (Spain)
2007
Universitat Rovira i Virgili
2003-2004
We provide a comprehensive survey of the research literature that applies Information Extraction techniques in Semantic Web setting.Works intersection these two areas can be seen from overlapping perspectives: using resources (languages/ontologies/knowledge-bases/tools) to improve Extraction, and/or populate Web.In more detail, we focus on extraction and linking three elements: entities, concepts relations.Extraction involves identifying (textual) mentions referring such elements given...
The most common machine-learning methods solve supervised and unsupervised problems based on datasets where the problem’s features belong to a numerical space. However, many often include data categorical coexist, which represents challenge manage them. To transform into numeric form, preprocessing tasks are compulsory. Methods such as one-hot feature-hashing have been widely used encoding approaches at expense of significant increase in dimensionality dataset. This effect introduces...
Indoor navigation systems incorporating augmented reality allow users to locate places within buildings and acquire more knowledge about their environment. However, although diverse works have been introduced with varied technologies, infrastructure, functionalities, a standardization of the procedures for elaborating these has not reached. Moreover, while usually handle contextual information in proprietary formats, platform-independent model is desirable, which would encourage its access,...
The automatic extraction of geospatial information is an important aspect data mining. Computer systems capable discovering geographic from natural language involve a complex process called geoparsing, which includes two tasks: entity recognition and toponym resolution. first task could be approached through machine learning approach, in case model trained to recognize sequence characters (words) corresponding entities. second consists assigning such entities their most likely coordinates....
The Semantic Web provides guidelines for the representation of information about real-world objects (entities) and their relations (properties). This is helpful dissemination consumption by people applications. However, mainly contained within natural language sentences, which do not have a structure or linguistic descriptions ready to be directly processed computers. Thus, challenge identify extract elements that can represented. Hence, this article presents strategy from sentences its with...
Creating effective mechanisms to detect misogyny online automatically represents significant scientific and technological challenges. The complexity of recognizing through computer models lies in the fact that it is a subtle type violence, not always explicitly aggressive, can even hide behind seemingly flattering words, jokes, parodies, other expressions. Currently, difficult have an exact figure for rate misogynistic comments because, unlike types such as physical these events are...
Content delivery and sharing (CDS) is a popular cost effective cloud-based service for organizations to deliver/share contents to/with end-users, partners insider users. This type of improves the data availability I/O performance by producing distributing replicas shared contents. However, such technique increases overhead on storage/network resources. article introduces threefold methodology improve trade-off between capacity utilization cloud storage CDS services. includes: i) Definition...
Integrated health services are characterized by a high degree of collaboration and communication among professionals, as well merge political, administrative, technical actions, which can allow the sharing information healthcare team members (physicians, nurses, managers, other stakeholders) related to patient care, access hospital infrastructure technology, within patient-centered approach. In this paper, we propose technological solution based on software agents, allows supporting...
Information dispersal is a fault-tolerant technique where files of size |F| are split into n redundant pieces |F|/k that dispersed to different servers k suffice for recovering the original file whenever k<;n. This popular solution service providers withstand server failures and improve storage utilization. However, coding/decoding time produced by this as well management heterogeneous size, belong files, represent both challenge deployment on clouds clusters. paper presents design...
Data-driven diabetes research has increased its interest in exploring the heterogeneity of disease, aiming to support development more specific prognoses and treatments within so-called precision medicine. Recently, one these studies found five subgroups with varying risks complications treatment responses. Here, we tackle assessment different models for classifying Type 2 Diabetes (T2DM) subtypes through machine learning approaches, aim providing a performance comparison new insights on...
Most of the information on Web can be currently classified according to its (information) structure in three different forms: unstructured (plain text), semi-structured (XML files) and structured (tables a relational database). Currently search is primary way access massive information. Keyword also becomes an alternative querying over databases XML documents, which simple people who are familiar with use engines. There several approaches perform keyword such as Steiner Trees, Candidate...
File redundancy techniques have been very useful mechanisms for offering fault tolerance and data availability in anykind of storage. Cloud storage is not the exception. This paper presents an evaluation classical file redundancytechniques implemented two cloud-storage deployment models, private hybrid. A small prototype a privateand hybrid cloud was this evaluation. The performance impact when onlyapplied versus also distributed public (the model) analyzed.Additional to techniques,...