- Caching and Content Delivery
- Peer-to-Peer Network Technologies
- Topic Modeling
- Web Data Mining and Analysis
- Data Management and Algorithms
- Advanced Image and Video Retrieval Techniques
- Sentiment Analysis and Opinion Mining
- Advanced Text Analysis Techniques
- Recommender Systems and Techniques
- Text and Document Classification Technologies
- Semantic Web and Ontologies
- Advanced Database Systems and Queries
- Natural Language Processing Techniques
- Music and Audio Processing
- Complex Network Analysis Techniques
- Privacy, Security, and Data Protection
- Video Analysis and Summarization
- Expert finding and Q&A systems
- Opportunistic and Delay-Tolerant Networks
- Big Data and Business Intelligence
- Internet Traffic Analysis and Secure E-voting
- Advanced Data Storage Technologies
- User Authentication and Security Systems
- Algorithms and Data Compression
- Privacy-Preserving Technologies in Data
Free University of Bozen-Bolzano
2011-2022
Max Planck Institute for Informatics
2007-2009
IBM Research - Haifa
2008
Max Planck Institute for the Study of Societies
2007
Max Planck Society
2007
Université de Bourgogne
2005-2007
Laboratoire d’Électronique, Informatique et Image
2005-2006
Online communities have become popular for publishing and searching content, as well finding connecting to other users. User-generated content includes, example, personal blogs, bookmarks, digital photos. These items can be annotated rated by different users, these social tags derived user-specific scores leveraged relevant discovering subjectively interesting items. Moreover, the relationships among users also taken into consideration ranking search results, intuition being that you trust...
Online communities have recently become a popular tool for publishing and searching content, as well finding connecting to other users that share common interests. The content is typically user-generated includes, example, personal blogs, bookmarks, digital photos. A particularly intriguing type of annotations (tags) items, these concise string descriptions allow reasonings about the interests user who created but also generated annotations. This paper presents framework cast different...
Knowledge-sharing communities like Wikipedia and automated extraction methods those of DBpedia enable the construction large machine-processible knowledge bases with relational facts about entities. These endeavors lack multimodal data photos videos people places. While famous entities are abundant on Internet, they much harder to retrieve for less popular such as notable computer scientists or regionally interesting churches. Querying entity names in image search engines yields candidate...
Recent knowledge extraction methods are moving towards ternary and higher-arity relations to capture more information about binary facts. An example is include the time, location, duration of a specific fact. These can be even complex extract in advanced domains such as news, where events typically come with different facets including reasons, consequences, purposes, involved parties, related events. The main challenge consists first finding set each fact, second tagging those relevant category.
Diversifying search results of queries seeking for different view points about controversial topics is key to improving satisfaction users. The challenge finding opinions how maximize the number discussed arguments without being biased against specific sentiments. This paper addresses issue by first introducing a new model that represents patterns occurring in documents topics. Second, proposing an opinion diversification uses (1) relevance documents, (2) semantic capture and (3) sentiment...
News websites give their users the opportunity to participate in discussions about published articles, by writing comments. Typically, these comments are unstructured making it hard understand flow of user discussions. Thus, there is a need for organizing help (1) gain more insights news topics, and (2) have an easy access that trigger interests. In this work, we address above problem around entities aspects they discuss. More specifically, propose approach entity aspect extraction from...
Social networks are the fastest growing Internet applications. They offer possibility to get in touch with current friends, discover where old ones are, and make new ones. While these applications a great enabler for our social life, they also well known fall short on privacy. The lack of adequate privacy enhancing technology is particularly important due nature information deal with, fact that many users underage. This paper provides contribution this direction by presenting protocol,...
Multimedia centric P2P must take into consideration the main characteristics and complex relationships among multimedia objects. In this paper, we propose a cluster-based hybrid overlay network HON-P2P for sharing content. It consists in clustering peers with similar feature based or semantic properties. We define two types of methods corresponding to overlays: clustering. To improve information retrieval cache management methodology. Semantic are defined each type overlay. Moreover, study...
A query topic can be subjective involving a variety of opinions, judgments, arguments, and many other debatable aspects. Typically, search engines process queries independently from the nature their topics using relevance-based retrieval strategy. Hence, results about are often biased towards specific view point or version. In this demo, we shall present MOUNA, novel approach for opinion diversification. Given on topic, MOUNA ranks based three scores: (1) relevance documents, (2) semantic...
While images of famous people and places are abundant on the Internet, they much harder to retrieve for less popular entities such as notable computer scientists or regionally interesting churches. Querying entity names in image search engines yields large candidate lists, but often have low precision unsatisfactory recall. In this paper, we propose a principled model finding rare ambiguous named entities. We set efficient, light-weight algorithms identifying entity-specific keyphrases from...
Online communities like Flickr, del.icio.us and YouTube have established themselves as very popular powerful services for publishing searching contents, but also identifying other users who share similar interests. In these communities, data are usually annotated with carefully selected often semantically meaningful tags, collaboratively chosen by the user uploaded an item came across item. Items urls or videos typically retrieved issueing queries that consist of a set returning items been...
P2P systems represent a large portion of the Internet traffic which makes data discovery great importance to user and broad community. Hence, power system comes from its ability provide an efficient search service. In this paper we address problem similarity in Hybrid Overlay Network organizes peers high dimensional feature space. Data are described by set features clustered using density-based algorithm. We experimentally evaluate effectiveness similarity-search uniform zipf distribution.
The International Conference on Information Retrieval and Knowledge Management (CIKM) brings together three avenues of data-oriented research, namely, Database Management, Management. confluence these becomes evident also in the PhD theses doctoral students: Stream processing makes use knowledge representation techniques, linked data is emerging as a research topic that bridges information retrieval representation, new forms querying draw techniques from both databases. In this paper, we...
Databases and related fields such as Information Retrieval, Data Mining Knowledge Management offer many topics of interest for dissertation research. Specific areas include, instance, big data, social networks, Web question answering interactive knowledge discovery. In this article, we provide a summary critique research problems presented in these at workshop on proposals early doctoral
With the tremendous growth of published news articles, a key issue is how to help users find diverse and interesting stories. To this end, it crucial understand build accurate profiles for both and news articles. In paper, we define user profile based on (1) set entities she/he talked about in her/his comments (2) key-concepts related those which has expressed a viewpoint. The same information extracted from content each article create its profile. These are then matched purpose...
Similarity searching is particularly important in distributed networks such as P2P systems, which use various routing schemes to submit queries relevant peers. We investigate content-based information and retrieval using similarity search clustered overlay focus on their maintenance cost models performance issues. present a query model for cluster based network, called HON, where data peers are organized high dimensional feature spaces. show through extensive simulations that HON has low...