NFDI4DS | UHH-SEMS - Publication Details

SphereFace: Deep Hypersphere Embedding for Face Recognition

OPENALEX - Publications

Weiyang Liu Yandong Wen Zhiding Yu Ming Li Bhiksha Raj and 1 more

This paper addresses deep face recognition (FR) problem under open-set protocol, where ideal features are expected to have smaller maximal intra-class distance than minimal inter-class a suitably chosen metric space. However, few existing algorithms can effectively achieve this criterion. To end, we propose the angular softmax (A-Softmax) loss that enables convolutional neural networks (CNNs) learn angularly discriminative features. Geometrically, A-Softmax be viewed as imposing constraints...

10.1109/cvpr.2017.713 article EN 2017-07-01

Learning Combinatorial Optimization Algorithms over Graphs

OPENALEX - Publications

Hanjun Dai Elias B. Khalil Yuyu Zhang Bistra Dilkina Le Song

The design of good heuristics or approximation algorithms for NP-hard combinatorial optimization problems often requires significant specialized knowledge and trial-and-error. Can we automate this challenging, tedious process, learn the instead? In many real-world applications, it is typically case that same problem solved again on a regular basis, maintaining structure but differing in data. This provides an opportunity learning heuristic exploit such recurring problems. paper, propose...

10.48550/arxiv.1704.01665 preprint EN other-oa arXiv (Cornell University) 2017-01-01

GRAM

OPENALEX - Publications

Edward Choi Mohammad Taha Bahadori Le Song Walter F. Stewart Jimeng Sun

Deep learning methods exhibit promising performance for predictive modeling in healthcare, but two important challenges remain: -

10.1145/3097983.3098126 article EN 2017-08-04

Neural Network-based Graph Embedding for Cross-Platform Binary Code Similarity Detection

OPENALEX - Publications

Xiaojun Xu Chang Liu Feng Qian Heng Yin Le Song and 1 more

The problem of cross-platform binary code similarity detection aims at detecting whether two functions coming from different platforms are similar or not. It has many security applications, including plagiarism detection, malware vulnerability search, etc. Existing approaches rely on approximate graph matching algorithms, which inevitably slow and sometimes inaccurate, hard to adapt a new task. To address these issues, in this work, we propose novel neural network-based approach compute the...

10.1145/3133956.3134018 article EN Proceedings of the 2022 ACM SIGSAC Conference on Computer and Communications Security 2017-10-27

Discriminative Embeddings of Latent Variable Models for Structured Data

OPENALEX - Publications

Hanjun Dai Bo Dai Le Song

Kernel classifiers and regressors designed for structured data, such as sequences, trees graphs, have significantly advanced a number of interdisciplinary areas computational biology drug design. Typically, kernels are beforehand data type which either exploit statistics the structures or make use probabilistic generative models, then discriminative classifier is learned based on via convex optimization. However, an elegant two-stage approach also limited kernel methods from scaling up to...

10.48550/arxiv.1603.05629 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Supervised feature selection via dependence estimation

OPENALEX - Publications

Le Song Alex Smola Arthur Gretton Karsten Borgwardt Justin Bedő

We introduce a framework for filtering features that employs the Hilbert-Schmidt Independence Criterion (HSIC) as measure of dependence between and labels.The key idea is good should maximise such dependence.Feature selection various supervised learning problems (including classification regression) unified under this framework, solutions can be approximated using backward-elimination algorithm.We demonstrate usefulness our method on both artificial real world datasets.

10.1145/1273496.1273600 article EN 2007-06-20

Variational Reasoning for Question Answering With Knowledge Graph

OPENALEX - Publications

Yuyu Zhang Hanjun Dai Zornitsa Kozareva Alexander J. Smola Le Song

Knowledge graph (KG) is known to be helpful for the task of question answering (QA), since it provides well-structured relational information between entities, and allows one further infer indirect facts. However, challenging build QA systems which can learn reason over knowledge graphs based on question-answer pairs alone. First, when people ask questions, their expressions are noisy (for example, typos in texts, or variations pronunciations), non-trivial system match those mentioned...

10.1609/aaai.v32i1.12057 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-26

Iterative Learning with Open-set Noisy Labels

OPENALEX - Publications

Yisen Wang Weiyang Liu Xingjun Ma James Bailey Hongyuan Zha and 2 more

Large-scale datasets possessing clean label annotations are crucial for training Convolutional Neural Networks (CNNs). However, labeling large-scale data can be very costly and error-prone, even high-quality likely to contain noisy (incorrect) labels. Existing works usually employ a closed-set assumption, whereby the samples associated with labels possess true class contained within set of known classes in data. such an assumption is too restrictive many applications, since might fact that...

10.1109/cvpr.2018.00906 preprint EN 2018-06-01

Material structure-property linkages using three-dimensional convolutional neural networks

OPENALEX - Publications

Ahmet Cecen Hanjun Dai Yuksel C. Yabansu Surya R. Kalidindi Le Song

10.1016/j.actamat.2017.11.053 article EN publisher-specific-oa Acta Materialia 2017-12-21

Learning to Explain: An Information-Theoretic Perspective on Model Interpretation

OPENALEX - Publications

Jianbo Chen Le Song Martin J. Wainwright Michael I. Jordan

We introduce instancewise feature selection as a methodology for model interpretation. Our method is based on learning function to extract subset of features that are most informative each given example. This selector trained maximize the mutual information between selected and response variable, where conditional distribution variable input be explained. develop an efficient variational approximation information, show effectiveness our variety synthetic real data sets using both...

10.48550/arxiv.1802.07814 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Adversarial Attack on Graph Structured Data

OPENALEX - Publications

Hanjun Dai Hui Li Tian Tian Xin Huang Lin Wang and 2 more

Deep learning on graph structures has shown exciting results in various applications. However, few attentions have been paid to the robustness of such models, contrast numerous research work for image or text adversarial attack and defense. In this paper, we focus attacks that fool model by modifying combinatorial structure data. We first propose a reinforcement based method learns generalizable policy, while only requiring prediction labels from target classifier. Also, variants genetic...

10.48550/arxiv.1806.02371 preprint EN other-oa arXiv (Cornell University) 2018-01-01

GeniePath: Graph Neural Networks with Adaptive Receptive Paths

OPENALEX - Publications

Ziqi Liu Chaochao Chen Longfei Li Jun Zhou Xiaolong Li and 2 more

We present, GeniePath, a scalable approach for learning adaptive receptive fields of neural networks defined on permutation invariant graph data. In we propose an path layer consists two complementary functions designed breadth and depth exploration respectively, where the former learns importance different sized neighborhoods, while latter extracts filters signals aggregated from neighbors hops away. Our method works in both transductive inductive settings, extensive experiments compared...

10.1609/aaai.v33i01.33014424 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2019-07-17

SphereFace: Deep Hypersphere Embedding for Face Recognition

OPENALEX - Publications

Weiyang Liu Yandong Wen Zhiding Yu Ming Li Bhiksha Raj and 1 more

This paper addresses deep face recognition (FR) problem under open-set protocol, where ideal features are expected to have smaller maximal intra-class distance than minimal inter-class a suitably chosen metric space. However, few existing algorithms can effectively achieve this criterion. To end, we propose the angular softmax (A-Softmax) loss that enables convolutional neural networks (CNNs) learn angularly discriminative features. Geometrically, A-Softmax be viewed as imposing constraints...

10.48550/arxiv.1704.08063 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Learning to Branch in Mixed Integer Programming

OPENALEX - Publications

Elias B. Khalil Pierre Le Bodic Le Song George L. Nemhauser Bistra Dilkina

The design of strategies for branching in Mixed Integer Programming (MIP) is guided by cycles parameter tuning and offline experimentation on an extremely heterogeneous testbed, using the average performance. Once devised, these (and their settings) are essentially input-agnostic. To address issues, we propose a machine learning (ML) framework variable MIP.Our method observes decisions made Strong Branching (SB), time-consuming strategy that produces small search trees, collecting features...

10.1609/aaai.v30i1.10080 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2016-02-21

Hilbert space embeddings of conditional distributions with applications to dynamical systems

OPENALEX - Publications

Le Song Jonathan Huang Alex Smola Kenji Fukumizu

In this paper, we extend the Hilbert space embedding approach to handle conditional distributions. We derive a kernel estimate for embedding, and show its connection ordinary embeddings. Conditional embeddings largely our ability manipulate distributions in spaces, as an example, nonparametric method modeling dynamical systems where belief state of system is maintained embedding. Our very general terms both domains types that it can handle, demonstrate effectiveness various systems. expect...

10.1145/1553374.1553497 article EN 2009-06-14

Deep Fried Convnets

OPENALEX - Publications

Zichao Yang Marcin Moczulski Misha Denil Nando de Freitas Le Song and 1 more

The fully-connected layers of deep convolutional neural networks typically contain over 90% the network parameters. Reducing number parameters while preserving predictive performance is critically important for training big models in distributed systems and deployment embedded devices. In this paper, we introduce a novel Adaptive Fastfood transform to reparameterize matrix-vector multiplication fully connected layers. Reparameterizing layer with d inputs n outputs reduces storage...

10.1109/iccv.2015.173 article EN 2015-12-01

Know-Evolve: Deep Temporal Reasoning for Dynamic Knowledge Graphs

OPENALEX - Publications

Rakshit Trivedi Hanjun Dai Yichen Wang Le Song

The availability of large scale event data with time stamps has given rise to dynamically evolving knowledge graphs that contain temporal information for each edge. Reasoning over in such dynamic is not yet well understood. To this end, we present Know-Evolve, a novel deep evolutionary network learns non-linearly entity representations time. occurrence fact (edge) modeled as multivariate point process whose intensity function modulated by the score computed based on learned embeddings. We...

10.48550/arxiv.1705.05742 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Syntax-Directed Variational Autoencoder for Structured Data

OPENALEX - Publications

Hanjun Dai Yingtao Tian Bo Dai Steven Skiena Le Song

Deep generative models have been enjoying success in modeling continuous data. However it remains challenging to capture the representations for discrete structures with formal grammars and semantics, e.g., computer programs molecular structures. How generate both syntactically semantically correct data still largely an open problem. Inspired by theory of compiler where syntax semantics check is done via syntax-directed translation (SDT), we propose a novel variational autoencoder (SD-VAE)...

10.48550/arxiv.1802.08786 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Variational Reasoning for Question Answering with Knowledge Graph

OPENALEX - Publications

Yuyu Zhang Hanjun Dai Zornitsa Kozareva Alexander J. Smola Le Song

Knowledge graph (KG) is known to be helpful for the task of question answering (QA), since it provides well-structured relational information between entities, and allows one further infer indirect facts. However, challenging build QA systems which can learn reason over knowledge graphs based on question-answer pairs alone. First, when people ask questions, their expressions are noisy (for example, typos in texts, or variations pronunciations), non-trivial system match those mentioned...

10.48550/arxiv.1709.04071 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Kernel Embeddings of Conditional Distributions: A Unified Kernel Framework for Nonparametric Inference in Graphical Models

OPENALEX - Publications

Le Song Kenji Fukumizu Arthur Gretton

Many modern applications of signal processing and machine learning, ranging from computer vision to computational biology, require the analysis large volumes high-dimensional continuous-valued measurements. Complex statistical features are commonplace, including multimodality, skewness, rich dependency structures. Such problems call for a flexible robust modeling framework that can take into account these diverse features. Most existing approaches, graphical models, rely heavily on...

10.1109/msp.2013.2252713 article EN IEEE Signal Processing Magazine 2013-06-12