NFDI4DS | UHH-SEMS - Publication Details

Yi Zhang

ORCID: 0000-0003-4299-1511

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100388333

Research Areas

Topic Modeling
Natural Language Processing Techniques
Recommender Systems and Techniques
Speech and dialogue systems
Multimodal Machine Learning Applications
Gold and Silver Nanoparticles Synthesis and Applications
Advanced Image and Video Retrieval Techniques
Advanced Text Analysis Techniques
Image Retrieval and Classification Techniques
Advanced Computational Techniques and Applications
Data Management and Algorithms
Domain Adaptation and Few-Shot Learning
Spectroscopy Techniques in Biomedical and Chemical Research
Electrochemical sensors and biosensors
Web Data Mining and Analysis
Text Readability and Simplification
Higher Education and Teaching Methods
Neural Networks and Applications
Text and Document Classification Technologies
Advanced MEMS and NEMS Technologies
Semantic Web and Ontologies
Carbon Nanotubes in Composites
Thermal properties of materials
Speech Recognition and Synthesis
Plasmonic and Surface Plasmon Research

Yunnan University of Finance And Economics
2023-2025

Central South University
2021-2025

State Key Laboratory of Powder Metallurgy
2025

Inner Mongolia Electric Power (China)
2023-2024

University of California, Santa Cruz
2014-2024

Collaborative Innovation Center of Advanced Microstructures
2012-2024

Nanjing University
2012-2024

Chinese Academy of Sciences
2008-2024

Nanjing Audit University
2019-2024

Wuhan University of Technology
2011-2024

Explicit factor models for explainable recommendation based on phrase-level sentiment analysis

OPENALEX - Publications

Yongfeng Zhang Guokun Lai Min Zhang Yi Zhang Yiqun Liu and 1 more

Collaborative Filtering(CF)-based recommendation algorithms, such as Latent Factor Models (LFM), work well in terms of prediction accuracy. However, the latent features make it difficulty to explain results users. Fortunately, with continuous growth online user reviews, information available for training a recommender system is no longer limited just numerical star ratings or user/item features. By extracting explicit opinions about various aspects product from possible learn more details...

10.1145/2600428.2609579 article EN 2014-07-03

Classifying Software Changes: Clean or Buggy?

OPENALEX - Publications

Sunghun Kim E. James Whitehead Yi Zhang

This paper introduces a new technique for finding latent software bugs called change classification. Change classification uses machine learning classifier to determine whether is more similar prior buggy changes, or clean changes. In this manner, predicts the existence of in The trained using features (in sense) extracted from revision history project, as stored its configuration management repository. can classify changes with 78% accuracy and 65% recall (on average). has several desirable...

10.1109/tse.2007.70773 article EN IEEE Transactions on Software Engineering 2008-03-01

Information Uncertainty and Expected Returns

OPENALEX - Publications

Guohua Jiang Charles M.C. Lee Yi Zhang

10.1007/s11142-005-1528-2 article EN Review of Accounting Studies 2005-07-05

The CoNLL-2009 shared task

OPENALEX - Publications

Jan Hajič Jan Štěpánek Pavel Straňák Mihai Surdeanu Nianwen Xue and 9 more

For the 11th straight year, Conference on Computational Natural Language Learning has been accompanied by a shared task whose purpose is to promote natural language processing applications and evaluate them in standard setting. In 2009, was dedicated joint parsing of syntactic semantic dependencies multiple languages. This combines tasks previous five years under unique dependency-based formalism similar 2008 task. this paper, we define task, describe how data sets were created show their...

10.3115/1596409.1596411 article EN 2009-01-01

Conversational Recommender System

OPENALEX - Publications

Yueming Sun Yi Zhang

A personalized conversational sales agent could have much commercial potential. E-commerce companies such as Amazon, eBay, JD, Alibaba etc. are piloting kind of agents with their users. However, the research on this topic is very limited and existing solutions either based single round adhoc search engine or traditional multi dialog system. They usually only utilize user inputs in current session, ignoring users' long term preferences. On other hand, it well known that conversion rate can be...

10.1145/3209978.3210002 article EN 2018-06-27

Harder than Diamond: Superior Indentation Strength of Wurtzite BN and Lonsdaleite

OPENALEX - Publications

Zicheng Pan Hong Sun Yi Zhang Changfeng Chen

Recent indentation experiments indicate that wurtzite BN (w-BN) exhibits surprisingly high hardness rivals of diamond. Here we unveil a novel two-stage shear deformation mechanism responsible for this unexpected result. We show by first-principles calculations large normal compressive pressures under indenters can compel w-BN into stronger structure through volume-conserving bond-flipping structural phase transformation during which produces significant enhancement in its strength,...

10.1103/physrevlett.102.055503 article EN Physical Review Letters 2009-02-06

SemEval 2014 Task 8: Broad-Coverage Semantic Dependency Parsing

OPENALEX - Publications

Stephan Oepen Marco Kuhlmann Yusuke Miyao Daniel Zeman Dan Flickinger and 3 more

Stephan Oepen, Marco Kuhlmann, Yusuke Miyao, Daniel Zeman, Dan Flickinger, Jan Hajič, Angelina Ivanova, Yi Zhang. Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014). 2014.

10.3115/v1/s14-2008 article EN cc-by Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022) 2014-01-01

Stable, high-performance sodium-based plasmonic devices in the near infrared

OPENALEX - Publications

Yang Wang Jianyu Yu Yifei Mao Ji Chen Suo Wang and 10 more

10.1038/s41586-020-2306-9 article EN Nature 2020-05-27

Personalized re-ranking for recommendation

OPENALEX - Publications

Changhua Pei Yi Zhang Yongfeng Zhang Fei Sun Lin Xiao and 6 more

Ranking is a core task in recommender systems, which aims at providing an ordered list of items to users. Typically, ranking function learned from the labeled dataset optimize global performance, produces score for each individual item. However, it may be sub-optimal because scoring applies item individually and does not explicitly consider mutual influence between items, as well differences users' preferences or intents. Therefore, we propose personalized re-ranking model systems. The...

10.1145/3298689.3347000 preprint EN 2019-09-10

Forecasting oil price volatility: Forecast combination versus shrinkage method

OPENALEX - Publications

Yaojie Zhang Yu Wei Yi Zhang Daxiang Jin

10.1016/j.eneco.2019.01.010 article EN Energy Economics 2019-01-28

Textbooks Are All You Need

OPENALEX - Publications

Suriya Gunasekar Yi Zhang Jyoti Aneja Caio César Teodoro Mendes Allie Del Giorno and 14 more

We introduce phi-1, a new large language model for code, with significantly smaller size than competing models: phi-1 is Transformer-based 1.3B parameters, trained 4 days on 8 A100s, using selection of ``textbook quality" data from the web (6B tokens) and synthetically generated textbooks exercises GPT-3.5 (1B tokens). Despite this small scale, attains pass@1 accuracy 50.6% HumanEval 55.5% MBPP. It also displays surprising emergent properties compared to phi-1-base, our before finetuning...

10.48550/arxiv.2306.11644 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Human capital quality and the regional economic growth: Evidence from China

OPENALEX - Publications

Yi Zhang Sanjay Kumar Xianhai Huang Yiming Yuan

10.1016/j.asieco.2023.101593 article EN Journal of Asian Economics 2023-02-13

Fintech development and green innovation: Evidence from China

OPENALEX - Publications

Jiangtao Liu Yi Zhang Jia Kuang

10.1016/j.enpol.2023.113827 article EN Energy Policy 2023-09-30

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

OPENALEX - Publications

Marah Abdin Sam Adé Jacobs Ammar Ahmad Awan Jyoti Aneja Ahmed Hassan Awadallah and 82 more

We introduce phi-3-mini, a 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such Mixtral 8x7B GPT-3.5 (e.g., phi-3-mini achieves 69% MMLU 8.38 MT-bench), despite being small enough to be deployed phone. The innovation lies entirely in our dataset for training, scaled-up version the one used phi-2, composed heavily filtered web data synthetic data. is also further...

10.48550/arxiv.2404.14219 preprint EN arXiv (Cornell University) 2024-04-22

A hydroquinone biosensor using modified core–shell magnetic nanoparticles supported on carbon paste electrode

OPENALEX - Publications

Yi Zhang Guangming Zeng Lin Tang Danlian Huang Xiaoyun Jiang and 1 more

10.1016/j.bios.2006.09.030 article EN Biosensors and Bioelectronics 2006-11-02

A non-negative matrix tri-factorization approach to sentiment classification with lexical prior knowledge

OPENALEX - Publications

Tao Li Yi Zhang Vikas Sindhwani

Sentiment classification refers to the task of automatically identifying whether a given piece text expresses positive or negative opinion towards subject at hand. The proliferation user-generated web content such as blogs, discussion forums and online review sites has made it possible perform large-scale mining public opinion. modeling is thus becoming critical component market intelligence social media technologies that aim tap into collective wisdom crowds. In this paper, we consider...

10.3115/1687878.1687914 article EN 2009-01-01

Efficient bayesian hierarchical user modeling for recommendation system

OPENALEX - Publications

Yi Zhang Jonathan Koren

A content-based personalized recommendation system learns user specific profiles from feedback so that it can deliver information tailored to each individual user's interest. serving millions of users learn a better profile for new user, or with little feedback, by borrowing other through the use Bayesian hierarchical model. Learning model parameters optimize joint data likelihood is very computationally expensive. The commonly used EM algorithm converges slowly due sparseness in IR...

10.1145/1277741.1277752 article EN 2007-07-23

Personalized interactive faceted search

OPENALEX - Publications

Jonathan Koren Yi Zhang Xue Liu

Faceted search is becoming a popular method to allow users interactively and navigate complex information spaces. A faceted system presents with key-value metadata that used for query refinement. While in e-commerce digital libraries, not much research has been conducted on which present user order improve the experience. Nor are there repeatable benchmarks evaluating engine. This paper proposes use of collaborative filtering personalization customize interface each user's behavior. also...

10.1145/1367497.1367562 article EN 2008-04-21

Experimental investigation of coal dust wetting ability of anionic surfactants with different structures

OPENALEX - Publications

Chaohang Xu Deming Wang Hetang Wang Liyang Ma Xiaolong Zhu and 3 more

10.1016/j.psep.2018.10.010 article EN Process Safety and Environmental Protection 2018-10-16

The determinants of international investment and attention allocation: Using internet search query data

OPENALEX - Publications

Jordi Mondria Thomas Wu Yi Zhang

10.1016/j.jinteco.2010.04.007 article EN Journal of International Economics 2010-05-06

Coming Soon ...