no code implementations • 27 Mar 2024 • Xiusi Chen, Hongzhi Wen, Sreyashi Nag, Chen Luo, Qingyu Yin, Ruirui Li, Zheng Li, Wei Wang
Such a constitution discovery pipeline can be run iteratively and automatically to discover new constitutions that specifically target the alignment gaps in the current LLM.
1 code implementation • 6 Mar 2024 • Yupeng Hou, Jiacheng Li, Zhankui He, An Yan, Xiusi Chen, Julian McAuley
This paper introduces BLaIR, a series of pretrained sentence embedding models specialized for recommendation scenarios.
no code implementations • 7 Feb 2024 • Yu Wang, Xiusi Chen, Jingbo Shang, Julian McAuley
Existing Large Language Models (LLMs) usually remain static after deployment, which might make it hard to inject new knowledge into the model.
no code implementations • 7 Feb 2024 • Yijun Tian, Yikun Han, Xiusi Chen, Wei Wang, Nitesh V. Chawla
To solve the problems and facilitate the learning of compact language models, we propose TinyLLM, a new knowledge distillation paradigm to learn a small student LLM from multiple large teacher LLMs.
no code implementations • 24 Jan 2024 • Haorui Wang, Rongzhi Zhang, Yinghao Li, Lingkai Kong, Yuchen Zhuang, Xiusi Chen, Chao Zhang
The teacher LLM generates problem-solving instructions and corrective principles based on the student LLM's errors.
no code implementations • 23 Oct 2023 • Yu Zhang, Yanzhen Shen, Xiusi Chen, Bowen Jin, Jiawei Han
As many academic conferences are overwhelmed by a rapidly increasing number of paper submissions, automatically finding appropriate reviewers for each submission becomes a more urgent need than ever.
1 code implementation • 11 Oct 2023 • Bowen Jin, Hansi Zeng, Guoyin Wang, Xiusi Chen, Tianxin Wei, Ruirui Li, Zhengyang Wang, Zheng Li, Yang Li, Hanqing Lu, Suhang Wang, Jiawei Han, Xianfeng Tang
Semantic identifier (ID) is an important concept in information retrieval that aims to preserve the semantics of objects such as documents and items inside their IDs.
no code implementations • 8 Oct 2023 • Xiusi Chen, Jyun-Yu Jiang, Wei-Cheng Chang, Cho-Jui Hsieh, Hsiang-Fu Yu, Wei Wang
Few-shot question answering (QA) aims at achieving satisfactory results on machine question answering when only a few training samples are available.
1 code implementation • 24 Jun 2023 • Yu Zhang, Bowen Jin, Xiusi Chen, Yanzhen Shen, Yunyi Zhang, Yu Meng, Jiawei Han
Instead of relying on human-annotated training samples to build a classifier, weakly supervised scientific paper classification aims to classify papers only using category descriptions (e. g., category names, category-indicative keywords).
1 code implementation • 7 Jun 2023 • Xiusi Chen, Yu Zhang, Jinliang Deng, Jyun-Yu Jiang, Wei Wang
Few-shot question answering (QA) aims at precisely discovering answers to a set of questions from context passages while only a few training samples are available.
no code implementations • 7 Jun 2023 • Xiusi Chen, Wei-Yao Wang, Ziniu Hu, Curtis Chou, Lam Hoang, Kun Jin, Mingyan Liu, P. Jeffrey Brantingham, Wei Wang
To accomplish reward-guided trajectory generation, conditional sampling is introduced to condition the diffusion model on the value function and conduct classifier-guided sampling.
1 code implementation • 22 May 2023 • Jinliang Deng, Xiusi Chen, Renhe Jiang, Du Yin, Yi Yang, Xuan Song, Ivor W. Tsang
The core issue in MTS forecasting is how to effectively model complex spatial-temporal patterns.
Ranked #1 on Time Series Forecasting on Weather (96)
1 code implementation • 7 Nov 2021 • Yu Zhang, Shweta Garg, Yu Meng, Xiusi Chen, Jiawei Han
We study the problem of weakly supervised text classification, which aims to classify text documents into a set of pre-defined categories with category surface names only and without any annotated training document provided.
1 code implementation • 2 Sep 2021 • Jinliang Deng, Xiusi Chen, Renhe Jiang, Xuan Song, Ivor W. Tsang
Therefore, there are two fundamental views which can be used to analyze MTS data, namely the spatial view and the temporal view.
no code implementations • 8 Aug 2021 • Yichao Zhou, Jyun-Yu Jiang, Xiusi Chen, Wei Wang
COVID-19 has caused lasting damage to almost every domain in public health, society, and economy.
1 code implementation • 26 Oct 2020 • Yu Zhang, Xiusi Chen, Yu Meng, Jiawei Han
Our experiments demonstrate a consistent improvement of HiMeCat over competitive baselines and validate the contribution of our representation learning and data augmentation modules.
2 code implementations • 22 Dec 2018 • Chao Zhang, Fangbo Tao, Xiusi Chen, Jiaming Shen, Meng Jiang, Brian Sadler, Michelle Vanni, Jiawei Han
Our method, TaxoGen, uses term embeddings and hierarchical clustering to construct a topic taxonomy in a recursive fashion.
Databases
2 code implementations • 17 Nov 2017 • Chang Zhou, Jinze Bai, Junshuai Song, Xiaofei Liu, Zhengchao Zhao, Xiusi Chen, Jun Gao
Downstream applications then can use the user behavior vectors via vanilla attention.