Search Results for author: Wei Xia

Found 53 papers, 16 papers with code

Empirical Study of Large Language Models as Automated Essay Scoring Tools in English Composition__Taking TOEFL Independent Writing Task for Example

no code implementations7 Jan 2024 Wei Xia, Shaoguang Mao, Chanjing Zheng

The primary objective is to assess the capabilities and constraints of ChatGPT, a prominent representative of large language models, within the context of automated essay scoring.

Automated Essay Scoring Text Generation

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

2 code implementations7 Jan 2024 Quan Wang, Yiling Huang, Guanlong Zhao, Evan Clark, Wei Xia, Hank Liao

In this paper, we introduce DiarizationLM, a framework to leverage large language models (LLM) to post-process the outputs from a speaker diarization system.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Adapting Large Language Models for Education: Foundational Capabilities, Potentials, and Challenges

no code implementations27 Dec 2023 Qingyao Li, Lingyue Fu, Weiming Zhang, Xianyu Chen, Jingwei Yu, Wei Xia, Weinan Zhang, Ruiming Tang, Yong Yu

Online education platforms, leveraging the internet to distribute education resources, seek to provide convenient education but often fall short in real-time communication with students.

Question Answering

Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network

no code implementations15 Sep 2023 Yiling Huang, Weiran Wang, Guanlong Zhao, Hank Liao, Wei Xia, Quan Wang

Whether it is the conventional modularized approach or the more recent end-to-end neural diarization (EEND), an additional automatic speech recognition (ASR) model and an orchestration algorithm are required to associate the speaker labels with recognized words.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

A Comprehensive Survey on Deep Learning Techniques in Educational Data Mining

no code implementations9 Sep 2023 Yuanguo Lin, Hong Chen, Wei Xia, Fan Lin, Zongyue Wang, Yong liu

With the increasing complexity and diversity of educational data, Deep Learning techniques have shown significant advantages in addressing the challenges associated with analyzing and modeling this data.

Knowledge Tracing

Set-to-Sequence Ranking-based Concept-aware Learning Path Recommendation

no code implementations7 Jun 2023 Xianyu Chen, Jian Shen, Wei Xia, Jiarui Jin, Yakun Song, Weinan Zhang, Weiwen Liu, Menghui Zhu, Ruiming Tang, Kai Dong, Dingyin Xia, Yong Yu

Noticing that existing approaches fail to consider the correlations of concepts in the path, we propose a novel framework named Set-to-Sequence Ranking-based Concept-aware Learning Path Recommendation (SRC), which formulates the recommendation task under a set-to-sequence paradigm.

Knowledge Tracing Recommendation Systems

DeSAM: Decoupling Segment Anything Model for Generalizable Medical Image Segmentation

1 code implementation1 Jun 2023 Yifan Gao, Wei Xia, Dingdu Hu, Xin Gao

In fully automatic mode, the presence of inevitable poor prompts (such as points outside the mask or boxes significantly larger than the mask) can significantly mislead mask generation.

Domain Generalization Image Segmentation +3

Rethinking k-means from manifold learning perspective

no code implementations12 May 2023 Quanxue Gao, Qianqian Wang, Han Lu, Wei Xia, Xinbo Gao

Although numerous clustering algorithms have been developed, many existing methods still leverage k-means technique to detect clusters of data points.

Clustering

Multi-View Clustering via Semi-non-negative Tensor Factorization

no code implementations29 Mar 2023 Jing Li, Quanxue Gao, Qianqian Wang, Wei Xia, Xinbo Gao

Multi-view clustering (MVC) based on non-negative matrix factorization (NMF) and its variants have received a huge amount of attention in recent years due to their advantages in clustering interpretability.

Clustering

Towards Regression-Free Neural Networks for Diverse Compute Platforms

no code implementations27 Sep 2022 Rahul Duggal, Hao Zhou, Shuo Yang, Jun Fang, Yuanjun Xiong, Wei Xia

With the shift towards on-device deep learning, ensuring a consistent behavior of an AI service across diverse compute platforms becomes tremendously important.

Neural Architecture Search regression

Attention and DCT based Global Context Modeling for Text-independent Speaker Recognition

no code implementations4 Aug 2022 Wei Xia, John H. L. Hansen

Second, a 2D-DCT based context model is proposed to improve model efficiency and examine the benefits of signal modeling.

Speaker Recognition Speaker Verification +1

ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training

no code implementations12 May 2022 Yue Zhao, Yantao Shen, Yuanjun Xiong, Shuo Yang, Wei Xia, Zhuowen Tu, Bernt Schiele, Stefano Soatto

We present a method to train a classification system that achieves paragon performance in both error rate and NFR, at the inference cost of a single model.

MeMOT: Multi-Object Tracking with Memory

no code implementations CVPR 2022 Jiarui Cai, Mingze Xu, Wei Li, Yuanjun Xiong, Wei Xia, Zhuowen Tu, Stefano Soatto

We propose an online tracking algorithm that performs the object detection and data association under a common framework, capable of linking objects after a long time span.

Multi-Object Tracking Object +2

An Automatic Detection Method Of Cerebral Aneurysms In Time-Of-Flight Magnetic Resonance Angiography Images Based On Attention 3D U-Net

no code implementations26 Oct 2021 Chen Geng, Meng Chen, Ruoyu Di, Dongdong Wang, Liqin Yang, Wei Xia, Yuxin Li, Daoying Geng

Conclusions:Compared with the results of our previous studies and other studies, the method in this paper achieves a very competitive sensitivity with less training data and maintains a low false positive rate. As the only method currently using 3D U-Net for aneurysm detection, it proves the feasibility and superior performance of this network in aneurysm detection, and also explores the potential of the channel attention mechanism in this task.

Self-supervised Contrastive Attributed Graph Clustering

no code implementations15 Oct 2021 Wei Xia, Quanxue Gao, Ming Yang, Xinbo Gao

Thus, for the OOS nodes, SCAGC can directly calculate their clustering labels.

Attribute Clustering +3

Path Auxiliary Proposal for MCMC in Discrete Space

no code implementations ICLR 2022 Haoran Sun, Hanjun Dai, Wei Xia, Arun Ramamurthy

Energy-based Model (EBM) offers a powerful approach for modeling discrete structure, but both inference and learning of EBM are hard as it involves sampling from discrete distributions.

Scenario Aware Speech Recognition: Advancements for Apollo Fearless Steps & CHiME-4 Corpora

no code implementations23 Sep 2021 Szu-Jui Chen, Wei Xia, John H. L. Hansen

With additional techniques such as pronunciation and silence probability modeling, plus multi-style training, we achieve a +5. 42% and +3. 18% relative WER improvement for the development and evaluation sets of the Fearless Steps Corpus.

speech-recognition Speech Recognition

Effective and Efficient Graph Learning for Multi-view Clustering

no code implementations15 Aug 2021 Quanxue Gao, Wei Xia, Xinbo Gao, Xiangdong Zhang, Qin Li, DaCheng Tao

Despite the impressive clustering performance and efficiency in characterizing both the relationship between data and cluster structure, existing graph-based multi-view clustering methods still have the following drawbacks.

Clustering Graph Learning

Long Short-Term Transformer for Online Action Detection

2 code implementations NeurIPS 2021 Mingze Xu, Yuanjun Xiong, Hao Chen, Xinyu Li, Wei Xia, Zhuowen Tu, Stefano Soatto

We present Long Short-term TRansformer (LSTR), a temporal modeling algorithm for online action detection, which employs a long- and short-term memory mechanism to model prolonged sequence data.

Online Action Detection Playing the Game of 2048

Semi-TCL: Semi-Supervised Track Contrastive Representation Learning

no code implementations6 Jul 2021 Wei Li, Yuanjun Xiong, Shuo Yang, Mingze Xu, Yongxin Wang, Wei Xia

We design a new instance-to-track matching objective to learn appearance embedding that compares a candidate detection to the embedding of the tracks persisted in the tracker.

Multiple Object Tracking Object +1

Learning Hierarchical Graph Neural Networks for Image Clustering

2 code implementations ICCV 2021 Yifan Xing, Tong He, Tianjun Xiao, Yongxin Wang, Yuanjun Xiong, Wei Xia, David Wipf, Zheng Zhang, Stefano Soatto

Our hierarchical GNN uses a novel approach to merge connected components predicted at each level of the hierarchy to form a new graph at the next level.

Clustering Face Clustering

Harnessing Unrecognizable Faces for Improving Face Recognition

no code implementations8 Jun 2021 Siqi Deng, Yuanjun Xiong, Meng Wang, Wei Xia, Stefano Soatto

The common implementation of face recognition systems as a cascade of a detection stage and a recognition or verification stage can cause problems beyond failures of the detector.

Face Recognition Quantization

Compatibility-aware Heterogeneous Visual Search

no code implementations CVPR 2021 Rahul Duggal, Hao Zhou, Shuo Yang, Yuanjun Xiong, Wei Xia, Zhuowen Tu, Stefano Soatto

Existing systems use the same embedding model to compute representations (embeddings) for the query and gallery images.

Neural Architecture Search Retrieval

Optical manipulation of electronic dimensionality in a quantum material

no code implementations21 Jan 2021 Shaofeng Duan, Yun Cheng, Wei Xia, Yuanyuan Yang, Fengfeng Qi, Tianwei Tang, Yanfeng Guo, Dong Qian, Dao Xiang, Jie Zhang, Wentao Zhang

Exotic phenomenon can be achieved in quantum materials by confining electronic states into two dimensions.

Strongly Correlated Electrons Materials Science Superconductivity

Learning Self-Consistency for Deepfake Detection

1 code implementation ICCV 2021 Tianchen Zhao, Xiang Xu, Mingze Xu, Hui Ding, Yuanjun Xiong, Wei Xia

We propose a new method to detect deepfake images using the cue of the source feature inconsistency within the forged images.

DeepFake Detection Face Swapping +2

Self-supervised Text-independent Speaker Verification using Prototypical Momentum Contrastive Learning

1 code implementation13 Dec 2020 Wei Xia, Chunlei Zhang, Chao Weng, Meng Yu, Dong Yu

First, we examine a simple contrastive learning approach (SimCLR) with a momentum contrastive (MoCo) learning framework, where the MoCo speaker embedding system utilizes a queue to maintain a large set of negative examples.

Clustering Contrastive Learning +2

DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning

no code implementations12 Dec 2020 Mufan Sang, Wei Xia, John H. L. Hansen

Despite speaker verification has achieved significant performance improvement with the development of deep neural networks, domain mismatch is still a challenging problem in this field.

Disentanglement Domain Adaptation +1

Positive-Congruent Training: Towards Regression-Free Model Updates

no code implementations CVPR 2021 Sijie Yan, Yuanjun Xiong, Kaustav Kundu, Shuo Yang, Siqi Deng, Meng Wang, Wei Xia, Stefano Soatto

Reducing inconsistencies in the behavior of different versions of an AI system can be as important in practice as reducing its overall error.

Image Classification regression

SMOT: Single-Shot Multi Object Tracking

1 code implementation30 Oct 2020 Wei Li, Yuanjun Xiong, Shuo Yang, Siqi Deng, Wei Xia

We combine this scheme with SSD detectors by proposing a novel tracking anchor assignment module.

Multi-Object Tracking Object

3D-Aided Data Augmentation for Robust Face Understanding

no code implementations3 Oct 2020 Yifan Xing, Yuanjun Xiong, Wei Xia

Data augmentation has been highly effective in narrowing the data gap and reducing the cost for human annotation, especially for tasks where ground truth labels are difficult and expensive to acquire.

3D Face Modelling Data Augmentation +1

Open-set Short Utterance Forensic Speaker Verification using Teacher-Student Network with Explicit Inductive Bias

no code implementations21 Sep 2020 Mufan Sang, Wei Xia, John H. L. Hansen

In forensic applications, it is very common that only small naturalistic datasets consisting of short utterances in complex or unknown acoustic environments are available.

Inductive Bias Knowledge Distillation +1

Cross-domain Adaptation with Discrepancy Minimization for Text-independent Forensic Speaker Verification

no code implementations5 Sep 2020 Zhenyu Wang, Wei Xia, John H. L. Hansen

Forensic audio analysis for speaker verification offers unique challenges due to location/scenario uncertainty and diversity mismatch between reference and naturalistic field recordings.

Domain Adaptation Speaker Verification

Speaker Representation Learning using Global Context Guided Channel and Time-Frequency Transformations

no code implementations2 Sep 2020 Wei Xia, John H. L. Hansen

In this study, we propose the global context guided channel and time-frequency transformations to model the long-range, non-local time-frequency dependencies and channel variances in speaker representations.

Representation Learning Speaker Verification

6VecLM: Language Modeling in Vector Space for IPv6 Target Generation

no code implementations5 Aug 2020 Tianyu Cui, Gang Xiong, Gaopeng Gou, Junzheng Shi, Wei Xia

Fast IPv6 scanning is challenging in the field of network measurement as it requires exploring the whole IPv6 address space but limited by current computational power.

Language Modelling

Towards causal benchmarking of bias in face analysis algorithms

1 code implementation ECCV 2020 Guha Balakrishnan, Yuanjun Xiong, Wei Xia, Pietro Perona

To address this problem we develop an experimental method for measuring algorithmic bias of face analysis algorithms, which manipulates directly the attributes of interest, e. g., gender and skin tone, in order to reveal causal links between attribute variation and performance change.

Attribute Benchmarking +2

On Improving Temporal Consistency for Online Face Liveness Detection

no code implementations11 Jun 2020 Xiang Xu, Yuanjun Xiong, Wei Xia

In this paper, we focus on improving the online face liveness detection system to enhance the security of the downstream face recognition system.

Face Anti-Spoofing Face Recognition

Towards Backward-Compatible Representation Learning

3 code implementations CVPR 2020 Yantao Shen, Yuanjun Xiong, Wei Xia, Stefano Soatto

Backward compatibility is critical to quickly deploy new embedding models that leverage ever-growing large-scale training datasets and improvements in deep learning architectures and training methods.

Face Recognition Representation Learning

Sound Event Detection in Multichannel Audio using Convolutional Time-Frequency-Channel Squeeze and Excitation

no code implementations4 Aug 2019 Wei Xia, Kazuhito Koishida

In this study, we introduce a convolutional time-frequency-channel "Squeeze and Excitation" (tfc-SE) module to explicitly model inter-dependencies between the time-frequency domain and multiple channels.

Event Detection Sound Event Detection

Analyses of Multi-collection Corpora via Compound Topic Modeling

1 code implementation17 Jun 2019 Clint P. George, Wei Xia, George Michailidis

The usability study on some real-world corpora illustrates the superiority of cLDA to explore the underlying topics automatically but also model their connections and variations across multiple collections.

Topic Models Variational Inference

Learning Robust Search Strategies Using a Bandit-Based Approach

no code implementations10 May 2018 Wei Xia, Roland H. C. Yap

However, choosing or designing a good search heuristic is non-trivial and is often a manual process.

Correlation Heuristics for Constraint Programming

no code implementations6 May 2018 Ruiwei Wang, Wei Xia, Roland H. C. Yap

We evaluate our correlation heuristics with well known heuristics, namely, dom/wdeg, impact-based search and activity-based search.

CNN: Single-label to Multi-label

no code implementations22 Jun 2014 Yunchao Wei, Wei Xia, Junshi Huang, Bingbing Ni, Jian Dong, Yao Zhao, Shuicheng Yan

Convolutional Neural Network (CNN) has demonstrated promising performance in single-label image classification tasks.

Image Classification

Subcategory-Aware Object Classification

no code implementations CVPR 2013 Jian Dong, Wei Xia, Qiang Chen, Jianshi Feng, Zhongyang Huang, Shuicheng Yan

In this paper, we introduce a subcategory-aware object classification framework to boost category level object classification performance.

Classification General Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.