Latest Research

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

id-animator/id-animator • 23 Apr 2024

Based on this pipeline, a random face reference training method is further devised to precisely capture the ID-relevant embeddings from reference images, thus improving the fidelity and generalization capacity of our model for ID-specific video generation.

Attribute Video Generation

23 Apr 2024

Paper
Code

XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts

ise-uiuc/xft • • 23 Apr 2024

We introduce XFT, a simple yet powerful training scheme, by simply merging upcycled Mixture-of-Experts (MoE) to unleash the performance limit of instruction-tuned code Large Language Models (LLMs).

23 Apr 2024

Paper
Code

Manipulating Recommender Systems: A Survey of Poisoning Attacks and Countermeasures

tamlhp/awesome-recsys-poisoning • 23 Apr 2024

This survey aims to fill this gap by primarily focusing on poisoning attacks and their countermeasures.

Recommendation Systems

23 Apr 2024

Paper
Code

Deep neural networks for choice analysis: Enhancing behavioral regularity with gradient regularization

siqi-feng/dnn-behavioral-regularity • 23 Apr 2024

Moreover, the proposed framework is applicable to other NN-based choice models such as TasteNets.

Domain Generalization

23 Apr 2024

Paper
Code

Multi-Session SLAM with Differentiable Wide-Baseline Pose Optimization

princeton-vl/multislam_diffpose • • 23 Apr 2024

The backbone is trained end-to-end using a novel differentiable solver for wide-baseline two-view pose.

Optical Flow Estimation Visual Odometry

23 Apr 2024

Paper
Code

Metric-guided Image Reconstruction Bounds via Conformal Prediction

matthewyccheung/conformal-metric • 23 Apr 2024

We apply our method to sparse-view CT for downstream radiotherapy planning and show 1) that metric-guided bounds have valid coverage for downstream metrics while conventional pixel-wise bounds do not and 2) anatomical differences of upper/lower bounds between metric-guided and pixel-wise methods.

Conformal Prediction Image Reconstruction +2

23 Apr 2024

Paper
Code

Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image Classification

mahmad00/disjoint-sampling-for-hyperspectral-image-classification • 23 Apr 2024

This paper presents an innovative disjoint sampling approach for training SOTA models on Hyperspectral image classification (HSIC) tasks.

Benchmarking Hyperspectral Image Classification

23 Apr 2024

Paper
Code

Pyramid Hierarchical Transformer for Hyperspectral Image Classification

mahmad00/pyformer • 23 Apr 2024

The traditional Transformer model encounters challenges with variable-length input sequences, particularly in Hyperspectral Image Classification (HSIC), leading to efficiency and scalability concerns.

Classification Hyperspectral Image Classification

23 Apr 2024

Paper
Code

UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition

opendatalab/unimernet • • 23 Apr 2024

This paper presents the UniMER dataset to provide the first study on Mathematical Expression Recognition (MER) towards complex real-world scenarios.

Image Augmentation

23 Apr 2024

Paper
Code

CASPR: Automated Evaluation Metric for Contrastive Summarization

niru-umass-dev/caspr • 23 Apr 2024

Summarizing comparative opinions about entities (e. g., hotels, phones) from a set of source reviews, often referred to as contrastive summarization, can considerably aid users in decision making.

Decision Making Natural Language Inference

23 Apr 2024

Paper
Code