Trending Research

DoRA: Weight-Decomposed Low-Rank Adaptation

NVlabs/DoRA • 14 Feb 2024

By employing DoRA, we enhance both the learning capacity and training stability of LoRA while avoiding any additional inference overhead.

0.29 stars / hour

Paper
Code

A Survey on Deep Learning for Theorem Proving

zhaoyu-li/dl4tp • 15 Apr 2024

Theorem proving is a fundamental aspect of mathematics, spanning from informal reasoning in mathematical language to rigorous derivations in formal systems.

Automated Theorem Proving

0.29 stars / hour

Paper
Code

TextHawk: Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models

yuyq96/texthawk • 14 Apr 2024

We conduct extensive experiments on both general and document-oriented MLLM benchmarks, and show that TextHawk outperforms the state-of-the-art methods, demonstrating its effectiveness and superiority in fine-grained document perception and general abilities.

0.28 stars / hour

Paper
Code

RoadBEV: Road Surface Reconstruction in Bird's Eye View

ztsrxh/roadbev • • 9 Apr 2024

This paper uniformly proposes two simple yet effective models for road elevation reconstruction in BEV named RoadBEV-mono and RoadBEV-stereo, which estimate road elevation with monocular and stereo images, respectively.

Autonomous Driving Monocular Depth Estimation +2

0.28 stars / hour

Paper
Code

NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving

wljungbergh/neuroncap • 11 Apr 2024

We present a versatile NeRF-based simulator for testing autonomous driving (AD) software systems, designed with a focus on sensor-realistic closed-loop evaluation and the creation of safety-critical scenarios.

Autonomous Driving

0.28 stars / hour

Paper
Code

LayoutLLM: Layout Instruction Tuning with Large Language Models for Document Understanding

AlibabaResearch/AdvancedLiterateMachinery • • 8 Apr 2024

The core of LayoutLLM is a layout instruction tuning strategy, which is specially designed to enhance the comprehension and utilization of document layouts.

document understanding

894

0.28 stars / hour

Paper
Code

HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images

faceonlive/ai-research • 14 Apr 2024

Benefiting from the developments in deep learning technology, deep-learning-based algorithms employing automatic feature extraction have achieved remarkable performance on the change detection (CD) task.

Ranked #1 on Change Detection on GoogleGZ-CD

Change Detection

131

0.27 stars / hour

Paper
Code