KAN: Kolmogorov-Arnold Networks

Blealtan/efficient-kan 30 Apr 2024

Inspired by the Kolmogorov-Arnold representation theorem, we propose Kolmogorov-Arnold Networks (KANs) as promising alternatives to Multi-Layer Perceptrons (MLPs).

3,201
0.43 stars / hour

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

xu1868/sayself 31 May 2024

Large language models (LLMs) often generate inaccurate or fabricated information and generally fail to indicate their confidence, which limits their broader applications.

34
0.42 stars / hour

SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation

swc-17/sparsedrive 30 May 2024

To this end, we explore the sparse representation and review the task design for end-to-end autonomous driving, proposing a new paradigm named SparseDrive.

Attribute Autonomous Driving +1

65
0.42 stars / hour

Non-destructive Degradation Pattern Decoupling for Ultra-early Battery Prototype Verification Using Physics-informed Machine Learning

terencetaothucb/TBSI-Sunwoda-Battery-Dataset 1 Jun 2024

Manufacturing complexities and uncertainties have impeded the transition from material prototypes to commercial batteries, making prototype verification critical to quality assessment.

Attribute Physics-informed machine learning

16
0.40 stars / hour
205
0.40 stars / hour

SimPO: Simple Preference Optimization with a Reference-Free Reward

princeton-nlp/simpo 23 May 2024

Our top-performing model, built on Llama3-8B-Instruct, achieves a remarkable 44. 7 length-controlled win rate on AlpacaEval 2 -- surpassing Claude 3 Opus on the leaderboard, and a 33. 8 win rate on Arena-Hard -- making it the strongest 8B open-source model.

Instruction Following

393
0.39 stars / hour

Make Your LLM Fully Utilize the Context

hsiehjackson/ruler 25 Apr 2024

While many contemporary large language models (LLMs) can process lengthy input, they still struggle to fully utilize information within the long context, known as the lost-in-the-middle challenge.

4k Information Retrieval +1

261
0.37 stars / hour

Generalizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid Modeling

black-yt/weathergft 22 May 2024

Data-driven artificial intelligence (AI) models have made significant advancements in weather forecasting, particularly in medium-range and nowcasting.

Weather Forecasting

60
0.36 stars / hour

SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models

guoyww/animatediff 28 Nov 2023

The development of text-to-video (T2V), i. e., generating videos with a given text prompt, has been significantly advanced in recent years.

Video Generation

9,419
0.35 stars / hour

APISR: Anime Production Inspired Real-World Anime Super-Resolution

kiteretsu77/apisr 3 Mar 2024

In addition, we identify two anime-specific challenges of distorted and faint hand-drawn lines and unwanted color artifacts.

Super-Resolution

696
0.35 stars / hour