DoRA: Weight-Decomposed Low-Rank Adaptation

NVlabs/DoRA 14 Feb 2024

By employing DoRA, we enhance both the learning capacity and training stability of LoRA while avoiding any additional inference overhead.

66
0.28 stars / hour

MEDITRON-70B: Scaling Medical Pretraining for Large Language Models

epfllm/meditron 27 Nov 2023

Large language models (LLMs) can potentially democratize access to medical knowledge.

 Ranked #1 on Multiple Choice Question Answering (MCQA) on MedMCQA (Dev Set (Acc-%) metric)

Conditional Text Generation Multiple Choice Question Answering (MCQA)

1,629
0.27 stars / hour

SUQL: Conversational Search over Structured and Unstructured Data with Large Language Models

stanford-oval/suql 16 Nov 2023

This paper presents the first conversational agent that supports the full generality of hybrid data access for large knowledge corpora, through a language we developed called SUQL (Structured and Unstructured Query Language).

Conversational Search In-Context Learning +1

77
0.27 stars / hour

OpenAgents: An Open Platform for Language Agents in the Wild

xlang-ai/xlang 16 Oct 2023

Language agents show potential in being capable of utilizing natural language for varied and intricate tasks in diverse environments, particularly when built upon large language models (LLMs).

2D Object Detection

3,546
0.27 stars / hour

EasyVolcap: Accelerating Neural Volumetric Video Research

zju3dv/easyvolcap 11 Dec 2023

Volumetric video is a technology that digitally records dynamic events such as artistic performances, sporting events, and remote conversations.

537
0.26 stars / hour

Advancing LLM Reasoning Generalists with Preference Trees

openbmb/eurus 2 Apr 2024

We introduce Eurus, a suite of large language models (LLMs) optimized for reasoning.

Benchmarking Code Generation +1

169
0.26 stars / hour

Hallucination of Multimodal Large Language Models: A Survey

showlab/awesome-mllm-hallucination 29 Apr 2024

By drawing the granular classification and landscapes of hallucination causes, evaluation benchmarks, and mitigation methods, this survey aims to deepen the understanding of hallucinations in MLLMs and inspire further advancements in the field.

Hallucination

126
0.25 stars / hour

Modeling Personalized Item Frequency Information for Next-basket Recommendation

HaojiHu/TIFUKNN 31 May 2020

NBR is in general more complex than the widely studied sequential (session-based) recommendation which recommends the next item based on a sequence of items.

Next-basket recommendation Session-Based Recommendations

92
0.25 stars / hour

Autonomous LLM-driven research from data to human-verifiable research papers

technion-kishony-lab/data-to-paper 24 Apr 2024

As AI promises to accelerate scientific discovery, it remains unclear whether fully AI-driven research is possible and whether it can adhere to key scientific values, such as transparency, traceability and verifiability.

23
0.25 stars / hour

OpenStreetView-5M: The Many Roads to Global Visual Geolocation

gastruc/osv5m 29 Apr 2024

Determining the location of an image anywhere on Earth is a complex visual task, which makes it particularly relevant for evaluating computer vision algorithms.

Memorization

24
0.25 stars / hour