OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

apple/corenet 22 Apr 2024

To this end, we release OpenELM, a state-of-the-art open language model.

Language Modelling

1,692
17.71 stars / hour

CyberSecEval 2: A Wide-Ranging Cybersecurity Evaluation Suite for Large Language Models

facebookresearch/purplellama 19 Apr 2024

We present BenchmarkName, a novel benchmark to quantify LLM security risks and capabilities.

1,817
3.46 stars / hour

QLoRA: Efficient Finetuning of Quantized LLMs

internlm/xtuner NeurIPS 2023

Our best model family, which we name Guanaco, outperforms all previous openly released models on the Vicuna benchmark, reaching 99. 3% of the performance level of ChatGPT while only requiring 24 hours of finetuning on a single GPU.

Chatbot Instruction Following +2

2,092
2.52 stars / hour

Improving Diffusion Models for Virtual Try-on

yisol/IDM-VTON 8 Mar 2024

Finally, we present a customization method using a pair of person-garment images, which significantly improves fidelity and authenticity.

Virtual Try-on

403
2.11 stars / hour

ID-Animator: Zero-Shot Identity-Preserving Human Video Generation

id-animator/id-animator 23 Apr 2024

Based on this pipeline, a random face reference training method is further devised to precisely capture the ID-relevant embeddings from reference images, thus improving the fidelity and generalization capacity of our model for ID-specific video generation.

Attribute Video Generation

73
1.67 stars / hour

ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation

hiyouga/llama-efficient-tuning 4 Aug 2023

Applying Reinforcement Learning (RL) to sequence generation models enables the direct optimization of long-term rewards (\textit{e. g.,} BLEU and human feedback), but typically requires large-scale sampling over a space of action sequences.

Abstractive Text Summarization Language Modelling +5

19,592
1.65 stars / hour

Dynamic Generation of Personalities with Large Language Models

hiyouga/llama-factory 10 Apr 2024

We propose a new metric to assess personality generation capability based on this evaluation method.

Personality Generation

19,575
1.64 stars / hour

AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation

ez-hwh/autocrawler 19 Apr 2024

We propose AutoCrawler, a two-stage framework that leverages the hierarchical structure of HTML for progressive understanding.

Action Generation

147
1.52 stars / hour

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

tencentarc/instantmesh 10 Apr 2024

We present InstantMesh, a feed-forward framework for instant 3D mesh generation from a single image, featuring state-of-the-art generation quality and significant training scalability.

Image to 3D

1,389
1.32 stars / hour

SnapKV: LLM Knows What You are Looking for Before Generation

fasterdecoding/snapkv 22 Apr 2024

Specifically, SnapKV achieves a consistent decoding speed with a 3. 6x increase in generation speed and an 8. 2x enhancement in memory efficiency compared to baseline when processing inputs of 16K tokens.

16k

59
1.20 stars / hour