Chatbot
168 papers with code • 0 benchmarks • 8 datasets
Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.
Source: Open Data Chatbot
Benchmarks
These leaderboards are used to track progress in Chatbot
Libraries
Use these libraries to find Chatbot models and implementationsDatasets
Latest papers
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators
Even simple, known confounders such as preference for longer outputs remain in existing automated evaluation metrics.
Physics Event Classification Using Large Language Models
The 2023 AI4EIC hackathon was the culmination of the third annual AI4EIC workshop at The Catholic University of America.
Facilitating Pornographic Text Detection for Open-Domain Dialogue Systems via Knowledge Distillation of Large Language Models
Pornographic content occurring in human-machine interaction dialogues can cause severe side effects for users in open-domain dialogue systems.
Characteristic AI Agents via Large Language Models
In response to this research gap, we create a benchmark for the characteristic AI agents task, including dataset, techniques, and evaluation metrics.
DeepSeek-VL: Towards Real-World Vision-Language Understanding
The DeepSeek-VL family (both 1. 3B and 7B models) showcases superior user experiences as a vision-language chatbot in real-world applications, achieving state-of-the-art or competitive performance across a wide range of visual-language benchmarks at the same model size while maintaining robust performance on language-centric benchmarks.
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
To address this issue, we introduce Chatbot Arena, an open platform for evaluating LLMs based on human preferences.
Yi: Open Foundation Models by 01.AI
The Yi model family is based on 6B and 34B pretrained language models, then we extend them to chat models, 200K long context models, depth-upscaled models, and vision-language models.
KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark
As language models are often deployed as chatbot assistants, it becomes a virtue for models to engage in conversations in a user's first language.
ASEM: Enhancing Empathy in Chatbot through Attention-based Sentiment and Emotion Modeling
Effective feature representations play a critical role in enhancing the performance of text generation models that rely on deep neural networks.
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs
We leverage LLMs to generate challenging tasks related to hypothetical phenomena, subsequently employing them as agents for efficient hallucination detection.