Search Results for author: Yixin Nie

Found 24 papers, 15 papers with code

Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning

no code implementations • 18 Mar 2024 • Rao Fu, Jingyu Liu, Xilun Chen, Yixin Nie, Wenhan Xiong

This paper introduces Scene-LLM, a 3D-visual-language model that enhances embodied agents' abilities in interactive 3D indoor environments by integrating the reasoning strengths of Large Language Models (LLMs).

Dense Captioning Language Modelling +1

Paper
Add Code

The Role of Chain-of-Thought in Complex Vision-Language Reasoning Task

no code implementations • 15 Nov 2023 • Yifan Wu, Pengchuan Zhang, Wenhan Xiong, Barlas Oguz, James C. Gee, Yixin Nie

The study explores the effectiveness of the Chain-of-Thought approach, known for its proficiency in language tasks by breaking them down into sub-tasks and intermediate steps, in improving vision-language tasks that demand sophisticated perception and reasoning.

Ranked #1 on Visual Reasoning on Winoground

Visual Reasoning

Paper
Add Code

Jointly Training Large Autoregressive Multimodal Models

1 code implementation • 27 Sep 2023 • Emanuele Aiello, Lili Yu, Yixin Nie, Armen Aghajanyan, Barlas Oguz

In recent years, advances in the large-scale pretraining of language and text-to-image models have revolutionized the field of machine learning.

Image Generation

Paper
Code

Llama 2: Open Foundation and Fine-Tuned Chat Models

14 code implementations • 18 Jul 2023 • Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, Dan Bikel, Lukas Blecher, Cristian Canton Ferrer, Moya Chen, Guillem Cucurull, David Esiobu, Jude Fernandes, Jeremy Fu, Wenyin Fu, Brian Fuller, Cynthia Gao, Vedanuj Goswami, Naman Goyal, Anthony Hartshorn, Saghar Hosseini, Rui Hou, Hakan Inan, Marcin Kardas, Viktor Kerkez, Madian Khabsa, Isabel Kloumann, Artem Korenev, Punit Singh Koura, Marie-Anne Lachaux, Thibaut Lavril, Jenya Lee, Diana Liskovich, Yinghai Lu, Yuning Mao, Xavier Martinet, Todor Mihaylov, Pushkar Mishra, Igor Molybog, Yixin Nie, Andrew Poulton, Jeremy Reizenstein, Rashi Rungta, Kalyan Saladi, Alan Schelten, Ruan Silva, Eric Michael Smith, Ranjan Subramanian, Xiaoqing Ellen Tan, Binh Tang, Ross Taylor, Adina Williams, Jian Xiang Kuan, Puxin Xu, Zheng Yan, Iliyan Zarov, Yuchen Zhang, Angela Fan, Melanie Kambadur, Sharan Narang, Aurelien Rodriguez, Robert Stojnic, Sergey Edunov, Thomas Scialom

In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters.

Ranked #2 on Question Answering on PubChemQA

Arithmetic Reasoning +5

52,595

Paper
Code

Text-guided 3D Human Generation from 2D Collections

no code implementations • 23 May 2023 • Tsu-Jui Fu, Wenhan Xiong, Yixin Nie, Jingyu Liu, Barlas Oğuz, William Yang Wang

To address this \texttt{T3H} task, we propose Compositional Cross-modal Human (CCH).

Ranked #1 on Text-to-3D-Human Generation on SHHQ

text-to-3d-human Text-to-3D-Human Generation

Paper
Add Code

TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models

1 code implementation • 18 Apr 2023 • Yuwei Yin, Jean Kaddour, Xiang Zhang, Yixin Nie, Zhenguang Liu, Lingpeng Kong, Qi Liu

In addition, generative data augmentation (GDA) has been shown to produce more diverse and flexible data.

Data Augmentation domain classification +1

Paper
Code

3DGen: Triplane Latent Diffusion for Textured Mesh Generation

no code implementations • 9 Mar 2023 • Anchit Gupta, Wenhan Xiong, Yixin Nie, Ian Jones, Barlas Oğuz

We take another step along this direction, combining these developments in a two-step pipeline consisting of 1) a triplane VAE which can learn latent representations of textured meshes and 2) a conditional diffusion model which generates the triplane features.

Image Generation Texture Synthesis

Paper
Add Code

CLIP-Layout: Style-Consistent Indoor Scene Synthesis with Semantic Furniture Embedding

no code implementations • 7 Mar 2023 • Jingyu Liu, Wenhan Xiong, Ian Jones, Yixin Nie, Anchit Gupta, Barlas Oğuz

Whether heuristic or learned, these methods ignore instance-level visual attributes of objects, and as a result may produce visually less coherent scenes.

Indoor Scene Synthesis Scene Generation

Paper
Add Code

TVLT: Textless Vision-Language Transformer

2 code implementations • 28 Sep 2022 • Zineng Tang, Jaemin Cho, Yixin Nie, Mohit Bansal

In this work, we present the Textless Vision-Language Transformer (TVLT), where homogeneous transformer blocks take raw visual and audio inputs for vision-and-language representation learning with minimal modality-specific design, and do not use text-specific modules such as tokenization or automatic speech recognition (ASR).

Automatic Speech Recognition (ASR) Image Retrieval +6

124,593

Paper
Code

MLP Architectures for Vision-and-Language Modeling: An Empirical Study

1 code implementation • 8 Dec 2021 • Yixin Nie, Linjie Li, Zhe Gan, Shuohang Wang, Chenguang Zhu, Michael Zeng, Zicheng Liu, Mohit Bansal, Lijuan Wang

Based on this, we ask an even bolder question: can we have an all-MLP architecture for VL modeling, where both VL fusion and the vision encoder are replaced with MLPs?

Language Modelling Visual Question Answering (VQA)

Paper
Code

Investigating Transfer Learning in Multilingual Pre-trained Language Models through Chinese Natural Language Inference

1 code implementation • Findings (ACL) 2021 • Hai Hu, He Zhou, Zuoyu Tian, Yiwen Zhang, Yina Ma, Yanting Li, Yixin Nie, Kyle Richardson

These results, however, come with important caveats: cross-lingual models often perform best when trained on a mixture of English and high-quality monolingual NLI data (OCNLI), and are often hindered by automatically translated resources (XNLI-zh).

Cross-Lingual Transfer Natural Language Inference +2

Paper
Code

Distributed NLI: Learning to Predict Human Opinion Distributions for Language Reasoning

1 code implementation • Findings (ACL) 2022 • Xiang Zhou, Yixin Nie, Mohit Bansal

We introduce distributed NLI, a new NLU task with a goal to predict the distribution of human judgements for natural language inference.

Natural Language Inference

Paper
Code

Dynabench: Rethinking Benchmarking in NLP

no code implementations • NAACL 2021 • Douwe Kiela, Max Bartolo, Yixin Nie, Divyansh Kaushik, Atticus Geiger, Zhengxuan Wu, Bertie Vidgen, Grusha Prasad, Amanpreet Singh, Pratik Ringshia, Zhiyi Ma, Tristan Thrush, Sebastian Riedel, Zeerak Waseem, Pontus Stenetorp, Robin Jia, Mohit Bansal, Christopher Potts, Adina Williams

We introduce Dynabench, an open-source platform for dynamic dataset creation and model benchmarking.

Benchmarking

Paper
Add Code

To what extent do human explanations of model behavior align with actual model behavior?

no code implementations • EMNLP (BlackboxNLP) 2021 • Grusha Prasad, Yixin Nie, Mohit Bansal, Robin Jia, Douwe Kiela, Adina Williams

Given the increasingly prominent role NLP models (will) play in our lives, it is important for human expectations of model behavior to align with actual model behavior.

Natural Language Inference

Paper
Add Code

I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling

no code implementations • ACL 2021 • Yixin Nie, Mary Williamson, Mohit Bansal, Douwe Kiela, Jason Weston

To quantify how well natural language understanding models can capture consistency in a general conversation, we introduce the DialoguE COntradiction DEtection task (DECODE) and a new conversational dataset containing both human-human and human-bot contradictory dialogues.

Natural Language Understanding

Paper
Add Code

ConjNLI: Natural Language Inference Over Conjunctive Sentences

1 code implementation • EMNLP 2020 • Swarnadeep Saha, Yixin Nie, Mohit Bansal

Reasoning about conjuncts in conjunctive sentences is important for a deeper understanding of conjunctions in English and also how their usages and semantics differ from conjunctive and disjunctive boolean logic.

Natural Language Inference

Paper
Code

What Can We Learn from Collective Human Opinions on Natural Language Inference Data?

1 code implementation • EMNLP 2020 • Yixin Nie, Xiang Zhou, Mohit Bansal

Analysis reveals that: (1) high human disagreement exists in a noticeable amount of examples in these datasets; (2) the state-of-the-art models lack the ability to recover the distribution over human labels; (3) models achieve near-perfect accuracy on the subset of data with a high level of human agreement, whereas they can barely beat a random guess on the data with low levels of human agreement, which compose most of the common errors made by state-of-the-art models on the evaluation sets.

Natural Language Inference

Paper
Code

Simple Compounded-Label Training for Fact Extraction and Verification

no code implementations • WS 2020 • Yixin Nie, Lisa Bauer, Mohit Bansal

Automatic fact checking is an important task motivated by the need for detecting and preventing the spread of misinformation across the web.

Claim Verification Fact Checking +4

Paper
Add Code

The Curse of Performance Instability in Analysis Datasets: Consequences, Source, and Suggestions

1 code implementation • EMNLP 2020 • Xiang Zhou, Yixin Nie, Hao Tan, Mohit Bansal

For the first question, we conduct a thorough empirical study over analysis sets and find that in addition to the unstable final performance, the instability exists all along the training curve.

Model Selection Natural Language Inference +1

Paper
Code

Adversarial NLI: A New Benchmark for Natural Language Understanding

2 code implementations • ACL 2020 • Yixin Nie, Adina Williams, Emily Dinan, Mohit Bansal, Jason Weston, Douwe Kiela

We introduce a new large-scale NLI benchmark dataset, collected via an iterative, adversarial human-and-model-in-the-loop procedure.

Natural Language Understanding

381

Paper
Code

Revealing the Importance of Semantic Retrieval for Machine Reading at Scale

2 code implementations • IJCNLP 2019 • Yixin Nie, Songhe Wang, Mohit Bansal

In this work, we give general guidelines on system design for MRS by proposing a simple yet effective pipeline system with special consideration on hierarchical semantic retrieval at both paragraph and sentence level, and their potential effects on the downstream task.

Ranked #44 on Question Answering on HotpotQA

Fact Verification Information Retrieval +5

Paper
Code

Combining Fact Extraction and Verification with Neural Semantic Matching Networks

2 code implementations • 16 Nov 2018 • Yixin Nie, Haonan Chen, Mohit Bansal

The increasing concern with misinformation has stimulated research efforts on automatic fact checking.

Claim Verification Fact Checking +5

Paper
Code

Analyzing Compositionality-Sensitivity of NLI Models

1 code implementation • 16 Nov 2018 • Yixin Nie, Yicheng Wang, Mohit Bansal

Therefore, we propose a compositionality-sensitivity testing setup that analyzes models on natural examples from existing datasets that cannot be solved via lexical features alone (i. e., on which a bag-of-words model gives a high probability to one wrong label), hence revealing the models' actual compositionality awareness.

Natural Language Inference

Paper
Code

Shortcut-Stacked Sentence Encoders for Multi-Domain Inference

3 code implementations • WS 2017 • Yixin Nie, Mohit Bansal

We present a simple sequential sentence encoder for multi-domain natural language inference.

Ranked #62 on Natural Language Inference on SNLI

Natural Language Inference Sentence +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.