Search Results for author: Oana Ignat

Found 15 papers, 12 papers with code

MAiDE-up: Multilingual Deception Detection of GPT-generated Hotel Reviews

no code implementations19 Apr 2024 Oana Ignat, Xiaomeng Xu, Rada Mihalcea

Using this dataset, we conduct extensive linguistic analyses to (1) compare the AI fake hotel reviews to real hotel reviews, and (2) identify the factors that influence the deception detection model performance.

Deception Detection

Cross-cultural Inspiration Detection and Analysis in Real and LLM-generated Social Media Data

1 code implementation19 Apr 2024 Oana Ignat, Gayathri Ganesh Lakshmy, Rada Mihalcea

To this end, we compile and make publicly available the InspAIred dataset, which consists of 2, 000 real inspiring posts, 2, 000 real non-inspiring posts, and 2, 000 generated inspiring posts evenly distributed across India and the UK.

GPT-4

Towards Algorithmic Fidelity: Mental Health Representation across Demographics in Synthetic vs. Human-generated Data

1 code implementation25 Mar 2024 Shinka Mori, Oana Ignat, Andrew Lee, Rada Mihalcea

Using GPT-3, we develop HEADROOM, a synthetic dataset of 3, 120 posts about depression-triggering stressors, by controlling for race, gender, and time frame (before and after COVID-19).

Synthetic Data Generation

Annotations on a Budget: Leveraging Geo-Data Similarity to Balance Model Performance and Annotation Cost

1 code implementation12 Mar 2024 Oana Ignat, Longju Bai, Joan Nwatu, Rada Mihalcea

In this paper, we propose methods to identify the data to be annotated to balance model performance and annotation costs.

Bridging the Digital Divide: Performance Variation across Socio-Economic Factors in Vision-Language Models

1 code implementation9 Nov 2023 Joan Nwatu, Oana Ignat, Rada Mihalcea

Despite the impressive performance of current AI models reported across various tasks, performance reports often do not include evaluations of how these models perform on the specific groups that will be impacted by these technologies.

Language Modelling

Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models

1 code implementation5 Nov 2023 Jingru Yi, Burak Uzkent, Oana Ignat, Zili Li, Amanmeet Garg, Xiang Yu, Linda Liu

While we demonstrate our data augmentation method with MDETR framework, the proposed approach is applicable to common grounding-based vision and language tasks with other frameworks.

Data Augmentation Phrase Grounding +1

Human Action Co-occurrence in Lifestyle Vlogs using Graph Link Prediction

1 code implementation12 Sep 2023 Oana Ignat, Santiago Castro, Weiji Li, Rada Mihalcea

We create and make publicly available the ACE (Action Co-occurrencE) dataset, consisting of a large graph of ~12k co-occurring pairs of visual actions and their corresponding video clips.

Link Prediction

Scalable Performance Analysis for Vision-Language Models

1 code implementation30 May 2023 Santiago Castro, Oana Ignat, Rada Mihalcea

Joint vision-language models have shown great performance over a diverse set of tasks.

Detecting Inspiring Content on Social Media

1 code implementation6 Sep 2021 Oana Ignat, Y-Lan Boureau, Jane A. Yu, Alon Halevy

We release a dataset of 5, 800 inspiring and 5, 800 non-inspiring English-language public post unique ids collected from a dump of Reddit public posts made available by a third party and use linguistic heuristics to automatically detect which social media English-language posts are inspiring.

FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation Framework

1 code implementation ACL 2022 Santiago Castro, Ruoyao Wang, Pingxuan Huang, Ian Stewart, Oana Ignat, Nan Liu, Jonathan C. Stroud, Rada Mihalcea

We propose fill-in-the-blanks as a video understanding evaluation framework and introduce FIBER -- a novel dataset consisting of 28, 000 videos and descriptions in support of this evaluation framework.

Language Modelling Multiple-choice +4

Cannot find the paper you are looking for? You can Submit a new open access paper.