The goal of InfoLossQA is to generate a series of QA pairs that reveal to lay readers what information a simplified text lacks compared to its original.

We provide an annotated dataset in the domain of medical text simplification, specifically abstracts of Randomized Controlled Trials (RCTs). The abstracts were automatically simplified by an LLM (GPT-4). Then, three linguists annotated information loss and wrote the QA pairs.

For more details about the dataset, see the project website: https://InfoLossQA.ikim.nrw/#/

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


License


Modalities


Languages