The goal of InfoLossQA is to generate a series of QA pairs that reveal to lay readers what information a simplified text lacks compared to its original.
We provide an annotated dataset in the domain of medical text simplification, specifically abstracts of Randomized Controlled Trials (RCTs). The abstracts were automatically simplified by an LLM (GPT-4). Then, three linguists annotated information loss and wrote the QA pairs.
For more details about the dataset, see the project website: https://InfoLossQA.ikim.nrw/#/
Paper | Code | Results | Date | Stars |
---|