no code implementations • EMNLP 2021 • Alexander Spangher, Jonathan May, Sz-Rung Shiang, Lingjia Deng
As labeling schemas evolve over time, small differences can render datasets following older schemas unusable.
Ranked #1 on Text Classification on NewsDiscourse
1 code implementation • NAACL 2022 • Alexander Spangher, Xiang Ren, Jonathan May, Nanyun Peng
News article revision histories provide clues to narrative and factual evolution in news articles.
no code implementations • 16 Nov 2023 • Alexander Spangher, Emilio Ferrara, Ben Welsh, Nanyun Peng, Serdar Tumgoren, Jonathan May
Journalists must find stories in huge amounts of textual data (e. g. leaks, bills, press releases) as part of their jobs: determining when and why text becomes news can help us understand coverage patterns and help us build assistive tools.
no code implementations • 30 Jun 2023 • Guillaume Sanchez, Honglu Fan, Alexander Spangher, Elad Levi, Pawan Sasanka Ammanamanchi, Stella Biderman
Classifier-Free Guidance (CFG) has recently emerged in text-to-image generation as a lightweight technique to encourage prompt-adherence in generations.
Ranked #1 on Text Generation on SciQ
1 code implementation • 24 May 2023 • Alexander Spangher, Nanyun Peng, Jonathan May, Emilio Ferrara
News articles are driven by the informational sources journalists use in reporting.
no code implementations • 5 Jan 2023 • Alexander Spangher, Xinyu Hua, Yao Ming, Nanyun Peng
While GPT-2 generates sentences that are remarkably human-like, longer documents can ramble and do not follow human-like writing structure.
no code implementations • 14 Jun 2022 • Alexander Spangher, Divya Choudhary
Developing and improving computational approaches to covering news can increase journalistic output and improve the way stories are covered.
1 code implementation • 14 Jun 2022 • Alexander Spangher, Xiang Ren, Jonathan May, Nanyun Peng
News article revision histories provide clues to narrative and factual evolution in news articles.
no code implementations • 20 Apr 2021 • Alexander Spangher, Jonathan May
In this work, we create a web application to highlight the output of NLP models trained to parse and label discourse segments in law text.
1 code implementation • 19 Apr 2021 • Alexander Spangher, Nanyun Peng, Jonathan May, Emilio Ferrara
Journalists publish statements provided by people, or \textit{sources} to contextualize current events, help voters make informed decisions, and hold powerful individuals accountable.
no code implementations • 19 Apr 2021 • Alexander Spangher, Nanyun Peng, Jonathan May, Emilio Ferrara
Journalists obtain "leads", or story ideas, by reading large corpora of government records: court cases, proposed bills, etc.
no code implementations • 19 Apr 2021 • Alexander Spangher, Jonathan May
In this work, we present, to our knowledge, the first publicly available dataset of news article revision histories, or NewsEdits.
no code implementations • 2 Jan 2021 • Alexander Spangher, Jonathan May, Sz-Rung Shiang, Lingjia Deng
Small class-imbalanced datasets, common in many high-level semantic tasks like discourse analysis, present a particular challenge to current deep-learning architectures.
3 code implementations • 18 Sep 2018 • Berk Ustun, Alexander Spangher, Yang Liu
We present integer programming tools to ensure recourse in linear classification problems without interfering in model development.
no code implementations • 16 Jan 2015 • Alexander Spangher
Using nonparametric methods has been increasingly explored in Bayesian hierarchical modeling as a way to increase model flexibility.