Search Results for author: Janek Bevendorff

Found 12 papers, 7 papers with code

Detecting Generated Native Ads in Conversational Search

2 code implementations7 Feb 2024 Sebastian Schmidt, Ines Zelch, Janek Bevendorff, Benno Stein, Matthias Hagen, Martin Potthast

In this paper, we thus take a first step to investigate whether LLMs can also be used as a countermeasure, i. e., to block generated native ads.

Conversational Search Sentence

The Information Retrieval Experiment Platform

1 code implementation30 May 2023 Maik Fröbe, Jan Heinrich Reimer, Sean MacAvaney, Niklas Deckers, Simon Reich, Janek Bevendorff, Benno Stein, Matthias Hagen, Martin Potthast

Standardization is achieved when a retrieval approach implements PyTerrier's interfaces and the input and output of an experiment are compatible with ir_datasets and ir_measures.

Information Retrieval Retrieval

SMAuC -- The Scientific Multi-Authorship Corpus

no code implementations4 Nov 2022 Janek Bevendorff, Philipp Sauer, Lukas Gienapp, Wolfgang Kircheis, Erik Körner, Benno Stein, Martin Potthast

The rapidly growing volume of scientific publications offers an interesting challenge for research on methods for analyzing the authorship of documents with one or more authors.

FastWARC: Optimizing Large-Scale Web Archive Analytics

1 code implementation22 Nov 2021 Janek Bevendorff, Martin Potthast, Benno Stein

Web search and other large-scale web data analytics rely on processing archives of web pages stored in a standardized and efficient format.

The Impact of Main Content Extraction on Near-Duplicate Detection

no code implementations21 Nov 2021 Maik Fröbe, Matthias Hagen, Janek Bevendorff, Michael Völske, Benno Stein, Christopher Schröder, Robby Wagner, Lukas Gienapp, Martin Potthast

Commercial web search engines employ near-duplicate detection to ensure that users see each relevant result only once, albeit the underlying web crawls typically include (near-)duplicates of many web pages.

Information Retrieval Retrieval

Crawling and Preprocessing Mailing Lists At Scale for Dialog Analysis

no code implementations ACL 2020 Janek Bevendorff, Khalid Al Khatib, Martin Potthast, Benno Stein

This paper introduces the Webis Gmane Email Corpus 2019, the largest publicly available and fully preprocessed email corpus to date.

Heuristic Authorship Obfuscation

1 code implementation ACL 2019 Janek Bevendorff, Martin Potthast, Matthias Hagen, Benno Stein

Authorship verification is the task of determining whether two texts were written by the same author.

Authorship Verification

Bias Analysis and Mitigation in the Evaluation of Authorship Verification

1 code implementation ACL 2019 Janek Bevendorff, Matthias Hagen, Benno Stein, Martin Potthast

The PAN series of shared tasks is well known for its continuous and high quality research in the field of digital text forensics.

Authorship Verification Benchmarking

A Stylometric Inquiry into Hyperpartisan and Fake News

1 code implementation ACL 2018 Martin Potthast, Johannes Kiesel, Kevin Reinartz, Janek Bevendorff, Benno Stein

The articles originated from 9 well-known political publishers, 3 each from the mainstream, the hyperpartisan left-wing, and the hyperpartisan right-wing.

Authorship Verification Fake News Detection +1

Cannot find the paper you are looking for? You can Submit a new open access paper.