Prioritizing and Scheduling Conferences for Metadata Harvesting in dblp

17 Apr 2018  ·  Mandy Neumann, Christopher Michels, Philipp Schaer, Ralf Schenkel ·

Maintaining literature databases and online bibliographies is a core responsibility of metadata aggregators such as digital libraries. In the process of monitoring all the available data sources the question arises which data source should be prioritized. Based on a broad definition of information quality we are looking for different ways to find the best fitting and most promising conference candidates to harvest next. We evaluate different conference ranking features by using a pseudo-relevance assessment and a component-based evaluation of our approach.

PDF Abstract

Datasets


  Add Datasets introduced or used in this paper