AskUbuntu question dataset is a preprocessed collection of questions taken from the AskUbuntu.com 2014 corpus dump. It also comes with 400*20 manual annotations, marking pairs of questions as "similar" or "non-similar".
Paper | Code | Results | Date | Stars |
---|