Search Results for author: Shotaro Ishihara

Found 4 papers, 0 papers with code

Nikkei at SemEval-2022 Task 8: Exploring BERT-based Bi-Encoder Approach for Pairwise Multilingual News Article Similarity

no code implementations SemEval (NAACL) 2022 Shotaro Ishihara, Hono Shirai

This paper describes our system in SemEval-2022 Task 8, where participants were required to predict the similarity of two multilingual news articles.

Sentence Translation

Quantifying Memorization of Domain-Specific Pre-trained Language Models using Japanese Newspaper and Paywalls

no code implementations26 Apr 2024 Shotaro Ishihara

In this study, we pre-trained domain-specific GPT-2 models using a limited corpus of Japanese newspaper articles and quantified memorization of training data by comparing them with general Japanese GPT-2 models.

Memorization Text Generation

Generating News-Centric Crossword Puzzles As A Constraint Satisfaction and Optimization Problem

no code implementations9 Aug 2023 Kaito Majima, Shotaro Ishihara

Crossword puzzles have traditionally served not only as entertainment but also as an educational tool that can be used to acquire vocabulary and language proficiency.

Training Data Extraction From Pre-trained Language Models: A Survey

no code implementations25 May 2023 Shotaro Ishihara

This study is the first to provide a comprehensive survey of training data extraction from PLMs.

Memorization

Cannot find the paper you are looking for? You can Submit a new open access paper.