no code implementations • 8 Feb 2015 • Seung-Hoon Na, In-Su Kang, Jong-Hyeok Lee
Although these document characteristics should be differently handled, all previous methods of term frequency normalization have ignored these differences and have used a simplified length-driven approach which decreases the term frequency by only the length of a document, causing an unreasonable penalization.