In its raw frequency form, tf is just the frequency with the "this" for every document. In Every document, the word "this" appears once; but because the document 2 has much more phrases, its relative frequency is smaller sized.log N n t = − log n t N displaystyle log frac N n_ t =-log frac n_ t N When you added the required changes, strik