<> "The repository administrator has not yet configured an RDF license."^^ . <> . . "Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL"^^ . "This paper presents a simple unsupervised learning algorithm for recognizing synonyms, based on statistical data acquired by querying a Web search engine. The algorithm, called PMI-IR, uses Pointwise Mutual Information (PMI) and Information Retrieval (IR) to measure the similarity of pairs of words. PMI-IR is empirically evaluated using 80 synonym test questions from the Test of English as a Foreign Language (TOEFL) and 50 synonym test questions from a collection of tests for students of English as a Second Language (ESL). On both tests, the algorithm obtains a score of 74%. PMI-IR is contrasted with Latent Semantic Analysis (LSA), which achieves a score of 64% on the same 80 TOEFL questions. The paper discusses potential applications of the new unsupervised learning algorithm and some implications of the results for LSA and LSI (Latent Semantic Indexing). \n\n"^^ . "2001" . . . . "Springer-Verlag"^^ . . . . . . . . . . . . . . "Peter"^^ . "Turney"^^ . "Peter Turney"^^ . . "Luc"^^ . "De Raedt"^^ . "Luc De Raedt"^^ . . "Peter"^^ . "Flach"^^ . "Peter Flach"^^ . . . . . . "Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL (Postscript)"^^ . . . . . . "ECML2001.ps"^^ . . . "Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL (Image (PNG))"^^ . . . . . . "preview.png"^^ . . . "Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL (PDF)"^^ . . . . . . . . . "ECML2001.pdf"^^ . . . "Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL (Indexer Terms)"^^ . . . . . . "indexcodes.txt"^^ . . "HTML Summary of #1796 \n\nMining the Web for Synonyms: PMI-IR versus LSA on TOEFL\n\n" . "text/html" . . . "Language" . . . "Machine Learning" . . . "Statistical Models" . .