www.collocations.de [dash] References
[separator bar]

Agresti, Alan (1990). Categorical Data Analysis. John Wiley & Sons, New York.

Agresti, Alan (1992). A survey of exact inference for contingency tables. Statistical Science 7(1), 131-153.

Baayen, Harald (2001). Word Frequency Distributions. Kluwer, Dordrecht.

Blaheta, Don and Johnson, Mark (2001). Unsupervised learning of multi-word verbs. In Proceedings of the ACL Workshop on Collocations, Toulouse, France, pages 54-60.

Church, Kenneth W. and Hanks, Patrick (1990). Word association norms, mutual information, and lexicography. Computational Linguistics 16(1), 22-29. (PostScript)

Church, Kenneth W.; Gale, William; Hanks, Patrick; Hindle, Donald (1991). Using statistics in lexical analysis. In Lexical Acquisition: Using On-line Resources to Build a Lexicon, Lawrence Erlbaum, pages 115-164. (PostScript)

Daille, Béatrice (1994). Approche mixte pour l'extraction automatique de terminologie: statistiques lexicales et filtres linguistiques. PhD thesis, Université Paris 7. (PostScript)

Dennis, Sally F. (1965). The construction of a thesaurus automatically from a sample of text. In Proceedings of the Symposium on Statistical Association Methods For Mechanized Documentation, Washington, DC, pages 61-148.

Dias, Gaël; Guilloré, Sylvie; Lopes, José G. P. (1999). Language independent automatic acquisition of rigid multiword units from unrestricted text corpora. In Proceedings of Traitement Automatique des Langues Naturelles (TALN), Cargèse, France.

Dunning, Ted (1993). Accurate methods for the statistics of surprise and coincidence. Computational Linguistics 19(1), 61-74.

Edmundson, H. P. (1965). A correlation coefficient for attributes or events. In Proceedings of the Symposium on Statistical Association Methods For Mechanized Documentation, Washington, DC, pages 41-44.

Evert, Stefan (2004). The Statistics of Word Cooccurrences: Word Pairs and Collocations. PhD dissertation, IMS, University of Stuttgart. Published in 2005, URN urn:nbn:de:bsz:93-opus-23714. (www.collocations.de)

Evert, Stefan and Krenn, Brigitte (2001). Methods for the qualitative evaluation of lexical association measures. In Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics. Toulouse, France, pages 188-195. (www.collocations.de)

Evert, Stefan and Krenn, Brigitte (2003). Computational approaches to collocations. Introductory course at the European Summer School on Logic, Language, and Information (ESSLLI 2003), Vienna. (www.collocations.de)

Evert, Stefan; Heid, Ulrich; Lezius, Wolfgang (2000). Methoden zum Vergleich von Signifikanzmaßen zur Kollokationsidentifikation. In Zühlke, Werner and Schukat-Talmazzini, Ernst G. (eds.), KONVENS-2000 Sprachkommunikation, VDE-Verlag, pages 215-220.

Firth, J. R. (1957). A synopsis of linguistic theory 1930-55. In Studies in Linguistic Analysis (special volume of the Philological Society), pages 1-32. The Philological Society, Oxford. [ Reprinted in: Palmer, F. R. (ed.) (1968). Selected Papers of J. R. Firth 1952-59, pages 168-205. Longmans, London. ]

Johnson, Mark (2001). Trading recall for precision with confidence sets. Unpublished technical report. (http://citeseer.nj.nec.com/378119.html)

Krenn, Brigitte (2000). The Usual Suspects: Data-Oriented Models for the Identification and Representation of Lexical Collocations. PhD Thesis, DFKI & Universität des Saarlandes, Saarbrücken.

Krenn, Brigitte and Evert, Stefan (2001). Can we do better than frequency? A case study on extracting PP-verb collocations. In Proceedings of the ACL Workshop on Collocations, Toulouse, France, pages 39-46. (www.collocations.de)

Kuhns, J. L. (1965). The continuum of coefficients of association. In Proceedings of the Symposium on Statistical Association Methods For Mechanized Documentation, Washington, DC, pages 33-39.

Liddell, Douglas (1976). Practical tests of 2 x 2 contingency tables. The Statistician 25(4), 295-304.

Manning, Christopher D. and Schütze, Hinrich (1999). Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, MA.

McEnery, Tony and Wilson, Andrew (2001). Corpus Linguistics, 2nd edition, Edinburgh University Press.

Pearce, Darren (2002). A comparative evaluation of collocation extraction techniques. In Third International Conference on Language Resources and Evaluation (LREC). Las Palmas, Spain. (PostScript)

Pedersen, Ted (1996). Fishing for exactness. In Proceedings of the South-Central SAS Users Group Conference. Austin, TX.

Pedersen, Ted and Bruce, Rebecca (1996). What to infer from a description. Technical Report 96-CSE-04, Southern Methodist University, Dallas, TX. (CiteSeer)

Quasthoff, Uwe and Wolff, Christian (2002). The Poisson collocation measure and its application. In Workshop on Computational Approaches to Collocations. Vienna, Austria. (Workshop homepage)

Siegel, Sidney (1956). Nonparametric Statistics for the Behavioral Sciences. McGraw-Hill Kogakusha, Tokyo.

Smadja, Frank (1993). Retrieving collocations from text: Xtract. Computational Linguistics 19(1), 143-177.

Smadja, Frank; McKeown, Kathleen R.; Hatzivassiloglou, Vasileios (1996). Translating collocations for bilingual lexicons: a statistical approach. Computational Linguistics 22(1), 1-38.

Stevens, M. E.; Giuliano, V. E.; Heilprin, L. B. (eds.) (1965). Proceedings of the Symposium on Statistical Association Methods For Mechanized Documentation, Washington 1964. Volume 269 of National Bureau of Standards Miscellaneous Publications.

Weeber, Marc; Vos, Rein; Baayen, R. Harald (2000). Extracting the lowest-frequency words: pitfalls and possibilities. Computational Linguistics 26(3), 301-317.

Weisstein, Eric W. (1999). Eric Weisstein's World of Mathematics. An on-line encyclopedia hosted by Wolfram Inc. (http://mathworld.wolfram.com/)

Yates, F. (1934). Contingency tables involving small numbers and the χ2 test. Supplement to the Journal of the Royal Statistical Society 1, 217-235.

Yates, F. (1984). Tests of significance for 2 x 2 contingency tables. Journal of the Royal Statistical Society, Series A, 147(3), 426-463.

Yeh, Alexander (2000). More accurate tests for the statistical significance of result differences. In Proceedings of the 18th International Conference on Computational Linguistics (COLING 2000). Saarbrücken, Germany. (CiteSeer)

[separator bar]
© 2004-2010 by Stefan Evert, Last Modified: Sun Sep 12 12:29:31 2010 (severt) — imprint & privacy