Language Identification: A Computational Linguistics Primer

April 25th, 2009 | by Will |

Slides and results from a talk I gave at Kalamazoo College on language identification.

My co-worker at Powerset, Chris Biemann, has a nice paper on Unsupervised Language Identification
.

  1. One Response to “Language Identification: A Computational Linguistics Primer”

  2. By Daniel Lemire on Apr 27, 2009 | Reply

    Great. Thanks for sharing.

    I did some vaguely related work hashing n-grams… you may appreciate it:

    Recursive n-gram hashing is pairwise independent, at best
    http://arxiv.org/abs/0705.4676

    (You provided the initial motivation of this paper a long, long, long time ago!)

Sorry, comments for this entry are closed at this time.