Language Identification: A Computational Linguistics Primer
April 25th, 2009 | by Will |Slides and results from a talk I gave at Kalamazoo College on language identification.
My co-worker at Powerset, Chris Biemann, has a nice paper on Unsupervised Language Identification
.
One Response to “Language Identification: A Computational Linguistics Primer”
By Daniel Lemire on Apr 27, 2009 | Reply
Great. Thanks for sharing.
I did some vaguely related work hashing n-grams… you may appreciate it:
Recursive n-gram hashing is pairwise independent, at best
http://arxiv.org/abs/0705.4676
(You provided the initial motivation of this paper a long, long, long time ago!)