Center for Research in Computational Linguistics
Who Says What Why When Where and How?
Mapping and Managing Comparative Data
The Mon-Khmer Languages Project addresses
wide-ranging questions regarding the development and divergence of languages in
At present, the MKLP has substantial resources (more than 1,000 citations each) for about fifty languages and dialects, and sizable samples (>500 items each) for dozens more. As these resources have grown, we have developed new approaches for tagging and storing data (Cooper 2007) and for phonetic approximation in searching (Cooper 2008).
We now discuss the combination of tabular and map-based displays we provide to help users understand the resources at their disposal: to inspect underlying data sets, to restrict the scope of queries to particular branches, sub-branches, and geographical regions, and to visualize, manage and make sense of the results that are returned. Rather than focusing solely on unifying the family’s origins, these tools help reveal the innovations and diversity of its many branches as well.
Cooper, Doug: "Data Sharing in the Mon-Khmer Languages Project." 3rd International Conference of Austroasiatic Linguistics, November 26-28, 2007
Doug: "Sound[s|ed] like ...? Approximate Phonetic Search in the Mon-Khmer
Languages Project." 18th Annual Conference of the Southeast Asian