Show simple item record

dc.contributor.authorMello, Heather Lee
dc.date.accessioned2014-03-04T21:13:34Z
dc.date.available2014-03-04T21:13:34Z
dc.date.issued2013-08
dc.identifier.othermello_heather_l_201308_phd
dc.identifier.urihttp://purl.galileo.usg.edu/uga_etd/mello_heather_l_201308_phd
dc.identifier.urihttp://hdl.handle.net/10724/29122
dc.description.abstractThis dissertation examines issues of units of meaning, word segmentation and language variation in a corpus of Vietnamese language blogs collected from publicly accessible internet sources originating in Viet Nam, the US, and Australia. Research using corpus linguistics techniques for study of the Vietnamese language have begun to proliferate in western sources in the past decade, however, studies using language-in-use data remain rare. Analysis of the corpus as a whole and by comments and blogs and Viet Nam, US, and Australia subcorpora used the Vietnamese syllable, or tiếng, as the basic unit of meaning, with subsequent iterations of one- through 5-tiếng. While results support previous research asserting the Vietnamese syllable as the basic distributional element in Vietnamese discourse, claims about Vietnamese as a monosyllabic language are not supported by results. Tiếng collocate and colligate meaningfully and regularly throughout the corpora in clusters larger than one syllable, indicating that syllable combinations, the union of tiếng (Nguyen, 1984), are also primary distributional patterns for the Vietnamese language. Varieties of Vietnamese by country show similarity in a variety of distributional patterns, including by a-curve (frequency of frequencies), structural, content, and units of meaning analyses. Variations of Vietnamese by country are primarily limited to collocational and colligational content and topical patterns.
dc.languageeng
dc.publisheruga
dc.rightspublic
dc.subjectVietnamese Language
dc.subjectCorpus Linguistics
dc.subjectSociolinguistics
dc.subjectWord Segmentation
dc.subjectUnit of Meaning
dc.subjectA-Curve
dc.subjectBlogs
dc.subjectInternet
dc.subjectDiaspora
dc.subjectLanguage Variety
dc.subjectTiếng
dc.titleAnalysis of language variation and word segmentation for a corpus of Vietnamese blogs
dc.title.alternativea sociolinguistic approach
dc.typeDissertation
dc.description.degreePhD
dc.description.departmentLinguistics Program
dc.description.majorLinguistics
dc.description.advisorWilliam Kretzschmar
dc.description.committeeWilliam Kretzschmar
dc.description.committeeLewis Howe
dc.description.committeeDezso Benedek


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record