Show simple item record

dc.contributor.authorHassell, Joseph Edward
dc.date.accessioned2014-03-04T01:09:35Z
dc.date.available2014-03-04T01:09:35Z
dc.date.issued2006-08
dc.identifier.otherhassell_joseph_e_200608_ms
dc.identifier.urihttp://purl.galileo.usg.edu/uga_etd/hassell_joseph_e_200608_ms
dc.identifier.urihttp://hdl.handle.net/10724/23392
dc.description.abstractPrecisely identifying entities in web documents is essential for document indexing, web search and data integration. Entity disambiguation is the challenge of determining the correct entity out of various candidate entities. Our novel method utilizes background knowledge in the form of a populated ontology. Additionally, it does not rely on the existence of any structure in a document or the appearance of data items that can provide strong evidence, such as e-mail addresses, for disambiguating authors for example. Originality of our method is demonstrated in the way it uses different relationships in a document as well as in the ontology to provide clues in determining the correct entity. We demonstrate the applicability of our method by disambiguating authors in a collection of DBWorld posts using a large scale, real-world ontology extracted from the DBLP. The precision and recall measurements provide encouraging results.
dc.languageeng
dc.publisheruga
dc.rightspublic
dc.subjectEntity disambiguation
dc.subjectontology
dc.subjectsemantic web.
dc.titleOntology-driven automatic entity disambiguation in unstructured text
dc.typeThesis
dc.description.degreeMS
dc.description.departmentComputer Science
dc.description.majorComputer Science
dc.description.advisorBudak Arpinar
dc.description.committeeBudak Arpinar
dc.description.committeeJohn Miller
dc.description.committeeAmit Sheth


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record