• Login
    View Item 
    •   Athenaeum Home
    • University of Georgia Theses and Dissertations
    • University of Georgia Theses and Dissertations
    • View Item
    •   Athenaeum Home
    • University of Georgia Theses and Dissertations
    • University of Georgia Theses and Dissertations
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Evolutionary instance resampling for difficult data sets

    Thumbnail
    Date
    2013-12
    Author
    Richardson, William Dale
    Metadata
    Show full item record
    Abstract
    In the field of machine learning, properties of data sets such as class imbalance and overlap often pose difficulties for classifier algorithms. A number of methods alleviate these difficulties by adjusting the distribution of the training data prior to classifier construction. Resampling is typically effected by weighting, removing, or duplicating instances, but finding a good resampling for the data set is a nontrivial problem. Genetic algorithms are frequently used to search for solutions in large, difficult search spaces. In this thesis, four evolutionary approaches are applied to the problem of instance resampling across a variety of data sets and classifier paradigms. In many cases, evolutionary pre-processing is able to produce better classifiers. In particular, an integer-based, one-to-one representation and a cluster-based, real-valued weighting encoding are shown to improve classifier performance on difficult data sets.
    URI
    http://purl.galileo.usg.edu/uga_etd/richardson_william_d_201312_ms
    http://hdl.handle.net/10724/30008
    Collections
    • University of Georgia Theses and Dissertations

    About Athenaeum | Contact Us | Send Feedback
     

     

    Browse

    All of AthenaeumCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    About Athenaeum | Contact Us | Send Feedback