Evaluating Machine Learning Approaches for Structural Genomics

dc.contributorCheung, Margaret S.
dc.contributor.authorPickett, Jonathan
dc.date.accessioned2019-01-03T17:48:37Z
dc.date.available2019-01-03T17:48:37Z
dc.date.issued2018-10-18
dc.description.abstractModern molecular biology produces large amounts of data, which can be difficult to derive any useful information from. We are investigating correlations that exist between genetic annotations of human DNA and chromosome structural features. Chromatin Immuno-Precipitation Sequencing(ChIP-Seq) data tracks, made available through the ENCODE project, characterize the biochemical nature of chromosomal loci. Chromatin can be categorized into types that we call type A and type B which we further classify into chromatin sub-types(A1, A2, B1, B2, and B3). It has been previously shown that these chromatin structural types are strongly related to the overall genome architecture of cells. Machine learning algorithms have proven to be especially adept at “learning” from correlations in very large data sets. We constructed a number of machine learning models and tested how accurately each performed when identifying chromatin sub-types. Our best approach so far is a recurrent neural network which produced a total error of less than 28% when classifying chromatin sub-types.
dc.description.departmentHonors College
dc.description.departmentPhysics, Department of
dc.identifier.urihttp://hdl.handle.net/10657/3792
dc.language.isoen_US
dc.rightsThe author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.titleEvaluating Machine Learning Approaches for Structural Genomics
dc.typePoster

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Pickett_Jonathan_2018URD.pdf
Size:
6.83 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.76 KB
Format:
Plain Text
Description: