Show simple item record

dc.creatorCilingir, Gokcen
dc.creatorBroschat, Shira L.
dc.description.abstractSupervised machine learning algorithms are used by life scientists for a variety of objectives. Expert-curated public gene and protein databases are major resources for gathering data to train these algorithms. While these data resources are continuously updated, generally, these updates are not incorporated into published machine learning algorithms which thereby can become outdated soon after their introduction. In this paper, we propose a new model of operation for supervised machine learning algorithms that learn from genomic data. By defining these algorithms in a pipeline in which the training data gathering procedure and the learning process are automated, one can create a system that generates a classifier or predictor using information available from public resources. The proposed model is explained using three case studies on SignalP, MemLoci, and ApicoAP in which existing machine learning models are utilized in pipelines. Given that the vast majority of the procedures described for gathering training data can easily be automated, it is possible to transform valuable machine learning algorithms into self-evolving learners that benefit from the ever-changing data available for gene products and to develop new machine learning algorithms that are similarly capable.en_US
dc.publisherBioMed Research International
dc.rightsCreative Commons Attribution 4.0 International
dc.subjectMachine learning algorithms
dc.subjectGenomic data
dc.titleAutomated training for algorithms that learn from genomic data
dc.description.versionPublished copy
dc.description.citationCilingir, G., and S. L. Broschat, Automated training for algorithms that learn from genomic data, BioMed Research International, Vol. 2015, Article ID 234236, 9 pages, 2015. doi:10.1155/2015/234236.

Files in this item


This item appears in the following Collection(s)

  • Broschat, Shira
    This collection features research and educational materials by Shira Broschat, Professor and Curriculum Coordinator for the School of Electrical Engineering and Computer Science at Washington State University.

Show simple item record

Creative Commons Attribution 4.0 International
Except where otherwise noted, this item's license is described as Creative Commons Attribution 4.0 International