Selpi
An inductive logic programming approach to learning which uORFs regulate gene expression.
Selpi
Authors
Contributors
Chris Bryant
Supervisor
Professor John McCall j.mccall@rgu.ac.uk
Supervisor
Daniel Fredouille
Supervisor
Abstract
Some upstream open reading frames (uORFs) regulate gene expression (i.e. they are functional) and can play key roles in keeping organisms healthy. However, how uORFs are involved in gene regulation is not het fully understood. In order to get a complete view of how uORFs are involved in gene regulation, it is expected that a large number of functional uORFs are needed. Unfortunately , lab experiments to verify that uORFs are functional are expensive. In this thesis, for the first time, the use of inductive logic programming (ILP) is explored for the task of learning which uORFs regulate gene expression in the yeast Saccharomyces cerevisiae. This work is directed to help select sets of candidate functional uORFs for experimental studies. With limited background knowledge, ILP can generate hypotheses which make the search for novel functional uORFs 17 times more efficient than random sampling. Adding mRNA secondary structure to the background knowledge results in hypotheses with significantly increased performance. This work is the first machine learning work to study both uORFs and mRNA secondary structures in the context of gene regulation. Using a novel combination of knowledge about biological conservation, gene ontology annotations and genes' response to different conditions results in hypotheses that are simple, informative, have an estimated sensitivity of 81% and provide provisional insights into biological characteristics of functional uORFs. The hypotheses predict 299 further genes to have 450 novel functional uORFs. A comparison with a related study suggests that 8 of these predicted functional uORFs (from 8 genes) are strong candidates for experimental studies.
Citation
SELPI 2008. An inductive logic programming approach to learning which uORFs regulate gene expression. Robert Gordon University, PhD thesis.
Thesis Type | Thesis |
---|---|
Deposit Date | Oct 17, 2008 |
Publicly Available Date | Oct 17, 2008 |
Keywords | uORFs; Gene expression; Gene regulation; Upstream open reading frames; Inductive logic programming; ILP |
Public URL | http://hdl.handle.net/10059/224 |
Contract Date | Oct 17, 2008 |
Award Date | Mar 31, 2008 |
Files
SELPI 2008 An inductive logic programming
(2 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by-nc-nd/4.0/
Copyright Statement
© The Author.
You might also like
Two-layer ensemble of deep learning models for medical image segmentation.
(2024)
Journal Article
DEFEG: deep ensemble with weighted feature generation.
(2023)
Journal Article
A comparative study of anomaly detection methods for gross error detection problems.
(2023)
Journal Article
Downloadable Citations
About OpenAIR@RGU
Administrator e-mail: publications@rgu.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search