St Andrews Research Repository

St Andrews University Home
View Item 
  •   St Andrews Research Repository
  • Biology (School of)
  • Biology
  • Biology Theses
  • View Item
  •   St Andrews Research Repository
  • Biology (School of)
  • Biology
  • Biology Theses
  • View Item
  •   St Andrews Research Repository
  • Biology (School of)
  • Biology
  • Biology Theses
  • View Item
  • Login
JavaScript is disabled for your browser. Some features of this site may not work without it.

A computational approach to discovering p53 binding sites in the human genome

Thumbnail
View/Open
Ji-HyunLimPhDThesis.pdf (3.831Mb)
Date
06/2013
Author
Lim, Ji-Hyun
Supervisor
Barker, Daniel
Iggo, Richard
Funder
Biotechnology and Biological Sciences Research Council (BBSRC)
Keywords
p53
Regulatory regions
Bioinformatics
Logistic regression
Epigenetics
Metadata
Show full item record
Altmetrics Handle Statistics
Abstract
The tumour suppressor p53 protein plays a central role in the DNA damage response/checkpoint pathways leading to DNA repair, cell cycle arrest, apoptosis and senescence. The activation of p53-mediated pathways is primarily facilitated by the binding of tetrameric p53 to two 'half-sites', each consisting of a decameric p53 response element (RE). Functional REs are directly adjacent or separated by a small number of 1-13 'spacer' base pairs (bp). The p53 RE is detected by exact or inexact matches to the palindromic sequence represented by the regular expression [AG][AG][AG]C[AT][TA]G[TC][TC][TC] or a position weight matrix (PWM). The use of matrix-based and regular expression pattern-matching techniques, however, leads to an overwhelming number of false positives. A more specific model, which combines multiple factors known to influence p53-dependent transcription, is required for accurate detection of the binding sites. In this thesis, we present a logistic regression based model which integrates sequence information and epigenetic information to predict human p53 binding sites. Sequence information includes the PWM score and the spacer length between the two half-sites of the observed binding site. To integrate epigenetic information, we analyzed the surrounding region of the binding site for the presence of mono- and trimethylation patterns of histone H3 lysine 4 (H3K4). Our model showed a high level of performance on both a high-resolution data set of functional p53 binding sites from the experimental literature (ChIP data) and the whole human genome. Comparing our model with a simpler sequence-only model, we demonstrated that the prediction accuracy of the sequence-only model could be improved by incorporating epigenetic information, such as the two histone modification marks H3K4me1 and H3K4me3.
Type
Thesis, PhD Doctor of Philosophy
Collections
  • Biology Theses
URI
http://hdl.handle.net/10023/3388

Items in the St Andrews Research Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

Advanced Search

Browse

All of RepositoryCommunities & CollectionsBy Issue DateNamesTitlesSubjectsClassificationTypeFunderThis CollectionBy Issue DateNamesTitlesSubjectsClassificationTypeFunder

My Account

Login

Open Access

To find out how you can benefit from open access to research, see our library web pages and Open Access blog. For open access help contact: openaccess@st-andrews.ac.uk.

Accessibility

Read our Accessibility statement.

How to submit research papers

The full text of research papers can be submitted to the repository via Pure, the University's research information system. For help see our guide: How to deposit in Pure.

Electronic thesis deposit

Help with deposit.

Repository help

For repository help contact: Digital-Repository@st-andrews.ac.uk.

Give Feedback

Cookie policy

This site may use cookies. Please see Terms and Conditions.

Usage statistics

COUNTER-compliant statistics on downloads from the repository are available from the IRUS-UK Service. Contact us for information.

© University of St Andrews Library

University of St Andrews is a charity registered in Scotland, No SC013532.

  • Facebook
  • Twitter