Identification of transcription factor binding sites using Gaussian mixture models

0.00 Avg rating—0 Votes

Article ID:	iaor201522534
Volume:	31
Issue:	1
Start Page Number:	70
End Page Number:	80
Publication Date:	Feb 2014
Journal:	Expert Systems
Authors:	Karabulut Mustafa, Ibrikci Turgay
Keywords:	statistics: distributions

Abstract:

Identification of transcription factor binding sites still remains a challenging problem even though many computational tools have been proposed in the literature for this specific task. In this study, a method to discover such DNA subsequences, that is, motifs, is proposed. The method uses Gaussian mixture models with expectation‐maximization algorithm. In order to show the potential of the proposed method, experiments are conducted by use of data sets extracted from the DNA sequences of various organisms. The proposed method is also compared with four other methods: MEME, MDScan, SOMBRERO and the fuzzy C‐means based motif finder. As a result, the proposed method proves itself as a promising tool in identifying over‐represented DNA motifs.

Reviews

Required fields are marked *. Your email address will not be published.