Exploiting the Functional and Taxonomic Structure of Genomic Data by Probabilistic To
#1

Abstract

In this paper, we present a method that enable both homology-based approach and composition-based approach to further study the functional core (i.e., microbial core and gene core, correspondingly). In the proposed method, the identification of major functionality groups is achieved by generative topic modeling, which is able to extract useful information from unlabeled data. We first show that generative topic model can be used to model the taxon abundance information obtained by homology-based approach and study the microbial core. The model considers each sample as a “document,” which has a mixture of functional groups, while each functional group (also known as a “latent topic”) is a weight mixture of species. Therefore, estimating the generative topic model fortaxon abundance data will uncover the distribution over latent functions (latent topic) in each sample. Second, we show that, generative topic model can also be used to study the genome-level composition of “N-mer” features (DNA subreads obtained by composition-based approaches). The model consider each genome as a mixture of latten genetic patterns (latent topics), while each functional pattern is a weighted mixture of the “N-mer” features, thus the existence of core genomes can be indicated by a set of common N-mer features. After studying the mutual information between latent topics and gene regions, we provide an explanation of the functional roles of uncovered latten genetic patterns. The experimental results demonstrate the effectiveness of proposed method
Reply
#2

Approach based on homology and composition-based approach to further study the functional nucleus (ie, the microbial nucleus and nucleus of the gene, correspondingly). In the proposed method, the identification of the main groups of functionality is achieved by generative theme modelling, which is able to extract useful information from unlabelled data. We first demonstrate that generative topic model can be used to model the taxon abundance information obtained by homology-based approach and study the microbial nucleus. The model considers each sample as a "document", which has a mixture of functional groups, while each functional group (also known as "latent theme") is a mixture of species weight. Therefore, the estimation of the generative theme model for taxon abundance data will reveal the distribution over latent functions (latent theme) in each sample. Second, we show that, generative topic model can also be used to study genome-level composition of "N-Mer" characteristics (DNA subreads obtained by composition-based approaches). The model considers each genome as a mixture Of latten genetic patterns (latent themes), while each functional pattern is a weighted mixture of "N-mer" characteristics, so the existence of nucleus genomes can be indicated by a set of common N-mer characteristics. After studying the mutual information between the latent themes and the genetic regions, we offer an explanation of the functional roles of the latent genetic patterns discovered. The experimental results demonstrate the efficacy of the proposed method.
Reply

Important Note..!

If you are not satisfied with above reply ,..Please

ASK HERE

So that we will collect data for you and will made reply to the request....OR try below "QUICK REPLY" box to add a reply to this page
Popular Searches: qnx vs nucleus, primitive and non primitive data structure ppt, taxonomic classificationclassification of animals, genomic library and cdna library, neural networks and genomic engineering, nucleus vs qnx, pdf primitive and non primitive data structure,

[-]
Quick Reply
Message
Type your reply to this message here.

Image Verification
Please enter the text contained within the image into the text box below it. This process is used to prevent automated spam bots.
Image Verification
(case insensitive)

Possibly Related Threads...
Thread Author Replies Views Last Post
  A Link-Based Cluster Ensemble Approach for Categorical Data Clustering 1 1,071 16-02-2017, 10:51 AM
Last Post: jaseela123d
  Service-Oriented Architecture for Weaponry and Battle Command and Control Systems in 1 1,047 15-02-2017, 03:40 PM
Last Post: jaseela123d
  Remote Server Monitoring System For Corporate Data Centers smart paper boy 3 2,814 28-03-2016, 02:51 PM
Last Post: dhanabhagya
  Secured Data Hiding and Extractions Using BPCS project report helper 4 3,653 04-02-2016, 12:52 PM
Last Post: seminar report asees
  Data Hiding in Binary Images for Authentication & Annotation project topics 2 1,821 06-11-2015, 02:27 PM
Last Post: seminar report asees
  DATA LEAKAGE DETECTION project topics 16 13,057 31-07-2015, 02:59 PM
Last Post: seminar report asees
  Privacy Preservation in Data Mining sajidpk123 3 2,948 13-11-2014, 10:48 PM
Last Post: jaseela123d
  projects on data mining? shakir_ali 2 2,029 05-11-2014, 09:30 PM
Last Post: jaseela123d
  data mining full report project report tiger 25 171,111 07-10-2014, 09:10 PM
Last Post: ToPWA
  Data Security Using Honey Pot System computer science topics 5 6,685 11-09-2014, 07:45 PM
Last Post: erhhk

Forum Jump: