Friday, 13 July 2012 at 3:46 pm

Information theory as an interdiciplinary field consisting of engineering, physics, bioinformatics, mathematics, and many others started with Claude E. Shannon's 1948 paper, "A mathematical theory of communication". Advancements in this field have been instrumental in improving communication across the world from data storage on disc drives to satellite communcations. 

In this blog post, I will briefly go over what information theory is all about in an intuitve way and it's practical application to the field of bioinformatics and evolution. I am not an expert on information theory, so I welcome any corrections.

The blog is posted on my personal site. Please go here for the rest of the blog post.

read more


by Damian | permalink | Bioinformatics One comment


Tuesday, 10 April 2012 at 10:48 pm

I gave an overview of the 3 planarian transcriptomes in my last blog post. Since we do not have immediate access to all the raw data that went into the transcriptomes, we have to resort to merging the assembled transcriptomes according to their strengths and weaknesses.

Here are some thoughts on the 3 transcriptomes that'll need to be address when performing the merge:

  • The BIMSB transcriptome contains the most full length, non redundant set of transcripts, but it also has the least amount of transcripts.

  • The AAA and Heidelberg transcriptomes have better coverage of the transcriptome due to high depth of sequencing.

  • The BIMSB and AAA transcriptomes both have a small population of very short sequences. It looks like Heidelberg discarded transcripts below 100 base pairs.

  • The Heidelberg transcriptome contains a lot of Ns.

  • There maybe some isoform information contained in all the transcriptomes.

  • The AAA transcriptome may contain elements of both asexual and sexual sequences since the SOLiD reads were reference assembled.

  • Strandness in BIMSB and Heidelberg transcripts are mainly based on ORF or homology evidence.

Read the rest of this entry here.


by Damian | permalink | Bioinformatics No comments Used tags:


Tuesday, 10 April 2012 at 1:48 pm

This post is taken directly from my blog at blog.nextgenetics.net

In terms of sequences, there are currently quite a lot of data in the planarian (Schmidtea medterranea) field. We have an assembled genome from University of Washington's genome institute and various transcriptome assemblies using different sequencing platforms.


by Damian | permalink | Bioinformatics No comments Used tags: , ,



Contact
+441865271234
aziz.aboobaker"at"zoo.ox.ac.uk

Address
F18, Department of Zoology
Tinbergen Building
Oxford
OX1 3PS
UK