<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Searching for poly(A) tails</title>
	<atom:link href="http://blog.malde.org/index.php/2009/12/14/searching-for-polya-tails/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.malde.org/index.php/2009/12/14/searching-for-polya-tails/</link>
	<description>bioinformatics and haskell</description>
	<lastBuildDate>Sun, 04 Jul 2010 19:06:47 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: ketil</title>
		<link>http://blog.malde.org/index.php/2009/12/14/searching-for-polya-tails/comment-page-1/#comment-1131</link>
		<dc:creator>ketil</dc:creator>
		<pubDate>Fri, 18 Dec 2009 19:26:18 +0000</pubDate>
		<guid isPermaLink="false">http://blog.malde.org/?p=65#comment-1131</guid>
		<description>I think it should be fairly fast, although I haven&#039;t profiled it or optimized it (beyond compiling with -O :-)

I did a quick test now, and on my laptop (1.3GHz Core2), it took exactly one minute to process 30K ESTs (about 23MB of sequence, and 60MB quality data).  Memory use is constant at about 20MB, according to &#039;top&#039; (using lazy bytestrings for the sequence and qual data -&gt; nice streaming behavior)

So: about 500 ESTs/second at the moment.  With some tuning, and perhaps using multiple threads, it should be possible to make parsing integers in the quality data the bottleneck.  But this is fast enough for me.</description>
		<content:encoded><![CDATA[<p>I think it should be fairly fast, although I haven&#8217;t profiled it or optimized it (beyond compiling with -O <img src='http://blog.malde.org/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<p>I did a quick test now, and on my laptop (1.3GHz Core2), it took exactly one minute to process 30K ESTs (about 23MB of sequence, and 60MB quality data).  Memory use is constant at about 20MB, according to &#8216;top&#8217; (using lazy bytestrings for the sequence and qual data -> nice streaming behavior)</p>
<p>So: about 500 ESTs/second at the moment.  With some tuning, and perhaps using multiple threads, it should be possible to make parsing integers in the quality data the bottleneck.  But this is fast enough for me.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: orbitz</title>
		<link>http://blog.malde.org/index.php/2009/12/14/searching-for-polya-tails/comment-page-1/#comment-1130</link>
		<dc:creator>orbitz</dc:creator>
		<pubDate>Wed, 16 Dec 2009 13:08:59 +0000</pubDate>
		<guid isPermaLink="false">http://blog.malde.org/?p=65#comment-1130</guid>
		<description>Any thoughts on performance of this or is that a non-issue?</description>
		<content:encoded><![CDATA[<p>Any thoughts on performance of this or is that a non-issue?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
