<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Xfam Blog</title>
	<atom:link href="http://xfam.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://xfam.wordpress.com</link>
	<description>News about the Pfam and Rfam projects</description>
	<lastBuildDate>Wed, 15 May 2013 08:36:36 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='xfam.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://1.gravatar.com/blavatar/b4e6843655942d15fef21468c92d4907?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>Xfam Blog</title>
		<link>http://xfam.wordpress.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://xfam.wordpress.com/osd.xml" title="Xfam Blog" />
	<atom:link rel='hub' href='http://xfam.wordpress.com/?pushpress=hub'/>
		<item>
		<title>Case studies from the list of human regions not in Pfam 27.0.</title>
		<link>http://xfam.wordpress.com/2013/05/14/case-studies-from-the-list-of-human-regions-not-in-pfam-27-0/</link>
		<comments>http://xfam.wordpress.com/2013/05/14/case-studies-from-the-list-of-human-regions-not-in-pfam-27-0/#comments</comments>
		<pubDate>Tue, 14 May 2013 08:21:20 +0000</pubDate>
		<dc:creator>mpunta</dc:creator>
				<category><![CDATA[Pfam]]></category>
		<category><![CDATA[curation]]></category>
		<category><![CDATA[human]]></category>
		<category><![CDATA[pfam]]></category>
		<category><![CDATA[proteome]]></category>

		<guid isPermaLink="false">http://xfam.wordpress.com/?p=977</guid>
		<description><![CDATA[Following on from Jaina and Marco&#8217;s blog post last week about conserved Human regions not in Pfam, I would like to give you some examples of how we have used the regions identified to improve existing Pfam families, and to create new ones. When available, we use three-dimensional structures to guide the boundary definitions of [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=977&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Following on from Jaina and Marco&#8217;s <a href="http://xfam.wordpress.com/2013/05/07/pfam-targets-conserved-human-regions/">blog post</a> last week about conserved Human regions not in Pfam, I would like to give you some examples of how we have used the regions identified to improve existing Pfam families, and to create new ones. When available, we use three-dimensional structures to guide the boundary definitions of our families. In cases where there is no available structure, either for the protein in question or for other proteins in the same Pfam family, we base boundary decisions on sequence conservation. The following paragraphs give three examples of cases I have looked at recently.</p>
<p><span id="more-977"></span></p>
<p><strong>Example 1</strong></p>
<p>The C-terminal region (residues 302 &#8211; 445) of Q9HAC7 (CaiB/baiF CoA-transferase family protein C7orf10) was a Human region identified as not covered by Pfam, that had &gt;3,000 <a href="http://hmmer.janelia.org/">phmmer</a> [1] hits in UniProtKB. Just over 40% of this protein was covered by PF02515 (CoA-transferase family III) in Pfam 27.0 (Figure 1A). There are several structures available for bacterial members of this family, which confirm that the domain boundaries on this family could be widened. I successfully extended the family at both the C-terminus and the N-terminus, and Pfam now covers 80% of Q9HAC7 (Figure 1B). The extended family will be released in Pfam 28.0.</p>
<p><a href="http://xfam.files.wordpress.com/2013/05/fig4.jpg"><img alt="Figure 4: Schematic representation of Q9HAC7 (CaiB/baiF CoA-transferase family protein C7orf10) before and after extension of CoA-transferase family III (PF02515)." src="http://xfam.files.wordpress.com/2013/05/fig4.jpg?w=300&#038;h=95" width="300" height="95" /></a></p>
<div>
<dl id="attachment_958">
<dd><strong>Figure 1</strong>: Schematic representation of Q9HAC7 (CaiB/baiF CoA-transferase family protein C7orf10) before (A) and after (B) extension of CoA-transferase family III (PF02515). Click on the figure to see a higher resolution image.</dd>
</dl>
</div>
<p><strong>Example 2</strong></p>
<p>Some cases are more complex and require improvements to be made to several different families. A region of P46063 (ATP-dependent DNA helicase Q1, RECQ1) covering residues 410 to 487 was identified as having no Pfam-A domain and had &gt;550 phmmer<i> </i>hits in UniProtKB. This protein is covered by three Pfam-A domains and a Pfam-B family in Pfam 27.0 (Figure 2A). Three-dimensional structures are available for this protein and for the <i>Escherichia coli </i>RecQ protein [2-3], and show RecQ to contain four distinct domains. The N-terminus of the protein is a DEAD/DEAH box helicase domain (PF00270), followed by a helicase conserved C-terminal domain (PF00271), then a zinc-binding domain, which was absent from Pfam 27.0, and, at the C-terminus, an RQC domain (PF09382). The Pfam alignment for the helicase conserved C-terminal domain was too short, so I extended this at the N-terminus to cover the region matched by the Pfam-B family. Extensions to large families such as this one (over 89,000 members in Pfam 27.0) help to increase the Pfam residue coverage of UniProtKB, something we are always striving to do. A new family, PF16124, was built for the zinc-binding domain, filling the gap between the helicase conserved C-terminal domain and the RQC domain, and the RQC domain was extended by a small amount at both ends. Figure 2B shows the new Pfam architecture of this protein, which is a far better reflection of it’s structure. The changes to PF00271 and PF09382, and the new family PF16124 will be available in Pfam 28.0.</p>
<div>
<dl id="attachment_959">
<dt><a href="http://xfam.files.wordpress.com/2013/05/fig5.jpg"><img alt="Figure 5: Schematic representation of P46063 (ATP-dependent DNA helicase Q1, RECQ1) before and after extension of PF00271 and PF09382 and building of PF16124" src="http://xfam.files.wordpress.com/2013/05/fig5.jpg?w=300&#038;h=72" width="300" height="72" /></a></dt>
<dd></dd>
<dd><strong>Figure 2</strong>: Schematic representation of P46063 (ATP-dependent DNA helicase Q1, RECQ1) before (A) and after (B) extension of PF00271 and PF09382 and building of PF16124. The striped box represents a Pfam-B family. Click on the figure to see a higher resolution image.</dd>
</dl>
</div>
<p><strong>Example 3</strong></p>
<p>In some cases there are no appropriate changes to be made to existing families or new families to build, an example of this is P01138 (Beta-nerve growth factor). The N-terminal region of this protein is not covered by Pfam and has over 600 hits to UniProtKB. However, closer examination revealed that the N-terminus of this protein consists of a signal peptide and a propeptide and it would therefore not be appropriate to include these regions in Pfam.</p>
<p><a href="http://database.oxfordjournals.org/content/2013/bat023.long">Our study on the human proteome</a> [4] has so far proved to be useful for selecting interesting regions to target for family building. We will continue to work on the long list of conserved Human regions over the next few months.</p>
<p>Posted by Ruth</p>
<p><strong>References</strong></p>
<p>[1] <a href="http://hmmer.janelia.org/">http://hmmer.janelia.org/</a></p>
<div>[2] Bernstein, D.A., Zittel, M.C. and Keck, J.L. (2003) <em>High-resolution structure of the E.coli RecQ helicase catalytic core</em>, Embo J, <b>22</b>, 4910-4921.</div>
<p>[3] Pike, A.C., <em>et al.</em> (2009) <em>Structure of the human RECQ1 helicase reveals a putative strand-separation pin</em>, Proc Natl Acad Sci U S A, <b>106</b>, 1039-1044.</p>
<p>[4] Mistry, J., <em>et al.</em> (2013) <em>The challenge of increasing Pfam coverage of the human proteome</em>, Database (Oxford), <b>2013</b>, bat023</p>
<br /> Tagged: <a href='http://xfam.wordpress.com/tag/curation/'>curation</a>, <a href='http://xfam.wordpress.com/tag/human/'>human</a>, <a href='http://xfam.wordpress.com/tag/pfam/'>pfam</a>, <a href='http://xfam.wordpress.com/tag/proteome/'>proteome</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xfam.wordpress.com/977/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xfam.wordpress.com/977/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=977&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xfam.wordpress.com/2013/05/14/case-studies-from-the-list-of-human-regions-not-in-pfam-27-0/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/6ae39db985031ee98c1366277be72fdc?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">mpunta</media:title>
		</media:content>

		<media:content url="http://xfam.files.wordpress.com/2013/05/fig4.jpg?w=300" medium="image">
			<media:title type="html">Figure 4: Schematic representation of Q9HAC7 (CaiB/baiF CoA-transferase family protein C7orf10) before and after extension of CoA-transferase family III (PF02515).</media:title>
		</media:content>

		<media:content url="http://xfam.files.wordpress.com/2013/05/fig5.jpg?w=300" medium="image">
			<media:title type="html">Figure 5: Schematic representation of P46063 (ATP-dependent DNA helicase Q1, RECQ1) before and after extension of PF00271 and PF09382 and building of PF16124</media:title>
		</media:content>
	</item>
		<item>
		<title>Pfam targets conserved human regions</title>
		<link>http://xfam.wordpress.com/2013/05/07/pfam-targets-conserved-human-regions/</link>
		<comments>http://xfam.wordpress.com/2013/05/07/pfam-targets-conserved-human-regions/#comments</comments>
		<pubDate>Tue, 07 May 2013 10:20:04 +0000</pubDate>
		<dc:creator>jainamistry</dc:creator>
				<category><![CDATA[Pfam]]></category>
		<category><![CDATA[curation]]></category>
		<category><![CDATA[human]]></category>
		<category><![CDATA[pfam]]></category>
		<category><![CDATA[proteome]]></category>

		<guid isPermaLink="false">http://xfam.wordpress.com/?p=948</guid>
		<description><![CDATA[Recently, we have been looking at how much of the human proteome is covered by Pfam (release 27.0), and ways in which we can improve this coverage. We have even written an open access paper about it that you can read here [1]  that is part of the proceedings of the 2013 Biocuration conference. We used [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=948&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Recently, we have been looking at how much of the human proteome is covered by Pfam (release 27.0), and ways in which we can improve this coverage. We have even written an open access paper about it that you can read <a title="here" href="http://database.oxfordjournals.org/content/2013/bat023.long">here</a> [1]  that is part of the proceedings of the <a title="2013 Biocuration conference" href="http://www.ebi.ac.uk/biocuration2013/content/home">2013 Biocuration conference</a>. We used the human proteins in UniProtKB/Swiss-Prot [2] (~20,000 sequences) as our human proteome set, and found that while most of the sequences in this set have some Pfam annotation (90% have at least one Pfam domain), there is still much ground to cover before we have a complete map of all (conserved) human regions (HRs). Here, rather than repeating what we presented in the paper (did we mention it is open access? <img src='http://s0.wp.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> ), we would like to tell you more about the impact this study is having on our strategies for selecting target regions to be added to Pfam.</p>
<p><span id="more-948"></span></p>
<p>Within the human proteome, we identified almost 15,000 clusters of sequence regions that are not in Pfam-A or Pfam-B families, not predicted to be signal peptide regions, and that are at least 50 residues long. This accounts for close to 40% of all human residues (Figure 1, click on the figures to see a higher resolution version of them).</p>
<p><div id="attachment_957" class="wp-caption aligncenter" style="width: 310px"><a href="http://xfam.files.wordpress.com/2013/05/fig11.png"><img class=" wp-image-957    " title="Figure 1" alt="Figure 1" src="http://xfam.files.wordpress.com/2013/05/fig11.png?w=300&#038;h=257" width="300" height="257" /></a><p class="wp-caption-text"><strong>Figure 1</strong>: Proportion of human residues covered by Pfam, in signal peptide regions predicted by Phobius [3], and in small regions unlikely to harbour Pfam domains. Click on the figure to see a higher resolution image.</p></div>We ran <a title="phmmer" href="http://hmmer.janelia.org/">phmmer</a> [4] searches using the members of each of the 15,000 clusters as queries against the whole of UniProtKB, and obtained the following distribution of significant hits:</p>
<div id="attachment_956" class="wp-caption aligncenter" style="width: 390px"><a href="http://xfam.files.wordpress.com/2013/05/fig2.png"><img class="size-large wp-image-956" alt="Figure 2: Distribution of the mean number of significant hits found in each cluster after a phmmer search against UniProtKB by each cluster member. Each cluster (~15,000 in total) represents a set of homologous HRs that do not contain a Pfam-A domain." src="http://xfam.files.wordpress.com/2013/05/fig2.png?w=380&#038;h=258" width="380" height="258" /></a><p class="wp-caption-text"><strong>Figure 2</strong>: Distribution of the mean number of significant hits found in each cluster after a phmmer search against UniProtKB by each cluster member. Each cluster (~15,000 in total) represents a set of homologous HRs that do not contain a Pfam-A domain. Click on the figure to see a higher resolution image.</p></div>
<p>This histogram has two important features:</p>
<ol>
<li>most HRs have few hits in UniProtKB (we will comment more specifically on these regions in a future blog post), and</li>
<li>there is a long tail of regions with many hits in UniProtKB.</li>
</ol>
<p>Want to guess which regions Pfam will target first? Of course we want to go for regions that might give us maximum impact, so we will start by trying to incorporate into Pfam the HRs that have the highest number of hits in UniProtKB. In fact, add to this the plot in Figure 3 below and you have our current strategy for adding conserved HRs to Pfam.</p>
<div id="attachment_954" class="wp-caption aligncenter" style="width: 390px"><a href="http://xfam.files.wordpress.com/2013/05/fig3.png"><img class="size-large wp-image-954 " alt="Figure 2: Distribution of the mean number of significant hits found in each cluster after a phmmer search against UniProtKB by each cluster member. Each cluster (~15,000 in total) represents a set of homologous HRs that do not contain a Pfam-A domain." src="http://xfam.files.wordpress.com/2013/05/fig3.png?w=380&#038;h=380" width="380" height="380" /></a><p class="wp-caption-text">Figure 3: Distribution of the mean proportion of hits that have overlap to Pfam-A, for clusters with a mean phmmer output of &gt;500 hits in UniProtKB. Click on the figure to see a higer resolution image.</p></div>
<p>Figure 3 was drawn by calculating, for each cluster member, the percentage of its <i>phmmer</i> hits that matched an existing Pfam-A family, and averaging these values over all members in the cluster. We show only clusters that have &gt;500 <i>phmmer </i>hits in UniProtKB. We see that a large number of regions have a high percentage of such ‘overlaps’ with existing Pfam-A families. These HRs are likely to be regions that are on the periphery of existing families (outliers) and that have been narrowly missed by our profile-HMMs. Alternatively, they could be problem regions that generate overlaps with many different clans, such as is sometimes observed for coiled-coil regions (see our recent paper [5] on the subject). You will also notice, however, that a large proportion of HRs feature few or no overlaps to Pfam-A. These are the ones that we are currently trying to add to Pfam. Ruth Eberhardt, one of our curators, will give some examples of HRs that she has recently added to Pfam-A in another post shortly.</p>
<p>It&#8217;s still very early stages in this analysis and we have looked at only around 30 cases. About two thirds resulted in extensions to existing families, and 6 led to new families being built. In a number of instances, such as the ones that will be discussed in our next post, structural information was key to improving family annotation. In many other cases, the only information available was sequence conservation, and based on this our curators had to make a decision as to whether or not to extend an existing family or to create a new one.</p>
<p>Overall, it looks as if the analysis of conserved human regions has given us great potential not only for increasing Pfam coverage of human, but also for improving the boundaries of many of our families. We can imagine that in the future, a similar strategy could be applied to other sets of sequences such as those, for example, in the proteomes of model organisms or pathogens.</p>
<p>Posted by Jaina and Marco</p>
<p><strong>References</strong></p>
<p>[1] Mistry, J., et al. (2013) <em>The challenge of increasing Pfam coverage of the human proteome</em>, <em>Database (Oxford)</em>, <b>2013</b>, bat023.</p>
<p>[2] (2013) <em>Update on activities at the Universal Protein Resource (UniProt) in 2013, Nucleic Acids Res</em>, <b>41</b>, D43-47.</p>
<p>[3] Kall, L. <em>et al. (2004) A combined transmembrane topology and signal peptide prediction method</em>.<em> J Mol Biol,</em> 338, 1027-1036.</p>
<p>[4] <a href="http://hmmer.janelia.org/">http://hmmer.janelia.org/</a></p>
<p>[5] Mistry, J., et al. (2013) <em>Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions, Nucleic Acids Res</em>.</p>
<br /> Tagged: <a href='http://xfam.wordpress.com/tag/curation/'>curation</a>, <a href='http://xfam.wordpress.com/tag/human/'>human</a>, <a href='http://xfam.wordpress.com/tag/pfam/'>pfam</a>, <a href='http://xfam.wordpress.com/tag/proteome/'>proteome</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xfam.wordpress.com/948/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xfam.wordpress.com/948/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=948&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xfam.wordpress.com/2013/05/07/pfam-targets-conserved-human-regions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/5df05397fec11ea44613816772d9b48b?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">jainamistry</media:title>
		</media:content>

		<media:content url="http://xfam.files.wordpress.com/2013/05/fig11.png" medium="image">
			<media:title type="html">Figure 1</media:title>
		</media:content>

		<media:content url="http://xfam.files.wordpress.com/2013/05/fig2.png?w=380" medium="image">
			<media:title type="html">Figure 2: Distribution of the mean number of significant hits found in each cluster after a phmmer search against UniProtKB by each cluster member. Each cluster (~15,000 in total) represents a set of homologous HRs that do not contain a Pfam-A domain.</media:title>
		</media:content>

		<media:content url="http://xfam.files.wordpress.com/2013/05/fig3.png?w=380" medium="image">
			<media:title type="html">Figure 2: Distribution of the mean number of significant hits found in each cluster after a phmmer search against UniProtKB by each cluster member. Each cluster (~15,000 in total) represents a set of homologous HRs that do not contain a Pfam-A domain.</media:title>
		</media:content>
	</item>
		<item>
		<title>TreeFam 9 is now available!</title>
		<link>http://xfam.wordpress.com/2013/05/03/treefam-9-is-now-available/</link>
		<comments>http://xfam.wordpress.com/2013/05/03/treefam-9-is-now-available/#comments</comments>
		<pubDate>Fri, 03 May 2013 08:10:01 +0000</pubDate>
		<dc:creator>fabsta</dc:creator>
				<category><![CDATA[Releases]]></category>
		<category><![CDATA[TreeFam]]></category>
		<category><![CDATA[dfam]]></category>
		<category><![CDATA[pfam]]></category>
		<category><![CDATA[production]]></category>
		<category><![CDATA[release]]></category>
		<category><![CDATA[rfam]]></category>
		<category><![CDATA[treefam]]></category>
		<category><![CDATA[website]]></category>

		<guid isPermaLink="false">http://xfam.wordpress.com/?p=924</guid>
		<description><![CDATA[We are happy to announce that TreeFam 9 is online and you can find it under http://www.treefam.org. TreeFam 9 now has 109 species (vs. 79 in TreeFam 8) and is based on data from Ensembl v69, Ensembl Genomes v16, Wormbase and JGI. This release marks an important step for TreeFam as it is the first [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=924&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>We are happy to announce that TreeFam 9 is online and you can find it under <a href="http://www.treefam.org" rel="nofollow">http://www.treefam.org</a>.</p>
<p>TreeFam 9 now has 109 species (vs. 79 in TreeFam 8) and is based on data from Ensembl v69, Ensembl Genomes v16, Wormbase and JGI.</p>
<p>This release marks an important step for TreeFam as it is the first release build since TreeFam has been resurrected.<br />
Here is a list of the most important changes in TreeFam 9:</p>
<ul>
<li>New website layout (adopting the Pfam/Rfam/Dfam layout)</li>
<li>Infrastructure move of web servers and databases to the EBI</li>
<li>Sequence search against the library of TreeFam family profiles</li>
<li>new tree visualisations in pure javascript using D3, e.g. see the BRCA2 gene tree <a title="TreeFam BRCA2 gene tree" href="http://www.treefam.org/family/TF105041#tabview=tab1">here</a>.</li>
<li>Pairwise homology download</li>
</ul>
<p>We hope you find all the information you are looking for. If you don&#8217;t, please let us know so that we can include the information you want. The old website will remain online <a title="TreeFam 8 website" href="http://legacy.treefam.org" target="_blank">here</a>.</p>
<p>If you have questions, suggestions or find bugs, don&#8217;t hesitate to contact us through our new forum <a title="TreeFam forum" href="https://treefam.tenderapp.com/home" target="_blank">here</a>.</p>
<p>Happy treefamming,</p>
<p>the TreeFam team<br />
(Fabian, Mateus)</p>
<br /> Tagged: <a href='http://xfam.wordpress.com/tag/dfam-2/'>dfam</a>, <a href='http://xfam.wordpress.com/tag/pfam/'>pfam</a>, <a href='http://xfam.wordpress.com/tag/production-2/'>production</a>, <a href='http://xfam.wordpress.com/tag/release/'>release</a>, <a href='http://xfam.wordpress.com/tag/rfam/'>rfam</a>, <a href='http://xfam.wordpress.com/tag/treefam-2/'>treefam</a>, <a href='http://xfam.wordpress.com/tag/website/'>website</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xfam.wordpress.com/924/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xfam.wordpress.com/924/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=924&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xfam.wordpress.com/2013/05/03/treefam-9-is-now-available/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/403d2796dc2e49deac3184aad0a0f496?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">fabsta</media:title>
		</media:content>
	</item>
		<item>
		<title>Pfam 27.0 is now available!</title>
		<link>http://xfam.wordpress.com/2013/03/22/pfam-27-0-is-now-available/</link>
		<comments>http://xfam.wordpress.com/2013/03/22/pfam-27-0-is-now-available/#comments</comments>
		<pubDate>Fri, 22 Mar 2013 18:42:13 +0000</pubDate>
		<dc:creator>rdfinn</dc:creator>
				<category><![CDATA[Pfam]]></category>
		<category><![CDATA[Production]]></category>
		<category><![CDATA[Releases]]></category>
		<category><![CDATA[pfam]]></category>
		<category><![CDATA[production]]></category>
		<category><![CDATA[release]]></category>

		<guid isPermaLink="false">http://xfam.wordpress.com/?p=926</guid>
		<description><![CDATA[In a blog post published just over a year ago, I proposed a number of changes to the content of Pfam to improve scalability and usability of the database.  These changes came into effect a few days ago, when we released Pfam 27.0.  This release of Pfam contains a total of 14831 families, with 1182 [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=926&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>In a <a title="Changes blog" href="http://xfam.wordpress.com/2012/02/27/proposed-changes/">blog post</a> published just over a year ago, I proposed a number of <strong>changes to the content of Pfam</strong> to improve scalability and usability of the database.  These changes came into effect a few days ago, when we<strong> released Pfam 27.0</strong>.  This release of Pfam contains a total of <strong>14831 families</strong>, with 1182 new families and 22 families killed since release 26.0. <strong>80% of all proteins</strong> in UniProt contain a match to at least one Pfam domain, and 58% of all residues in the sequence database fall within a Pfam domain.<span id="more-926"></span></p>
<p>So what has changed?  To the user, hopefully, not a great deal has changed! Nevertheless, there has been a considerable amount of reorganization of the database production pipeline. The most notable loss of information is that we are <strong>no longer providing neighbour-joining</strong> trees for the Pfam full alignments.  If you want or care about this type data,  it will now be up to you to calculate it – however, if this data was important to you it should be recalculated using a more precise method anyway.  On the flip side, there are<strong> many new features</strong> that have been integrated into <a href="http://pfam.sanger.ac.uk">Pfam 27.0</a>.  Below is a brief list of the new developments that are now available:</p>
<ul>
<li>Real time <a href="http://pfam.sanger.ac.uk/search">searches</a> of <strong>DNA</strong>  sequences for matches to Pfam models</li>
<li>Use of<strong> Representative proteomes </strong>(1) sequence sets used to provide redundant views of the Pfam-A full alignments</li>
<li>Addition of <strong>disorder predictions</strong> to the repertoire of sequence feature annotations</li>
<li><strong>AntiFam </strong>(2) has been applied to the underlying sequence database to remove sequences believed to be <strong>spurious translations</strong></li>
<li><strong>Selectable</strong> sunbursts in the Pfam-A &#8216;species&#8217; distribution tab, allowing the generation of alignments or visualisation of sequences from a user defined taxonomic range.</li>
<li>New, faster <strong>keyword search</strong> using <a href="http://lucy.apache.org">Apache Lucy</a></li>
</ul>
<p>Blog posts describing these news developments in more detail<em> will</em> follow in the coming weeks.  In addition to the changes listed above, there have been <strong>many improvements to existing families</strong>, be it improving domain boundaries, expanding members or the generation of Wikipedia entries.  Many of the  new entries in Pfam have been built with the purpose of improving <strong>Human</strong> coverage.</p>
<p>Enjoy the release!</p>
<p>Posted by Rob</p>
<p><span style="text-decoration:underline;">References</span></p>
<p>1: Chen C, Natale DA, Finn RD, Huang H, Zhang J, Wu CH, Mazumder R. <a title="PLOS One article" href="http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0018910">Representative proteomes: a stable, scalable and unbiased proteome set for sequence analysis and functional annotation.</a> PLoS One. 2011 6(4):e18910</p>
<p>2: Eberhardt RY, Haft DH, Punta M, Martin M, O&#8217;Donovan C, Bateman A. <a title="Database article" href="http://database.oxfordjournals.org/content/2012/bas003.long">AntiFam: a tool to help identify spurious ORFs in protein annotation.</a> Database 2012:bas003.</p>
<br /> Tagged: <a href='http://xfam.wordpress.com/tag/pfam/'>pfam</a>, <a href='http://xfam.wordpress.com/tag/production-2/'>production</a>, <a href='http://xfam.wordpress.com/tag/release/'>release</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xfam.wordpress.com/926/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xfam.wordpress.com/926/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=926&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xfam.wordpress.com/2013/03/22/pfam-27-0-is-now-available/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/f68c4aebddc8549cf75776ec3b45b743?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">rdfinn</media:title>
		</media:content>
	</item>
		<item>
		<title>Dfam paper is an NAR &#8220;featured article&#8221;; RepeatMasker4 is out</title>
		<link>http://xfam.wordpress.com/2013/01/12/dfam-paper-intro/</link>
		<comments>http://xfam.wordpress.com/2013/01/12/dfam-paper-intro/#comments</comments>
		<pubDate>Sat, 12 Jan 2013 02:45:18 +0000</pubDate>
		<dc:creator>travisjwheeler</dc:creator>
				<category><![CDATA[Dfam]]></category>
		<category><![CDATA[Releases]]></category>
		<category><![CDATA[dfam]]></category>

		<guid isPermaLink="false">http://xfam.wordpress.com/?p=918</guid>
		<description><![CDATA[We are pleased to announce that the Dfam paper (&#8220;Dfam: a database of repetitive DNA based on profile hidden Markov models&#8220;) is now available in the 2013 NAR Database issue, and has been selected as a &#8220;featured article&#8220; (meaning the NAR editorial board thinks it is among &#8220;the top 5% of papers in terms of originality, significance and [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=918&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>We are pleased to announce that the Dfam paper (&#8220;<a title="Dfam paper link" href="http://nar.oxfordjournals.org/content/41/D1/D70.full">Dfam: a database of repetitive DNA based on profile hidden Markov models</a>&#8220;) is now available in the 2013 NAR Database issue, and has been selected as a &#8220;<a title="featured article link" href="http://www.oxfordjournals.org/our_journals/nar/featured_articles.html">featured article</a>&#8220; (meaning the NAR editorial board thinks it is among &#8220;the top 5% of papers in terms of originality, significance and scientific excellence&#8221;).</p>
<p>In other exciting news, two members of the Dfam consortium, Arian Smit and Robert Hubley (Institute for Systems Biology, Seattle), just released <strong>RepeatMasker 4.0</strong>. This is a major update that, among other important improvements, adds support for searching with Dfam and nhmmer. Go get yourself a copy at <a href="http://www.repeatmasker.org/">http://www.repeatmasker.org/</a></p>
<p>Posted by Travis</p>
<br /> Tagged: <a href='http://xfam.wordpress.com/tag/dfam-2/'>dfam</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xfam.wordpress.com/918/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xfam.wordpress.com/918/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=918&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xfam.wordpress.com/2013/01/12/dfam-paper-intro/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/ac2cd569ad62cd7a63ef508b3a6d0b2d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">travisjwheeler</media:title>
		</media:content>
	</item>
		<item>
		<title>TreeFam: What&#8217;s in the next release</title>
		<link>http://xfam.wordpress.com/2012/12/10/treefam-whats-in-the-next-release/</link>
		<comments>http://xfam.wordpress.com/2012/12/10/treefam-whats-in-the-next-release/#comments</comments>
		<pubDate>Mon, 10 Dec 2012 15:06:08 +0000</pubDate>
		<dc:creator>fabsta</dc:creator>
				<category><![CDATA[TreeFam]]></category>

		<guid isPermaLink="false">http://xfam.wordpress.com/?p=902</guid>
		<description><![CDATA[Behind the scenes we are working hard on building the next TreeFam release, which will be TreeFam 9. TreeFam 9 will have 109 species, that is a 37% increase over TreeFam 8. Most of the species come from EnsEMBL (v.69) and EnsEMBL genomes (v.16) with a few ones coming from JGI. Besides that &#8211; and probably most important [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=902&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Behind the scenes we are working hard on building the next TreeFam release, which will be TreeFam 9.</p>
<p>TreeFam 9 will have 109 species, that is a 37% increase over TreeFam 8. Most of the species come from EnsEMBL (v.69) and EnsEMBL genomes (v.16) with a few ones coming from JGI.</p>
<p>Besides that &#8211; and probably most important for the user &#8211; will be our new web site. Based on the success of other Xfam-databases like Pfam [5], Rfam [6] and -most recently- Dfam [7], we decided to give the TreeFam website a face lift by adapting it to the Xfam look&amp;feel.</p>
<p>So, there are great things to come and soon we will have our next blog post.<br />
The next TreeFam blog post will then be about TreeFam 9!</p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xfam.wordpress.com/902/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xfam.wordpress.com/902/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=902&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xfam.wordpress.com/2012/12/10/treefam-whats-in-the-next-release/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/403d2796dc2e49deac3184aad0a0f496?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">fabsta</media:title>
		</media:content>
	</item>
		<item>
		<title>The Rfam NAR paper is now available!</title>
		<link>http://xfam.wordpress.com/2012/11/23/the-rfam-nar-paper-is-now-available/</link>
		<comments>http://xfam.wordpress.com/2012/11/23/the-rfam-nar-paper-is-now-available/#comments</comments>
		<pubDate>Fri, 23 Nov 2012 16:53:53 +0000</pubDate>
		<dc:creator>Sarah</dc:creator>
				<category><![CDATA[News]]></category>
		<category><![CDATA[Rfam]]></category>
		<category><![CDATA[NAR database issue]]></category>
		<category><![CDATA[papers]]></category>
		<category><![CDATA[rfam]]></category>

		<guid isPermaLink="false">http://xfam.wordpress.com/?p=895</guid>
		<description><![CDATA[For some light weekend reading, have a look at the latest Rfam paper, Rfam 11.0: 10 years of RNA Families.  It&#8217;s part of the 2013 Nucleic Acids Research Database issue, and you&#8217;ll find all the latest developments to Rfam mentioned, including the sunbursts, the Biomart and an update on the Wikipedia annotation effort. Tagged: NAR [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=895&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>For some light weekend reading, have a look at the <a title="Rfam NAR paper" href="http://nar.oxfordjournals.org/content/early/2012/11/02/nar.gks1005.long">latest Rfam paper,</a> Rfam 11.0: 10 years of RNA Families.  It&#8217;s part of the 2013 Nucleic Acids Research Database issue, and you&#8217;ll find all the latest developments to Rfam mentioned, including the sunbursts, the Biomart and an update on the Wikipedia annotation effort.</p>
<br /> Tagged: <a href='http://xfam.wordpress.com/tag/nar-database-issue/'>NAR database issue</a>, <a href='http://xfam.wordpress.com/tag/papers/'>papers</a>, <a href='http://xfam.wordpress.com/tag/rfam/'>rfam</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xfam.wordpress.com/895/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xfam.wordpress.com/895/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=895&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xfam.wordpress.com/2012/11/23/the-rfam-nar-paper-is-now-available/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/d7e7c6d6acc0ce986feed010a85fc073?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">sarahburge01</media:title>
		</media:content>
	</item>
		<item>
		<title>R-chie arc diagrams now available in our secondary structure galleries</title>
		<link>http://xfam.wordpress.com/2012/11/19/r-chie-arc-diagrams-now-available-in-our-secondary-structure-galleries/</link>
		<comments>http://xfam.wordpress.com/2012/11/19/r-chie-arc-diagrams-now-available-in-our-secondary-structure-galleries/#comments</comments>
		<pubDate>Mon, 19 Nov 2012 11:20:56 +0000</pubDate>
		<dc:creator>Sarah</dc:creator>
				<category><![CDATA[Rfam]]></category>
		<category><![CDATA[R-chie]]></category>
		<category><![CDATA[rfam]]></category>
		<category><![CDATA[secondary structure]]></category>

		<guid isPermaLink="false">http://xfam.wordpress.com/?p=868</guid>
		<description><![CDATA[We are pleased to announce the inclusion of R-chie arc diagrams in the Rfam family secondary structure galleries. We think these images are beatiful and intuitive ways of visualising complex RNA secondary structures, and we hope that you find them as useful as we do. You can find the R-chie tab in the secondary structure [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=868&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>We are pleased to announce the inclusion of R-chie arc diagrams in the Rfam family secondary structure galleries. We think these images are beatiful and intuitive ways of visualising complex RNA secondary structures, and we hope that you find them as useful as we do. You can find the R-chie tab in the secondary structure image gallery for each family; from there you can zoom in and out of the images, as well as viewing the image in a seperate window. The majority of Rfam families have R-chie images; those which don&#8217;t are families without secondary structure. Have a look at the <a title="U1" href="http://rfam.sanger.ac.uk/family/U1#tabview=tab3">U1 spliceosomal RNA</a>, or <a title="tRNA" href="http://rfam.sanger.ac.uk/family/RF00005#tabview=tab3">tRNA</a> for examples.</p>
<p>The R-chie diagrams are created using the R4RNA R package from <a title="e-rna" href="http://www.e-rna.org/">Irmtraud Meyer&#8217;s</a> group; be sure to check out the <a title="R-chie" href="http://dx.doi.org/10.1093/nar/gks241">R-chie paper</a>, as well as their own gallery of Rfam structures.</p>
<br /> Tagged: <a href='http://xfam.wordpress.com/tag/r-chie/'>R-chie</a>, <a href='http://xfam.wordpress.com/tag/rfam/'>rfam</a>, <a href='http://xfam.wordpress.com/tag/secondary-structure/'>secondary structure</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xfam.wordpress.com/868/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xfam.wordpress.com/868/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=868&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xfam.wordpress.com/2012/11/19/r-chie-arc-diagrams-now-available-in-our-secondary-structure-galleries/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/d7e7c6d6acc0ce986feed010a85fc073?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">sarahburge01</media:title>
		</media:content>
	</item>
		<item>
		<title>Dfam 1.1 released</title>
		<link>http://xfam.wordpress.com/2012/11/15/dfam-1-1-released/</link>
		<comments>http://xfam.wordpress.com/2012/11/15/dfam-1-1-released/#comments</comments>
		<pubDate>Thu, 15 Nov 2012 14:13:07 +0000</pubDate>
		<dc:creator>travisjwheeler</dc:creator>
				<category><![CDATA[Dfam]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Releases]]></category>
		<category><![CDATA[dfam]]></category>
		<category><![CDATA[pfam]]></category>
		<category><![CDATA[rfam]]></category>

		<guid isPermaLink="false">http://xfam.wordpress.com/?p=846</guid>
		<description><![CDATA[We are pleased to announce that we&#8217;ve released Dfam 1.1. This version represents a few important changes from 1.0, including updated hit results, a new tab for each entry page showing relationships to other entries, and improved handling of redundant profile hits. New Hit Results The underlying database and set of entries have not changed from Dfam 1.0, but [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=846&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>We are pleased to announce that we&#8217;ve released <a href="http://dfam.janelia.org">Dfam 1.1</a>. This version represents a few important changes from 1.0, including updated hit results, a new tab for each entry page showing relationships to other entries, and improved handling of redundant profile hits.</p>
<p><span id="more-846"></span></p>
<p><strong>New Hit Results</strong></p>
<p>The underlying database and set of entries have not changed from Dfam 1.0, but the seed alignments have been tweaked to add in some additional hits from chromosome Y and the sequence matches (and match boundaries) have changed slightly as a result of using a new version of nhmmer (snapshot <a href="ftp://selab.janelia.org/pub/dfam/Tools/hmmer3.1-snap20121016.1.tgz">here</a>) that better handles regions of low complexity. This allowed us to mask fewer low-complexity regions in models and decrease gathering thresholds (increasing sensitivity) while still slightly reducing false positive rates.</p>
<p><strong>Relationships tab</strong></p>
<p>In many cases, a single locus of DNA sequence might be matched by multiple Dfam models, due to similarity between models. For example, there are 37 <a href="http://dfam.janelia.org/search/keyword?keywords=alu">Alu subfamily models</a>, all representing minor variants of the prototypical Alu, so any Alu instance will likely be found by nearly all Alu models. Many models have a more complicated relationship, as is the case with Ricksha (which long ago picked up the 3&#8242; end of an <a href="http://dfam.janelia.org/entry/DF0000118">ERVL</a>, including its LTR, <a href="http://dfam.janelia.org/entry/DF0001029">MLTB2</a>), and <a href="http://dfam.janelia.org/entry/DF0001068">SVA</a> (which carries copies of both a portion of a <a href="http://dfam.janelia.org/entry/DF0000540">HERVK LTR</a> and two Alus in reverse orientation). To indicate these sorts of relationships between models, we have created a Relationships tab for each model, where we graphically represent related models, and provide a dot plot for each relationship that depicts the alignment between the related models.</p>
<p><strong>Redundant profile hit resolution</strong></p>
<p>We call these cases of a single locus matching multiple profile HMMs, &#8220;Redundant Profile Hits&#8221; (RPHs). When presenting hit counts and hit distributions both over the model (Coverage Plot on the Model tab) and over the genome (Hit tab), Dfam 1.0 only showed the total number of hits, so that an RPH would impact multiple model tallies. We now resolve RPHs, so that each locus is associated with only the model that best explains it (though in some cases, the resolution choice is unclear, and a couple of models might be assigned to the locus).</p>
<p><strong>And finally&#8230;..</strong></p>
<p>We have already started to receive positive feedback from some early adopters of the databases.  We really appreciate this and suggestions for new features, so if you have any suggestions or would like to contribute models to the database, please get in <a href="mailto:dfam@janelia.hhmi.org">contact</a>.  In the next release of Dfam, we aim to continue making improvements to the existing set of entries, before we pause and determine the next major milestone.</p>
<p>Posted by Travis and Rob</p>
<br /> Tagged: <a href='http://xfam.wordpress.com/tag/dfam-2/'>dfam</a>, <a href='http://xfam.wordpress.com/tag/pfam/'>pfam</a>, <a href='http://xfam.wordpress.com/tag/rfam/'>rfam</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xfam.wordpress.com/846/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xfam.wordpress.com/846/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=846&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xfam.wordpress.com/2012/11/15/dfam-1-1-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/ac2cd569ad62cd7a63ef508b3a6d0b2d?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">travisjwheeler</media:title>
		</media:content>
	</item>
		<item>
		<title>What’s new in AntiFam?</title>
		<link>http://xfam.wordpress.com/2012/11/13/whats-new-in-antifam/</link>
		<comments>http://xfam.wordpress.com/2012/11/13/whats-new-in-antifam/#comments</comments>
		<pubDate>Tue, 13 Nov 2012 09:31:38 +0000</pubDate>
		<dc:creator>Ruth Eberhardt</dc:creator>
				<category><![CDATA[AntiFam]]></category>
		<category><![CDATA[News]]></category>
		<category><![CDATA[Releases]]></category>
		<category><![CDATA[antifam]]></category>
		<category><![CDATA[pfam]]></category>
		<category><![CDATA[release]]></category>
		<category><![CDATA[rfam]]></category>

		<guid isPermaLink="false">http://xfam.wordpress.com/?p=852</guid>
		<description><![CDATA[We have recently produced a new release of AntiFam, release 3.0. AntiFam has grown in size, and release 3.0 contains 54 entries – compared to just 23 when we last blogged about AntiFam (release 1.1).  Over 80 % of these new entries arise from translations of non-coding RNAs, including several families from translations of rRNA, [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=852&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>We have recently produced a new release of AntiFam, release 3.0. AntiFam has grown in size, and release 3.0 contains 54 entries – compared to just 23 when we last blogged about AntiFam (release 1.1).  Over 80 % of these new entries arise from translations of non-coding RNAs, including several families from translations of rRNA, tmRNA and RNaseP.</p>
<p><span id="more-852"></span></p>
<p><strong> New features</strong></p>
<p>AntiFam has several new features, which we hope our users will find useful. In release 2.0 we introduced a DO line, this is the line in the AntiFam.seed file which begins “#=GF DO”. This line gives a recommended course of action to take when a predicted protein sequence is recognised by an AntiFam HMM. In most cases this line recommends that the predicted protein is deleted.</p>
<p>In release 3.0 we have introduced a TX line, the line in the AntiFam.seed file which begins “#=GF TX”. This is used to indicate which superkingdoms predicted proteins recognised by AntiFam HMMs have been found in. We have also produced superkingdom-specific sets of HMMs for this release.  One AntiFam HMM may identify spurious proteins arising from multiple superkingdoms, and therefore there is some overlap between these sets.  There are 18 entries in the Eukaryota set, 47 in the Bacteria set, 4 in the Archaea set, 4 in the Virus set and 18 in the unidentified set; the latter includes proteins from unclassified organisms, such as those from metagenomics studies.</p>
<p><strong>Availability</strong></p>
<p>AntiFam 3.0 is available to download from our ftp site on <a href="ftp://ftp.sanger.ac.uk/pub/databases/Pfam/AntiFam/">ftp://ftp.sanger.ac.uk/pub/databases/Pfam/AntiFam/</a></p>
<p>AntiFam is made freely available under the Creative commons Zero (CC0) licence <a href="http://creativecommons.org/publicdomain/zero/1.0/">http://creativecommons.org/publicdomain/zero/1.0/</a></p>
<p>We welcome any suggestions for new families, and can be contacted by email <a href="mailto:ruthe@ebi.ac.uk">ruthe@ebi.ac.uk</a></p>
<p>Posted by Ruth</p>
<br /> Tagged: <a href='http://xfam.wordpress.com/tag/antifam-2/'>antifam</a>, <a href='http://xfam.wordpress.com/tag/pfam/'>pfam</a>, <a href='http://xfam.wordpress.com/tag/release/'>release</a>, <a href='http://xfam.wordpress.com/tag/rfam/'>rfam</a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/xfam.wordpress.com/852/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/xfam.wordpress.com/852/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=xfam.wordpress.com&#038;blog=6232182&#038;post=852&#038;subd=xfam&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://xfam.wordpress.com/2012/11/13/whats-new-in-antifam/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/a21789b9c60d41e9c1d7be6be982edef?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">ryeberhardt</media:title>
		</media:content>
	</item>
	</channel>
</rss>
