<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The OpenHelix Blog &#187; hapmap</title>
	<atom:link href="http://blog.openhelix.eu/?feed=rss2&#038;tag=hapmap" rel="self" type="application/rss+xml" />
	<link>http://blog.openhelix.eu</link>
	<description>at OpenHelix</description>
	<lastBuildDate>Fri, 03 Sep 2010 14:58:30 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Friday SNPpets</title>
		<link>http://blog.openhelix.eu/?p=5187</link>
		<comments>http://blog.openhelix.eu/?p=5187#comments</comments>
		<pubDate>Fri, 03 Sep 2010 12:57:16 +0000</pubDate>
		<dc:creator>Mary</dc:creator>
				<category><![CDATA[SNPpets]]></category>
		<category><![CDATA[ensembl]]></category>
		<category><![CDATA[hapmap]]></category>
		<category><![CDATA[metagenomics]]></category>
		<category><![CDATA[plos]]></category>
		<category><![CDATA[USGS]]></category>
		<category><![CDATA[whale shark]]></category>

		<guid isPermaLink="false">http://blog.openhelix.eu/?p=5187</guid>
		<description><![CDATA[]]></description>
			<content:encoded><![CDATA[<p>Welcome to our Friday feature link collection: SNPpets. During the    week we come across a lot of links and reads that we think are    interesting, but don&#8217;t make it to a blog post. Here they are for your    enjoyment&#8230;</p>
<ul>
<li><a href="http://bioinformatics.oxfordjournals.org/cgi/content/abstract/btq455v1" target="_blank">METAREP</a>: JCVI Metagenomics Reports &#8211; an open source tool for high-performance comparative metagenomics. Hat tip to <a href="http://twitter.com/kozo2/statuses/22303322025" target="_blank">@kozo2</a> [<em>Mary</em>]</li>
<li><a href="http://blogs.plos.org/plos/2010/09/announcing-plos-blogs/" target="_blank">PLoS now has a blogging network</a>. Science blogging continues to diversify [<em>Trey</em>]</li>
<li>USGS has a <a href="http://biology.usgs.gov/genetics_genomics/" target="_blank">genomics site</a>? I thought they just did <a href="http://earthquake.usgs.gov/" target="_blank">earthquakes</a>. From their page they refer to the sequencing of the fungus that&#8217;s affecting bat populations, and their team is involved with that. Neat. Plus lots of <a href="http://biology.usgs.gov/genetics_genomics/epidemiology_fish_wildlife_diseases_mammals.html" target="_blank">other interesting genetics of wildlife issues</a>. And a <a href="http://microbiology.usgs.gov/" target="_blank">microbiology component</a>. Wowsa. H/T to <a href="http://twitter.com/lakeganharris/status/22614366409" target="_blank">@lakeganharris</a> [<em>Mary</em>]</li>
<li>Crowd-sourcing a genome project: to raise funds for sequencing the whale shark genome, you can buy hair bands. I&#8217;m not sure I get the linkage&#8211;but if I still had long hair I&#8217;d probably do it&#8230; <a href="http://alistairdove.com/blog/2010/8/30/whale-shark-genome-schwag-has-been-unbridled.html" target="_blank">Whale shark genome schwag has been unbridled!</a> [<em>Mary</em>]</li>
<li>Speaking of crowds (crowds of variations?.. something <img src='http://blog.openhelix.eu/wp-includes/images/smilies/icon_biggrin.gif' alt=':D' class='wp-smiley' /> ), HapMap 3 paper just came out and <a href="http://blogs.discovermagazine.com/gnxp/2010/09/hapmap-3-more-people-more-genetic-variation/" target="_blank">Razib Khan has an excellent blog post</a> about it. [<em>Trey</em>]</li>
<li>Organizations that you might consider joining, or at least following: <a href="http://www.scientistswithoutborders.org" target="_blank">Scientists without  Borders</a>, <a href="http://www.unesco.org/" target="_blank">UNESCO</a>, and <a href="http://ways.org/en" target="_blank">WAY: World Association of Young Scientists &#8211; Science, Remixed</a>. [<em>Jennifer</em>]</li>
<li>I might have to read the full article, based on this summary: &#8220;<a href="http://www.jhoonline.org/content/3/1/19" target="_blank">Sweet fruit from a poisonous kiss</a>&#8221; by Delong Liu Full paper by Zhang et al at <a href="http://www.sciencemag.org/cgi/content/abstract/328/5975/240" target="_blank">Science. 2010 Apr 9;328(5975):240-3</a>. [<em>Jennifer</em>]</li>
<li>ensembl wants your feedback. Check out their survey:<a href="http://bit.ly/cTSXyu" target="_blank"> http://bit.ly/cTSXyu</a> [<em>Mary</em>]</li>
<li>Check it out &#8211; <a href="http://en.wordpress.com/tag/scio10-interviews/" target="_blank">the series of ScienceOnline interviews</a>, or <a href="http://blog.coturnix.org/2010/09/01/scienceonline2010-interview-%e2%80%93-jennifer-williams/" target="_blank">my specific interview</a> &#8211; you decide which to read&#8230; <img src='http://blog.openhelix.eu/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  [<em>Jennifer</em>]</li>
<li>more than 2 weeks of science fun throughout North Carolina and including the MythBusters guys: <a href="http://www.ncsciencefestival.org/" target="_blank">NC Science Festival 2010 &#8211; Life is Your Lab!</a> [<em>Jennifer</em>]</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://blog.openhelix.eu/?feed=rss2&amp;p=5187</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Guest Post: SNAP &#8212; Andrew Johnson</title>
		<link>http://blog.openhelix.eu/?p=4697</link>
		<comments>http://blog.openhelix.eu/?p=4697#comments</comments>
		<pubDate>Tue, 22 Jun 2010 18:01:07 +0000</pubDate>
		<dc:creator>Trey</dc:creator>
				<category><![CDATA[Genomics Research]]></category>
		<category><![CDATA[Genomics Resource News]]></category>
		<category><![CDATA[Guest Posts]]></category>
		<category><![CDATA[New Resource]]></category>
		<category><![CDATA[hapmap]]></category>
		<category><![CDATA[SNAP]]></category>
		<category><![CDATA[snps]]></category>

		<guid isPermaLink="false">http://blog.openhelix.eu/?p=4697</guid>
		<description><![CDATA[This next post in our continuing semi-regular Guest Post series is from Andrew Johnson, one of the developers and the concept designer of SNAP, SNP Annotation and Proxy Search which is hosted at the Broad Institute. If you are a provider of a free, publicly available genomics tool, database or resource and would like to convey [...]]]></description>
			<content:encoded><![CDATA[<p><em>This next post in our continuing semi-regular </em><a href="http://blog.openhelix.eu/?cat=7" target="_blank"><em>Guest Post series</em></a><em> is from Andrew Johnson, one of the developers and the concept designer of </em><em><a href="http://www.broadinstitute.org/mpg/snap/" target="_blank">SNAP, SNP Annotation and Proxy Search</a> which is hosted at the <a href="http://www.broadinstitute.org/" target="_blank">Broad Institute</a></em><em>. If you are a provider of a free, publicly available genomics tool, database or resource and would like to convey something to users on our guest post feature, please feel free to contact us at wlathe AT openhelix DOT com or </em><a href="http://www.openhelix.com/cgi/contact.cgi" target="_blank"><em>the contact form</em></a><em> (write &#8216;guest post&#8217; as subject heading). We welcome introductions to your resource, information on updates, highlights of little known gems or opinion pieces on the state of genomic research and databases.</em></p>
<p id="zw-11">SNAP (<a id="zw-13" href="http://www.broadinstitute.org/mpg/snap/">http://www.broadinstitute.org/mpg/snap/</a>, Johnson et al. (2008) Bioinformatics 24(24): 2938), “SNP Annotation and Proxy search”, is a flexible, web-based tool that allows anyone in the world to quickly accomplish a range of SNP-related genetics and bioinformatics tasks. This post highlights some common questions andfeatures of SNAP, some more obscure uses, and recent and planned developments.</p>
<p id="zw-24"><strong><em>How did SNAP come about?</em></strong></p>
<p id="zw-26">The idea for SNAP was originally sparked by GWAS analysts within a large collaborative group (the Framingham Heart Study SHARe project). This was in the pre-imputation era when GWAS investigators from different groups using different SNP arrays often wanted to find best proxy SNPs based on HapMap for comparison when they didn’t have common genotyped SNPs across groups. We initially implemented local programs to lookup upHapMap LD and also consider the presence of query and proxy SNPs on different commercial genotyping arrays. We quickly realized this was a community-wide problem as we received requests from outside collaborators so we decided it was worth developing a public tool and approached investigators at the Broad Institute. Through collaboration with Paul de Bakker, Bob Handsaker and others at the Broad Institute we were able to add more features like plotting and build a nice, quick and accessible interface. Many people have contributed ideas, testingand improvements to SNAP, and Bob Handsaker and Pei Lin in particular continue to maintain and update SNAP.</p>
<p id="zw-45"><em><strong>What do you use SNAP for the most?</strong></em></p>
<p id="zw-51">The two major features of SNAP widely used 1) SNP LD queries, and 2) plotting of LD and association data. There are a number of flexible options for these functions. Beyond these, as a SNP bioinformatics specialist, I often use SNAP to rapidly retrieve information about a list of SNPs for other uses (see specialized queries below).</p>
<p id="zw-57"><strong><em>What are some commonly asked questions from users of SNAP?</em></strong></p>
<p><strong><em><span id="more-4697"></span><br />
</em></strong></p>
<p>Many of the common questions are covered in detail in the FAQ and/or Documentation available on the website. Here are some questions I commonly receive.</p>
<p id="zw-63">How do I return all LD proxy relationships within 500kb of my query SNPs? Change the r^2 threshold to “No Limit” and leave the distance limit at “500”.</p>
<p id="zw-67">Why doesn’t my favorite SNP appear when I make a query? Occasionally query and/or expected proxy SNPs will not be found. This could be a result of an error in your representation of the query SNPid. The most likely explanation for proxy SNP is that the filters you’ve selected (r^2 and distance limits) have caused a SNP not to be included. Another likely explanation is that the expected SNP is &gt;500kb apart from query SNPs (see below). Rarely SNAP may return an alias SNPid for a proxy rather than the one you expect since SNAP takes aliases into account in queries (see below).  Finally, a SNP(s) may not be included in the HapMap release you are querying.HapMap releases 21 and 22 differ among a small number of SNPs. HapMap release 3 differs from prior releases at a greater number of SNPs. If it is important you should try querying different HapMap releases to find a release(s) with your SNP(s) of interest. Alternatively, you can also try to find genotype data separately and load into a program such as Haploview to calculate LD metrics.</p>
<p id="zw-100">Can I generate plots based on my own data? Yes. You can upload both your own association data (-log P) and your own LD data, as you may have generated LD estimates within your own population and/or a larger sample than available in the HapMap. If you don’t specify your own LD data SNAP uses HapMap by default. If you don’t provide chromosome and position SNAP fills these in based on HapMap. If you have de novo markers that are not in HapMap you can also include these as long as you specify the chromosome and positions and LD to the target SNP.</p>
<p id="zw-113">Why do I observe different LD estimates for the same pair of SNPs in different HapMap releases? Identical SNP pairs generally have identical, or very similar, LD estimates in different releases of HapMap. If LD estimates differ slightly it is attributable to differences in genotypes in the releases.</p>
<p id="zw-128">What if I just want to query LD among a select group of SNPs? Click the “Pairwise LD” tab. Copy and paste yourSNPid list, or upload a file. Your LD queries will be limited to only your SNPs of interest rather than all HapMapSNPs that meet the filtering criteria.</p>
<p id="zw-143">What if I want to find SNPs genotyped only on a specific array or group of arrays? There is a rapid way to limit queries to specific arrays. Click the ‘+’ on the Filter By Array. Select those arrays you want.</p>
<p id="zw-146">What do if I want to calculate long range LD or trans-chromosomal LD? SNAP returns results for pairwise LD between SNPs with distance up to 500kb. This is greater than the default of HapMap pre-calculated data of 250kb. In some cases users may want to assess SNPs that are further apart. A few options exist including 1) downloading the HapMap genotypes and loading into Haploview while removing the pairwise distance limitation, 2) calculating using PLINK, or 3) querying with the GLIDERS website (<a id="zw-159" href="http://mather.well.ox.ac.uk/GLIDERS/">http://mather.well.ox.ac.uk/GLIDERS/</a>). GLIDERS returns extreme long range queries on chromosomes or trans-chromosomal queries.</p>
<p id="zw-164"><a name="OLE_LINK1"></a><a name="OLE_LINK2"></a><strong><em>What are some specialized queries I can conduct with SNAP?</em></strong></p>
<p id="zw-168">Find annotation for SNPs regardless of LD proxies. SNAP doesn’t have to be used as an LD querying tool. You can simply retrieve information about a list of SNPs. To do so load your SNPids. Under “Search Options” select Distance Limit as 0 instead of the default 500kb. With the default settings SNAP will now only return information for your query SNPs themselves. You can select additional options like GeneCruiser annotations and MAFs. This is an excellent way to rapidly answer questions like: would a SNP(s) be genotyped on my array(s) of interest?Which of my SNPs are nonsynonymous SNPs? What are the genomic coordinates for my list of SNPs in a specific genome build (just select the corresponding HapMap build – Release 21=hg17, Release 22/HapMap 3=hg18)?What are the HapMap MAFs for my list of SNPs? Of course, you can also turn on proxy querying and ask these same questions in relation to both query and/or proxy SNPs. For instance, of my significant GWAS SNPs are any of them in LD with r^2 &gt; 0.5 with a known nonsynonymous SNP?</p>
<p id="zw-203">Find alias or alternate SNPids for my SNPs of importance. Some people do not realize that SNPs can suffer from a historical aliasing problem just like gene names. If you are using SNPids to query bioinformatics tools or databases, or to conduct cross dataset queries, to be extra cautious you should rely on genome positions and allelesor account for potential alias IDs. SNAP allows querying to return alias SNPids. Click the tab “Map SNP IDs“. You can retrieve IDs for a SNP across all previous dbSNP builds or specify a specific build to target.</p>
<p id="zw-224"><em><strong>What are recently added features of planned future updates to SNAP?</strong></em></p>
<p id="zw-228">SNAP is in version 2.1. Recent updates have included the addition of 1) HapMap 3 release featuring 12 population groupings, 2) SNP information for 7 new commercial SNP arrays (25 arrays now listed), 3) and the ability to include HapMap major and minor alleles, frequencies and observed genotype counts in output. We welcome suggestions for additional features. Most of the added features in the past have come from suggestions by active SNAP users and testers. In the future we plan to include query options based on 1000 Genomes Project based LDdata. We also plan to keep up with additional SNP array releases as they come to our attention.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.openhelix.eu/?feed=rss2&amp;p=4697</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Tip of the Week: HapMap data in Haploview</title>
		<link>http://blog.openhelix.eu/?p=3715</link>
		<comments>http://blog.openhelix.eu/?p=3715#comments</comments>
		<pubDate>Wed, 10 Mar 2010 05:01:17 +0000</pubDate>
		<dc:creator>Trey</dc:creator>
				<category><![CDATA[Tip of the Week]]></category>
		<category><![CDATA[haplotypes]]></category>
		<category><![CDATA[haploview]]></category>
		<category><![CDATA[hapmap]]></category>
		<category><![CDATA[LD]]></category>
		<category><![CDATA[linkage disequilibrium]]></category>
		<category><![CDATA[snps]]></category>

		<guid isPermaLink="false">http://blog.openhelix.eu/?p=3715</guid>
		<description><![CDATA[HapMap has had a few minor updates to their browser, and importantly, new phase 3 data was released early last year (drafts of that data were released in 2008). Haploview, the downloaded software that allows the user to perform in depth LD and haplotype analysis, has been recently updated from version 4.1 to version 4.2. [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.openhelix.com/downloads/jing/haploview.swf"><img class="alignleft size-medium wp-image-3716" title="haploview_thumb" src="http://blog.openhelix.eu/wp-content/uploads/2010/03/haploview_thumb-300x216.png" alt="" width="300" height="216" /></a><a href="http://hapmap.ncbi.nlm.nih.gov/" target="_blank">HapMap</a> has had a few minor updates to their browser, and importantly, new phase 3 data was released early last year (<a href="http://hapmap.ncbi.nlm.nih.gov/old_news.html.en" target="_blank">drafts of that data </a>were released in 2008). <a href="http://www.broadinstitute.org/haploview/haploview-downloads" target="_blank">Haploview</a>, the downloaded software that allows the user to perform in depth LD and haplotype analysis, has been recently updated from version 4.1 to version 4.2. Haploview can be used with user data or data downloaded from the HapMap project. Though, version 4.1 did not work for phase III HapMap project data, so the user had to use phase I and II data if they wanted to use version 4.1. Haploview has now been updated to version 4.2, allowing the user to use HapMap phase III data.</p>
<p>That&#8217;s a lot of versions and phases <img src='http://blog.openhelix.eu/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> . The short of it is, if you use Haploview 4.2, you can view and analyze data from any phase of the HapMap project.</p>
<p>Today&#8217;s tip briefly shows you how to download data from the HapMap project and view it in Haploview.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.openhelix.eu/?feed=rss2&amp;p=3715</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Top SNPs of the year</title>
		<link>http://blog.openhelix.eu/?p=3325</link>
		<comments>http://blog.openhelix.eu/?p=3325#comments</comments>
		<pubDate>Mon, 11 Jan 2010 20:14:05 +0000</pubDate>
		<dc:creator>Trey</dc:creator>
				<category><![CDATA[General Science]]></category>
		<category><![CDATA[hapmap]]></category>
		<category><![CDATA[SNPedia]]></category>
		<category><![CDATA[snps]]></category>

		<guid isPermaLink="false">http://blog.openhelix.eu/?p=3325</guid>
		<description><![CDATA[Interesting post from SNPedia blog (we mentioned being able to view SNPedia SNPS HapMap last year in a post) of the top 10 SNPs of the year. Of course, as they mention, it&#8217;s very subjective. Because they have chosen SNPs with serious health interest, I&#8217;ll semi-frivolously (because hey, no knowledge is necessarily &#8220;frivolous&#8221; nominate either: [...]]]></description>
			<content:encoded><![CDATA[<p>Interesting post from SNPedia blog (we mentioned<a href="http://blog.openhelix.eu/?p=82" target="_blank"> being able to view SNPedia SNPS HapMap </a>last year in a post) of the top 10 SNPs of the year.</p>
<p>Of course, as they mention, it&#8217;s very subjective.</p>
<p>Because they have chosen SNPs with serious health interest, I&#8217;ll semi-frivolously (because hey, no knowledge is necessarily &#8220;frivolous&#8221; <img src='http://blog.openhelix.eu/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  nominate either:</p>
<p>The <a href="http://www.snpedia.com/index.php/Rs17822931 " target="_blank">&#8220;ear wax&#8221; SNP</a> which determines whether you have &#8216;wet or dry&#8217; earwax, only because (yes, TMI) I have both, one in each ear so now I&#8217;m curious as to why.</p>
<p>and</p>
<p>The <a href="http://www.snpedia.com/index.php/Rs3057" target="_blank">&#8220;Perfect Musical Pitch&#8221; SNP</a>, only because my daughter and I seem to have that particular variation, and we know a few people who don&#8217;t <img src='http://blog.openhelix.eu/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> .</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.openhelix.eu/?feed=rss2&amp;p=3325</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Important announcement from HapMap about data archiving</title>
		<link>http://blog.openhelix.eu/?p=1910</link>
		<comments>http://blog.openhelix.eu/?p=1910#comments</comments>
		<pubDate>Thu, 18 Jun 2009 16:11:19 +0000</pubDate>
		<dc:creator>Mary</dc:creator>
				<category><![CDATA[Genomics Research]]></category>
		<category><![CDATA[Genomics Resource News]]></category>
		<category><![CDATA[biomart]]></category>
		<category><![CDATA[hapmap]]></category>

		<guid isPermaLink="false">http://www.openhelix.com/blog/?p=1910</guid>
		<description><![CDATA[This is from the HapMap team for the human data (not the green HapMap I referenced recently). Older data is going to be removed from the HapMap.org browser and from BioMart. It will still be available in the ftp archive&#8211;but I just thought a heads-up was in order for folks who might not be on [...]]]></description>
			<content:encoded><![CDATA[<p>This is from the <a href="http://hapmap.org/" target="_blank">HapMap </a>team for the human data (not the <a href="http://www.openhelix.com/blog/?p=1843" target="_blank">green HapMap</a> I referenced recently). <strong> Older data is going to be removed from the HapMap.org browser and from <a href="http://www.biomart.org/" target="_blank">BioMart</a>. </strong>It will still be available in the ftp archive&#8211;but I just thought a heads-up was in order for folks who might not be on the mailing list:</p>
<blockquote><p>Beginning July 1st, HapMap data releases prior to October 2008 (or release #24) will no longer be available via the HapMap Genome Browser and HapMart utility.</p>
<p>The following three data releases will continue to be served via the HapMap Genome Browser:</p>
<p>* HapMap Release #24 &#8212; Latest HapMap (phase I+II) data release</p>
<p>http://www.hapmap.org/cgi-perl/gbrowse/hapmap24_B36/</p>
<p>* HapMap3 Draft #2 (hapmap3_r2) &#8212; Latest HapMap3 (phase III) data release</p>
<p>http://www.hapmap.org/cgi-perl/gbrowse/hapmap3r2_B36/</p>
<p>* HapMap Release #27 &#8212; Latest merged HapMap (phase I+II+III) data release</p>
<p>http://www.hapmap.org/cgi-perl/gbrowse/hapmap27_B36/</p>
<p>The following data release will continue to be served via HapMart:<br />
* HapMap Release #27 &#8212; Latest merged HapMap (phase I+II+III) data release</p>
<p>http://hapmart.hapmap.org/BioMart/martview</p>
<p>All other data releases will still be available for FTP download:</p>
<p>http://www.hapmap.org/downloads/index.html.en#bulk</p></blockquote>
<p>If you aren&#8217;t on the mailing list you can check it out and sign up here: <a href="http://osceola.cshl.edu/mailman/listinfo/announcements">http://osceola.cshl.edu/mailman/listinfo/announcements</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.openhelix.eu/?feed=rss2&amp;p=1910</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tip of the Week: F-SNP</title>
		<link>http://blog.openhelix.eu/?p=1894</link>
		<comments>http://blog.openhelix.eu/?p=1894#comments</comments>
		<pubDate>Wed, 17 Jun 2009 05:01:30 +0000</pubDate>
		<dc:creator>Trey</dc:creator>
				<category><![CDATA[Tip of the Week]]></category>
		<category><![CDATA[dbSNP]]></category>
		<category><![CDATA[ensembl]]></category>
		<category><![CDATA[GeneSNPs]]></category>
		<category><![CDATA[gvs]]></category>
		<category><![CDATA[hapmap]]></category>
		<category><![CDATA[polymorphisms]]></category>
		<category><![CDATA[SeattleSNPs]]></category>
		<category><![CDATA[snps]]></category>
		<category><![CDATA[UCSC Genome Browser]]></category>

		<guid isPermaLink="false">http://www.openhelix.com/blog/?p=1894</guid>
		<description><![CDATA[There are a lot of databases to search for to find SNP data, HapMap, dbSNP, SeattleSNPs, Genome Variation Server and many more. I&#8217;m going to add one more to your data mining arsenal, F-SNP. F-SNP (described more fully here in the 2008 NAR Database issue), provides integrated information about the functional effects of SNPs obtained from [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.openhelix.com/downloads/jing/fsnp.swf" target="_blank"><img class="alignleft size-medium wp-image-1898" title="fsnp_thumb" src="http://www.openhelix.com/blog/wp-content/uploads/2009/06/fsnp_thumb-300x193.png" alt="fsnp_thumb" width="239" height="153" align="left" /></a>There are a lot of databases to search for to find SNP data, <a href="http://www.hapmap.org" target="_blank">HapMap</a>, <a href="http://ncbi.nlm.nih.gov/SNP" target="_blank">dbSNP</a>, SeattleSNPs, Genome Variation Server and many more. I&#8217;m going to add one more to your data mining arsenal, <a href="http://compbio.cs.queensu.ca/F-SNP/" target="_blank">F-SNP</a>. F-SNP (described more fully <a href="http://nar.oxfordjournals.org/cgi/content/abstract/gkm904v1" target="_blank">here in the 2008 NAR Database issue</a>),</p>
<blockquote><p>provides integrated information about the functional effects of SNPs obtained from <a href="http://compbio.cs.queensu.ca/F-SNP/main_files/resources.html#tools">16 bioinformatics tools and databases</a>. The functional effects are predicted and indicated at the splicing, transcriptional, translational, and post-translational level. As such, the F-SNP database helps identify and focus on SNPs with potential pathological effect to human health.</p></blockquote>
<p>&#8230;as they say in the introduction. It looks to be a good first stop to find SNPs of functional relevance. The databases they pull from to get their information include several I&#8217;ve mentioned above and also the <a href="http://genome.ucsc.edu" target="_blank">UCSC Genome database</a>, <a href="http://www.ensembl.org" target="_blank">Ensembl</a>, <a href="http://sift.jcvi.org/" target="_blank">SIFT</a> and <a href="http://genetics.bwh.harvard.edu/pph/data/index.html" target="_blank">PolyPhen</a> predictions and more. I&#8217;ve given a quick intro in the tip this week on how to get functional SNP information from F-SNP.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.openhelix.eu/?feed=rss2&amp;p=1894</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>1001 Genomes: plant researchers raise by 1</title>
		<link>http://blog.openhelix.eu/?p=1843</link>
		<comments>http://blog.openhelix.eu/?p=1843#comments</comments>
		<pubDate>Tue, 09 Jun 2009 14:20:42 +0000</pubDate>
		<dc:creator>Mary</dc:creator>
				<category><![CDATA[Genomics News]]></category>
		<category><![CDATA[Genomics Research]]></category>
		<category><![CDATA[Genomics Resource News]]></category>
		<category><![CDATA[New Resource]]></category>
		<category><![CDATA[Arabidopsis]]></category>
		<category><![CDATA[GBrowse]]></category>
		<category><![CDATA[hapmap]]></category>
		<category><![CDATA[TAIR]]></category>

		<guid isPermaLink="false">http://www.openhelix.com/blog/?p=1843</guid>
		<description><![CDATA[There is plenty of buzz out there for the big data biology projects&#8211;but usually the focus is the human data (with a few token model organisms thrown in).  But this week plant researchers renewed the call for big plant data.  I&#8217;m totally on board with that. The 1000 Genomes project to obtain more human variation [...]]]></description>
			<content:encoded><![CDATA[<p><img class="wp-image-1501" title="arabidopsis" src="http://www.openhelix.com/blog/wp-content/uploads/2009/04/arabidopsis-300x197.jpg" alt="arabidopsis" width="150" align="right" />There is plenty of <a href="http://scienceblogs.com/geneticfuture/2008/11/limitations_of_1000_genomes_pr.php" target="_blank">buzz </a>out there for the <a href="http://www.nature.com/news/specials/bigdata/index.html" target="_blank">big data biology</a> projects&#8211;but usually the focus is the human data (with a few token model organisms thrown in).  But this week plant researchers renewed the call for big plant data.  I&#8217;m totally on board with that.</p>
<p>The<a href="http://www.1000genomes.org/" target="_blank"> 1000 Genomes project</a> to obtain more human variation information is well underway, <a href="http://www.genome.gov/26524516" target="_blank">funded</a>, and has companies supporting it.  And that&#8217;s great&#8211;I&#8217;m all for this too!  But as someone who survives largely on the kindness of plants I want more plant research going on.  I want to see this funded and supported.  And as we face increasing stresses on resources from limitations like oil and water supplies to wacky climate conditions and environmental consequences I think we could well afford to spend less time gazing at our human genomic navels and devote more attention to the plants.</p>
<p>There is already some work on this Arabidopsis project.  The first paper with data on this effort <a href="http://genome.cshlp.org/content/18/12/2024" target="_blank">came out last fall</a>.  But the researchers are still having to go out and lobby for this project.  A new <a href="http://genomebiology.com/2009/10/5/107/abstract/" target="_blank">opinion piece</a> in Genome Biology calls out for awareness and support for this effort.</p>
<p>They have already done a <a href="http://www.sciencemag.org/cgi/content/abstract/317/5836/338" target="_blank">first generation</a> <span style="color: #008000;"><strong>green HapMap</strong></span>.  The <a href="http://genome.cshlp.org/content/18/12/2024.full" target="_blank">paper last fall </a>illustrated the feasibility of the project by looking at the reference Col-O (Columbia) and Bur-O and Tsu-1 strains.  The paper presents the process, compares their pipeline software with another package (SHORE that they developed and MAQ), They have a GBrowse <a href="http://gbrowse.weigelworld.org/cgi-bin/gbrowse/ath_reseq_1001/" target="_blank">installation </a>that presents the data  (and you can get <a href="http://openhelix.com/gbrowse" target="_blank">free training on GBrowse</a> here to effectively use the site).  They also provide data to <a href="http://arabidopsis.org/" target="_blank">TAIR</a>.</p>
<p>I think this is important and I hope it gets the same level of support and respect that 1000 humans will get.</p>
<p>1001 Genomes main site: <a href="http://1001genomes.org/" target="_blank">http://1001genomes.org/</a></p>
<p>1001 Genomes GBrowse: <a href="http://gbrowse.weigelworld.org/cgi-bin/gbrowse/ath_reseq_1001/" target="_blank">http://gbrowse.weigelworld.org/cgi-bin/gbrowse/ath_reseq_1001/</a><br />
<span style="float: left; padding: 5px;"><a href="http://www.researchblogging.org"><img style="border:0;" src="http://www.researchblogging.org/public/citation_icons/rb2_large_gray.png" alt="ResearchBlogging.org" /></a></span><br />
<span style="color: #008000;"><strong>References:</strong></span><br />
<span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Science&amp;rft_id=info%3Adoi%2F10.1126%2Fscience.1138632&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Common+Sequence+Polymorphisms+Shaping+Genetic+Diversity+in+Arabidopsis+thaliana&amp;rft.issn=0036-8075&amp;rft.date=2007&amp;rft.volume=317&amp;rft.issue=5836&amp;rft.spage=338&amp;rft.epage=342&amp;rft.artnum=http%3A%2F%2Fwww.sciencemag.org%2Fcgi%2Fdoi%2F10.1126%2Fscience.1138632&amp;rft.au=Clark%2C+R.&amp;rft.au=Schweikert%2C+G.&amp;rft.au=Toomajian%2C+C.&amp;rft.au=Ossowski%2C+S.&amp;rft.au=Zeller%2C+G.&amp;rft.au=Shinn%2C+P.&amp;rft.au=Warthmann%2C+N.&amp;rft.au=Hu%2C+T.&amp;rft.au=Fu%2C+G.&amp;rft.au=Hinds%2C+D.&amp;rft.au=Chen%2C+H.&amp;rft.au=Frazer%2C+K.&amp;rft.au=Huson%2C+D.&amp;rft.au=Scholkopf%2C+B.&amp;rft.au=Nordborg%2C+M.&amp;rft.au=Ratsch%2C+G.&amp;rft.au=Ecker%2C+J.&amp;rft.au=Weigel%2C+D.&amp;rfe_dat=bpr3.included=1;bpr3.tags=Biology">Clark, R., Schweikert, G., Toomajian, C., Ossowski, S., Zeller, G., Shinn, P., Warthmann, N., Hu, T., Fu, G., Hinds, D., Chen, H., Frazer, K., Huson, D., Scholkopf, B., Nordborg, M., Ratsch, G., Ecker, J., &amp; Weigel, D. (2007). Common Sequence Polymorphisms Shaping Genetic Diversity in Arabidopsis thaliana <span style="font-style: italic;">Science, 317</span> (5836), 338-342 DOI: <a rev="review" href="http://dx.doi.org/10.1126/science.1138632">10.1126/science.1138632</a></span></p>
<p><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Genome+Research&amp;rft_id=info%3Adoi%2F10.1101%2Fgr.080200.108&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=Sequencing+of+natural+strains+of+Arabidopsis+thaliana+with+short+reads&amp;rft.issn=1088-9051&amp;rft.date=2008&amp;rft.volume=18&amp;rft.issue=12&amp;rft.spage=2024&amp;rft.epage=2033&amp;rft.artnum=http%3A%2F%2Fwww.genome.org%2Fcgi%2Fdoi%2F10.1101%2Fgr.080200.108&amp;rft.au=Ossowski%2C+S.&amp;rft.au=Schneeberger%2C+K.&amp;rft.au=Clark%2C+R.&amp;rft.au=Lanz%2C+C.&amp;rft.au=Warthmann%2C+N.&amp;rft.au=Weigel%2C+D.&amp;rfe_dat=bpr3.included=1;bpr3.tags=Biology">Ossowski, S., Schneeberger, K., Clark, R., Lanz, C., Warthmann, N., &amp; Weigel, D. (2008). Sequencing of natural strains of Arabidopsis thaliana with short reads <span style="font-style: italic;">Genome Research, 18</span> (12), 2024-2033 DOI: <a rev="review" href="http://dx.doi.org/10.1101/gr.080200.108">10.1101/gr.080200.108</a></span></p>
<p><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.jtitle=Genome+Biology&amp;rft_id=info%3Adoi%2F10.1186%2Fgb-2009-10-5-107&amp;rfr_id=info%3Asid%2Fresearchblogging.org&amp;rft.atitle=The+1001+Genomes+Project+for+Arabidopsis+thaliana&amp;rft.issn=1465-6906&amp;rft.date=2009&amp;rft.volume=10&amp;rft.issue=5&amp;rft.spage=107&amp;rft.epage=0&amp;rft.artnum=http%3A%2F%2Fgenomebiology.com%2F2009%2F10%2F5%2F107&amp;rft.au=Weigel%2C+D.&amp;rft.au=Mott%2C+R.&amp;rfe_dat=bpr3.included=1;bpr3.tags=Biology">Weigel, D., &amp; Mott, R. (2009). The 1001 Genomes Project for Arabidopsis thaliana <span style="font-style: italic;">Genome Biology, 10</span> (5) DOI: <a rev="review" href="http://dx.doi.org/10.1186/gb-2009-10-5-107">10.1186/gb-2009-10-5-107</a></span></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.openhelix.eu/?feed=rss2&amp;p=1843</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Tip of the Week: Visualizing GWAS with HapMap tools</title>
		<link>http://blog.openhelix.eu/?p=731</link>
		<comments>http://blog.openhelix.eu/?p=731#comments</comments>
		<pubDate>Wed, 21 Jan 2009 11:21:45 +0000</pubDate>
		<dc:creator>Mary</dc:creator>
				<category><![CDATA[General Science]]></category>
		<category><![CDATA[Genomics Research]]></category>
		<category><![CDATA[Genomics Resource News]]></category>
		<category><![CDATA[New Resource]]></category>
		<category><![CDATA[Tip of the Week]]></category>
		<category><![CDATA[GBrowse]]></category>
		<category><![CDATA[GWAS]]></category>
		<category><![CDATA[hapmap]]></category>
		<category><![CDATA[UCSC Genome Browser]]></category>

		<guid isPermaLink="false">http://www.openhelix.com/blog/?p=731</guid>
		<description><![CDATA[We are seeing a lot of interest in visualizing GWAS data lately.  We cover this a bit in our UCSC Genome Browser tutorial.  And we recently did a pretty popular post on a quick look at the NHGRI GWAS catalog data using the UCSC Genome Graphs tool. But as I was looking at the HapMap [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://openhelix.com/downloads/jing/hapmap_gwas1.swf" target="_blank"><img src="http://blog.openhelix.eu/wp-content/uploads/2009/01/hapmap_gwas_movie.jpg" alt="hapmap_gwas_movie.jpg" width="300" align="left" /></a>We are seeing a lot of interest in visualizing GWAS data lately.  We cover this a bit in our UCSC Genome Browser <a href="http://www.openhelix.com/ucsc" target="_blank">tutorial</a>.  And we recently did a pretty popular post on a quick look at the <a href="http://www.openhelix.com/blog/?p=670" target="_blank">NHGRI GWAS catalog data </a>using the <a href="http://genome.ucsc.edu/cgi-bin/hgGenome" target="_blank">UCSC Genome Graphs</a> tool.</p>
<p>But as I was looking at the <a href="http://www.hapmap.org/" target="_blank">HapMap</a> tools again recently, I noticed that they have a tool for this as well.  So today&#8217;s tip examines that tool for visualizing the NHGRI GWAS catalog data, and having a look at the <a href="http://gmod.org/wiki/Gbrowse" target="_blank">GBrowse </a>view of this data in genomic regions with the HapMap context.  In this movie I load up one of the sample data sets and move from that GWAS karyogram visualization to the HapMap GBrowse view.  Click the image to view the movie.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.openhelix.eu/?feed=rss2&amp;p=731</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>How to pick a genome database platform</title>
		<link>http://blog.openhelix.eu/?p=619</link>
		<comments>http://blog.openhelix.eu/?p=619#comments</comments>
		<pubDate>Tue, 21 Oct 2008 15:09:15 +0000</pubDate>
		<dc:creator>Mary</dc:creator>
				<category><![CDATA[Genomics Research]]></category>
		<category><![CDATA[Genomics Resource News]]></category>
		<category><![CDATA[New Resource]]></category>
		<category><![CDATA[flybase]]></category>
		<category><![CDATA[GBrowse]]></category>
		<category><![CDATA[gramene]]></category>
		<category><![CDATA[hapmap]]></category>
		<category><![CDATA[James Watson]]></category>
		<category><![CDATA[MaizeGDB]]></category>
		<category><![CDATA[MGI]]></category>
		<category><![CDATA[wormbase]]></category>

		<guid isPermaLink="false">http://www.openhelix.com/blog/?p=619</guid>
		<description><![CDATA[I was reading a newsletter I get from Biotechniques, and their WebWatch often has some fun items. (You may need to get a free login to see the WebWatch.) This week they referred to the MaizeGDB database in the post Amaizing Base. Although I had been aware of MaizeGDB before, it was a nice reminder [...]]]></description>
			<content:encoded><![CDATA[<p>I was reading a newsletter I get from <a href="http://www.biotechniques.com/" target="_blank">Biotechniques</a>, and their <a href="http://www.biotechniques.com/default.asp?page=current&amp;subsection=article_display&amp;display=full&amp;id=112966" target="_blank">WebWatch </a>often has some fun items.  (<em>You may need to get a free login to see the WebWatch.</em>) This week they referred to the <a href="http://www.maizegdb.org/" target="_blank">MaizeGDB </a>database in the post <strong>Amaizing Base</strong>.  Although I had been aware of MaizeGDB before, it was a nice reminder to go over and have a look to see what&#8217;s new.</p>
<p>When I went over there I was intrigued by the new browser they are about to launch (in mid-October).   The link says &#8220;<a href="http://www.maizegdb.org/genome_browser_survey.php" target="_blank">coming soon</a>&#8221; and I went to check out the information there.</p>
<p>Currently that link goes to a page that describes their move to a more sequence-centric representation of their data.  It was a fascinating look at their decision process to move to a new browser platform and what they decided to do.  For database geeks like me, seeing their ranking of the importance of various features was very compelling.</p>
<p>And what they decided?  <strong>GBrowse</strong>!</p>
<p>We have a <a href="http://www.openhelix.com/blog/?p=572" target="_blank">tutorial available</a> on GBrowse.  Usually we do tutorials on specific sites, but as we kept seeing GBrowse over and over at different sites we created a tutorial for that.  It helps me to understand the underlying basic browser when I visit any site that employs it.  Even though the wrappings and the data types will vary at different sites, understanding how it works makes it much easier to use at any new site that uses it.   <a href="http://www.hapmap.org/" target="_blank">HapMap</a>, <a href="http://www.informatics.jax.org/" target="_blank">MGI</a>, <a href="http://www.wormbase.org" target="_blank">WormBase</a>, <a href="http://www.flybase.org" target="_blank">FlyBase</a>, <a href="http://www.arabidopsis.org/" target="_blank">TAIR</a>, <a href="http://jimwatsonsequence.cshl.edu/cgi-perl/gbrowse/jwsequence/" target="_blank">Watson&#8217;s personal genome</a>, and a <a href="http://gmod.org/wiki/GMOD_Users" target="_blank">whole bunch of other sites</a> use the GBrowse software.</p>
<p>Looking forward to checking out the MaizeGDB GBrowse version when it launches!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.openhelix.eu/?feed=rss2&amp;p=619</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Tip of the Week: Human Genome Structural Variation Viewer</title>
		<link>http://blog.openhelix.eu/?p=356</link>
		<comments>http://blog.openhelix.eu/?p=356#comments</comments>
		<pubDate>Wed, 21 May 2008 05:01:56 +0000</pubDate>
		<dc:creator>Mary</dc:creator>
				<category><![CDATA[General Science]]></category>
		<category><![CDATA[Genomics News]]></category>
		<category><![CDATA[Genomics Research]]></category>
		<category><![CDATA[Genomics Resource News]]></category>
		<category><![CDATA[New Resource]]></category>
		<category><![CDATA[Tip of the Week]]></category>
		<category><![CDATA[CNV]]></category>
		<category><![CDATA[deletion]]></category>
		<category><![CDATA[duplication]]></category>
		<category><![CDATA[ENCODE]]></category>
		<category><![CDATA[hapmap]]></category>
		<category><![CDATA[insertion]]></category>
		<category><![CDATA[nhgri]]></category>
		<category><![CDATA[snp]]></category>
		<category><![CDATA[UCSC Genome Browser]]></category>

		<guid isPermaLink="false">http://www.openhelix.com/blog/?p=356</guid>
		<description><![CDATA[Looking at the NHGRI News feed recently, I noticed this story (below) about a new genomic data collection that intrigued me. I found out about a new resource that I wanted to share as this week&#8217;s Tip of the Week. So this ~4 minute movie discusses my path to the Human Genome Structural Variation resource [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://openhelix.com/downloads/jing/structure_var_site.swf" target="_blank"><img src="http://www.openhelix.com/blog/wp-content/uploads/2008/05/nature_cnv_paper.thumbnail.jpg" alt="nature_cnv_paper.jpg" align="left" /></a>Looking at the <a href="http://www.genome.gov/10000009" target="_blank">NHGRI News feed</a> recently, I noticed this story (below) about a new genomic data collection that intrigued me.  I found out about a new resource that I wanted to share as this week&#8217;s Tip of the Week.  So this ~4 minute movie discusses my path to the Human Genome Structural Variation resource and a quick look at some of the data.  But the paper was so influential on my thinking about the genome that I wanted to cover that in more detail in text form as well.  So for a quick hit, watch the movie.  For more detail, check out the text and links below.  Quick trip to the database: <a href="http://hgsv.washington.edu/" target="_blank">http://hgsv.washington.edu</a></p>
<h3><a href="http://www.genome.gov/27026113" target="_blank">Researchers Produce First Sequence Map  of Large-Scale Structural Variation in Human Genome</a></h3>
<p><img src="http://www.openhelix.com/blog/wp-content/uploads/2008/05/chromosomes.jpg" alt="chromosomes.jpg" align="right" width="250" /></p>
<blockquote><p>&#8230;.Other recently created maps, such as the HapMap, have catalogued the patterns of small-scale variations in the genome that involve single DNA letters, or bases. However, the scientific community has been eagerly awaiting the creation of additional types of maps in light of findings that larger scale differences account for a great deal of the common genetic variation among individuals and between populations, and may account for a significant fraction of disease. While previous work has identified structural variation in the human genome, a sequence-based map provides much finer resolution and location information&#8230;.</p></blockquote>
<p>I spend a lot of time thinking about the official or &#8220;reference&#8221; human genome sequence.  This sequence&#8211;the one that was released to all that fanfare a few years back&#8211;is a <a href="http://www.genome.gov/11006943" target="_blank">composite of several people</a>. Rather like a &#8220;generic&#8221; genome.</p>
<p><span id="more-356"></span></p>
<blockquote>
<h4><em>Whose DNA was sequenced for the Human Genome Project?</em></h4>
<p>This is intentionally not known to protect the volunteers who provided DNA samples for sequencing. The sequence is derived from the DNA of several volunteers&#8230;.</p></blockquote>
<p>Of course, projects to investigate small variations (simple or single nucleotide polymorphisms, or SNPs) such as the large HapMap effort and many other projects have revealed a lot of important single A, T, C, and Gs that vary among us.   If you think of those as typos, you&#8217;ll have a sense of the scale of that variation.  However, it is clear that there are other genomic variations that may be important for our understanding of any one individual&#8217;s genome. Whole pages or even whole chapters missing or added, not just typos, that might be  an issue. Recently I got intrigued by that discovery of a <a href="http://www.openhelix.com/blog/?p=49" target="_blank">possible deletion or duplication that might be associated with autism</a>.  So this wider look of these types of structural changes&#8211;deletions, insertions, inversions&#8211;got me really thinking about how to interpret data differently than I usually do when I consider the reference genome.  It isn&#8217;t just the SNPs I need to keep in mind.</p>
<p><strong><em>On to the paper itself: </em></strong></p>
<p><strong>Mapping and sequencing of structural variation from eight human genomes, by Kidd et al. </strong></p>
<p><span style="padding: 5px; float: left"><a href="http://www.researchblogging.org"><img src="http://www.researchblogging.org/images/rbicons/ResearchBlogging-Medium-Trans.png" alt="ResearchBlogging.org" height="50" width="80" /></a></span>The paper in Nature is remarkable for many reasons, including the <em>sheer volume of data</em> it provides.   This team examined the individual genomes of 8 different people. These same 8 people were examined in the <a href="http://hapmap.org/" target="_blank">HapMap</a> project&#8211;so we can consider their genomes in the context of that variation project as well.  Further, 5 of them are also part of the <a href="http://www.openhelix.com/blog/?p=326" target="_blank">ENCODE</a> project&#8211;so we will also be able to look at those genomes with that set of flashlights as well.  I swear, those 5 people are going to be the most closely examined humans ever.</p>
<p>The researchers used several techniques to examine and confirm these structural variations in the genomes.  They created whole genomic libraries of clones for each person&#8211;with about a million clones each.  They sequenced each end to be able to map them to the reference genome.  Think about how much that means in both physical and electronic reagents&#8230;.And how much you can put in a figure on a regular page in a journal&#8230;.</p>
<p>They mapped these clones and found over 75,000+ that differed in length or orientation.  Some larger, some smaller, some flipped, some missing ends, etc. They whittled these down to around 3000 copy number variant candidates to pursue in more detail, and ended up with over 1,000 &#8220;non-redundant sites of copy-number variation&#8221; identified by restriction enzyme digestions.  They validated these with microarrays that they had designed, and rescued another ~200 sites of variation with that strategy.  Many of these could be additionally confirmed using SNP analyses as well.    Inversions are more challenging to confirm, but they were able to find some with sequencing strategies and in situ hybridizations.  End result: 1,695 sites of structural variation unearthed: 747 deletions, 724 insertions, 224 inversions.   They discuss the occurrence of these in the context of the reference genome sequence, with some intriguing conclusions.  You should go read the whole thing.</p>
<p>Of course, there is much more in the paper.  Novel human sequences inserted in the genomes.  Specific genes often affected by these changes.  Examinations around human diversity.  Very compelling stuff.</p>
<p>But there is just so much information in this paper&#8211;all those clones, all those individual implications&#8230;.it simply can&#8217;t be contained in just a few pages.  There is an extensive section of <a href="http://www.nature.com/nature/journal/v453/n7191/suppinfo/nature06862.html" target="_blank">supplements </a>for this paper. <em>But even that isn&#8217;t enough.  </em>You have to check out the web site where you can see and interact with all of this clone data: <a href="http://hgsv.washington.edu/" target="_blank">http://hgsv.washington.edu/</a></p>
<p>The <strong>Human Genome Structural Variation Project</strong> site is a mirror of the <a href="http://genome.ucsc.edu/" target="_blank">UCSC Genome Browser</a> that we know and love (<a href="http://www.openhelix.com/downloads/ucsc/ucsc_home.shtml" target="_blank">and provide training for</a>).  But they overlay all of that clone data on top of the reference genome sequence information, and present it <em>with </em>the other genomic context that you can find the UCSC Genome Browser. So you can have it all in your view&#8211;and in your head&#8211;at once.  It is a great way to go deeper with this data, and explore any genomic regions you might be interested in.</p>
<p>A traditional paper publication simply cannot contain all the data in a situation like this.   This is an excellent example of one way to use the UCSC Genome Browser framework to present custom data for a project.</p>
<p>But it is such a huge advance in our understanding of genome-wide variations in humans, beyond the individual SNPs that have been such a major focus lately.  We need to be thinking about the possible differences in each one of the pair of chromosomes that we carry.  We need to be thinking about entire chunks of DNA that can be there, not there, twice there, or completely turned around.  And if you think a personal genome scan for SNPs is going to provide all the information you may need to understand your biology, you may be missing some really crucial bits.  Just keep that in mind as we move forward in the personal genome era.  SNPs ain&#8217;t all, folks.</p>
<p><a href="http://www.genome.gov/pressDisplay.cfm?photoID=98" target="_blank"><em>Image from: Jane Ades, NHGRI</em></a></p>
<p><span class="Z3988" title="ctx_ver=Z39.88-2004&amp;rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&amp;rft.aulast=Kidd&amp;rft.aufirst=Jeffrey&amp;rft.aumiddle=M&amp;rft.au=Jeffrey+ Kidd&amp;rft.au=Gregory+M+Cooper&amp;rft.au=William+F+Donahue&amp;rft.au=Hillary+S+Hayden&amp;rft.au=Nick++Sampas&amp;rft.au=Tina++Graves&amp;rft.au=Nancy++Hansen&amp;rft.au=Brian++Teague&amp;rft.au=Can++Alkan&amp;rft.au=Francesca++Antonacci&amp;rft.au=Eric++Haugen&amp;rft.au=Troy++Zerr&amp;rft.au=N+Alice+Yamada&amp;rft.au=Peter++Tsang&amp;rft.au=Tera+L+Newman&amp;rft.au=Eray++T%C3%BCz%C3%BCn&amp;rft.au=Ze++Cheng&amp;rft.au=Heather+M+Ebling&amp;rft.au=Nadeem++Tusneem&amp;rft.au=Robert++David&amp;rft.au=Will++Gillett&amp;rft.au=Karen+A+Phelps&amp;rft.au=Molly++Weaver&amp;rft.au=David++Saranga&amp;rft.au=Adrianne++Brand&amp;rft.au=Wei++Tao&amp;rft.au=Erik++Gustafson&amp;rft.au=Kevin++McKernan&amp;rft.au=Lin++Chen&amp;rft.au=Maika++Malig&amp;rft.au=Joshua+D+Smith&amp;rft.au=Joshua+M+Korn&amp;rft.au=Steven+A+McCarroll&amp;rft.au=David+A+Altshuler&amp;rft.au=Daniel+A+Peiffer&amp;rft.au=Michael++Dorschner&amp;rft.au=John++Stamatoyannopoulos&amp;rft.au=David++Schwartz&amp;rft.au=Deborah+A+Nickerson&amp;rft.au=James+C+Mullikin&amp;rft.au=Richard+K+Wilson&amp;rft.au=Laurakay++Bruhn&amp;rft.au=Maynard+V+Olson&amp;rft.au=Rajinder++Kaul&amp;rft.au=Douglas+R+Smith&amp;rft.au=Evan+E+Eichler&amp;rft.title=Nature&amp;rft.atitle=Mapping+and+sequencing+of+structural+variation+from+eight+human+genomes&amp;rft.date=2008&amp;rft.volume=453&amp;rft.issue=7191&amp;rft.spage=56&amp;rft.epage=64&amp;rft.genre=article&amp;rft.id=info:DOI/10.1038%2Fnature06862"></span>Kidd, J.M., Cooper, G.M., Donahue, W.F., Hayden, H.S., Sampas, N., Graves, T., Hansen, N., Teague, B., Alkan, C., Antonacci, F., Haugen, E., Zerr, T., Yamada, N.A., Tsang, P., Newman, T.L., Tüzün, E., Cheng, Z., Ebling, H.M., Tusneem, N., David, R., Gillett, W., Phelps, K.A., Weaver, M., Saranga, D., Brand, A., Tao, W., Gustafson, E., McKernan, K., Chen, L., Malig, M., Smith, J.D., Korn, J.M., McCarroll, S.A., Altshuler, D.A., Peiffer, D.A., Dorschner, M., Stamatoyannopoulos, J., Schwartz, D., Nickerson, D.A., Mullikin, J.C., Wilson, R.K., Bruhn, L., Olson, M.V., Kaul, R., Smith, D.R., Eichler, E.E. (2008). Mapping and sequencing of structural variation from eight human genomes. <span style="font-style: italic">Nature, 453</span>(7191), 56-64. DOI: <a href="http://dx.doi.org/10.1038/nature06862" rev="review">10.1038/nature06862</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.openhelix.eu/?feed=rss2&amp;p=356</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
