<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0"><channel><atom:link rel="hub" href="http://tumblr.superfeedr.com/" xmlns:atom="http://www.w3.org/2005/Atom"/><description>An offshoot of the Genome 10K project, and primarily organized by the UC Davis Genome Center, Assemblathons are  contests to assess state-of-the-art methods in the field of genome assembly. 

Assemblathon 1 occurred at the end of 2010 and the results were published in late 2011. A second Assemblathon, using real data from three vertebrate species, started in June 2011 and a manuscript has recently been submitted.</description><title>The Assemblathon</title><generator>Tumblr (3.0; @assemblathon)</generator><link>http://assemblathon.org/</link><item><title>Slides from an Assemblathon 2 talk</title><description>&lt;p&gt;Here are some slides that I made for a talk that I gave to a general audience at UC Davis. These slides were also used by Prof. Ian Korf for his presentation at the Genome 10K workshop (5/25/2013). Notes are included for each slide.&lt;/p&gt;
&lt;p&gt;The first part of this talk presents a simple overview of genome assembly and introduces many of the terms that are used. The next part of the talk discusses results from Assemblathon 2.&lt;/p&gt;
&lt;p&gt;&lt;iframe frameborder="0" height="356" marginheight="0" marginwidth="0" scrolling="no" src="http://www.slideshare.net/slideshow/embed_code/19974246" width="427"&gt; &lt;/iframe&gt;&lt;/p&gt;
&lt;div&gt;&lt;strong&gt; &lt;a href="http://www.slideshare.net/kbradnam/assemblathon-2-talk" title="Assemblathon 2 talk" target="_blank"&gt;Assemblathon 2 talk&lt;/a&gt; &lt;/strong&gt; from &lt;strong&gt;&lt;a href="http://www.slideshare.net/kbradnam" target="_blank"&gt;Keith Bradnam&lt;/a&gt;&lt;/strong&gt;&lt;/div&gt;</description><link>http://assemblathon.org/post/48865310097</link><guid>http://assemblathon.org/post/48865310097</guid><pubDate>Thu, 25 Apr 2013 14:14:59 -0400</pubDate><category>assemblathon 2</category></item><item><title>Assemblathon has moved to a new home</title><description>&lt;p&gt;Goodbye Posterous.&lt;/p&gt;
&lt;p&gt;Hello tumblr.&lt;/p&gt;
&lt;p&gt;Please let me know if you spot any errors or broken links. If you had any bookmarks to pages on the Assemblathon site, these most likely will need to be updated.&lt;/p&gt;</description><link>http://assemblathon.org/post/44436140243</link><guid>http://assemblathon.org/post/44436140243</guid><pubDate>Sun, 03 Mar 2013 01:10:00 -0500</pubDate></item><item><title>Feedback and analysis of the Assemblathon 2 pre-print</title><description>&lt;p&gt;&lt;span&gt;There has already been some discussion of the &lt;a href="http://arxiv.org/abs/1301.5406"&gt;pre-print&lt;/a&gt; of the Assemblathon 2 manuscript. Although a pre-print is not the same thing as a peer-reviewed, accepted paper — I don’t want us to get too ahead of ourselves! — I thought it useful to start collecting together some of the online commentaries:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;&lt;span&gt;&lt;a href="http://www.homolog.us/blogs/2013/01/23/assemblathon-2-paper-is-available-at-arxiv/"&gt;Homologus blog post 1&lt;/a&gt;:  highlights a few conclusions from the paper&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span&gt;&lt;a href="http://www.homolog.us/blogs/2013/01/25/notes-on-assemblathon-paper/"&gt;Homologus blog post 2&lt;/a&gt;: delves into the results, and attempts to estimate some of the costs of genome assembly. Assemblathon  co-author Sébastien Boisvert adds some useful comments. &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span&gt;&lt;a href="http://haldanessieve.org/2013/01/28/our-paper-making-pizzas-and-genome-assemblies/"&gt;Haldane’s Sieve post&lt;/a&gt;: an invited blog post by lead author Keith Bradnam, that summarizes what the Assemblathons are all about by way of a pizza-themed analogy&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span&gt;&lt;a href="http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0052210"&gt;Reevaluating Assembly Evaluations with Feature Response Curves: GAGE and Assemblathons&lt;/a&gt;: this is not a blog post, but a recently published paper that evaluates some of the Assemblathon 2 data&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span&gt;&lt;a href="http://ivory.idyll.org/blog/thoughts-on-assemblathon-2.html"&gt;Thoughts on the Assemblathon 2 paper&lt;/a&gt;: by C. Titus Brown (a reviewer of the manuscript)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span&gt;&lt;a href="http://www.homolog.us/blogs/2013/02/23/titus-browns-thoughts-on-the-assemblathon-2-paper/"&gt;Homologus blog post 3&lt;/a&gt;: reactions to the previous post by C. Titus Brown&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span&gt;&lt;a href="http://ivory.idyll.org/blog/2013-assemblathon-review-i.html"&gt;Assemblathon 2 review, round 1, parts thereof&lt;/a&gt;: a concise version of C. Titus Brown’s formal manuscript review (minus the specific suggestions)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span&gt;&lt;a href="http://flxlexblog.wordpress.com/2013/02/26/on-assembly-uncertainty-inspired-by-the-assemblathon2-debate/"&gt;On assembly uncertainty (inspired by the Assemblathon 2 debate)&lt;/a&gt;: blog post by Lex Nederbragt in response to post by C. Titus Brown&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;&lt;span&gt;Remember, the &lt;a href="https://twitter.com/assemblathon"&gt;Assemblathon twitter account &lt;/a&gt;has also tweeted about these pieces, as well as many other articles relating to genome assembly.&lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431915644</link><guid>http://assemblathon.org/post/44431915644</guid><pubDate>Mon, 28 Jan 2013 16:55:00 -0500</pubDate><category>assemblathon 2</category></item><item><title>The Assemblathon 2 paper has been submitted!</title><description>&lt;p&gt;&lt;span&gt;The manuscript has been submitted to the &lt;a href="http://www.gigasciencejournal.com"&gt;GigaScience journal&lt;/a&gt; and a pre-print of the paper is &lt;a href="http://arxiv.org/abs/1301.5406"&gt;now available&lt;/a&gt; on arXiv.org. Additional data files that support the paper &lt;/span&gt;&lt;span&gt;are currently &lt;/span&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Additional_files/"&gt;available here&lt;/a&gt;&lt;span&gt; (these are also part of the manuscript submission).&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;Please note that the pre-print has not undergone peer review and we do not assume that the manuscript will be automatically accepted by the journal.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;If all goes well, the submitted Assemblathon 2 assemblies, along with sets of &lt;a href="http://korflab.ucdavis.edu/Datasets/cegma/"&gt;CEGMA&lt;/a&gt; gene predictions for each assembly, will be available from GigaScience’s &lt;a href="http://gigadb.org"&gt;GigaDB&lt;/a&gt; database. These are also &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/"&gt;currently available&lt;/a&gt; from the Korflab website.&lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431916477</link><guid>http://assemblathon.org/post/44431916477</guid><pubDate>Thu, 24 Jan 2013 12:58:00 -0500</pubDate><category>assemblathon 2</category></item><item><title>Image taken...</title><description>&lt;img src="http://24.media.tumblr.com/d179740ff28484609d3960a80f03cb2e/tumblr_mj2jj7CLHT1s7t9swo1_500.jpg"/&gt;&lt;br/&gt;&lt;br/&gt;&lt;p&gt;Image taken from &lt;a href="http://www.flickr.com/photos/incrediblehow/5577200173/"&gt;&lt;a href="http://www.flickr.com/photos/incrediblehow/5577200173/"&gt;http://www.flickr.com/photos/incrediblehow/5577200173/&lt;/a&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;Why we need the Assemblathon&lt;/h2&gt;
&lt;p&gt;&lt;span&gt;&lt;em&gt;This short essay was originally written in Fall 2011 for a Genome 10K newsletter that never came to pass so I’m posting it here. The views in this essay are my own and should not necessarily be taken to represent official views of the Assemblathon.&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;&lt;em&gt;Keith Bradnam, Jan 20th 2012&lt;/em&gt;&lt;/span&gt;&lt;/p&gt;
&lt;hr&gt;&lt;p&gt;&lt;span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;strong&gt;Note:&lt;/strong&gt; article updated 2/21/12 to include mention of Meraculous among the best performing assemblers in Assemblathon 1&lt;/p&gt;
&lt;p&gt;&lt;span&gt;There was a time when scientists were grateful for any sort of genome sequence no matter its level of quality or completeness (let alone its N50 length). In the late 1990s there were still only a handful of completed eukaryotic genome sequences in existence and many of these had been painstakingly completed using the ‘clone-by-clone’ sequencing approach. If you had a detailed genetic map for your species of interest – which is what the clone-by-clone approach ideally required – then you also had a good chance of knowing how complete your genome sequence might be. End users of these genome sequences were mostly grateful for what they had been given. Even a ‘working draft’ version of a genome which might contain lots of errors was better than no sequence at all.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;This era of ‘genomic innocence’ did not last long. As sequencing projects moved away from the clone-by-clone approach and instead started to employ the whole genome shotgun (WGS) strategy, it became clear that there are many different ways to put a genome sequence together. Sometimes you could produce a drastically different assembly from the same sequence data just by using a different &lt;em&gt;version&lt;/em&gt; of the same assembly program (e.g. the N50 length of scaffolds from the dog genome &lt;a href="http://www.nature.com/nature/journal/v438/n7069/full/nature04338.html"&gt;&lt;span&gt;was seen to double&lt;/span&gt;&lt;/a&gt; just by using a later version of the ARACHNE assembler).&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;It also became clear that just because you can assemble a genome in a way that increases its N50 size (or whatever &lt;em&gt;metric du jour&lt;/em&gt; you care to use), it may not necessarily be a better genome. For example, the v1.95 assembly of the sea squirt &lt;em&gt;Ciona intestinalis&lt;/em&gt; had a N50 length of 234 Kbp. A later v2.0 assembly produced over a tenfold increase in this metric, but in the process the newer assembly &lt;a href="http://nar.oxfordjournals.org/content/37/1/289.full"&gt;&lt;span&gt;removed several highly conserved genes&lt;/span&gt;&lt;/a&gt; that were present in the earlier assembly. One step forward, two steps back.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;&lt;span&gt;These challenges were happening in an era when a genome sequence was something that represented a &lt;em&gt;species&lt;/em&gt; rather than an individual (let alone a tissue type); an era when sequence reads were typically longer than your email signature; and when a genome sequencing project was expected to take years not days. In short, these challenges took place &lt;/span&gt;&lt;span&gt;before the dark times, before the Empire&lt;/span&gt;&lt;span&gt; before next generation sequencing (NGS) technologies took off.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;How times change. Whereas the human genome project took about ten years before it was published – longer before all chromosomes were actually finished – we can now generate enough raw sequence data to produce a similar genome sequence in just ten &lt;em&gt;days&lt;/em&gt;. And that’s only considering the output from a single NGS machine. Of course that’s assuming that you can put it all together in the correct order. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;The question of ‘how do I assemble a genome from NGS data?’ is not always an easy question to answer. It can be made easier if you have some expectation of what the final genome should look like. However, it is one thing to try assembling a genome sequence with NGS data when you already have a completed sequence from a closely-related species to compare it to. It’s another thing altogether when there is no such ‘reference’ genome. Imagine trying to put a jigsaw puzzle together when there is no picture on the box to guide you, no edge pieces, and only a ‘best guess’ as to how many pieces there are. Oh, and there may be up to 100 copies of every piece. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;The biggest headache about dealing with NGS data is that things are probably going to get worse before they get better. It is true that read lengths are increasing year-on-year, and newer sequencing chemistries continue to claim ever greater accuracy. These improvements should make genome assembly a much easier problem in the very near future. However, we still have to deal with the explosion of sequence data that is happening &lt;em&gt;now&lt;/em&gt;. Everybody has genomes that need to be sequenced, and without genome assembly the raw output from a genome project is pretty much a meaningless set of very large files taking up space on a disk somewhere. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;Projects such as &lt;a href="http://genome10k.org/"&gt;&lt;span&gt;Genome 10K&lt;/span&gt;&lt;/a&gt; are aiming to sequence the genomes from – wait for it – 10,000 vertebrate species (the clue was in the name) and all those genomes will need assembling. It was thanks to Genome 10K that the &lt;a href="http://assemblathon.org/"&gt;&lt;span&gt;Assemblathon project&lt;/span&gt;&lt;/a&gt; was born. The Assemblathon is more than just a made-up word…it is an ongoing project that is attempting to help drive improvements in the field of genome assembly. It will try to do this by bringing together all of the major players in the genome assembly field together for regular ‘contests’. To date there has been one such contest that &lt;a href="http://genome.cshlp.org/content/early/2011/11/02/gr.126599.111.long"&gt;&lt;span&gt;has been completed&lt;/span&gt;&lt;/a&gt; which we oh-so-cleverly named ‘Assemblathon 1’. We  are currently in the middle of the inventively named ‘Assemblathon 2’. We plan – subject to our Assemblathon grant application being successful – to organize more Assemblathons in future. So how exactly can the Assemblathon help to assemble genomes?&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;In many ways, a more pertinent question to ask about genome assembly is not ‘how do I assemble a genome?’ but rather ‘how will I know if my assembly is any good?’. This is a very easy question to ask, but not such an easy question to answer. Genome assembly programs – like most bioinformatics software – will always spit out some output…the challenge is to make sense of that output. There are a range of fairly simple statistical metrics that are commonly used to describe genome assemblies. In most cases we like to believe that higher values for those metrics indicates a better genome assembly, but as the earlier example with &lt;em&gt;C. intestinalis &lt;/em&gt;showed, this is not always the case. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;Hopefully though, there are always some intelligent questions that we can ask of any genome assembly, and many of these will involve leveraging any other sequence information that may exist for that species (finished BACs, optical maps, ESTs etc.). For example, we should expect that a good genome assembly will contain most of the genes that are present in the genome. We often do not know what those genes will look like, but we know that a large handful of important genes remain essentially unchanged across millions of years of evolution and so we should be able to find those at least. This approach has &lt;a href="http://nar.oxfordjournals.org/content/37/1/289.short"&gt;&lt;span&gt;previously been used by our group&lt;/span&gt;&lt;/a&gt; to assess the ‘gene space’ in a variety of new genome sequences, and we are currently employing this technique as just one of many strategies in the evaluation of Assemblathon 2 entries.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;In the world of genome assembly, beauty is very much in the eye of the beholder. Some people consider a genome assembly that is full of genes to be a very beautiful thing. However, a genome assembly can theoretically contain 100% of the genes, but still only represent ~5–10% of the genome. Some researchers want genome assemblies that capture the maximum amount of the underlying genome. Other researchers don’t care for getting all of the genome – or even all of the genes – just as long as what they do get is highly accurate. If there is a lot of heterozygosity in your genome of interest, then you may prefer an assembly that attempts to best resolve that heterozygosity and produce two haplotype sequences. Or maybe your research concerns repetitive elements and you just want to assemble a genome in a way that still captures a lot of the hard-to-assemble repetitive regions.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;If you want the best genome assembly possible, you may have to accept some trade-offs. The assemblers that may perform well in one area may not perform as well in other areas. Everybody wants to be told ‘genome assembler X will give you the best assembly’ but at the moment it doesn’t seem fair to make such bold assertions. What we did find out from Assemblathon 1 was that a number of genome assemblers performed admirably across many, but not all, of the different metrics. For example, the assembler that did the best job at increasing &lt;em&gt;coverage&lt;/em&gt; (the amount of the known genome present in the assembly), ranked 9th when considering the number of substitution errors present in the assembly. Conversely, the assembler that did the best job at minimizing substitution errors ranked 8th in terms of coverage. You pays your money and you takes your choice. We should mention, however, that these two assemblers (&lt;a href="http://soap.genomics.org.cn/soapdenovo.html"&gt;&lt;span&gt;SOAPdenovo&lt;/span&gt;&lt;/a&gt; and &lt;a href="https://github.com/jts/sga"&gt;&lt;span&gt;SGA&lt;/span&gt;&lt;/a&gt;), along with &lt;a href="https://www.broadinstitute.org/scientific-community/science/programs/genome-sequencing-and-analysis/computational-rd/computational-"&gt;&lt;span&gt;ALLPATHS&lt;/span&gt;&lt;/a&gt; and &lt;a href="http://genome.jgi.doe.gov/programs/plants/assembly.jsf"&gt;Meraculous&lt;/a&gt; were consistently ranked highly for many of the metrics that we used and were the four best overall assemblers in Assemblathon 1. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;One of the unusual features of Assemblathon 1 was that it involved synthetic genome data. We actually made a small artificial genome, let it evolve for millions of years, and then produced synthetic reads from the final genome. We did this so that we would know what the final answer should look like (a luxury in the world of genome assembly). However, many people who are developing the latest and greatest algorithms for putting genomes together do not necessarily want to devote too much time to such ‘toy’ problems. They would much rather get their hands dirty with real world data, so their efforts can be of actual use to many researchers around the world. Ask, and you shall receive. In &lt;a href="http://assemblathon.org/a-parrot-a-fish-and-a-snake-walk-into-a-bar"&gt;&lt;span&gt;Assemblathon 2&lt;/span&gt;&lt;/a&gt; we provided real data from three (count ‘em!) vertebrate genomes: a snake, a fish, and a bird. For the latter species (a parrot), we actually provided sequencing data from three different technologies (Illumina, 454, and PacBio) to see if groups would try a ‘mix &amp; match’ strategy.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt;&lt;span&gt;Assemblathon 2 is now in the ‘evaluation phase’ where the 43 genome assemblies that we received are being prodded, poked, and subjected to expert scrutiny. We hope that we can produce new metrics to complement those that are currently used to assess genome assemblies. We also hope that we can provide accurate assessments of how good these assemblies are &lt;em&gt;with respect to what different users might want from a genome sequence&lt;/em&gt;. But most of all, we hope that we can continue to be useful by drawing the genome assembly community together – both physically and virtually – in order to share ideas and and to &lt;/span&gt;&lt;span&gt;start occasional arguments&lt;/span&gt;&lt;span&gt; promote vigorous discussion. The Assemblathon will be maximally useful if we help improve the state of genome assembly to the point where we no longer need to organize Assemblathons.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span&gt; &lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431925986</link><guid>http://assemblathon.org/post/44431925986</guid><pubDate>Fri, 20 Jan 2012 16:32:00 -0500</pubDate></item><item><title>Announcing the arrival of a beautiful, baby…mailing...</title><description>&lt;img src="http://24.media.tumblr.com/8f9cdc34141d499ba705de25e12c8a1e/tumblr_mj2jjdvtR61s7t9swo1_500.jpg"/&gt;&lt;br/&gt;&lt;br/&gt;&lt;h2&gt;Announcing the arrival of a beautiful, baby…mailing list&lt;/h2&gt;&lt;p&gt;&lt;span style="font-size: x-small;"&gt;Image copied from &lt;a href="http://www.flickr.com/photos/mknott/4652064320/sizes/m/in/photostream/"&gt;&lt;a href="http://www.flickr.com/photos/mknott/4652064320/sizes/m/in/photostream/"&gt;http://www.flickr.com/photos/mknott/4652064320/sizes/m/in/photostream/&lt;/a&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;At the recent &lt;a href="http://assemblathon.org/agenda-for-cshl-breakout-meeting-on-genome-as"&gt;CSHL Genome Assembly Workshop&lt;/a&gt;, one of the issues that was raised was for the need to develop a new file format that can capture and accurately represent all of the information inherent in a genome assembly. &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;For example, you might know that within a contig in your genome assembly, there is a particular subsequence that contains a run of 5–10 adenosine nucleotides. If you generate a FASTA file to represent this, you cannot specify such variation and have to choose a fixed number of As in your sequence. More fundamentally, a genome assembly is – or should be – attempting to represent sequences from two different haplotypes. In an ideal world,  the file format should be able to capture this variation.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;There has already been a lot of discussion over what such a file format might look like, and the Assemblathon would like to help move this process forward. We don’t want to dictate any standards ourselves, but rather just provide a suitable forum for intelligent discussion that might help drive things along. If the genome assembly community can agree on a new file format, then we will of course be happy to recognize that format and support it in future Assemblathons.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;If you want to be part of the conversation, head on over to our &lt;a href="http://assemblathon.org/pages/mailing-list"&gt;mailing list page&lt;/a&gt; and sign up for our new mailing list: &lt;strong&gt;assemblathon-file-format&lt;/strong&gt;. This will hopefully co-exist peacefully alongside our main mailing list and become a focused forum for this important issue.&lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431931136</link><guid>http://assemblathon.org/post/44431931136</guid><pubDate>Tue, 15 Nov 2011 19:12:00 -0500</pubDate><category>JustMigrate</category><category>mailing list</category></item><item><title>Agenda for CSHL breakout meeting on Genome Assembly </title><description>&lt;p&gt;&lt;span style="font-size: medium;"&gt;Saturday November 5th – 1:00–6:00&lt;/span&gt;&lt;/p&gt;
&lt;p style="text-align: center;"&gt;&lt;span style="font-family: Helvetica; font-size: medium;"&gt;&lt;strong&gt;Agenda&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: Helvetica;"&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;1:00 - 1:10 General introduction&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;1:10 - 1:30 Groups and individuals&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;1:40 - 1:50 Evaluation update&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;1:50 - 2:50 Soapbox session&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;2:50 - 3:00 Group formation&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;3:00 - 4:30 Group discussion breakout&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;4:30 - 6:00 Group presentations&lt;/span&gt;&lt;/div&gt;

&lt;div style="text-align: center;"&gt;&lt;span style="font-size: medium;"&gt;&lt;strong&gt;Details&lt;/strong&gt;&lt;/span&gt;&lt;/div&gt;
&lt;p&gt;&lt;/p&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;&lt;strong&gt;Groups and individuals:&lt;/strong&gt; We are not a huge community, and while some of us know each other, it&amp;#8217;s not true of everyone. One of the goals of the Assemblathon is to foster a friendly, yet competitive, community that maximally advances the state of the art in the field of sequence assembly. We would like to hear a brief introduction from assemblers, data providers, and other interested parties.&lt;/span&gt;&lt;/div&gt;

&lt;div&gt;&lt;span style="font-size: medium;"&gt;&lt;strong&gt;Evaluation update:&lt;/strong&gt; The evaluation team will describe the Assemblathon 2 analyses that are currently underway and those that will begin in the very near future. Please don&amp;#8217;t expect any announcements of winners at this time. It&amp;#8217;s too early for that!&lt;/span&gt;&lt;/div&gt;
&lt;p&gt;&lt;/p&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;&lt;strong&gt;Soapbox session:&lt;/strong&gt; We want to give 10–12 people a short (~5 min) slot to present an assembly issue that is very important to them. This could be assembly file format, choices of genomes, new algorithm ideas, community resources, etc. This session will inform the next one.&lt;/span&gt;&lt;/div&gt;

&lt;div&gt;&lt;span style="font-size: medium;"&gt;&lt;strong&gt;Group work:&lt;/strong&gt; We will break up into groups to discuss various assembly issues. Such discussions may focus on, but are not limited to, the following topics:&lt;/span&gt;&lt;/div&gt;
&lt;p&gt;&lt;/p&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;A: Assembly file formats&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;B: Future Assemblathon targets (transcriptomes, metagenomes, diploid assembly etc.)&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;C: Assemblathon 2 evaluation&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;D: New evaluation procedures&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;E: Improving and merging Assemblathon 2 genome assemblies&lt;/span&gt;&lt;/div&gt;

&lt;div&gt;&lt;span style="font-size: medium;"&gt;At the end of the meeting we will have groups present the results of their discussions. We suggest that these presentations focus on the following key points:&lt;/span&gt;&lt;/div&gt;
&lt;p&gt;&lt;/p&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;1. Concise description of the problem.&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;2. What resources are required to solve the problem?&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;3. What are the achievable intermediate goals?&lt;/span&gt;&lt;/div&gt;
&lt;div&gt;&lt;span style="font-size: medium;"&gt;4. What is the timeline for achieving those goals?&lt;/span&gt;&lt;/div&gt;
&lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431931973</link><guid>http://assemblathon.org/post/44431931973</guid><pubDate>Thu, 03 Nov 2011 14:01:00 -0400</pubDate><category>JustMigrate</category><category>assemblathon 2</category><category>assemblathon 3</category></item><item><title>Assemblathon 2 basic assembly metrics</title><description>&lt;p&gt;&lt;span style="font-size: medium;"&gt;I’ve calculated a set of basic assembly metrics for each Assemblathon 2 submission using our &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/assemblathon_stats.pl"&gt;assemblathon_stats.pl&lt;/a&gt; script (requires &lt;a href="http://homepage.mac.com/iankorf/FAlite.pm"&gt;FAlite.pm&lt;/a&gt;). Results for individual assemblies are available below but you can also download this information as &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/all_species.txt"&gt;one combined text file&lt;/a&gt; or one &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/all_species.csv"&gt;CSV file&lt;/a&gt;.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;Most of these basic measures are hopefully self-explanatory, but here are some further notes: &lt;/span&gt;&lt;/p&gt;
&lt;ol&gt;&lt;li&gt;&lt;span style="font-size: medium;"&gt;All sequences in submitted assemblies are initially treated as ‘scaffolds’.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;Each scaffold was split on runs of 25 or more ‘N’ characters to form ‘contigs’. Any contigs that could be split in this way were regarded as ‘scaffolded contigs’&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;Scaffolds that didn’t have runs of 25 or more ‘N’ characters were also counted as ‘unscaffolded contigs’.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;N50 scaffold/contig length is calculated by summing lengths of scaffolds/contigs from the longest to the shortest and determining at what point you reach 50% of the total assembly size. The length of the scaffold/contig at that point is the N50 length.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;The L50 measure is the &lt;em&gt;number&lt;/em&gt; of scaffolds/contigs that are greater than, or equal to, the N50 length.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;The NG50 and LG50 measures are the same as the N50 and L50 measures except that rather than compare against the total assembly size, we compare against the estimated genome sizes of the three species. These estimates are crude approximations averaged from a combination of data in the published literature and calculations from Assemblathon 2 participants. The NG50/LG50 measures permit fairer comparisons between assemblies of different sizes. Though note that some assemblies were not big enough to be able to calculate these metrics.&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;&lt;ul&gt;&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_1C.txt"&gt;bird_1C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_2C.txt"&gt;bird_2C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_3E.txt"&gt;bird_3E.txt&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_4C.txt"&gt;bird_4C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_5C.txt"&gt;bird_5C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_6C.txt"&gt;bird_6C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_7C.txt"&gt;bird_7C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_8C.txt"&gt;bird_8C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_9C.txt"&gt;bird_9C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_10C.txt"&gt;bird_10C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_11C.txt"&gt;bird_11C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_12C.txt"&gt;bird_12C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_13E.txt"&gt;bird_13E.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_14E.txt"&gt;bird_14E.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/bird_15C.txt"&gt;bird_15C.txt&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_1C.txt"&gt;fish_1C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_2E.txt"&gt;fish_2E.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_3C.txt"&gt;fish_3C.tx&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_4C.txt"&gt;fish_4C.tx&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_5C.txt"&gt;fish_5C.tx&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_6C.txt"&gt;fish_6C.tx&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_7C.txt"&gt;fish_7C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_8C.txt"&gt;fish_8C.txt&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_9C.txt"&gt;fish_9C.tx&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_10E.txt"&gt;fish_10E.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_11E.txt"&gt;fish_11E.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_12C.txt"&gt;fish_12C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_13C.txt"&gt;fish_13C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_14E.txt"&gt;fish_14E.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_15E.txt"&gt;fish_15E.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/fish_16E.txt"&gt;fish_16E.txt&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/snake_1C.txt"&gt;snake_1C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/snake_2C.txt"&gt;snake_2C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/snake_3C.txt"&gt;snake_3C.txt&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/snake_4C.txt"&gt;snake_4C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/snake_5C.txt"&gt;snake_5C.txt&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/snake_6C.txt"&gt;snake_6C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/snake_7C.txt"&gt;snake_7C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/snake_8C.txt"&gt;snake_8C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/snake_9C.txt"&gt;snake_9C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/snake_10C.txt"&gt;snake_10C.txt&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/snake_11C.txt"&gt;snake_11C.txt&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon2/Basic_metrics/snake_12C.txt"&gt;snake_12C.txt&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;&lt;span style="font-size: medium;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;In addition to calculating NG50, you can also calculate NG60, NG70, NG80 etc. E.g. ‘NG25’ would refer to the length of the scaffold at which you have seen a total of 25% of the estimated genome size (assuming you sum all scaffold lengths starting from the longest). Plotting all such ‘N(X)’ values from NG1 through to NG100 allows you easily compare all assemblies against each other. This is shown in the graphs below. Note that the Y-axis is on a log scale.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;&lt;div class="p_embed p_image_embed"&gt;
&lt;a href="http://getfile3.posterous.com/getfile/files.posterous.com/temp-2011-10-13/JnoACsDaHvGfsqCxfitvxBrAJdiikIqobCiGaHJJjqmqipwqsrwrbIsJcvtg/fish.png.scaled1000.png"&gt;&lt;img alt="Fish" src="http://getfile8.posterous.com/getfile/files.posterous.com/temp-2011-10-13/JnoACsDaHvGfsqCxfitvxBrAJdiikIqobCiGaHJJjqmqipwqsrwrbIsJcvtg/fish.png.scaled500.png"/&gt;&lt;/a&gt;
&lt;/div&gt;
&lt;div class="p_embed p_image_embed"&gt;
&lt;a href="http://getfile6.posterous.com/getfile/files.posterous.com/temp-2011-10-13/AslynuqocExkhkrsyhmIwEochlhkxrhslppdGlDEfmyGzezIjnedrwbhqviD/bird.png.scaled1000.png"&gt;&lt;img alt="Bird" src="http://getfile0.posterous.com/getfile/files.posterous.com/temp-2011-10-13/AslynuqocExkhkrsyhmIwEochlhkxrhslppdGlDEfmyGzezIjnedrwbhqviD/bird.png.scaled500.png"/&gt;&lt;/a&gt;
&lt;/div&gt;
&lt;div class="p_embed p_image_embed"&gt;
&lt;a href="http://getfile8.posterous.com/getfile/files.posterous.com/temp-2011-10-13/xAyszHqjnsdfmohDhmGbeDAqCwpuhhAFuqioJrsqybBqrrJoJyigsmIjhgmI/snake.png.scaled1000.png"&gt;&lt;img alt="Snake" src="http://getfile7.posterous.com/getfile/files.posterous.com/temp-2011-10-13/xAyszHqjnsdfmohDhmGbeDAqCwpuhhAFuqioJrsqybBqrrJoJyigsmIjhgmI/snake.png.scaled500.png"/&gt;&lt;/a&gt;
&lt;/div&gt;
&lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431933387</link><guid>http://assemblathon.org/post/44431933387</guid><pubDate>Thu, 13 Oct 2011 19:03:00 -0400</pubDate><category>JustMigrate</category><category>assemblathon 2</category></item><item><title>A parrot, a fish, and a snake walk into a bar... </title><description>&lt;p&gt;&lt;span style="font-size: medium;"&gt;&lt;div class="p_embed p_image_embed"&gt;
&lt;img alt="435293692_09243853c3" src="http://getfile6.posterous.com/getfile/files.posterous.com/temp-2011-10-03/CwywvggDentrkcwEFuDroxkxCpfxCgeAgllqyJvcgBwuJCGItFvgDfzutiet/435293692_09243853c3.jpg.scaled500.jpg"/&gt;&lt;/div&gt;
&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: x-small;"&gt;Image from &lt;a href="http://www.flickr.com/photos/enchufado/435293692/"&gt;&lt;a href="http://www.flickr.com/photos/enchufado/435293692/"&gt;http://www.flickr.com/photos/enchufado/435293692/&lt;/a&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: x-small;"&gt;&lt;br/&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;At midnight on Sunday we reached the deadline for Assemblathon 2 submissions. As scientists like to laugh in the face of deadlines, there was somewhat of a last minute rush to submit assemblies. The following graph shows a timeline of the entries received since the submission period started. Nearly a third of all submissions arrived in the last 6 hours of Saturday night!&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;&lt;div class="p_embed p_image_embed"&gt;
&lt;a href="http://getfile7.posterous.com/getfile/files.posterous.com/temp-2011-10-03/ciatrAJhcdEflgcoxIrIlabctqzpAahsowxqFyJdjdjjJgCDhyxdBfFJwJhG/chart_1.png.scaled1000.png"&gt;&lt;img alt="Chart_1" src="http://getfile3.posterous.com/getfile/files.posterous.com/temp-2011-10-03/ciatrAJhcdEflgcoxIrIlabctqzpAahsowxqFyJdjdjjJgCDhyxdBfFJwJhG/chart_1.png.scaled500.png"/&gt;&lt;/a&gt;
&lt;/div&gt;
&lt;/span&gt;&lt;/p&gt;

&lt;p&gt; &lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;In total we received 43 assemblies which is fantastic. They are very evenly split between the three species:&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt; &lt;/span&gt;&lt;/p&gt;

&lt;p&gt; &lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;&lt;div class="p_embed p_image_embed"&gt;
&lt;a href="http://getfile2.posterous.com/getfile/files.posterous.com/temp-2011-10-03/sCABzpzorFBEkaCGjtwEtmauEjmAaqmwpkBudIjGosgpCbroyjjDJicosfII/chart_2.png.scaled1000.png"&gt;&lt;img alt="Chart_2" src="http://getfile1.posterous.com/getfile/files.posterous.com/temp-2011-10-03/sCABzpzorFBEkaCGjtwEtmauEjmAaqmwpkBudIjGosgpCbroyjjDJicosfII/chart_2.png.scaled500.png"/&gt;&lt;/a&gt;
&lt;/div&gt;
&lt;/span&gt;&lt;/p&gt;

&lt;p&gt; &lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;Although we received 16 fish assemblies, 5 of these are for evaluation only; such entries are not eligible for any prizes, and we may not be able to assess them as comprehensively as the main entries (though we will try). Similarly, 3 of the 15 parrot assemblies are for evaluation, but all of the 12 snake assemblies are for the main competition.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;For the parrot assemblies, there were three available datasets (Illumina, 454, and PacBio), which participants were allowed to use in any combination that they desired. A few weeks ago, I asked people which combinations they think they would use. The actual usage in the submitted assemblies is a little bit different from what we were expecting:&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span style="font-size: medium;"&gt;&lt;div class="p_embed p_image_embed"&gt;
&lt;a href="http://getfile4.posterous.com/getfile/files.posterous.com/temp-2011-10-03/IsFpbEqGjIvCwseJAFsEhehCyEAoBscrhHyuziDFbzGHebFrBDcoectyyzse/chart_3.png.scaled1000.png"&gt;&lt;img alt="Chart_3" src="http://getfile2.posterous.com/getfile/files.posterous.com/temp-2011-10-03/IsFpbEqGjIvCwseJAFsEhehCyEAoBscrhHyuziDFbzGHebFrBDcoectyyzse/chart_3.png.scaled500.png"/&gt;&lt;/a&gt;
&lt;/div&gt;
&lt;/span&gt;&lt;/p&gt;

&lt;p&gt; &lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;It will be interesting to have a discussion (perhaps in the comments below), as to why fewer teams combined sequences from the different datasets than what was expected.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;Prior to the competition, I also asked how many people intended to take part and 21 groups said &amp;#8216;yes&amp;#8217; and that is exactly how many ended up taking part. Countries represented include: Canada, China, France, Italy, Portugal, Russia, the UK, and the USA. The full list of participating teams is shown below, team name(s) are included in parenthenses:&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;&lt;span style="font-size: medium;"&gt;Wellcome Trust Sanger Institute (Phusion + sga)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;Human Genome Sequencing Center, Baylor College of Medicine (BCS-HGSC)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;Université Laval (Ray)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;National Research University of Information Technologies, Mechanics, and Optics (Computer Technologies Department)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;European Bioinformatics Institute (Curtain)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;I.G.A Applied Genomics Institute (GAM)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;UC Berkeley (MacManes)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;DoE Joint Genome Institute (meraculous)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;454 Life Sciences (Newbler+454)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;Universidade de Lisboa (CoBiG² - Computational Biology &amp;amp; Population Genomics Group)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;University of Maryland (CBCB)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;IRISA -Institut de recherche en informatique et systèmes aléatoires (Symbiose)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;The Broad Institute (Allpaths)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;Canada&amp;#8217;s Michael Smith Genome Sciences Centre (ABySS)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;CRACS - Center for Research in Advanced Computing Systems (CRACS)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;BGI - SOAPdenovo&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;Cold Spring Harbor Laboratory (CSHL)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;University of Georgia (IOBUGA)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;UCSF&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: medium;"&gt;Wayne State University (AB2L)&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;&lt;span style="font-size: medium;"&gt;I will soon be making all of the assemblies from these groups available for any participant to inspect (and for the evaluators to&amp;#8230;well, evaluate). In the interests of ensuring fairness, the identity of each assembly will be kept secret and will be known only to me until we finish evaluating the assemblies. You will of course, be able to work out which assemblies are yours by looking at the file sizes/md5 values/base count etc.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;Thank you to everyone who is participating. I hope the results of our analysis will become a useful resource to anyone interested in producing better genome assemblies (insert your own definition of &amp;#8216;better&amp;#8217; here).&lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431934535</link><guid>http://assemblathon.org/post/44431934535</guid><pubDate>Mon, 03 Oct 2011 14:42:00 -0400</pubDate><category>JustMigrate</category><category>assemblathon 2</category></item><item><title>Hot off the presses: Assemblathon 1 results published!
Image...</title><description>&lt;img src="http://25.media.tumblr.com/9ec93687cfb592c32d3ff763bdf55496/tumblr_mj2jjkHxyD1s7t9swo1_500.jpg"/&gt;&lt;br/&gt;&lt;br/&gt;&lt;h2&gt;Hot off the presses: Assemblathon 1 results published!&lt;/h2&gt;
&lt;p&gt;&lt;span style="font-size: x-small;"&gt;Image reproduced from &lt;a href="http://www.flickr.com/photos/nitsrejk/126982680/sizes/z/in/photostream/"&gt;&lt;a href="http://www.flickr.com/photos/nitsrejk/126982680/sizes/z/in/photostream/"&gt;http://www.flickr.com/photos/nitsrejk/126982680/sizes/z/in/photostream/&lt;/a&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;A preliminary version of the Assemblathon 1 paper is now available on &lt;a href="http://genome.cshlp.org/content/early/2011/09/16/gr.126599.111?top=1"&gt;Genome Research’s website&lt;/a&gt;. Congratulations to all involved! &lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431939852</link><guid>http://assemblathon.org/post/44431939852</guid><pubDate>Mon, 26 Sep 2011 20:11:00 -0400</pubDate><category>JustMigrate</category><category>assemblathon 1</category></item><item><title>Assemblathon 2 overview</title><description>&lt;p&gt;&lt;span style="font-size: small;"&gt;With one month to go to the submission deadline, I recently asked the genome assembly field as to whether they would be participating in Assemblathon 2. Twenty-one people completed my short survey (&lt;a href="https://docs.google.com/spreadsheet/viewform?hl=en_US&amp;amp;formkey=dEdYWEZWUHk5Ti1FM2NjNFEwb1BMYmc6MQ#gid=0"&gt;you can still complete it too&lt;/a&gt; if you haven&amp;#8217;t done so). &lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;The first question was simply to ask whether people were going to take part:&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;&lt;div class="p_embed p_image_embed"&gt;
&lt;img alt="Chart" src="http://getfile0.posterous.com/getfile/files.posterous.com/temp-2011-09-06/gbpazEodcykkkfGdsexBsdExapIqgGqJGGnxypCqteJrBnbGChgduBqidwDF/chart.png.scaled500.png"/&gt;&lt;/div&gt;
&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;The &amp;#8216;maybe&amp;#8217; category suggests that some groups don&amp;#8217;t yet know whether they will have enough time to complete their genome assemblies. By way of comparison, Assemblathon 1 had 17 entrants, so we are in the same ballpark as last time and may still exceed that figure. Question 2 asked which of the three possible species, groups would be submitting assemblies for:&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;&lt;div class="p_embed p_image_embed"&gt;
&lt;img alt="Chart-1" src="http://getfile7.posterous.com/getfile/files.posterous.com/temp-2011-09-06/wJGHmgxjgJgwHscklcvybuqjxCqoGuoHnDyyriymtBbIFFhgkrvHznmeddbr/chart-1.png.scaled500.png"/&gt;&lt;/div&gt;
&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;At the moment only four groups are submitting genome assemblies for all three species and eight groups are only submitting assemblies for a single species (though these numbers will probably change a little bit). Remember that each species is effectively a separate Assemblathon contest and groups are not required to submit assemblies for all species.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;Parrot is currently proving most popular species, possibly because of the varied sequencing technologies that have been used to generate genome sequence data for this species. This leads into the last question, which tried to assess which combinations of sequencing technologies people would use to complete the parrot genome.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;&lt;div class="p_embed p_image_embed"&gt;
&lt;img alt="Chart-2" src="http://getfile1.posterous.com/getfile/files.posterous.com/temp-2011-09-06/kGsujzekwkGHwbyGDzqjtAGaBupxmyADFmxufsuuJaIDdqDqiahuburHjBIk/chart-2.png.scaled500.png"/&gt;&lt;/div&gt;
Six of the seven possible combinations will be represented with five groups looking to use a combination of Illumina, 454, and PacBio data. This diversity in how the input data are being used will make it interesting to see how the the final assemblies differ (assuming that they differ).&lt;/span&gt;&lt;/p&gt;

&lt;p&gt;&lt;span style="font-size: small;"&gt;&lt;strong&gt;Update: 9:56 am&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;Since writing the blog post an hour or so ago, a few more people have completed the feedback form. There are now 18 confirmed participants (and still 5 maybes).&lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431940680</link><guid>http://assemblathon.org/post/44431940680</guid><pubDate>Tue, 06 Sep 2011 11:14:00 -0400</pubDate><category>JustMigrate</category></item><item><title>Assemblathon breakout meeting to follow CSHL Genome Informatics...</title><description>&lt;img src="http://25.media.tumblr.com/a926a085ed5cfb11c7125e9ea088f1c7/tumblr_mj2jjqLM0F1s7t9swo1_500.jpg"/&gt;&lt;br/&gt;&lt;br/&gt;&lt;h2&gt;Assemblathon breakout meeting to follow CSHL Genome Informatics meeting&lt;/h2&gt;&lt;p style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt;Image taken from &lt;a href="http://www.flickr.com/photos/idiolector/100071458/sizes/m/in/photostream/"&gt;&lt;a href="http://www.flickr.com/photos/idiolector/100071458/sizes/m/in/photostream/"&gt;http://www.flickr.com/photos/idiolector/100071458/sizes/m/in/photostream/&lt;/a&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p style="text-align: left;"&gt;&lt;span style="font-size: medium;"&gt;The &lt;a href="http://meetings.cshl.edu/meetings/info11.shtml"&gt;CSHL Genome Informatics meeting&lt;/a&gt; ends at lunchtime on Saturday 5th November. We are planning to hold an Assemblathon breakout meeting to take place on this afternoon (after lunch). This meeting will discuss issues relating to Assemblathon 2, and potentially cover other issues such as directions for future Assemblathons. We envisage that the meeting may last until 6:00 pm and this breakout session will be open to participants of the Genome Informatics meeting.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: medium;"&gt;More information will be posted on the Assemblathon &lt;a href="http://assemblathon.org/pages/timetable"&gt;timetable page&lt;/a&gt; as details emerge. Also, an announcement will hopefully be made a the start of the Genome Informatics meeting.&lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431947019</link><guid>http://assemblathon.org/post/44431947019</guid><pubDate>Mon, 15 Aug 2011 13:06:00 -0400</pubDate><category>JustMigrate</category></item><item><title>Parrot genome in Assemblathon 2 will now be represented by a...</title><description>&lt;img src="http://24.media.tumblr.com/4cd12788a0cf8958fa11a0bfa49a4791/tumblr_mj2jjvF8BM1s7t9swo1_500.jpg"/&gt;&lt;br/&gt;&lt;br/&gt;&lt;h2&gt;Parrot genome in Assemblathon 2 will now be represented by a *third* sequencing technology&lt;/h2&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;&lt;span style="font-size: xx-small;"&gt;Image reproduced from &lt;a href="http://www.flickr.com/photos/kiyoshi_be/2436967848/sizes/m/in/photostream/"&gt;&lt;a href="http://www.flickr.com/photos/kiyoshi_be/2436967848/sizes/m/in/photostream/"&gt;http://www.flickr.com/photos/kiyoshi_be/2436967848/sizes/m/in/photostream/&lt;/a&gt;&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;Thanks to the generosity of &lt;a href="http://www.pacificbiosciences.com/" class="vt-p"&gt;Pacific Biosciences&lt;/a&gt;, we will soon be releasing a third set of sequences that assemblers can optionally use when assembling the parrot genome. Reads from a PacBio RS machine will be giving us 10X coverage from two libraries:&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-family: arial, sans-serif; font-size: small;"&gt;&lt;span style="background-color: #ffffff;"&gt; &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;div style="font-family: arial, sans-serif; font-size: 14px;"&gt;&lt;ol&gt;&lt;li&gt;&lt;span style="font-size: small;"&gt;7Kb insert library: average read sizes =  ~1.2kb.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;10Kb insert library: average read sizes = ~3Kb, with 5% of the reads over 6–8Kb&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;This dataset adds to the existing &lt;a href="http://bioshare.bioinformatics.ucdavis.edu/Data/hcbxz0i7kg/Parrot/README.txt" class="vt-p"&gt;219X&lt;/a&gt; coverage of the genome by Illumina HiSeq data, and the &lt;a href="http://bioshare.bioinformatics.ucdavis.edu/Data/hcbxz0i7kg/Parrot/README.txt" class="vt-p"&gt;16X&lt;/a&gt; of coverage by Roche 454 data. Assemblers are free to ‘mix and match’ from any of these three datasets when making their parrot genome assembly.&lt;/span&gt;&lt;/p&gt;
&lt;ol&gt;&lt;/ol&gt;&lt;/div&gt;
&lt;p&gt;&lt;span style="background-color: #ffffff; font-size: small;"&gt;  &lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431952690</link><guid>http://assemblathon.org/post/44431952690</guid><pubDate>Mon, 27 Jun 2011 17:47:37 -0400</pubDate><category>JustMigrate</category><category>assemblathon 2</category></item><item><title>Full Assemblathon 1 analysis by UCSC team now available</title><description>&lt;p&gt;&lt;span style="font-size: small;"&gt;The &lt;a href="http://www.cbse.ucsc.edu/people/hausslerlab" class="vt-p"&gt;Assemblathon group from UC Santa Cruz&lt;/a&gt; have now released their full analysis of the Assemblathon 1 genome assembly assement. Click on the following link to access documents, data and code:&lt;/span&gt;&lt;/p&gt;
&lt;p style="text-align: center;"&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://compbio.soe.ucsc.edu/assemblathon1/" class="vt-p"&gt;Assemblathon 1 UCSC Materials&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431953552</link><guid>http://assemblathon.org/post/44431953552</guid><pubDate>Fri, 03 Jun 2011 19:19:00 -0400</pubDate><category>JustMigrate</category><category>assemblathon 1</category></item><item><title>Assemblathon 2 begins today!
Today is the start date for...</title><description>&lt;img src="http://24.media.tumblr.com/513ad0332d3c4ec25cddf03a5cf6e044/tumblr_mj2jk04DPh1s7t9swo1_500.jpg"/&gt;&lt;br/&gt;&lt;br/&gt;&lt;h2&gt;Assemblathon 2 begins today!&lt;/h2&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;Today is the start date for Assemblathon 2. There are next-generation datasets available for three species: a fish (a cichlid), a bird (budgerigar), and a reptile (snake). The cichlid and snake datasets are all based on Illumina data (mate pair + paired-read) and are &lt;a href="http://assemblathon.org/pages/download-data" class="vt-p"&gt;available to download now&lt;/a&gt;. The parrot data has a mixture of 454 and Illumina data (enough of each to make two separate assemblies if you so desired), and will hopefully be available on our website within a day or two.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;We are expecting up to 25 participants, and although people are free to assemble any or all of these datasets, we hope that participants will attempt all three if possible. To encourage this, we will hopefully have a prize for the team that produces the highest quality assemblies for all three species.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;Initially, it might take some time (possibly up to a week) to download these datasets. We hope to be able to announce some mirror download sites in the coming days which might help things. The &lt;a href="http://assemblathon.org/pages/rules" class="vt-p"&gt;rules&lt;/a&gt; are simple. At the start of September we will require that each participating team would have uploaded &lt;em&gt;one&lt;/em&gt; assembly for each dataset that they are assembling. I.e. no more than three assemblies per team.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;We are extremely grateful to Illumina for generously providing the snake data, for the Broad Institute for providing the cichlid data, and for Erich Jarvis at Duke University &amp; the BGI for providing the parrot data.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;Good luck and happy assembling!&lt;/span&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431958425</link><guid>http://assemblathon.org/post/44431958425</guid><pubDate>Thu, 02 Jun 2011 00:08:00 -0400</pubDate><category>JustMigrate</category><category>assemblathon 2</category></item><item><title>Assemblathon 1 results</title><description>&lt;div style="color: #000000; font-family: Arial, Helvetica, sans-serif; background-color: #ffffff; margin: 8px;"&gt;
&lt;p style="font-size: 13px;"&gt;&lt;span style="font-size: small;"&gt;The majority of results were initially made available during the &lt;a href="http://www.genome10k.org/events/event/5" class="vt-p"&gt;Genome Assembly Workshop&lt;/a&gt;, though some were not fully available until after this meeting. The final publication describing the official set of Assemblathon 1 results was &lt;a href="http://genome.cshlp.org/content/21/12/2224.full?sid=cd5cab27-e336-48c9-9e4f-373581f946d3"&gt;published in Genome Research&lt;/a&gt;, in late 2011.&lt;/span&gt;&lt;/p&gt;
&lt;p style="font-size: 13px;"&gt;&lt;span style="font-size: small;"&gt;Collectively, the Assemblathon evaluation groups at &lt;a href="http://www.cbse.ucsc.edu/people/hausslerlab" class="vt-p"&gt;UC Santa Cruz&lt;/a&gt; and &lt;a href="http://www.genomecenter.ucdavis.edu/" class="vt-p"&gt;UC Davis&lt;/a&gt; generated over 100 different genome assembly metrics, many of which make use of the fact that we had a definitive picture of what the genome of species A actually looks like.&lt;/span&gt;&lt;/p&gt;
&lt;p style="font-size: 13px;"&gt;&lt;span style="font-size: small;"&gt;This page links to various results, submitted assemblies, and Assemblathon-related talks. We advise against focusing on a single metric in order to claim that any one assembly is &amp;#8216;better&amp;#8217; than another. As we will hopefully make clear, there are many complex ways of describing assemblies and sometimes assemblies perform very well when using one metric, but very badly when using a different – and equally valid – metric.&lt;/span&gt;&lt;/p&gt;
&lt;p style="font-size: 13px;"&gt;&lt;span style="font-size: small;"&gt;Note that most of the files described below are contained in &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/" class="vt-p"&gt;one directory on the Korf Lab website&lt;/a&gt;. You might find it easier accessing them from this directory.&lt;/span&gt;&lt;/p&gt;
&lt;p style="font-size: 13px;"&gt;&lt;span style="font-size: small;"&gt;&lt;strong&gt;Assemblathon Talks&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;/div&gt;
&lt;div style="color: #000000; font-family: Arial, Helvetica, sans-serif; background-color: #ffffff; margin: 8px;"&gt;
&lt;ul&gt;&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;span&gt;March 2011 - Genome Assembly Workshop&lt;/span&gt;: &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/assemblathon_talk.pdf" class="vt-p"&gt;UC Davis Assemblathon talk&lt;/a&gt;&lt;span&gt; - 1.8&amp;#160;MB PDF with notes&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;span&gt;March 2011 - Genome Assembly Workshop&lt;/span&gt;: &lt;a href="http://hgwdev.cse.ucsc.edu/~dearl/assemblathon/gawMarch2011/" class="vt-p"&gt;UC Santa Cruz Assemblathon talk&lt;/a&gt; - 33&amp;#160;MB PDF&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;May 2011 - CSHL Biology of Genomes: &lt;span&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/biology_of_genomes_assemblathon_talk.pdf" class="vt-p"&gt;Assemblathon talk&lt;/a&gt; - 2.6&amp;#160;MB PDF with notes&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;

&lt;div style="color: #000000; font-family: Arial, Helvetica, sans-serif; background-color: #ffffff; margin: 8px;"&gt;
&lt;p style="font-size: 13px;"&gt; &lt;/p&gt;
&lt;p style="font-size: 13px;"&gt;&lt;span style="font-size: small;"&gt;&lt;strong&gt;The Assemblies&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p style="font-size: 13px;"&gt;&lt;span style="font-size: small;"&gt;Each submitted assembly is available here as a gzipped file of scaffolds:&lt;/span&gt;&lt;/p&gt;
&lt;ul style="font-size: 13px;"&gt;&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/A1_scaffolds.fa.gz" class="vt-p"&gt;A1&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/B1_scaffolds.fa.gz" class="vt-p"&gt;B1&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/B2_scaffolds.fa.gz" class="vt-p"&gt;B2&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/C1_scaffolds.fa.gz" class="vt-p"&gt;C1&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/C2_scaffolds.fa.gz" class="vt-p"&gt;C2&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/D1_scaffolds.fa.gz" class="vt-p"&gt;D1&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/D2_scaffolds.fa.gz" class="vt-p"&gt;D2&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/D3_scaffolds.fa.gz" class="vt-p"&gt;D3&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/D4_scaffolds.fa.gz" class="vt-p"&gt;D4&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/D5_scaffolds.fa.gz" class="vt-p"&gt;D5&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/E1_scaffolds.fa.gz" class="vt-p"&gt;E1&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/E2_scaffolds.fa.gz" class="vt-p"&gt;E2&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/E3_scaffolds.fa.gz" class="vt-p"&gt;E3&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/F1_scaffolds.fa.gz" class="vt-p"&gt;F1&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/F2_scaffolds.fa.gz" class="vt-p"&gt;F2&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/F3_scaffolds.fa.gz" class="vt-p"&gt;F3&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/F4_scaffolds.fa.gz" class="vt-p"&gt;F4&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/F5_scaffolds.fa.gz" class="vt-p"&gt;F5&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/G1_scaffolds.fa.gz" class="vt-p"&gt;G1&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/H1_scaffolds.fa.gz" class="vt-p"&gt;H1&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/H2_scaffolds.fa.gz" class="vt-p"&gt;H2&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/H3_scaffolds.fa.gz" class="vt-p"&gt;H3&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/H4_scaffolds.fa.gz" class="vt-p"&gt;H4&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/H5_scaffolds.fa.gz" class="vt-p"&gt;H5&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/I1_scaffolds.fa.gz" class="vt-p"&gt;I1&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/I2_scaffolds.fa.gz" class="vt-p"&gt;I2&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/J1_scaffolds.fa.gz" class="vt-p"&gt;J1&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/K1_scaffolds.fa.gz" class="vt-p"&gt;K1&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/K2_scaffolds.fa.gz" class="vt-p"&gt;K2&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/K3_scaffolds.fa.gz" class="vt-p"&gt;K3&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/L1_scaffolds.fa.gz" class="vt-p"&gt;L1&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/M1_scaffolds.fa.gz" class="vt-p"&gt;M1&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/M2_scaffolds.fa.gz" class="vt-p"&gt;M2&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/M3_scaffolds.fa.gz" class="vt-p"&gt;M3&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/M4_scaffolds.fa.gz" class="vt-p"&gt;M4&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/M5_scaffolds.fa.gz" class="vt-p"&gt;M5&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/N1_scaffolds.fa.gz" class="vt-p"&gt;N1&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/N2_scaffolds.fa.gz" class="vt-p"&gt;N2&lt;/a&gt; &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/N3_scaffolds.fa.gz" class="vt-p"&gt;N3&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/O1_scaffolds.fa.gz" class="vt-p"&gt;O1&lt;/a&gt; &lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/P1_scaffolds.fa.gz" class="vt-p"&gt;P1&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/Entries/Q1_scaffolds.fa.gz" class="vt-p"&gt;Q1&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;p style="font-size: 13px;"&gt;&lt;span style="font-size: small;"&gt;&lt;strong&gt;Results and reports&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;ul style="font-size: 13px;"&gt;&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/basic_assembly_metrics.txt" class="vt-p"&gt;Basic Assemblathon metrics&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p style="font-size: 13px;"&gt;&lt;span style="font-size: small;"&gt;The Perl script that generated these results is &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/assemblathon_stats.pl" class="vt-p"&gt;available here&lt;/a&gt; (requires &lt;a href="http://homepage.mac.com/iankorf/FAlite.pm" class="vt-p"&gt;FAlite.pm&lt;/a&gt;). Most of these basic measures are hopefully self-explanatory, but here are some further notes:&lt;/span&gt;&lt;/p&gt;
&lt;ol style="font-size: 13px;"&gt;&lt;li&gt;&lt;span style="font-size: small;"&gt;All sequences in submitted assemblies are initially treated as &amp;#8216;scaffolds&amp;#8217;.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;Each scaffold was split on runs of 25 or more &amp;#8216;N&amp;#8217; characters to form &amp;#8216;contigs&amp;#8217;. Any contigs that could be split in this way were regarded as &amp;#8216;scaffolded contigs&amp;#8217;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;Scaffolds that didn&amp;#8217;t have runs of 25 or more &amp;#8216;N&amp;#8217; characters were also counted as &amp;#8216;unscaffolded contigs&amp;#8217;.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;N50 scaffold/contig length is calculated by summing lengths of scaffolds/contigs from the longest to the shortest and determining at what point you reach 50% of the total assembly size. The length of the scaffold/contig at that point is the N50 length.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;The L50 measure is the number of scaffolds/contigs that are greater than, or equal to, the N50 length.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;The NG50 and LG50 measures are the same as the N50 and L50 measures except that rather than compare against the total assembly size, we compare against the known genome size of species A (using the average size of haplotypes 1 and 2). These measures permit fairer comparisons between assemblies of different sizes.&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;&lt;ul style="font-size: 13px;"&gt;&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/assemblathon_ucdavis_report.pdf" class="vt-p"&gt;Preliminary UC Davis report&lt;/a&gt; - version 0.6 (4/28/11)&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p style="font-size: 13px;"&gt;&lt;span style="font-size: small;"&gt;This PDF contains provisional findings from some of the analysis performed by the Assemblathon group at the UC Davis Genome Center. It will be updated in subsequent days and some of this information will end up in the main Assemblathon paper that is being written.&lt;/span&gt;&lt;/p&gt;
&lt;ul style="font-size: 13px;"&gt;&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://edhar.genomecenter.ucdavis.edu/~koadman/assemblathon.pdf" class="vt-p"&gt;Mauve analysis&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p style="font-size: 13px;"&gt;&lt;span style="font-size: small;"&gt;This PDF report was prepared by Aaron Darling and uses the &lt;a href="http://gel.ahabs.wisc.edu/mauve/" class="vt-p"&gt;Mauve&lt;/a&gt; multiple genome alignment software to map genome assemblies to the known genome of species A. As with the previous report, some of these results will be included in the main Assemblathon paper.&lt;/span&gt;&lt;/p&gt;
&lt;ul style="font-size: 13px;"&gt;&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://hgwdev.cse.ucsc.edu/~dearl/assemblathon/gawMarch2011/" class="vt-p"&gt;UC Santa Cruz Assemblathon analysis&lt;/a&gt;  - &lt;/span&gt;&lt;span style="font-size: small;"&gt;A summary of all of their results, 16&amp;#160;MB PDF. N.B. still being updated!&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://compbio.soe.ucsc.edu/assemblathon1/" class="vt-p"&gt;UC Santa Cruz Assemblathon 1 materials&lt;/a&gt; - a comprehensive set of documents, data, and code&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;ul style="font-size: 13px;"&gt;&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://genome.cshlp.org/content/early/2011/09/16/gr.126599.111?top=1"&gt;The published Assemblathon 1 paper&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;
&lt;p&gt; &lt;/p&gt;</description><link>http://assemblathon.org/post/44431959830</link><guid>http://assemblathon.org/post/44431959830</guid><pubDate>Wed, 01 Jun 2011 14:28:00 -0400</pubDate><category>JustMigrate</category><category>assemblathon 1</category></item><item><title>Assemblathon 1 rules</title><description>&lt;p&gt;&lt;p style="text-align: center;"&gt;&lt;span style="font-size: small;"&gt;&lt;strong&gt;Assemblathon &amp;#8216;1&amp;#8217; – 2010&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;ol&gt;&lt;li&gt;&lt;span style="font-size: small;"&gt;A total of &lt;span style="text-decoration: line-through;"&gt;three&lt;/span&gt; two genome sequences will be made available to participants.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;span style="text-decoration: line-through;"&gt;One genome sequence will be a real set of Illumina reads from an unspecified organism.&lt;/span&gt;&lt;strong&gt; Update:&lt;/strong&gt; real Illumina data will now be available as part of &lt;a href="http://assemblathon.org/delays-to-real-data-component-of-the-assembla" class="vt-p"&gt;Assemblathon 2&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;The other two genomes will be derived from a pair of related &amp;#8216;virtual&amp;#8217; species whose genomes have been artificially evolved using the &lt;a href="http://www.drive5.com/evolver/" class="vt-p"&gt;EVOLVER&lt;/a&gt; program (Edgar, R.C., Asimenos, G., Batzoglou, S, and Sidow, A.). The estimated divergence time between the two virtual species is 100 million years.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;One synthetic genome will already be assembled and participants will be able to (optionally) use information from this &amp;#8216;sister&amp;#8217; species&amp;#8217; genome to guide the assembly of the other unassembled genome (which will exist as a set of synthetic Illumina reads).&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;Reads from the virtual assembly will be made to simulate the dynamics of &amp;#8216;real&amp;#8217; Illumina reads as much as possible and will be derived from a mixture of Paired-reads and Mate Pairs from a mixture of insert sizes.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;Genome data will be available to &lt;a href="http://assemblathon.org/pages/download-data" class="vt-p"&gt;download&lt;/a&gt; from December 1st, 2010.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;Participants have until February 6th 2011 to submit their assemblies for the synthetic genome.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;Assemblies will be assessed using a variety of metrics: one of the goals of the Assemblathon is to devise new ways of quantifying and qualifying genome assemblies.&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;Results from the Assemblathon will be discussed by invited participants at the &lt;a href="http://genome10k.org/events/event/5" class="vt-p"&gt;Genome Assembly Workshop&lt;/a&gt;, in March 2011&lt;/span&gt;&lt;/li&gt;
&lt;/ol&gt;&lt;/p&gt;</description><link>http://assemblathon.org/post/44431960689</link><guid>http://assemblathon.org/post/44431960689</guid><pubDate>Wed, 01 Jun 2011 14:04:00 -0400</pubDate><category>JustMigrate</category><category>assemblathon 1</category></item><item><title>Assemblathon 1 participants</title><description>&lt;div style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; background-color: #ffffff; margin: 8px;"&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;Here are the details of the groups participating in the Assemblathon. Please let us know if any details are incorrect, need amending, or whether you would like links directed to specific web pages of labs/groups. Also make a note of your &amp;#8216;team ID&amp;#8217; as we may use these in subsequent discussions about the Assemblathon. &lt;/span&gt;&lt;/p&gt;
&lt;/div&gt;

&lt;table border="1" align="center" style="cursor: default;"&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt;&lt;strong&gt;Team ID &lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt;&lt;strong&gt;Team name/details&lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt;&lt;strong&gt; Affiliation(s)&lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt;&lt;strong&gt;Principle contact &lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt;&lt;strong&gt;Number of entries &lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt;&lt;strong&gt;Software tools used for assembly&lt;/strong&gt;&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; A&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; GIS_CMB1&lt;span style=""&gt; &lt;/span&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.a-star.edu.sg/" class="vt-p"&gt;Agency for Science, Technology and Research&lt;/a&gt;, Singapore&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Pramila Ariyaratne &lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 1 &lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; PE-Assembler &lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; B&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Phusion&lt;span style=""&gt; &lt;/span&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.sanger.ac.uk/" class="vt-p"&gt;Wellcome Trust Sanger Institute&lt;/a&gt;, UK&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Zemin Ning&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 2&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Phusion2, phrap&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; C&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Ensembl Genomes&amp;#8217; Curtain&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.ebi.ac.uk/" class="vt-p"&gt;European Bioinformatics Institute&lt;/a&gt;, UK&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Matthias Haimel&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 2&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; SGA, BWA, Curtain, Velvet&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; D&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; sanger-sga&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.sanger.ac.uk/" class="vt-p"&gt;Wellcome Trust Sanger Insitute&lt;/a&gt;, UK&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Jared Simpson&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 4&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; SGA&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; E&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Borgs&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://cracs.fc.up.pt/" class="vt-p"&gt;CRACS&lt;/a&gt; (Center for Research in Advanced Computing Systems), Portugal&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Nuno Fonseca&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 3&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; ABySS&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; F&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; ABySS&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.bcgsc.bc.ca/" class="vt-p"&gt;BC Cancer Genome Sciences Centre&lt;/a&gt;, Canada&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Shaun Jackman&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 5&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.bcgsc.ca/platform/bioinfo/software/abyss" class="vt-p"&gt;ABySS&lt;/a&gt;, Anchor&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; G&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Plant Genome Assembly Group&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; DOE &lt;a href="http://www.jgi.doe.gov/" class="vt-p"&gt;Joint Genome Insititute&lt;/a&gt;, USA&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Jarrod Chapman &lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 1&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Meraculous &lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; H&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Team Symbiose&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.irisa.fr/" class="vt-p"&gt;L&amp;#8217;IRISA&lt;/a&gt; (Institut de recherche en informatique et &lt;span style="font-family: Arial, Verdana, Helvetica, sans-serif;"&gt;systèmes aléatoires&lt;span style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span style="font-family: Arial, Verdana, Helvetica, sans-serif;"&gt;)&lt;/span&gt;, France&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Rayan Chikhi&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 5&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Monument&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; I&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Terrapins&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt;&lt;a href="http://www.cshl.edu/" class="vt-p"&gt;CSHL&lt;/a&gt; (Cold Spring Harbor Laboratory), USA &lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Michael Schatz &lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 2&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Quake, Celera, Bambus2&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; J&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt; &lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.cs.iastate.edu/" class="vt-p"&gt;Department of Computer Science&lt;/a&gt;, Iowa State University&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Xiaoqiu Huang&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 1&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; PCAP&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; K&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Super Dawgs&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://csbl.bmb.uga.edu/" class="vt-p"&gt;Computational Systems Biology Laboratory&lt;/a&gt;, University of Georgia, USA&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Wen-Chi Chou&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 3&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Seqclean, SOAPdenovo&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; L&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; PRICE @ &lt;a href="http://derisilab.ucsf.edu/" class="vt-p"&gt;deRisi Lab&lt;/a&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://ucsf.edu" class="vt-p"&gt;UC San Francicso&lt;/a&gt;, USA&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Graham Ruby&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 1&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; PRICE&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; M&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Softberry&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.rhul.ac.uk/home.aspx" class="vt-p"&gt;Royal Holloway, University of London&lt;/a&gt;, UK &lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Victor Solovyev&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 5&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; OligoZip&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; N&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; TGAC/TSL/Oxford&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.tgac.bbsrc.ac.uk/" class="vt-p"&gt;The Genome Analysis Centre&lt;/a&gt;, &lt;a href="http://www.tsl.ac.uk/" class="vt-p"&gt;Sainsbury Laboratory&lt;/a&gt;, and &lt;a href="http://www.well.ox.ac.uk/home" class="vt-p"&gt;Wellcome Trust Centre for Human Genetics&lt;/a&gt;, UK&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Mario Caccamo&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 3&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Cortex_con_rp&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; O&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; KAS&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.cs.uchicago.edu/" class="vt-p"&gt;Department of Computer Science&lt;/a&gt;, University of Chicago, USA&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Fangfang Xia&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 1&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt;Kiki&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; P&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; BGI-Shenzhen&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.genomics.cn/en/index.php" class="vt-p"&gt;BGI&lt;/a&gt;, China&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; Zhenyu Li&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 1&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; SOAPdenovo&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;tr&gt;&lt;td style="text-align: center;"&gt;&lt;span style="font-size: x-small;"&gt; Q &lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; ALLPATHS Assembly Team&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; &lt;a href="http://www.broadinstitute.org/" class="vt-p"&gt;Broad Institute&lt;/a&gt;&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; David Jaffe&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; 1&lt;/span&gt;&lt;/td&gt;
&lt;td style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; cursor: text; margin: 8px;"&gt;&lt;span style="font-size: x-small;"&gt; ALLPATHS-LG&lt;/span&gt;&lt;/td&gt;
&lt;/tr&gt;&lt;/table&gt;</description><link>http://assemblathon.org/post/44431962184</link><guid>http://assemblathon.org/post/44431962184</guid><pubDate>Wed, 01 Jun 2011 13:58:00 -0400</pubDate><category>JustMigrate</category><category>assemblathon 1</category></item><item><title>Assemblathon 1 data</title><description>&lt;div style="color: #000000; font-family: Arial, Helvetica, sans-serif; font-size: 13px; background-color: #ffffff; margin: 8px;"&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;Currently there are 13 files available to download. The first 8 of these files contain synthetic Illumina reads for species &amp;#8216;A&amp;#8217;. The next 3 files contain: the complete (synthetic) genome of the related species &amp;#8216;B&amp;#8217;, a reference version of that genome, and annotations for the reference. Finally, a README file provides more detailed information about the contents of these files and a checksums file contains the &lt;a href="http://en.wikipedia.org/wiki/MD5" class="vt-p"&gt;MD5&lt;/a&gt; checksum value for each file. See the &lt;a href="http://assemblathon.org/pages/rules" class="vt-p"&gt;Rules&lt;/a&gt; page for more information about the Assemblathon and please &lt;a href="http://assemblathon.org/pages/contact-us" class="vt-p"&gt;contact us&lt;/a&gt; if you have any problems in downloading these data.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;Please visit the &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/" class="vt-p"&gt;download website&lt;/a&gt; or use the direct links below for HTTP access to individual files.&lt;/span&gt;&lt;/p&gt;
&lt;ul&gt;&lt;li&gt;&lt;span style="font-size: small;"&gt;README.txt - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/README.txt" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;speciesA_200i_40x.1.fastq.bz2 - 1.8&amp;#160;GB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA_200i_40x.1.fastq.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;speciesA_200i_40x.2.fastq.bz2 - 1.9&amp;#160;GB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA_200i_40x.2.fastq.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;speciesA_300i_40x.1.fastq.bz2 - 1.8&amp;#160;GB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA_300i_40x.1.fastq.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;speciesA_300i_40x.2.fastq.bz2 - 1.9&amp;#160;GB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA_300i_40x.2.fastq.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA_300i_40x.2.fastq.bz2" class="vt-p"&gt;&lt;/a&gt;&lt;span&gt;speciesA_3000i_20x_r3.1.fastq.bz2 - 931&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA_3000i_20x_r3.1.fastq.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;span&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA_3000i_20x.1.FIXED.fastq.bz2" class="vt-p"&gt;&lt;/a&gt;&lt;/span&gt;&lt;span&gt;speciesA_3000i_20x_r3.2.fastq.bz2 - 946&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA_3000i_20x_r3.2.fastq.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;span&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA_3000i_20x.2.FIXED.fastq.bz2" class="vt-p"&gt;&lt;/a&gt;&lt;/span&gt;&lt;span&gt;speciesA_10000i_20x_r3.1.fastq.bz2 - 930&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA_10000i_20x_r3.1.fastq.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;span&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA_10000i_20x.1.FIXED.fastq.bz2" class="vt-p"&gt;&lt;/a&gt;&lt;/span&gt;&lt;span&gt;speciesA_10000i_20x_r3.2.fastq.bz2 - 945&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA_10000i_20x_r3.2.fastq.bz2" class="vt-p" style="font-size: small;"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;speciesA.diploid.fa.bz2 - 62&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesA.diploid.fa.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt; &lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;speciesB.diploid.fa.bz2 - 62&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesB.diploid.fa.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;speciesB.reference.fa.bz2 - 31&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesB.reference.fa.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesB.reference.fa.bz2" class="vt-p"&gt;&lt;/a&gt;speciesB.annotations.gff.bz2 - 11MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/speciesB.annotations.gff.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt; &lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;checksums.md5 - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/Assemblathon1/checksums.md5" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;&lt;strong&gt;Important update - January 5th 2011&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;The original set of mate-pair files contained errors and have been replaced with new files. Please use the mate-pair files listed above as these&lt;span style="border-collapse: collapse; font-family: arial, sans-serif;"&gt;correct some problems in the orientation of the mate-pair reads (see below for more details). The paired-end libraries are unaffected by this issue.  &lt;/span&gt;&lt;span&gt;We apologize for any inconvenience this may cause you.&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;&lt;strong&gt;&lt;span style="border-collapse: collapse; font-family: arial, sans-serif;"&gt;Explanation of problem&lt;/span&gt;&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="border-collapse: collapse; font-family: arial, sans-serif; font-size: small;"&gt;In the previous dataset both reads were placed on the &lt;em&gt;same&lt;/em&gt; strand by mistake. A quick-fix would be to simply reverse complement all of the first reads and reverse the quality strings which would lead to the reads being in the proper orientation. However, fixing the reads in this way would result in the underlying error applied to the reads being done so in reverse. For example an observed error rate typical of the beginning of an Illumina read would show up at the end of the first reads. We have modified the sequencing simulator code so that it produces mate-pair reads in the proper orientation, and then used this modified code to generate two new replacement mate-pair libraries.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;After we fixed this initial problem (and made available release &amp;#8216;r2&amp;#8217; of the data), we discovered a subsequent problem. The non-biotin pull-down reads within the mate-pair library should have been orientated towards each other but were mistakenly orientated in the same way as the rest of the mate-pair library. This was subsequent fixed in a third release of the data (&amp;#8216;r3&amp;#8217;) available above. The files containing errors are listed below but we would strongly urge people to re-download the r3 files from above.&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;&lt;strong&gt;Older files containing errors&lt;/strong&gt;&lt;/span&gt;&lt;/p&gt;

&lt;ul&gt;&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;span style="text-decoration: line-through;"&gt;&lt;span style="text-decoration: line-through;"&gt;&lt;span style="text-decoration: line-through;"&gt;speciesA_3000i_20x.1.fastq.bz2 - 931&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_3000i_20x.1.fastq.bz2" class="vt-p" style="font-size: small;"&gt;HTTP&lt;/a&gt; &lt;/span&gt;&lt;/span&gt;&lt;/span&gt; - &lt;strong&gt;DO NOT USE!&lt;/strong&gt; (see above)&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="text-decoration: line-through; font-size: small;"&gt;&lt;span style="text-decoration: line-through;"&gt;speciesA_3000i_20x.2.fastq.bz2 - 946&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_3000i_20x.2.fastq.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;span style="text-decoration: line-through;"&gt;&lt;span style="text-decoration: line-through;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_3000i_20x.2.fastq.bz2" class="vt-p"&gt;&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="text-decoration: line-through;"&gt;speciesA_10000i_20x.1.fastq.bz2 - 930&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_10000i_20x.1.fastq.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="font-size: small;"&gt;&lt;span style="text-decoration: line-through;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_10000i_20x.1.fastq.bz2" class="vt-p"&gt;&lt;/a&gt;&lt;/span&gt;&lt;span style="text-decoration: line-through;"&gt;speciesA_10000i_20x.2.fastq.bz2 - 945&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_10000i_20x.2.fastq.bz2" class="vt-p" style="font-size: small;"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="text-decoration: line-through; font-size: small;"&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_10000i_20x.2.fastq.bz2" class="vt-p" style="font-size: small;"&gt;&lt;/a&gt;&lt;span&gt;speciesA_3000i_20x_r2.1.fastq.bz2 - 931&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_3000i_20x_r2.1..fastq.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="text-decoration: line-through; font-size: small;"&gt;&lt;span&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_3000i_20x.1.FIXED.fastq.bz2" class="vt-p"&gt;&lt;/a&gt;&lt;/span&gt;&lt;span&gt;speciesA_3000i_20x_r2.2.fastq.bz2 - 946&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_3000i_20x_r2.2.fastq.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="text-decoration: line-through; font-size: small;"&gt;&lt;span&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_3000i_20x.2.FIXED.fastq.bz2" class="vt-p"&gt;&lt;/a&gt;&lt;/span&gt;&lt;span&gt;speciesA_10000i_20x_r2.1.fastq.bz2 - 930&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_10000i_20x_r2.1.fastq.bz2" class="vt-p"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;li&gt;&lt;span style="text-decoration: line-through; font-size: small;"&gt;&lt;span&gt;&lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_10000i_20x.1.FIXED.fastq.bz2" class="vt-p"&gt;&lt;/a&gt;&lt;/span&gt;&lt;span&gt;speciesA_10000i_20x_r2.2.fastq.bz2 - 945&amp;#160;MB - &lt;a href="http://korflab.ucdavis.edu/Datasets/Assemblathon/speciesA_10000i_20x_r2.2.fastq.bz2" class="vt-p" style="font-size: small;"&gt;HTTP&lt;/a&gt;&lt;/span&gt;&lt;/span&gt;&lt;/li&gt;
&lt;/ul&gt;&lt;/div&gt;</description><link>http://assemblathon.org/post/44431963352</link><guid>http://assemblathon.org/post/44431963352</guid><pubDate>Wed, 01 Jun 2011 13:53:00 -0400</pubDate><category>JustMigrate</category><category>assemblathon 1</category></item><item><title>Setting out the stall for Assemblathon 2
Image taken...</title><description>&lt;img src="http://24.media.tumblr.com/c83cd6352c0a09a6fcc60a18f595fab6/tumblr_mj2jk9phgz1s7t9swo1_500.jpg"/&gt;&lt;br/&gt;&lt;br/&gt;&lt;h2&gt;Setting out the stall for Assemblathon 2&lt;/h2&gt;&lt;p&gt;&lt;span style="font-size: small;"&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: xx-small;"&gt;Image taken from &lt;a href="http://www.flickr.com/photos/lwr/101661613/"&gt;&lt;a href="http://www.flickr.com/photos/lwr/101661613/"&gt;http://www.flickr.com/photos/lwr/101661613/&lt;/a&gt;&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;As we finish up the analysis of Assemblathon 1, our thoughts turn to Assemblathon 2 and we’d like to ask for some guidance from the genome assembly community. Specifically, w&lt;span style="font-family: Arial;"&gt;e are seeking feedback as to what combination of data sets we should use for the next Assemblathon event (due to start in May). Currently we are planning to use parrot sequence data from the BGI and cichlid data from the Broad Institute. While there is Illumina data for both species the 454 data only exists for parrot at the moment, but this could change. &lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt;Therefore we have the option of using &lt;em&gt;only&lt;/em&gt; Illumina data or a mixture of Illumina &lt;em&gt;and&lt;/em&gt; 454 data (either as a combined genome assembly challenge or as separate challenges for each sequencing technology). Of course, increasing the number of challenges also greatly increases the workload for the you (the participants) and us (the evaluators).&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;span style="font-size: small;"&gt; &lt;/span&gt;&lt;span style="font-size: small;"&gt;If you entered Assemblathon 1 and/or if you plan to enter Assemblathon 2 please help us by completing the following form:&lt;/span&gt;&lt;/p&gt;
&lt;p class="p3"&gt;&lt;span class="s1" style="font-size: small;"&gt;&lt;a href="https://spreadsheets.google.com/viewform?formkey=dGtVU056b294V21TVXZ2NUpqbjBMRGc6MQ" class="vt-p"&gt;Assemblathon 2 datasets&lt;/a&gt;&lt;/span&gt;&lt;/p&gt;

&lt;p class="p3"&gt; &lt;/p&gt;</description><link>http://assemblathon.org/post/44431969904</link><guid>http://assemblathon.org/post/44431969904</guid><pubDate>Wed, 30 Mar 2011 14:37:00 -0400</pubDate><category>JustMigrate</category></item></channel></rss>
