Monday, November 2, 2015

Software from CSHL Genome Informatics 2015

I just returned from the Genome Informatics meeting at Cold Spring Harbor. This was, hands down, the best scientific conference I've been to in years. The quality of the talks and posters was excellent, and it was great meeting in person many of the scientists and developers whose tools and software I use on a daily basis. To get a sense of what the meeting was about, 140 characters at a time, you can access all the Tweets sent Oct 28-31 2015 tagged #gi2015 at this link.

Below is a very short list of software that was presented at GI2015. This is only a tiny slice of the tools and methods that were presented at the meeting, and the list is highly biased toward tools that I personally find interesting or useful to my own work (please don't be offended if I omitted your stuff, and feel free to mention it in the comments).

Monocle: Software for analyzing single-cell RNA-seq data
Paperhttp://www.nature.com/nbt/journal/v32/n4/full/nbt.2859.html
Softwarehttp://cole-trapnell-lab.github.io/monocle-release/

Kallisto: very fast RNA-seq transcript abundance estimation using pseudoalignment.
Preprinthttp://arxiv.org/abs/1505.02710
Softwarehttp://pachterlab.github.io/kallisto/about.html

Sleuth: R package for analyzing & reporting differential expression analysis from transcript abundances estimated with Kallisto.
Preprint: coming soon?
Softwarehttp://pachterlab.github.io/sleuth/about.html
See also: The bear's lair (http://lair.berkeley.edu/): reanalysis of published RNA-seq studies using kallisto+sleuth.

QoRTs: Quality of RNA-Seq Toolset. Toolkit for QC, gene/junction counting, and other miscellaneous downstream processing from RNA-seq alignments.

JunctionSeq: R package for testing differential junction usage with RNA-seq data.

HISAT2: RNA-seq alignment against populations of genomes (aligns DNA also).
Softwarehttp://ccb.jhu.edu/software/hisat2/index.shtml

Rail: software for aligning many-sample RNA-seq data, producing alignments, genome coverage bigWigs, and splice junction BED files.
Softwarehttp://rail.bio
Preprinthttp://biorxiv.org/content/early/2015/08/11/019067

LobSTR: genotype short tandem repeats from NGS data.
Softwarehttp://melissagymrek.com/lobstr-code/
Paperhttp://www.ncbi.nlm.nih.gov/pubmed/22522390

Basset: convolutional neural networks for learning functional/regulatory features of DNA sequence.
Softwarehttps://github.com/davek44/Basset
Preprinthttp://biorxiv.org/content/early/2015/10/05/028399

Genotype Query Tools (GQT): fast/efficient individual-level queries of large-scale variation data.
Softwarehttps://github.com/ryanlayer/gqt
Preprinthttp://biorxiv.org/content/early/2015/06/05/018259

Centrifuge: a metagenomics classifier.
Softwarehttps://github.com/infphilo/centrifuge
Posterhttp://www.ccb.jhu.edu/people/infphilo/data/Centrifuge-poster.pdf

Mash: MinHash-based method for rapidly estimating pairwise distances between genomes or metagenomes.
Softwarehttps://github.com/marbl/Mash
Docshttp://mash.readthedocs.org/en/latest/
Preprinthttp://biorxiv.org/content/early/2015/10/26/029827

VCFanno: ultrafast large-sample VCF annotation
Softwarehttps://github.com/brentp/vcfanno

Ginkgo: Interactive analysis and assessment of single-cell copy-number variations
Paper: http://www.nature.com/nmeth/journal/v12/n11/full/nmeth.3578.html
Software: https://github.com/robertaboukhalil/ginkgo

StringTie: RNA-seq transcript assembly+quantification, with or without a reference. See paper for comparison to existing tools.
Software: http://ccb.jhu.edu/software/stringtie/
Source: https://github.com/gpertea/stringtie
Poster: http://ccb.jhu.edu/software/stringtie/cshl2015.pdf
Paperhttp://www.nature.com/nbt/journal/v33/n3/full/nbt.3122.html




8 comments:

  1. Thank for the post Stephen. I tried to put together a list of all the #gi2015 tools I noticed, available here: https://github.com/galaxyproject/tools-iuc/issues/361

    ReplyDelete
  2. Thanks for the nice summary and all the links!

    ReplyDelete
  3. Thanks for your comments & glad you enjoyed the meeting. I agree, was an outstanding lineup this year.

    From my lab we also discussed:

    Ginkgo: Interactive analysis and assessment of single-cell copy-number variations
    Paper: http://www.nature.com/nmeth/journal/v12/n11/full/nmeth.3578.html
    Software: http://qb.cshl.edu/ginkgo/
    Sourcecode: https://github.com/robertaboukhalil/ginkgo
    Presented by Robert Aboukhalil http://robertaboukhalil.com/

    The other packages will be available soon

    Thank you!
    Mike

    ReplyDelete
  4. I've got at least one more for you:
    StringTie: a fast, accurate method for assembly and quantification of transcripts from RNA-seq reads, either with or without annotation as a guide. Version 1.1 has major memory usage improvements.
    Software: http://ccb.jhu.edu/software/stringtie/
    Github: https://github.com/gpertea/stringtie
    Poster: http://ccb.jhu.edu/software/stringtie/cshl2015.pdf

    ReplyDelete
  5. Thanks for your nice summary of tools at Genome Informatics. I just wanted to point out that HISAT2 is an alignment program for both DNA-seq and RNA-seq reads. Would you like to reflect this on the blog post?

    Thanks,
    Daehwan Kim

    ReplyDelete

Note: Only a member of this blog may post a comment.

Creative Commons License
Getting Genetics Done by Stephen Turner is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License.