News & Blog

Preparing, Archiving, and Resurrecting Data To and From Harvard Tape Storage

March 13, 2024

How to use Harvard’s tape storage with your data.

Performing the Bonferonni and Benjamini–Hochberg Procedures on a Large Dataset

October 22, 2022

How to control the false discovery rate without relying on R.

Installing RepeatModeler & RepeatMasker

October 13, 2022

A guide for installing popular repeat annotation software.

Purging Phylogenetic Redundancy

July 29, 2022

Subsetting a phylogeny non-randomly based on genetic distance.

Genomics Data Management and Compression

July 16, 2022

Best-practices guide for managing and minimizing the storage footprint of genomics data.

Repeat Annotation using RepeatModeler and RepeatMasker

July 09, 2022

Thorough guide for annotating and masking repeats using RepeatModeler/RepeatMasker.

Making Axis Text run Downwards in ggplot2

June 08, 2022

How to run text downward instead of rotating it vertically

Optimizing Augustus

July 23, 2020

Thorough guide for optimizing Augustus on well-suited BUSCO genes.

Identifying genome-wide RAD-seq loci

December 26, 2019

A useful script for identifying the footprints of RAD-seq loci from read mappings

A tutorial on whole genome alignment

November 01, 2019

A thorough tutorial on performing whole genome alignment with the Lastz/MultiZ pipeline

Configuring CPAN on Harvard Odyssey cluster

October 26, 2019

Notes for how I configured CPAN on Harvard Odyssey cluster

A Brief Introduction to msprime

October 22, 2019

Notes from a quick introduction session I attended on msprime

Removing Space in Box Sync Folder

September 27, 2019

How to remove the annoying space in the default Box Sync folder

Relinking ZotFile Attachments

September 19, 2019

How to re-establish ZotFile links to PDF attachments

Introductory RAD-seq Activity

September 15, 2019

Experimental design activity to accompany an introductory lesson on RAD-seq

Determining whether RNA-seq data is stranded

September 13, 2019

How-to guide on quickly determining whether RNA-seq data is stranded

Presented Preliminary Results on Lerista Limb Reduction at Genome10K meeting

September 03, 2019

Invited to present work as part of comparative genomics session joint meeting of VGP and EBP.

Rattlesnake genome published in Genome Research

March 25, 2019

Genome provides insight into reptilian microchomosomes, venom gene regulation, and sex chromosome evolution.

Recap of Time in Australia

February 15, 2019

Accomplished a lot and had some fun in Australia!

Embedding Gist into webpage Markdown

January 25, 2019

How-to on imbedding Gist content into Minimal Mistakes markdown blog post.

Visualizing Genome Annotations

January 25, 2019

Creating a UCSC Genome Track for Viewing Genome Annotations

Touchdown in Australia!

October 24, 2018

I have arrived ‘down under’ to begin postdoc research.

Extracting conserved regions from PhastCons

October 11, 2018

Extracting conserved regions sequences from PhastCons conservation tracks.

Notes from running Provean

October 11, 2018

Notes from work installing and running Provean to predict protein impact of variants.

BaseSpace CLI Quickstart

October 04, 2018

Installing, authenticating, and downloading using BaseSpace CLI.

Invasive Florida Burmese python paper published!

September 30, 2018

Our paper on rapid adaptation in invasive Burmese pythons published in Molecular Ecology!

Extracting paired FASTQ read data from a BAM mapping file

September 07, 2018

Details on acquiring reads from BAM file

Setting up and using Entware on Synology device

August 14, 2018

Use Entware package manager on a Synology storage device.

Dissertation Defended

July 19, 2018

My dissertation defense on rapid adaptation in snakes has concluded!

Awarded NSF Postdoc Research Fellowship

May 15, 2018

Received notification of NSF Postdoctoral Research Fellowship in Biology!

Creating a scrolling DNA sequence visualization

January 29, 2018

Colored DNA sequence scrolling animation.

Extracting spliced sequences from GFF files

January 18, 2018

Extract CDS and exons sequences from GFF files.

Tutorial for running OrthoMCL

January 12, 2018

Step-by-step guide to running OrthoMCL ortholog detection software

Variant Effect Analysis using VEP

January 10, 2018

Understand potential impact of variants based on VEP analysis.

Summarizing the structure of gene annotations

January 10, 2018

Summarizing various measures about the structure of gene annotations.

gnuplot quickstart guide

August 31, 2017

A quick-start guide for using gnuplot for in-terminal plotting.

Introductory walkthrough of SLiM

August 26, 2017

Some notes I made as I began learning forward-time simulation using SLiM

Google Drive download from command line

August 02, 2017

Bash script to download Google Drive file via the command line.

Useful One-liners for Calculating Population Genetic Statistics from VCF files

July 18, 2017

Various Bash one-liners for summarizing variant data

A nice introduction to Plotly

July 12, 2017

Interactive graphics in Plotly.

Configuring JBrowse

July 02, 2017

Instructions for configuring JBrowse for viewing genome annotations

Genome Annotation using MAKER

May 16, 2017

Thorough guide for performing genome annotation using MAKER.

Setting-up automatic Git pushing upon file change

May 02, 2017

Instructions for configuring automatic Git pushing when files change. Useful for live coding demos.

Useful genomics one-liners

April 11, 2017

Running list of shell one-liners that can be quite useful in genomics.

Image dimensions from ImageJ

March 04, 2017

Automatically detecting and outputing image dimensions from ImageJ.

ImageJ macro for adding scale bars automatically

March 03, 2017

Useful ImageJ macro that automatically adds scale bars to images

Running RepeatModeler more efficiently on sequencing reads

March 03, 2017

Instructions for modifying RepeatModeler to run more efficiently on sequencing reads

Quantify missing data in VCF file

January 13, 2017

One-liner to calculate proportion missing data per sample in VCF/BCF file.

A (still in progress) best practices workflow for reference-guided genome assembly

October 07, 2015

Some thoughts on best practices for performing reference-guided genome assembly.

SRA Batch download

October 06, 2015

Tutorial on batch downloading NCBI SRA files using Bash.

Population structure analysis input file generation

August 19, 2015

Generating input files for NGSadmix and Entropy.

Batch Ensembl data download

March 24, 2014

How-to on doing large downloads of Ensembl data.

Submitting data to the NCBI SRA

February 11, 2014

How-to on creating BioProjects, BioSamples, and SRA submissions.

Plug for Rosalind Bioinformatics learning resource

February 10, 2014

Learn bioinformatics in any language using Rosalind.

Burmese python genome published

February 04, 2014

Announcement of Burmese python genome publication in PNAS!