Follow the Data

A data driven blog

GitHub goodies

  • The first post from the brand new Nuts ‘n Bolts blog talks about hash kernels and how to use them to represent arbitrary input data in a format suitable for machine learning. There is a GitHub repo called hashkernel that demonstrates the approach. The tag line for the repo is great: A demonstration of how to use hash kernels for ridiculously unprincipled machine learning.
  • This iPython notebook shows how to write a (greedy, not de Bruijn) genome assembler using tools available at the Pacific Biosciences GitHub repo. Titus Brown also has a repo showing how to implement a de Bruijn graph based ASCII assembler on top of Bloom filters. 
Advertisements

Single Post Navigation

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: