snakemake - Snakemake is a workflow management system that aims to reduce the complexity of creating workflows by providing a fast and comfortable execution environment, together with a clean and modern specification language in python style. Build bioinformatics pipelines with Snakemake
toil - A scalable, efficient, cross-platform and easy-to-use workflow engine in pure Python
Ruffus - Ruffus is a Computation Pipeline library for python. It is open-sourced, powerful and user-friendly, and widely used in science and bioinformatics.
Dataset
awesome-public-datasets - An awesome list of (large-scale) public datasets on the Internet. (On-going collection)
Tools
csvkit - A suite of utilities for converting to and working with CSV, the king of tabular file formats. http://csvkit.rtfd.org/