Useful bioinformatics software packages in Linux
Go back to parent page Introduction to Linux for bioinformatics
Below I have compiled a non-exhaustive list of useful bioinformatics software packages. First check with your software center whether the packages are available. If not, follow the installation instructions on the the webpage of the package.
Contents
- 1 UGene, an integrated bioinformatics suite
- 2 R, the statistical language project
- 3 RStudio
- 4 IGV, the integrative genomics viewer
- 5 Qualimap suite to analyse quality metrics of NGS data
- 6 Cytoscape, a interactive network analysis tool
- 7 GNU Octave, the open-source matlab clone
- 8 Multi Experiment viewer, MeV
- 9 RapidMiner, data analysis toolkit
- 10 bedtools
- 11 SAMtools
UGene, an integrated bioinformatics suite
UGene is available via the Software Manager in Ubuntu-based distributions.
R, the statistical language project
R is the most widely used language for computing and statistical analysis. It can be extended via the CRAN repository. It has also a lot of bioinformatics extensions distributed via http://bioconductor.org/. Every bioinformatician should know at least the basics of R.
Be aware! Installing R on your own machine can take a while. Always use the latest version of R, and update frequently. It will break your scripts from time to time, but there is no reason to postpone upgrading, since the new R code is better, faster and stronger then before.
Be aware! Some extensions are not maintained well, and become obsolete while R advances to next versions.
RStudio
A very popular and powerful graphical interface to R. It comes with productivity enhancing extensions, such as RMarkdown and knitr extensions to document your code easily, help search screen, and more.
IGV, the integrative genomics viewer
See http://www.broadinstitute.org/igv/
The Integrative Genomics Viewer (IGV) is a high-performance visualization tool for interactive exploration of large genomic datasets.
Qualimap suite to analyse quality metrics of NGS data
See http://qualimap.bioinfo.cipf.es/
Qualimap is a platform-independent application written in Java and R that provides both a Graphical User Inteface (GUI) and a command-line interface to facilitate the quality control of alignment sequencing data.
Cytoscape, a interactive network analysis tool
Open-source platform for netwerk analysis and visualisation.
GNU Octave, the open-source matlab clone
See http://www.gnu.org/software/octave/
A powerful numerical computations language, with extensive graphical capabilities.
Multi Experiment viewer, MeV
See http://www.tm4.org/mev.html
MeV is a versatile microarray data analysis tool, incorporating sophisticated algorithms for clustering, visualization, classification, statistical analysis and biological theme discovery.
RapidMiner, data analysis toolkit
See http://rapid-i.com/content/view/181/
Open-source system for data mining. It is available as a stand-alone application for data analysis and as a data mining engine for the integration into own applications.
bedtools
See https://github.com/arq5x/bedtools2
BEDTools utilities address common genomics tasks such as finding feature overlaps and computing coverage.
SAMtools
SAM Tools provide various utilities for manipulating alignments in the SAM format, including sorting, merging, indexing and generating alignments in a per-position format.
Go back to parent page Introduction to Linux for bioinformatics