Bioinformatics support
Overview
UITS supports researchers who do biological computing, particularly in the areas of genomics, cell biology and molecular biology. Support is provided in the forms of application support, technical support, and software development. Application support includes testing and licensing of software that is broadly useful across academic departments that analyze molecular data, including biology, medicine, and anthropology. Technical support includes consulting with regard to data management and choice of algorithms and statistical methods of data analysis. Software development includes investment of time assisting researchers developing software that poses challenges for which they lack resources to complete in a timely fashion.
Services
Software on UITS computer systems
UITS provides bioinformatics software on the the Libra cluster (AIX) and on Big Red. The Libra cluster is for long-running single-processor programs, programs that require large amounts of memory, and for parallel programs that require memory that is shared by many processors (SMP). Big Red is configured for massively parallel computing. To request software for general use on research systems, submit a software request form.
The following bioinformatics software is available:
-
BLAST can be used to query hundreds to tens of
thousands of molecular sequences against sequence databases. For
details, see BLAST sequence searching on Big Red.
-
CAP3 is a sequence assembly program. For details,
see Using CAP3 on Big Red at IU.
-
CLUSTAL W aligns multiple sequences. A parallel
version is available on Big Red; see Running CLUSTAL W on Big Red.
-
EMBOSS/wEMBOSS is a set of sequence analysis
applications that use GCG libraries to extend the functionality of the
GCG Package. wEMBOSS is the web interface of the EMBOSS. EMBOSS
command line applications run on the Libra cluster. The wEMBOSS
interface is available on the Discern server
(
discern.uits.iu.edu). For details, see On Libra, what are EMBOSS and wEMBOSS, and how can I use them?
-
FASTA searches databases for sequences that are
similar to a set of query sequences. The parallel version of FASTA is
available on Big Red. For more, see FASTA programs on Big Red.
-
fastDNAml generates phylogenetic trees from
molecular data using maximum likelihood methods. The program
implements Felsenstein's likelihood methods and Olson's search
algorithm. This implementation was parallelized by the UITS High
Performance Applications team. For more, see Maximum likelihood analysis of phylogenetic data.
-
GARLI conducts phylogenetic searches on aligned
sequences using maximum likelihood criteria. It supports codon- and
amino-acid-based models in addition to standard nucleotide models. See
What is GARLI, and how can I use it on Quarry?
-
MEME detects patterns in nucleotide and protein
sequences and efficiently uses multiple processors. See Using MEME on Big Red at IU.
-
MIGRATE uses genotypes of individuals to form
clusters that represent populations, assign individuals to
populations, and describe genotypes of individuals in terms of
populations from which they were derived. It applies maximum
likelihood and Bayesian analysis. See What is MIGRATE, and how can I use it on Quarry?
-
MPI-HMMER creates and queries sets of hidden
Markov models that describe molecular sequences. This port of HMMER
includes parallel versions of hmmpfam and hmmsearch and is available
on Big Red. See Using MPI-HMMER on Big Red at IU.
-
MrBayes estimates evolutionary trees using
Bayesian methods. It has been compiled to use multiple processors. For
more, see On Big Red, how do I submit batch jobs that run MrBayes?
-
MULTICLUSTAL optimizes multiple sequence
alignments by searching for the best alignment parameters. See Running MULTICLUSTAL on Big Red at IU.
-
PAUP estimates evolutionary trees and is
available on the Libra cluster. For details, see Using PAUP on the Libra cluster at IU.
-
PHYLIP is a well known package for phylogenetic
inferences. It is available on the Libra cluster. For help, see the program
documentation and Using PHYLIP on Libra at IU.
-
structure is a program that can cluster
individuals based on their genotypes and can assign individuals to
clusters (populations) based on their genotypes. It is installed on
Quarry. For more, see What is structure, and how do I use it on Quarry?
- TREE-PUZZLE estimates phylogenetic trees from sequence data using maximum likelihood methods. A parallel version is available on Big Red. For details, see Using TREE-PUZZLE on Big Red at IU.
Data
Centralized Life Sciences Data (CLSD) provides an access point through which a number of public data sources can be queried programmatically using SQL. It is also possible to phrase NCBI BLAST searches as SQL queries. Information about CLSD is available at:
http://rtinfo.uits.iu.edu/clsd/Consulting is available to help researchers find
solutions to bioinformatics computing problems. Contact
bioinfor@iu.edu .
This document was developed with support from the National Science Foundation (NSF) under Grant No. 0503697 to the University of Chicago and subcontracted to Indiana University. Additional support was provided by IU through its participation in the TeraGrid, which is supported by the NSF under Grants No. 0833618, SCI451237, SCI535258, and SCI504075. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the NSF.
Also see:
- What statistical and mathematical software packages are available on IU's supercomputers?
- At IU, what supercomputer systems are available for academic research?
- At IU, what research computing services are available?
- On IU's TeraGrid-enabled system Big Red, what software is available to researchers with TeraGrid accounts?
- At IU, what software is available on the research computing systems, and how may I request that software be added?
Last modified on September 24, 2008.






