Software

Many of the research methods developed by Center investigators are packaged into software applications and computational services which are made freely available to the scientific community:

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Software nameADOMETA
DescriptionADOMETA is a bioinformatics resource designed to predict genes for orphan metabolic activities - known biochemical activities not currently assigned to genes in some or all organisms. ADOMETA returns a ranked list of genes likely to catalyze a given metabolic activity in a selected organism.
URL to software pagehttp://vitkuplab.c2b2.columbia.edu/adometa/adometa.html
Lead investigator(s) Dennis Vitkup
Software nameARACNE
DescriptionARACNE is an algorithm for inferring gene regulatory networks from a set of microarray experiments. The method uses mutual information to identify genes that are co-expressed and then applies the data processing inequality to filter out interactions that are likely to be indirect.
URL to software pagehttp://wiki.c2b2.columbia.edu/califanolab/index.php/Software
Lead investigator(s) Andrea Califano
Software nameBiolearn 1.0
DescriptionBiolearn is a general package for applying probabilistic graphical models to biological applications. Biolearn release 1.0 is concentrated on structure learning for bayesian networks; future releases are expected to include other types of graphical models, support inference applications, and allow plug-and-play addition of new types of probability distributions, scoring functions and search algorithms.
URL to software pagehttp://www.c2b2.columbia.edu/danapeerlab/html/biolearn.html
Lead investigator(s)Dana Pe'er
Software nameB Cell Interactome
DescriptionThe B cell interactome (BCI) is a network of protein-protein, protein-DNA and modulatory interactions in human B cells. The network contains known interactions (reported in public databases) and predicted interactions by a Bayesian evidence integration framework which integrates a variety of generic and context specific experimental clues about protein-protein and protein-DNA interactions - such as a large collection of B cell expression profiles - with inferences from different reverse engineering algorithms, such as GeneWays and ARACNE. Modulatory interactions are predicted by MINDY, an algorithm for the prediction of modulators of transcriptional interactions.
URL to software pagehttp://wiki.c2b2.columbia.edu/califanolab/index.php/Software
Lead investigator(s)Andrea Califano
Software nameCamelot
DescriptionCAusal Modeling with Expression Linkage for cOmplex Traits (Camelot) provides a framework to both model complex traits and identify the potential underlying causal factors. Briefly, Camelot takes genotype, gene expression and phenotype data as input, and output a linear regression model that uses genotype and expression to predict phenotype. Powered by regularized linear regression, Camelot aims to choose features (either genotype or gene expression) that are predictive (and likely causal) to the phenotype. In addition, Camelot also provides functions to prioritize genes residing within a locus that is associated with the trait.
URL to software pagehttp://www.c2b2.columbia.edu/danapeerlab/html/camelot.html
Lead investigator(s)Dana Pe'er
Software nameCLEANER
DescriptionR-system software package for the assembly of informative, transcript-specific probe-clusters for Affymetrix expression microarrays.
URL to software pagehttp://wiki.c2b2.columbia.edu/califanolab/index.php/Cleaner
Lead investigator(s)Mariano Alvarez
Software nameCONDEX (condr)
DescriptionHMM Model to detect copy number variants in exome sequence data
URL to software pagehttp://code.google.com/p/condr/
Lead investigator(s)Itsik Pe'er
Software nameCONEXIC
DescriptionCOpy Number and EXpression In Cancer (CONEXIC) is an algorithm that integrates matched copy number (amplifications and deletions) and gene expression data from tumor samples to identify driving mutations and the processes they influence. CONEXIC is inspired by Module Networks (Segal et al, 2003), but has been augmented by a number of critical modifications that make it suitable for identifying drivers. CONEXIC uses a score-guided search to identify the combination of modulators that best explains the behavior of a gene expression module across tumor samples and searches for those with the highest score within the amplified or deleted region.
URL to software pagehttp://www.c2b2.columbia.edu/danapeerlab/html/conexic.html
Lead investigator(s)Dana Pe'er
Software namecryoEM Refinement Tools
DescriptionPrograms and protocols that can be used to refine atomic structures, either experimentally determined or computationally constructed, within cryo-electron microscopy (cryoEM) density maps of macromolecular complexes.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:cryoEM
Lead investigator(s)Barry Honig
Software nameDASH
DescriptionDASH (DASH Associates Shared Haplotypes) is a tool for detecting association to clusters of IBD segments detected by Germline. It builds upon pairwise IBD shared segments to infer clusters of IBD individuals.
URL to software pagehttp://www1.cs.columbia.edu/~gusev/dash/
Lead investigator(s)Itsik Pe'er
Software nameDelPhi
DescriptionDelPhi provides numerical solutions to the Poisson-Boltzmann equation (both linear and nonlinear form) for molecules of arbitrary shape and charge distribution. The current version is fast (the best relaxation parameter is estimated at run time), accurate (calculation of the electrostatic free energy is less dependent on the resolution of the lattice) and can handle extremely high lattice dimensions. It also includes flexible features for assigning different dielectric constants to different regions of space and treating systems containing mixed salt solutions.
URL to software page http://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:DelPhi
Lead investigator(s)Barry Honig
Software nameEDGI
DescriptionEDGI (Enhancer Detection using only Genomic Information) integrates a traditional motif-discovery algorithm SPLASH with a novel local permutation-clustering algorithm PromoClust in order to identify large regulatory elements (e.g. enhancers) as evolutionarily conserved, order-independent clusters of short conserved motifs. EDGI relies on the evolutionary conservation of the binding sites for regulating transcription factors, without requiring knowledge of these transcription factors or their binding sites.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:EDGI
Lead investigator(s)Barry Honig
Software nameEVE
DescriptionEVE is a simulation framework for microbial populations in complex dynamic environments.
URL to software pagehttps://tavazoielab.c2b2.columbia.edu/EVE/
Lead investigator(s)Saeed Tavazoie
Software nameFASD
DescriptionFrequency Analysis of Sequence Data (FASD)
URL to software pagehttp://projects.dbmi.columbia.edu/fasd/index.php
Lead investigator(s)Raul Rabadan
Software nameFIRE
DescriptionFIRE is a motif discovery and characterization program based on mutual information.
URL to software pagehttps://tavazoielab.c2b2.columbia.edu/FIRE/
Lead investigator(s)Saeed Tavazoie
Software nameFIRE-pro
DescriptionFIRE-pro is a motif discovery and characterization program for proteins based on mutual information.
URL to software pagehttps://tavazoielab.c2b2.columbia.edu/FIRE-pro/
Lead investigator(s)Saeed Tavazoie
Software nameGenatomy
DescriptionGenatomy is a visualizing tool for biological data, such as gene expression, genotypes, growth curves, copy number variation and more. It can be used to analyze the data mathematically and to study the biological aspects of the data and the results.
URL to software pagehttp://www.c2b2.columbia.edu/danapeerlab/html/genatomy.html
Lead investigator(s)Dana Pe'er
Software nameGermline
DescriptionGERMLINE is an algorithm for discovering long shared segments of Identity by Descent (IBD) between pairs of (unrelated) individuals in a large population.
URL to software pagehttp://www1.cs.columbia.edu/~gusev/germline/
Lead investigator(s)Itsik Pe'er
Software namegeWorkbench
DescriptiongeWorkbench is a Java application that provides users with an integrated suite of genomics tools. It is built on an open-source, extensible architecture that promotes interoperability and simplifies the development of new as well as the incorporation of pre-existing components. The resulting system provides seamless access to a multitude of both local and remote data and computational services through an integrated environment that offers a unified user experience. Over 50 data analysis and visualization components have been developed for the framework, covering a wide range of genomics domains including gene expression, sequence, structure and network data.
URL to software page http://www.geworkbench.org
Lead investigator(s)Aris Floratos
Software nameGRASP
DescriptionA molecular visualization and analysis program. It is particularly useful for the display and manipulation of the surfaces of molecules and their electrostatic properties.
URL to software page http://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:GRASP
Lead investigator(s)Anthony Nicholls, Barry Honig
Software nameGRASP2
DescriptionGRASP2 is an updated version of the GRASP program used for macromolecular structure and surface visualization, contains a large number of new features and scientific tools: Enhanced GUI; Structure alignment and domain database scanning; A gaussian surface generator and new surface coloring schemes; Sequence visualization and alignment; Completed work can be stored in project files; Among the many objects that can be stored in a project file are views of the structure; defined subsets, surfaces; Direct printing to printers at full printer resolution.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:GRASP2
Lead investigator(s)Donald Petrey and Barry Honig
Software nameHADiT
DescriptionHADiT (Haplotype Amplification Distortion in Tumors) is a tool for computing and visualizing allelic distortion in tumor SNP data. Implements the Amplification Distortion Test (ADT).
URL to software pagehttp://www.cs.columbia.edu/~itsik/HADiT/HADiT.html
Lead investigator(s)Itsik Pe'er
Software nameHATS
DescriptionHATS is a tool for calling the amplified alleles and constructing the amplified haplotype within called tumor amplicons.
URL to software pagehttp://www.cs.columbia.edu/~itsik/HATS/HATS.html
Lead investigator(s)Itsik Pe'er
Software nameHLAIBD
DescriptionThe Human Leukocyte Antigen (HLA) genes play a major role in adaptive immune response and are used to differentiate self cells from non-self cells. HLA genes are hyper variable with nearly every locus containing over a dozen alleles. This variation plays an important role in autoimmune disease and organ transplantation. HLA typing by serological methods is time consuming and expensive. HLABID is a new computational method to infer per-locus HLA types using shared segments Identical By Descent (IBD), inferred from genotype data.
URL to software pagehttp://www.cs.columbia.edu/~itsik/hla_ibd/index.html
Lead investigator(s)Itsik Pe'er
Software nameInference of Modules associated to eQTLs
DescriptionA tool for constructing modules regulated by eQTLs using gene expression and SNP variation data across multiple individuals.
URL to software pagehttp://www.cs.columbia.edu/~itsik/IMR_eQTL2/
Lead investigator(s)Itsik Pe'er
Software nameINFOSTIP
DescriptionA tool for selecting INdividual FOr Sequencing by Total Information Potential, based on GERMLINE output.
URL to software pagehttp://www.cs.columbia.edu/~itsik/INFOSTIP/readme.html
Lead investigator(s)Itsik Pe'er
Software nameiPAGE
DescriptioniPAGE a functional and categorical enrichment program based on mutual information.
URL to software pagehttps://tavazoielab.c2b2.columbia.edu/iPAGE/
Lead investigator(s)Saeed Tavazoie
Software nameJACKAL
DescriptionJackal is a collection of programs designed for the modeling and analysis of protein structures. Its core program is a versatile homology modeling package nest. JACKAL has the following capabilities: 1) comparative modeling based on single, composite or multiple templates; 2) side-chain prediction; 3) modeling residue mutation, insertion or deletion; 4) loop prediction; 5) structure refinement; 6) reconstruction of protein missing atoms;7) reconstruction of protein missing residues; 8) prediction of hydrogen atoms; 9) fast calculation of solvent accessible surface area; 10) structure superimposition.
URL to software page http://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:Jackal
Lead investigator(s)Z. Xiang and B. Honig
Software nameJISTIC
DescriptionJISTIC is a tool for analyzing datasets of genome-wide copy number variation to identify driver aberrations in cancer. JISTIC is an improvement over the widely used GISTIC algorithm.
URL to software pagehttp://www.c2b2.columbia.edu/danapeerlab/html/jistic.html
Lead investigator(s)Dana Pe'er
Software nameLitVAn
DescriptionLiterature Vector Analysis, is an algorithm that searches forover-represented terms in papers associated with genes in a gene set. LitVAn uses a manually curated database (NCBI Gene) to connect genes with terms from the complete text of more than 70,000 published scientific articles.
URL to software pagehttp://litvan.bio.columbia.edu/
Lead investigator(s)Dana Pe'er
Software nameMark-Us
DescriptionMark-Us is a web server to assist the assessment of the biochemical function for a given protein structure. Mark-Us identifies related protein structures and sequences, detects protein cavities, and calculates the surface electrostatic potentials and amino acid conservation profile. The results can be browsed by an interactive web interface that allows to integrate Gene Ontology terms, UniProt features, and the Enzyme Classification.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:Mark-Us
Lead investigator(s)Barry Honig
Software nameMatrixREDUCE
DescriptionRegulation of gene expression by a transcription factor requires physical interaction between the factor and the DNA, which can be described by astatistical mechanical model. Based on this model, the MatrixREDUCE algorithm uses genome-wide occupancy data for a transcription factor (e.g.ChIP-chip or mRNA expression data) and associated nucleotide sequences to discover the sequence-specific binding affinity of the transcription factor. The sequence specificity of the transcription factor's DNA-binding domain is modeled using a position-specific affinity matrix (PSAM), representing the change in the binding affinity (Kd) whenever a specific position within a reference binding sequence is mutated. The PSAM can be transformed into affinity logo for visualization using the utility program AffinityLogo, and a MatrixREDUCE run can be summarized in an easy-to-navigate webpage using HTMLSummary.
URL to software pagehttp://www.bussemakerlab.org/software/MatrixREDUCE
Lead investigator(s)Harmen J. Bussemaker
Software nameMEDUSA and Gorgon
DescriptionMEDUSA is an algorithm for learning predictive models of transcriptional gene regulation from gene expression and promoter sequence data. By using a statistical learning approach based on boosting, MEDUSA learns cis regulatory motifs, condition-specific regulators, and regulatory programs that predict the differential expression of target genes. The regulatory program is specified as an alternating decision tree (ADT). The Java implementation of MEDUSA allows a number of visualizations of the regulatory program and other inferred regulatory information, implemented in the accompanying Gorgon tool, including hits of significant and condition-specific motifs along the promoter sequences of target genes and regulatory network figures viewable in Cytoscape.
URL to software pagehttp://cbio.mskcc.org/leslielab/software/medusa
Lead investigator(s)David Quigley, Manuel Middendorf, Steve Lianoglou, Anshul Kundaje, Yoav Freund, Chris Wiggins, Christina Leslie
Software nameMINDY
DescriptionGiven a transcription factor of interest, MINDY uses a large set of gene expression profile data to identify potential post-transcriptional modulators of the transcription factor's activity. MINDY is based on a three-way statistical interaction model that captures the post-transcriptional regulatory event where the ability of a transcription factor to activate/repress its target genes is monotonically controlled by a potential modulator gene.
URL to software pagehttp://wiki.c2b2.columbia.edu/califanolab/index.php/Software
Lead investigator(s)Andrea Califano
Software nameModel Quality Assessment: Global and Local Quality Assessment of Protein Models
DescriptionThree protocols have been implemented for model quality assessment (MQA), including direct assessment of local qualities using statistical potentials and two machine-learning based protocols. The statistical potentials used here include newly developed solvation potentials, environmental potentials, pairwise potentials, hydrogen bonding potentials and backbone dihedral potentials. The normalized version of these potentials can be directly used to evaluate local qualities by plotting against residue numbers.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software
Lead investigator(s)Barry Honig
Software nameMutaGeneSys
DescriptionMutaGeneSys uses genome-wide genotype data to estimate individual disease susceptibility. It integrates OMIM and HapMap-based single- and multi-marker predictors of causal SNPs to report the collection of associated alleles in a whole-genome-typed individual.
URL to software pagehttp://www1.cs.columbia.edu/~jds1/MutaGeneSys/
Lead investigator(s)Itsik Pe'er
Software nameNest
DescriptionModeling protein structure based on a sequence-template alignment. The current server works only for modeling with a single template. Part of jackal, which can be downloaded.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:nest
Lead investigator(s)Xiang, Z. and Honig, B.
Software nameOpera
DescriptionOPERA is a tool for power estimation and design of whole-genome resequencing projects aimed at rare variant associations.
URL to software pagehttp://www.cs.columbia.edu/~itsik/OPERA/
Lead investigator(s)Itsik Pe'er
Software namePoolDesign
DescriptionPoolDesign is a tool for designing overlapping pools for variant-carrier identification.
URL to software pagehttp://www.cs.columbia.edu/~itsik/Pooling/pooldesign.jar
Lead investigator(s)Itsik Pe'er
Software namePredUs
DescriptionPredUs (Prediction of Protein Interfaces Using Structural Alignment) is a tool to predict protein-protein interface based on interface conservation among structurally similar protein neighbors.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:PredUs
Lead investigator(s)Barry Honig
Software namePREPPI
DescriptionPREPPI is a database of predicted and experimentally determined protein-protein interactions (PPIs) for yeast and human. Predicted interactions are assigned a likelihood using a Bayesian framework that combines structural, functional, evolutionary and expression information. The database contains ~2 million predictions including 31,402 for yeast and 317,813 for human that are considered high confidence based on our analysis. Experimentally determined interactions are compiled from a set of public databases (e.g., DIP, IntAct, HPRD, etc).
URL to software pagehttp://bhapp.c2b2.columbia.edu/PREPPI/
Lead investigator(s)Barry Honig
Software namePrISM
DescriptionPrISM is an integrated computational system where computational tools are implemented for protein sequence and structure analysis and modeling.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:PrISM
Lead investigator(s)Barry Honig
Software nameProtein-DNA interface alignment
DescriptionThe protein-DNA alignment software allows one to align the interfacial amino acids from two protein-DNA complexes based on the geometric relationship of each amino acid to its local DNA.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:Protein-DNA_Interface_Alignment
Lead investigator(s)Barry Honig
Software nameProtein-DNA Modeling Interface Software
DescriptionThe program can be used predict the conformation of the sidechains within a protein-DNA binding interface.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:Protein-DNA_Modeling_Interface
Lead investigator(s)Barry Honig
Software nameREDUCE Suite
DescriptionThe REDUCE Suite is a set of software tools to model the regulation of gene expression by transcription factors (TF). By directly correlating genome-wide mRNA expression or TF binding data (e.g. ChIP-chip) with associated nucleotide sequences, the REDUCE Suite can discover the sequence-specific binding affinity of a TF from a single experiment, using all measurements simultaneously, and without using any "background" sequence model. The REDUCE suite has its origin in the REDUCE algorithm of Bussemaker, Li, and Siggia, Nature Genetics (2001), which pioneered the use of motif-based linear regression model to discover cis-regulatory elements (motifs) and infer condition-specific transcription factor activities from a single genomewide mRNA expression profile.
URL to software pagehttp://bussemaker.bio.columbia.edu/software/REDUCE/
Lead investigator(s)Harmen Bussemaker
Software nameSCREEN
DescriptionA web server tool for identifying protein cavities and computing cavity attributes that can be applied for classification and analysis.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:SCREEN
Lead investigator(s)Barry Honig
Software nameShareViz
DescriptionShareviz is a tool for graphical visualization of hidden relatedness based on GERMLINE output.
URL to software pagehttp://www.cs.columbia.edu/~itsik/sharevizWeb/shareviz.html
Lead investigator(s)Itsik Pe'er
Software nameSIXPAC
DescriptionSIXPAC (Search for Interactions is Probably Approximately Complete) is an efficient, scalable search algorithm that finds synergy between pairs of physically unlinked SNPs (genome-wide) in large case-control datasets.
URL to software pagehttp://www.cs.columbia.edu/~snehitp/sixpac/
Lead investigator(s)Itsik Pe'er
Software nameSPLASH
DescriptionSPLASH (Structural Pattern Localization Analysis by Sequential Histograms) is a deterministic pattern discovery algorithm which can find sparse amino or nucleic acid patterns matching identically or similarly in a set of protein or DNA sequences. Sparse patterns of any length, up to the size of the input sequence can be discovered without significant loss in performance. Splash performance is extremely efficient.
URL to software pagehttp://amdec-bioinfo.cu-genome.org/html/Splash.html
Lead investigator(s)Andrea Califano
Software nameSTING Millennium
DescriptionA web based suite of programs for comprehensive and simultaneous analysis of structure and sequence: nature and volume of atomic contacts of both intra- and inter- chain types, analysis of the quality of structure, residue conservation and its relationship with intra-chain contacts, functional parameters are also deciphered.
URL to software pagehttp://luna.bioc.columbia.edu/SMS/STINGm/
Lead investigator(s)Barry Honig
Software nameString kernel package
DescriptionThe string kernel package contains implementations for the mismatch and profile string kernels for use with support vector machine (SVM) classifiers for protein sequence classification. Both kernels compute similarity between protein sequences based on common occurrences of k-length subsequences (k-mers) counted with substitutions. Kernel functions for protein sequence data enable the training of SVMs for a range of prediction problems, in particular protein structural class prediction and remote homology detection. A version of the Spider MATLAB machine learning package is also bundled with the code, which allows users to train SVMs and evaluate performance on test sets with the packaged software.
URL to software pagehttp://svm-fold.c2b2.columbia.edu/
Lead investigator(s)Christina Leslie
Software nameSURFace
DescriptionSURFace algorithms are programs that calculate solvent accessible surface area and curvature corrected solvent accessible surface area
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:SURFace_Algorithms
Lead investigator(s)Barry Honig
Software nameT-profiler
DescriptionT-profiler is a web-based tool that uses the t-test to score changes in the average activity of pre-defined groups of genes. The gene groups are defined based on Gene Ontology categorization, ChIP-chip experiments, upstream matches to a consensus transcription factor binding motif, and location on the same chromosome, respectively. If desired, an iterative procedure can be used to select a single, optimal representative from sets of overlapping gene groups. A jack-knife procedure is used to make calculations more robust against outliers. T-profiler makes it possible to interpret microarray data in a way that is both intuitive and statistically rigorous, without the need to combine experiments or choose parameters.
URL to software pagehttp://www.t-profiler.org
Lead investigator(s)Harmen J. Bussemaker
Software nameTarget Explorer
DescriptionAutomated process of prediction of complex regulatory elements for specified set of transcription factors in Drosophila melanogaster genome. Target Explorer is a complex tool with user-friendly self-explanatory Web-interface that allows to user: 1. create customized library of TF binding site matrices based on user defined sets of training sequences; 2. search for new clusters of binding sites for specified set of TFs; 3.extract annotation for potential target genes.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:Target_Explorer
Lead investigator(s)Barry Honig and Richard Mann
Software nameTEISER
DescriptionTEISER is a de-novo motif discovery tool for finding informative structural elements in RNA.
URL to software pagehttps://tavazoielab.c2b2.columbia.edu/TEISER/
Lead investigator(s)Saeed Tavazoie
Software nameTranscriptionDetector
DescriptionA tool for finding probes measuring significantly expressed loci in a genomic array experiment. Given expression data from some tiling array experiment, TranscriptionDetector decides the likelihood that a probe is detecting transcription from the locus in which it resides. Probabilities are assigned by making use of a background signal intensity distribution from a set of negative control probes. This tool is useful for the functional annotation of genomes as it allows for the discovery of novel transcriptional units independently of any genomic annotation.
URL to software pagehttp://www.bussemakerlab.org/software/TranscriptionDetector/
Lead investigator(s)Harmen J. Bussemaker
Software nameTransfactomeDB
DescriptionAccurate and comprehensive information about the nucleotide sequence specificity of trans-acting factors (TFs) is essential for computational and experimental analyses of gene regulatory networks. The Yeast Transfactome Database is a repository of sequence specificity models and condition-specific regulatory activities for a large number of DNA- and RNA-binding proteins in Saccharomyces cerevisiae. The sequence specificities in TransfactomeDB, represented as position-specific affinity matrices (PSAMs), are directly estimated from genomewide measurements of TF-binding using our previously published MatrixREDUCE algorithm, which is based on a biophysical model. For each mRNA expression profile in the NCBI Gene Expression Omnibus, we used sequence-based regression analysis to estimate the post-translational regulatory activity of each TF for which a PSAM is available. The trans-factor activity profiles across multiple experiments available in TransfactomeDB allow the user to explore potential regulatory roles of hundreds of TFs in any of thousands of microarray experiments.
URL to software pagehttp://bussemaker.bio.columbia.edu:8080/YeastTransfactomeDB/
Lead investigator(s)Harmen Bussemaker
Software nameVISTAL
DescriptionA two-dimensional visualization tool for structural alignments. VISTAL describes structures as a series of secondary structure elements, and places matched residues one on top of each other colored according to the three-dimensional distance of their Ca atoms.
URL to software pagehttp://wiki.c2b2.columbia.edu/honiglab_public/index.php/Software:Vistal
Lead investigator(s)Barry Honig
Software nameXplorigin
DescriptionA tool for parsing population ancestry of admixed individuals.
URL to software pagehttp://www.cs.columbia.edu/~itsik/Xplorigin/Xplorigin.htm
Lead investigator(s)Itsik Pe'er
Software nameZinfandel
DescriptionZinfandel is a tool for detecting CNVs from low-pass WGS data.
URL to software pagehttp://www.cs.columbia.edu/~itsik/zinfandel/
Lead investigator(s)Itsik Pe'er