Software Tools

For the purpose of organizing, visualizing, analyzing and modeling data from such sources we develop computational approaches which can assist experimental systems-biologists to form rational hypotheses for further experimentation. We analyze high-dimensional data collected for projects integrating results from multiple layers of regulation (genomics, transcriptomics and proteomics). In addition to computational approaches, we also develop software so that such methodologies can reach and impact the interested research community. Below are some of the software tools we developed:

 

x2k  

Expression2Kinases: Genome-wide mRNA profiling provides a snapshot of the global state of cells under different conditions. However, mRNA levels do not provide direct understanding of upstream regulatory mechanisms. Here, we present a new approach called Expression2Kinases (X2K) to identify upstream regulators likely responsible for observed patterns in genome-wide gene expression. By integrating ChIP-seq/chip and position-weight-matrices (PWMs) data, protein-protein interactions, and kinase-substrate phosphorylation reactions, we can better identify regulatory mechanisms upstream of genome-wide differences in gene expression. We validated X2K by applying it to recover drug targets of FDA approved drugs from drug perturbations followed by mRNA expression profiling; to map the regulatory landscape of 44 stem cells and their differentiating progeny; to profile upstream regulatory mechanisms of 327 breast cancer tumors; and to detect pathways from profiled hepatic stellate cells and hippocampal neurons. The X2K approach can advance our understanding of cell signaling and unravel drugs mechanisms of action.
Website: X2K | PMID: 22080467

 

 Lists2Networks

Lists2Networks (Integrated Analysis for Lists of Mammalian Genes) is a web-based system that allows users to upload and analyze lists of mammalian gene-sets in a client-server software application. Within their workspace users can examine the overlap among the lists they upload, manipulate lists with different set operations, expand lists using existing mammalian networks of protein-protein, co-expression correlations, or background knowledge annotation correlations, and apply simple gene-set enrichment analyses on many gene lists at once against a plethora of prior knowledge datasets.
Website: List2Networks | PMID: 20152038

 

 GATE

GATE (Grid Analysis of Time-series Expression) is a computational software platform for integrated visualization and analysis of expression time-series. Given a high-dimensional time-series dataset, GATE employs a clustering algorithm which creates movies of expression dynamics by assigning individual genes/proteins to hexagons on a hexagonal array and dynamically coloring each hexagon according to the expression level of the molecular species to which it is associated. Additionally, in order to infer potential regulatory control mechanisms from patterns of time-series correlations, GATE allows interactive interrogation of the movies with a wide variety of background knowledge datasets.
Website: GATE | PMID: 19892805

 

 KEA

KEA (Kinase Enrichment Analysis) is a web-based tool with an underlying database providing users with the ability to link lists of mammalian proteins/genes with the kinases that phosphorylate them. The system draws from several available kinase-substrate databases to computes kinase enrichment probability based on the distribution of kinase-substrate proportions in the background kinase-substrate database compared with kinases found to be associated with an input list of genes/proteins.
Websites: KEA web-interface | KEA command-line version | PMID: 19176546

 

 ChEA

ChEA (ChIP-X Enrichment Analysis) database contains manually extracted datasets of transcription-factor/target-gene interactions from over 100 experiments such as ChIP-chip, ChIP-seq, ChIP-PET applied to mammalian cells. We use the database to analyze mRNA expression data where we perform gene-list enrichment analysis as the prior biological knowledge gene-list library. The system is delivered as web-based interactive software. With this software users can input lists of mammalian genes for which the program computes over-representation of transcription factor targets from the ChEA database.
Websites: ChEA web interface | ChEA command-line version | PMID: 20709693

 

 Genes2Networks

Genes2Networks is a command-line software tool that can be used to place lists of mammalian genes in the context of a background mammalian signalome and interactome networks. The input to the program is a list of human Entrez Gene gene symbols and background networks in SIG format, while the output includes: (a) all identified interactions for the genes/proteins, (b) a subnetwork connecting the genes/proteins using intermediate components that are used to connect the genes, (c) ranking of the specificity of intermediate components to interact with the list of genes/proteins.
Websites: Genes2Networks command-line version | Genes2Networks web interface | PMID: 17916244

 

 AVIS

AVIS (AJAX Viewer for Signaling Networks) is a visualization tool for viewing and sharing intracellular signaling, gene regulation and protein interaction networks. AVIS is implemented as an AJAX enabled syndicated Google gadget. It allows any webpage to render an image from a text file representation of signaling, gene regulatory, or protein interaction networks.
Website: AVIS | PMID: 17855420

 

 SNAVI

SNAVI (Signaling Networks Analysis and Visualization) is a Windows-based desktop application that implements standard network analysis methods to compute the clustering, connectivity distribution, and detection of network motifs, as well as provides means to visualize networks and network motifs. SNAVI is capable of generating linked web pages from network datasets loaded in text format. SNAVI can also create networks from lists of gene or protein names. SNAVI is a useful tool for analyzing, visualizing and sharing cell signaling data. SNAVI is open source free software.
Website: SNAVI | PMID: 19154595

 

 Sig2BioPAX

Sig2BioPAX is a command-line Java program that can be used to convert structured text files describing molecular interactions into the BioPAX Level 3 standard format.
Website: Sig2BioPAX | PMID: 21418653

 

Genes2WordCloud is a word-cloud generator and a word-cloud viewer that is based on WordCram implemented using Java, Processing, AJAX, mySQL, and PHP. Text is fetched from several sources and then processed to extract the most relevant terms with their computed weights based on word frequencies. 
Website: Genes2WordCloud | PMID: 21995939

FNV (Flashed-based Network Viewer) is for the visualization of small to moderately sized biological networks and pathways. FVN can also be used to embed pathways inside PDF files for the communication of pathways in soft publication materials.
Website: FNV | PMID: 21349871

PubMed Alert Me! is a software utility that allows users to enter a list of PubMed queries. Once a list of queries is configured, the program runs either daily or weekly. It searches PubMed and if it finds new matching published papers, the program sends an e-mail notification with a list of links to the new articles.
Website: PubMed Alert Me! | PMID: 18402930

SequenceViewer is a software system that includes two bioinformatics tools: DNAViewer and RepeatsViewer. These Windows based computer programs create graphical images from text DNA sequences in FASTA format. The images created by the tools can help researchers identify repeating patterns, letter concentrations in different regions of DNA and possibly give clues about DNA physical structure as well as identifying novel promoter binding sites.
Website: SequenceViewer

 

Databases and Datasets

Additional databases and datasets generated by the lab are available for download from the Systems Biology Center New York's Databases and Datasets page.

Network Datasets for Download: in SIG file format

Neuronal Signaling Network - A massive literature search was applied to develop a large-scale mammalian neuronal cell signaling network. This network integrate cell signaling pathways specific to mammalian neurons. The network was analyzed for topological properties in the in this article published in Science. The most recent version of this network is available at: http://amp.pharm.mssm.edu/SAVI.txt.

Adhesome - A comprehensive literature derived biochemical network was developed in collaboration with Benny Geiger's Lab. The network is made of known interactions and cellular components composing the focal adhesion complex in mammalian cells. The Adhesome web-site provides a reference and supporting materials to the analysis published in Nature Cell Biology.


Contact Us

Avi Ma'ayan, PhD
Tel: 212-659-1739
Fax: 212-831-0114
Send e-mail

Icahn Medical Institute
1425 Madison Avenue
Room 12-78 (Office), 12-76 (Lab)
One Gustave L. Levy Place
Box 1215
New York, NY 10029

In the News

HIPK2 Regulator Protein Plays a Crucial Role in Kidney Fibrosis
Press release on News-Medical.net

New Database Could Speed Up Drug Discovery
Tech news feature on CNET

Animating Molecular Biology
Article in Biomedical Computation Review

Systematic Tracking of Cell Fate Changes
News and views article in Nature Biotechnology

Computational Honeycombs Drip with Data
News item in NIGMS Computing Life

Molecular Movies: New Software Animates Gene Expression Data
Technology observation on Scientific American Online

Stem Cells, Systems Biology and Human Feedback
News feature in Nature Reports Stem Cells