Software Tools
For the purpose of organizing, visualizing, analyzing and modeling data from such sources we develop computational approaches which can assist experimental systems-biologists to form rational hypotheses for further experimentation. We analyze high-dimensional data collected for projects integrating results from multiple layers of regulation (genomics, transcriptomics and proteomics). In addition to computational approaches, we also develop software so that such methodologies can reach and impact the interested research community. Below are some of the software tools we developed:
|
Expression2Kinases: Genome-wide mRNA profiling provides a snapshot of the global state of cells under different conditions. However, mRNA levels do not provide direct understanding of upstream regulatory mechanisms. Here, we present a new approach called Expression2Kinases (X2K) to identify upstream regulators likely responsible for observed patterns in genome-wide gene expression. By integrating ChIP-seq/chip and position-weight-matrices (PWMs) data, protein-protein interactions, and kinase-substrate phosphorylation reactions, we can better identify regulatory mechanisms upstream of genome-wide differences in gene expression. We validated X2K by applying it to recover drug targets of FDA approved drugs from drug perturbations followed by mRNA expression profiling; to map the regulatory landscape of 44 stem cells and their differentiating progeny; to profile upstream regulatory mechanisms of 327 breast cancer tumors; and to detect pathways from profiled hepatic stellate cells and hippocampal neurons. The X2K approach can advance our understanding of cell signaling and unravel drugs mechanisms of action.
|
|
| |
Lists2Networks (Integrated Analysis for Lists of Mammalian Genes) is a web-based system that allows users to upload and analyze lists of mammalian gene-sets in a client-server software application. Within their workspace users can examine the overlap among the lists they upload, manipulate lists with different set operations, expand lists using existing mammalian networks of protein-protein, co-expression correlations, or background knowledge annotation correlations, and apply simple gene-set enrichment analyses on many gene lists at once against a plethora of prior knowledge datasets.
|
![]() |
GATE (Grid Analysis of Time-series Expression) is a computational software platform for integrated visualization and analysis of expression time-series. Given a high-dimensional time-series dataset, GATE employs a clustering algorithm which creates movies of expression dynamics by assigning individual genes/proteins to hexagons on a hexagonal array and dynamically coloring each hexagon according to the expression level of the molecular species to which it is associated. Additionally, in order to infer potential regulatory control mechanisms from patterns of time-series correlations, GATE allows interactive interrogation of the movies with a wide variety of background knowledge datasets.
|
| |
KEA (Kinase Enrichment Analysis) is a web-based tool with an underlying database providing users with the ability to link lists of mammalian proteins/genes with the kinases that phosphorylate them. The system draws from several available kinase-substrate databases to computes kinase enrichment probability based on the distribution of kinase-substrate proportions in the background kinase-substrate database compared with kinases found to be associated with an input list of genes/proteins.
|
![]() |
ChEA (ChIP-X Enrichment Analysis) database contains manually extracted datasets of transcription-factor/target-gene interactions from over 100 experiments such as ChIP-chip, ChIP-seq, ChIP-PET applied to mammalian cells. We use the database to analyze mRNA expression data where we perform gene-list enrichment analysis as the prior biological knowledge gene-list library. The system is delivered as web-based interactive software. With this software users can input lists of mammalian genes for which the program computes over-representation of transcription factor targets from the ChEA database.
|
| |
Genes2Networks is a command-line software tool that can be used to place lists of mammalian genes in the context of a background mammalian signalome and interactome networks. The input to the program is a list of human Entrez Gene gene symbols and background networks in SIG format, while the output includes: (a) all identified interactions for the genes/proteins, (b) a subnetwork connecting the genes/proteins using intermediate components that are used to connect the genes, (c) ranking of the specificity of intermediate components to interact with the list of genes/proteins.
|
![]() |
AVIS (AJAX Viewer for Signaling Networks) is a visualization tool for viewing and sharing intracellular signaling, gene regulation and protein interaction networks. AVIS is implemented as an AJAX enabled syndicated Google gadget. It allows any webpage to render an image from a text file representation of signaling, gene regulatory, or protein interaction networks.
|
| |
SNAVI (Signaling Networks Analysis and Visualization) is a Windows-based desktop application that implements standard network analysis methods to compute the clustering, connectivity distribution, and detection of network motifs, as well as provides means to visualize networks and network motifs. SNAVI is capable of generating linked web pages from network datasets loaded in text format. SNAVI can also create networks from lists of gene or protein names. SNAVI is a useful tool for analyzing, visualizing and sharing cell signaling data. SNAVI is open source free software.
|
![]() |
Sig2BioPAX is a command-line Java program that can be used to convert structured text files describing molecular interactions into the BioPAX Level 3 standard format.
|
Genes2WordCloud is a word-cloud generator and a word-cloud viewer that is based on WordCram implemented using Java, Processing, AJAX, mySQL, and PHP. Text is fetched from several sources and then processed to extract the most relevant terms with their computed weights based on word frequencies.
Website: Genes2WordCloud | PMID: 21995939
FNV (Flashed-based Network Viewer) is for the visualization of small to moderately sized biological networks and pathways. FVN can also be used to embed pathways inside PDF files for the communication of pathways in soft publication materials.
Website: FNV | PMID: 21349871
PubMed Alert Me! is a software utility that allows users to enter a list of PubMed queries. Once a list of queries is configured, the program runs either daily or weekly. It searches PubMed and if it finds new matching published papers, the program sends an e-mail notification with a list of links to the new articles.
Website: PubMed Alert Me! | PMID: 18402930
SequenceViewer is a software system that includes two bioinformatics tools: DNAViewer and RepeatsViewer. These Windows based computer programs create graphical images from text DNA sequences in FASTA format. The images created by the tools can help researchers identify repeating patterns, letter concentrations in different regions of DNA and possibly give clues about DNA physical structure as well as identifying novel promoter binding sites.
Website: SequenceViewer
Databases and Datasets
Additional databases and datasets generated by the lab are available for download from the Systems Biology Center New York's Databases and Datasets page.
Network Datasets for Download: in SIG file format
Neuronal Signaling Network - A massive literature search was applied to develop a large-scale mammalian neuronal cell signaling network. This network integrate cell signaling pathways specific to mammalian neurons. The network was analyzed for topological properties in the in this article published in Science. The most recent version of this network is available at: http://amp.pharm.mssm.edu/SAVI.txt.
Adhesome - A comprehensive literature derived biochemical network was developed in collaboration with Benny Geiger's Lab. The network is made of known interactions and cellular components composing the focal adhesion complex in mammalian cells. The Adhesome web-site provides a reference and supporting materials to the analysis published in Nature Cell Biology.
Avi Ma'ayan, PhD
Tel: 212-659-1739
Fax: 212-831-0114
Send e-mail
Icahn Medical Institute
1425 Madison Avenue
Room 12-78 (Office), 12-76 (Lab)
One Gustave L. Levy Place
Box 1215
New York, NY 10029
HIPK2 Regulator Protein Plays a Crucial Role in Kidney Fibrosis
Press release on News-Medical.net
New Database Could Speed Up Drug Discovery
Tech news feature on CNET
Animating Molecular Biology
Article in Biomedical Computation Review
Systematic Tracking of Cell Fate Changes
News and views article in Nature Biotechnology
Computational Honeycombs Drip with Data
News item in NIGMS Computing Life
Molecular Movies: New Software Animates Gene Expression Data
Technology observation on Scientific American Online
Stem Cells, Systems Biology and Human Feedback
News feature in Nature Reports Stem Cells







