Arthur P Goldberg, PhD
- ASSOCIATE PROFESSOR | Genetics and Genomic Sciences
Research Topics:Autism, Autophagy, Bioinformatics, Cancer Genetics, Computational Biology, Computer Simulation, Genetics, Genomics, Human Genetics and Genetic Disorders
Professor Goldberg has a degree in Astrophysics from Harvard and a PhD in Computer Science from UCLA. He entered bioinformatics after a distinguished career researching distributed software at IBM Research and network protocol performance at NYU’s Courant Institute of Mathematical Sciences.
As a Research Scientist in Bioinformatics in the Plant Systems Biology Laboratory at NYU he co-designed and co-implemented the Virtual Plant System, which provides tools to conveniently visualize and analyze gene expression and network data about the model plant Arabidopsis and crops.
At Sloan-Kettering, Dr. Goldberg co-designed and co-developed the cBioPortal for Cancer Genomics, a website that enables researchers to visualize and analyze multiple types of cancer genetic data. The cBioPortal stores all of the NCI’s reduced Cancer Genome Atlas (TCGA) data, along with data from multiple other studies.
As a member of the Seaver Autism Center in the Icahn School of Medicine at Mount Sinai, Prof. Goldberg studies the genetic causes of autism. He is the founding Director of the Bioinformatic Hub for the NIMH-funded Autism Sequencing Consortium (ASC). The Hub stores published and unpublished sequence data and called variants from autism studies by ASC members and provides secure, shared data access and cluster computing to ASC members. Dr. Goldberg has built a three person bioinformatics team for the Seaver Center.
BA, Harvard College
PhD, University of California, Los Angeles
Software tools for studying heritable diseases
Professor Goldberg creates programs that organize and analyze genetic data. Recently developed tools include
- A program that identifies the transmission status of variants on the sex chromosomes in males and females.
- A pipeline that inputs variant call (vcf) and pedigree (PED) files and filters outlier subjects and common variants to produce input for the Transmission And De novo Analysis (TADA) gene risk statistical analysis. The analysis tallies transmitted, de novo, and case-control variants, while properly handling the sex chromosomes and integrating dependency analysis to perform well with over 10,000 subjects.
- A database that stores and analyzes identifier aliases and pedigree relationships in large samples of genetic subjects.
Gene discovery in complex heritable diseases such as autism
Prof. Goldberg and his collaborators work on identifying risk genes in Autism Spectrum Disorder (ASD).
Work with Christopher Poultney, Joseph Buxbaum and others found that small (1-30 kilobase) copy number variant deletions are enriched in ASD cases versus matched controls. Genes disrupted by these variants are overrepresented in the autophagy pathway, suggesting that improper pruning of neuronal interconnections during development may contribute to autism. This work won a top 10 paper of the year award from SFARI in 2013.
Work with many collaborators including Silvia De Rubeis, Xin He, Christopher Poultney, Menachem Fromer, Kathryn Roeder, Bernie Devlin, Mark Daly, Joseph Buxbaum, and other members of the ASC focused on discovering new autism risk genes on the autosomes. Employing a powerful new statistical model called Transmission and De novo Analysis (TADA) [He et. al., 2013] we implicate 22 autosomal genes at an FDR < 0.05. In this analysis, Dr. Goldberg developed and operated software that input VCF and pedigree files and extracted rare variants while filtering outlier samples. We are continuing this research with additional data and improved calls from the GATK Haplotype Caller.
De Rubeis S, He X, Goldberg AP, Poultney CS, Samocha K, Cicek AE, Kou Y, Liu L, Fromer M, Walker S, Singh T, Klei L, Kosmicki J, Shih-Chen F, Aleksic B, Biscaldi M, Bolton PF, Brownfeld JM, Cai J, Campbell NG, Carracedo A, Chahrour MH, Chiocchetti AG, Coon H, Crawford EL, Curran SR, Dawson G, Duketis E, Fernandez BA, Gallagher L, Geller E, Guter SJ, Hill RS, Ionita-Laza J, Jimenz Gonzalez P, Kilpinen H, Klauck SM, Kolevzon A, Lee I, Lei I, Lei J, Lehtimäki T, Lin CF, Ma'ayan A, Marshall CR, McInnes AL, Neale B, Owen MJ, Ozaki N, Parellada M, Parr JR, Purcell S, Puura K, Rajagopalan D, Rehnström K, Reichenberg A, Sabo A, Sachse M, Sanders SJ, Schafer C, Schulte-Rüther M, Skuse D, Stevens C, Szatmari P, Tammimies K, Valladares O, Voran A, Li-San W, Weiss LA, Willsey AJ, Yu TW, Yuen RK, Cook EH, Freitag CM, Gill M, Hultman CM, Lehner T, Palotie A, Schellenberg GD, Sklar P, State MW, Sutcliffe JS, Walsh CA, Scherer SW, Zwick ME, Barett JC, Cutler DJ, Roeder K, Devlin B, Daly MJ, Buxbaum JD. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature 2014 Nov; 515(7526).
Poultney CS, Goldberg AP, Drapeau E, Kou Y, Harony-Nicolas H, Kajiwara Y, De Rubeis S, Durand S, Stevens C, Rehnström K, Palotie A, Daly MJ, Ma'ayan A, Fromer M, Buxbaum JD. Identification of small exonic CNV from whole-exome sequence data and application to autism spectrum disorder. American journal of human genetics 2013 Oct; 93(4).
Cerami E, Gao J, Dogrusoz U, Gross BE, Sumer SO, Aksoy BA, Jacobsen A, Byrne CJ, Heuer ML, Larsson E, Antipin Y, Reva B, Goldberg AP, Sander C, Schultz N. The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data. Cancer discovery 2012 May; 2(5).
Gaugler T, Klei L, Sanders SJ, Bodea CA, Goldberg AP, Lee AB, Mahajan M, Manaa D, Pawitan Y, Reichert J, Ripke S, Sandin S, Sklar P, Svantesson O, Reichenberg A, Hultman CM, Devlin B, Roeder K, Buxbaum JD. Most genetic risk for autism resides with common variation. Nature genetics 2014 Aug; 46(8).
Katari MS, Nowicki SD, Aceituno FF, Nero D, Kelfer J, Thompson LP, Cabello JM, Davidson RS, Goldberg AP, Shasha DE, Coruzzi GM, Gutiérrez RA. VirtualPlant: a software platform to support systems biology research. Plant physiology 2010 Feb; 152(2).
TCGA Consortium . Comprehensive molecular characterization of human colon and rectal cancer. Nature 2012 Jul; 487(7407).
Buxbaum JD, Daly MJ, Devlin B, Lehner T, Roeder K, State MW, ASC Members . The autism sequencing consortium: large-scale, high-throughput sequencing in autism spectrum disorders. Neuron 2012 Dec; 76(6).