Xinghua Mindy Shi
Postdoctoral Fellow in Machine Learning and Privacy
One postdoctoral position in Machine Learning and Privacy is available in the Department of Computer and Information Sciences at Temple University. The appointee will support and complement ongoing projects in developing new algorithms for machine learning and privacy-preserving modeling. The appointee will work closely with scientists in the Institute for Genomics and Evolutionary Medicine (iGEM) and Center for Data Analytics and Biomedical Informatics at Temple, other institutions in Philadelphia (e.g. Penn Medicine) and beyond (e.g. Stevens Institute of Technology) to develop privacy preserving machine learning methods.
Candidate must have or be close to obtaining a Ph.D. in quantitative fields (include but not limited to Computer Science, Statistics, Mathematics, and Physics) and have demonstrated a high level of research productivity through publication in peer-reviewed conferences and journals. Experiences with machine learning or data privacy is required. As a team player working closely with cross-institutional scientists with broad areas of expertise, strong interpersonal and communication skills are essential.
Postdoctoral Fellow in Computational Biology / Bioinformatics
One postdoctoral position in computational biology and bioinformatics is available in the Department of Computer and Information Sciences at Temple University. The appointee will support and complement ongoing projects in developing computational approaches/pipelines to integrate and analyze large-scale (epi)genomic, expression and interaction data sets for scientific discovery and validation. The appointee will work closely with scientists in the Institute for Genomics and Evolutionary Medicine (iGEM) and Center for Data Analytics and Biomedical Informatics at Temple, other institutions (e.g. Penn Medicine, Jackson Laboratory for Genomic Medicine) and International Consortium like The Human Genome Structural Variation Consortium (HGSVC) to develop new methods and pipelines toward precision medicine.
Candidate must have or be close to obtaining a Ph.D. in quantitative fields (include but not limited to Bioinformatics, Computational Biology, Computer Science, Statistics, Mathematics, and Physics) and have demonstrated a high level of research productivity through publication in peer-reviewed conferences and journals. Extensive experiences using scripting languages (e.g. Perl, Python) and statistical/mathematical toolkits (e.g. R, SAS, Matlab, Mathematica) are required. Knowledge of machine/statistical learning and mathematical modeling techniques is desirable. As a team player working closely with cross-institutional scientists with broad areas of expertise, strong interpersonal and communication skills are essential.
Recently celebrating its 50th year anniversary, CIS@Temple is one of the oldest computer science departments in the country and is experiencing tremendous growth in its research and academic programs. Temple University is a Carnegie R1 institution that serves more than 40,000 students and is ranked #44 among top public universities by the U.S. News & World Report. Located in the heart of Philadelphia (the 5th largest city in the United States, known for its arts, culture, history and affordable living), Temple University is in close proximity to many outstanding research centers and industry partners in information technology, healthcare, biotechnology, and finance.
We offer a collegial environment, excellent facilities, and a competitive salary commensurate with experience. Applicants should email a single PDF file including cover letter, CV, brief research statement and list of 3 references to Dr. Mindy Shi (email@example.com).
I am seeking for highly motivated graduate students with undergraduate training in computer science, statistics, mathematics, physics or quantitative biological sciences. Full assistantship will be provided upon admission. Please check out the admission information for CIS PhD and Bioinformatics PhD respectively. Please contact Dr. Mindy Shi (firstname.lastname@example.org) if you are interested.
Current Group Members
Chong Li (Bioinformatics PhD student, fall 2020)
Sky Gao (Bioinformatics PhD student, fall 2020)
Mohammad Erfan Molaei (CIS PhD student, summer 2021)
Chen Song (CIS PhD student, spring 2021)
Bin Li (CIS PhD student, spring 2021)
Zhengkang Fan (CIS PhD student, fall 2021)
Parisa Ghasemi (CIS PhD student, fall 2021)
AlumniListed by (Role, Duration in the lab, Last known position)
Postdocs: Dr. Junjie Chen (postdoctoral researcher, 2018-21, Assistant Professor of Computer Science at Harbin Institute of Technology, Shenzhen, China), Dr. Letu Qingge (postdoctoral researcher, 2018-19, Assistant Professor of Computer Science at North Carolina A&T University), Dr. Jia Wen (postdoctoral researcher, 2015-18, Postdoctoral Research Associate at University of North Carolina at Chapel Hill), Dr. Rubén Armañanzas (postdoctoral researcher, 2013, Research Assistant Professor at George Mason University)
PhD students: Conor Nodzak (PhD, 2016-19, Senior Data Technology Analyst at Bank of America), Andrew Quitadamo (PhD, 2013-18, Bioinformatics Scientist at Boston Children's Hospital), Sisi Yuan (PhD rotation, 2018, PhD student at the University of North Carolina at Charlotte), Benika Hall (PhD, 2014-2017, Senior Data Scientist at NVIDIA), James Johnson (PhD rotation, 2017, PhD student at the University of North Carolina at Charlotte)
Master, undergraduate students and interns: Yichen Du (Data Science undergraduate student, 2020), Jill Jenkins (PSM, 2019, Data Scientist at GAVS Technologies), Katherine Jensen (PSM and undergraduate student, 2018-19, PhD student at the University of North Carolina at Charlotte), Valentina Talevi (PSM, 2019, PhD Student at German Center for Neurodegenerative Disease), Angat Puri (PSM, 2017, Data Scientist at the Hartford), Lauren Gridley (PSM, 2017, Bioinformatician at RTI International), Seyed Nader Nazemi (PSM, 2016), Sasan Najar (PSM, 2016), Heena Desai (PSM, 2016, Bioinformatician at University of Pennsylvania School of Medicine), Dahlia Shvets (PSM, 2015, Data Scientist at Scripps Health), Lu Tian (PSM, 2015), Frederick Lin (PSM, 2014, Sr. Research Data Analyst at Northwestern University), Brianna Lam (PSM, 2014, Bioinformatician at Ayass BioScience), Swanthana Rekulapally (PSM, 2014, Research Associate at University of North Carolina at Chapel Hill), Shivam Patel (undergraduate, 2019, ), Erin Tsai (undergraduate, 2018), Alan King (undergraduate, 2016), Gang (Chris) Chen (intern, 2015), Ana C. Jacobs (intern, 2014).
We work at the intersection of data science, computer science and life sciences, focusing on the development of statistical and machine learning methods for biomedical research.
Ph.D. and M.S. Department of Computer Science , University of Chicago, Chicago, IL. M.Eng. and B. Eng. Department of Computer Science and Technology, Beijing Institute of Technology, Beijing, China.
Associate Professor, Department of Computer & Information Sciences , Temple University , (2019-) Assistant Professor, Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, (2013-19) Postdoctoral Research Fellow and NIH T32 Genetics Fellow, Brigham and Women's Hospital, Harvard Medical School,(2009-12) Visiting Research Fellow, Broad Institute of MIT and Harvard, (2009-12)
Selected Publications (Full publications available at Google Scholar )(* Denote current or former trainees with Dr. Shi.)
- "PAR-GAN: Improving the Generalization of Generative Adversarial Networks Against Membership Inference Attacks.", Chen J*, Wang HW, Gao H, and Shi X, In Proceedings of The 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2021), August 14-18, 2021, Virtual. (Code for PAR-GAN is available at GitHub)
- "On the Convergence of Stochastic Compositional Gradient Descent Ascent Method.", Gao H, Wang X, Luo L, and Shi X, In Proceedings of The 30th International Joint Conference on Artificial Intelligence (IJCAI 2021), August 21-26, 2021, Virtual.
- "Haplotype-resolved diverse human genomes and integrated analysis of structural variation.", Ebert P, Audano PA, Zhu Q, Rodriguez-Martin B, Porubsky D, Bonder M, Sulovari A, Ebler J, Zhou W, Mari RS, Yilmaz F, Zhao X, Hsieh P, Lee J, Kumar S, Lin J, Rausch T, Chen Y, Ren J, Santamarina M, Hops W, Ashraf H, Chuang NT, Yang X, Munson KM, Lewis AP, Fairley S, Tallon LJ, Clarke WE, Basile AO, Byrska-Bishop M, Corvelo A, Chaisson MJP, Chen J*, Li C*, Brand H, Wenger AM, Ghareghani M, Harvey WT, Raeder B, Hasenfeld P, Regier A, Abel H, Hall I, Flicek P, Stegle O, Gerstein MB, Tubio JMC, Mu Z, Li YI, Shi X, Hastie AR, Ye K, Chong Z, Sanders AD, Zody MC, Talkowski ME, Mills RE, Devine SE, Lee C, Korbel JO, Marschall T, anb Eichler EE, Science, 25 Feb 2021. bioRxiv
- "Differential Privacy Protection against Membership Inference Attack on Genomic Data",Chen J*, Wang WH, and Shi X, In Proceedings of The 26th Pacific Symposium on Biocomputing (PSB 2021) , January 3-7, 2021, The Big Island of Hawaii. (Code for MIA-GAN is at GitHub) BioRxiv
- "Population-scale Genomic Data Augmentation Based on Conditional Generative Adversarial Networks",Chen J* and Shi X, In Proceedings of The 11th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM BCB 2020), September 21-24, 2020, Virtual. (Code for PG-cGAN is available at GitHub)
- "A Parallelized Strategy for Epistasis Analysis Based on Empirical Bayesian Elastic Net Models", Wen J*, Ford CT, Janies D, and Shi X, Bioinformatics, 2020, 36(12), Pages 3803–3810. (Code for parEBEN is available at GitHub)
- "Association of CNVs with Methylation Variation", Shi X, Radhakrishnan S, Wen J*, Chen JY, Chen J*, Lam BA*, Mills RE, Stranger BE, Lee C, and Setlur SR, npj Genomic Medicine, 5, 41 (2020).
- "Sparse Convolutional Denoising Autoencoders for Genotype Imputation", Chen J* and Shi X, Genes, 2019, 10(9), 652. (Code for SCDA is available at GitHub)
- "A Sparse Convolutional Predictor with Denoising Autoencoders for Phenotype Prediction",Chen J* and Shi X , In Proceedings of The 10th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM BCB 2019), September 7-10, 2019, Niagara Falls, NY. (Code for SCP_DAE is available at GitHub)
- "Multi-platform Discovery of Haplotype-resolved Structural Variation in Human Genomes",Chaisson MJP, Sanders A.D, Zhao X, Malhotra A, Porubsky D, Rausch T, Gardner EJ, Rodriguez O, Guo L, Collins RL, Fan X, Wen J*, Handsaker RE, Fairley S, Kronenberg ZN, Kong X, Hormozdiari F, Lee D, Wenger AM, Hastie A, Antaki D, Audano P, Brand H, Cantsilieris S, Cao H, Cerveira E, Chen C, Chen X, Chin C-S, Chong Z, Chuang NT, Church DM, Clarke L, Farrell A, Flores J, Galeev T, David G, Gujral M, Guryev V, Haynes-Heaton W, Korlach J, Kumar, S, Kwon JY, Lee JE, Lee J, Lee W-P, Lee SP, Marks P, Valud-Martinez K, Meiers S, Munson KM, Navarro F, Nelson BJ, Nodzak C*, Noor A, Kyriazopoulou-Panagiotopoulou S, Pang A, Qiu Y, Rosanio G, Ryan M, Stutz A, Spierings DCJ, Ward A, Welsch AE, Xiao M, Xu W, Zhang C, Zhu Q, Zheng-Bradley X, Jun G, Ding L, Koh CL, Ren B, Flicek P, Chen K, Gerstein MB, Kwok P-Y, Lansdorp PM, Marth G, Sebat J, Shi X, Bashir A, Ye K, Devine SE, Talkowski M, Mills RE, Marschall T, Korbel J, Eichler EE and Lee C, Nature Communications, 2019, 10:1784
- "A Deep Auto-encoder Model for Gene Expression Prediction", Xie R, Wen J*, Quitadamo A*, Cheng J, and Shi X, BMC Genomics, 2017, 18(Suppl 9):845.
- "Bayesian Hyperparameter Optimization for Machine Learning Based eQTL Analysis", Quitadamo A*, Johnson J*, and Shi X, In Proceedings of the 8th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM BCB 2017), pp 98-106, Boston, MA, August 20-23, 2017.
- "An Overview of Human Genetic Privacy", Shi X and Wu X, Annals of NY Academy of Sciences, 2016, DOI: 10.1111/nyas.13211.
- “An Integrated Map of Structural Variation in 2,504 Human Genomes”, Sudmant PH, Rausch T, Gardner EJ, Handsaker RE, Abyzov A, Huddleston J, Zhang Y, Ye K, Jun G, Fritz M, Konkel MK, Malhotra A, Stütz AM, Shi X, Casale FP, Chen J, Hormozdiari F, Dayama G, Chen K, Malig M, Chaisson M, Walter K, Meiers S, Kashin S, Garrison E, Auton A, Lam H, Mu XJ, Alkan C, Antaki D, Bae T, Chines P, Chong Z, Clarke L, Dal E, Ding L, Emery S, Fan X, Gujral M, Kahveci F, Kidd JM, Kong Y, Lameijer E-W, McCarthy S, Flicek P, Gibbs RA, Marth G, Menelaou A, Muzny DM, Nelson BJ, Noor A, Parrish NF, Quitadamo A*, Raeder B, Schadt E, Schlattl A, Shabalin AA, Untergasser A, Walker JA, Wang M, Yu Y, Zhang C, Zhang J, Zheng-Bradley X, Zhou W, Zichner T, Sebat J, Batzer MA, McCarroll SA, The 1000 Genomes Project Consortium, Mills RE, Gerstein MB, Bashir A, Stegle O, Devine SE, Lee C, Eichler EE, Korbel JO, Nature, 2015. 562(7571): 75-81.
- "A Global Reference for Human Genetic Variation", The 1000 Genomes Project Consortium, Nature, 2015. 562(7571): 68-74.
- "An Integrated Map of Genetic Variation from 1,092 Human Genomes", The 1000 Genomes Project Consortium, Nature, 2012, 491: 56-65.
- "A Two-Graph Guided Multi-task Lasso Approach for eQTL Mapping", Chen X*, Shi X*, Xu X, Wang Z, Mills R, Lee C, Xu J, In Proceedings of the fifteenth international conference on Artificial Intelligence and Statistics (AISTATS 2012). JMLR W&CP, 2012, 22: 208-217.
|Last Modified: June 2, 2021|