The SSGG Member Engagement Committee presents a webinar on "Data Thinning to Overcome Double Dipping"
4/25/2024 | 4:00 - 5:00 PM (Eastern) / 1:00 - 2:00 PM (Pacific)
Registration (free but required): zoom.us/webinar/register/WN_YqpkeaPcTYmiGxQ3odHG_g
Abstract: We refer to the practice of using the same data to fit and validate a model as double dipping. Problems arise when standard statistical procedures are applied in settings that involve double dipping. To circumvent the challenges associated with double dipping, one approach is to fit a model on one dataset, and then validate the model on another independent dataset. When we only have access to one dataset, we typically accomplish this via sample splitting. Unfortunately, in many unsupervised problems, sample splitting does not allow us to avoid double dipping. In this talk, we are motivated by unsupervised problems that arise in the analysis of single-cell RNA sequencing data. We first propose Poisson count splitting, which splits a single observation drawn from a Poisson distribution into two independent components. We show that Poisson count splitting allows us to avoid double dipping in the context of our motivating problems. We next generalize the count splitting framework to a variety of distributions and refer to the generalized framework as data thinning. Data thinning is a very general alternative to sample splitting that is useful far beyond the context of single-cell RNA sequencing data, and, unlike sample splitting, can be applied in both supervised and unsupervised settings.
Speaker Bio: Dr. Anna Neufeld completed her Ph.D. in Statistics from the University of Washington in 2023, under the guidance of Professor Daniela Witten. She worked on problems related to "double dipping" and testing data-driven hypotheses, including problems related to the analysis of single-cell RNA sequencing data. She is now a postdoctoral research fellow at the Fred Hutch Cancer Center in Seattle, WA, working with Professor Jeff Leek.
------------------------------------------------------------
Previous Events
The recordings for previous webinars and select short courses can be accessed through the members-only site. If you are not a section member yet, you may join us through Join – Section on Statistics in Genomics and Genetics (amstat.org)
Webinar on The importance of genetic evidence to improve productivity in drug discovery & development
Dr. Matthew R. Nelson
Webinar on Translating polygenic risk scores in the clinic: promises and challenges
Dr. Bogdan Pasaniuc
Date/Time: Tuesday, January 23, 2024 4pm - 5pm EST.
Webinar on Rigor, Reproducibility, and Mindful Programming
Dr. Claudia Solís-Lemus & Dr. Stephanie Hicks
Date/Time: Nov 29 2023 11am - 12pm EST
Webinar: Time order structure and feature learning for trajectory modeling
Speaker: Benedict Anchang, MSc. PhD
Date/Time: Oct 23 2023, 2pm - 3pm EST.
Panel Discussion: Academic Careers and Job Search
The Membership Engagement Committee of the American Statistical Association’s Section of Statistical Genomics and Genetics (ASA SSGG) proudly presents a panel discussion on academic careers and job search with a particular focus on statistical genomics and genetics. The 1.5-hour session will be virtual on Tuesday, September 26, 2023, 2 - 3:30 PM EST. We are excited to have invited six panelists who represent various positions, career stages, and fields of research in academia: Dr. Alison A. Motsinger-Reif (NIEHS), Dr. Nianjun Liu (Indiana Bloomington), Dr. Brooke Fridley (Moffit Cancer Center), Dr. Shilin Li (Ohio State University), Dr. Jack O’Brien (Bowdoin College), Dr. Kesley Grinde (Macalester College).
SSGG JSM Business Meeting
All Members are welcomed to join our section’s business meeting at JSM.
Date: 8/8/2023. Time: 6-7pm
Location: InterContinental Toronto Centre Oakville Room
SSGG JSM Walk/Jog/Run
For this upcoming JSM. we would like to announce an SSGG Run/Jog/Walk along the lakefront on Tuesday August 8, 6:15am in the morning. Come and meet your colleagues in SSGG as we take a relaxing jog (or adrenaline pumping run) along Lake Ontario! Let's gather at the east end of the Simcoe Wavedeck, directly on the lakefront 10 minutes walk from the Metro Convention Center, between 6-6:15. We plan to take off at 6:15, and go westward along the Martin Goodman Trail. Depending on the number of participants and interests, we expect to split into small groups dynamically. Some may run, some may slowly jog and some may walk. Our plan is to make it back in one hour, by 7:15 (sessions start at 8:30), but this is an out-and-back trail that follows the lakefront, and you can go for as far as you like.
SSGG JSM Dinner
The Membership Engagement Committee of the American Statistical Association's Section of Statistical Genomics and Genetics (ASA SSGG) is excited to invite you to join us for a Dinner-Together event (at your own expense) if you will be attending JSM. Let's gather together and network with each other in a nice restaurant within walking distance to the Metro Toronto Convention Center.
Time: Sunday, August 6, 6:30-8:30 PM (EST)
Restaurant: Le Sélect Bistro (leselectbistro.com)
ASA SSGG MEC Contact: Judong Shen judong.shen@merck.com
SSGG Logo Design Competition
Calling all artists and designers! Showcase your talent and be part of ASA's SSGG logo design competition. We're in search for a captivating logo that represents the spirit of our section and resonates with our diverse community. Whether you're an experienced graphic designer or simply passionate about creating visual masterpieces, this is your chance to shine. Submit your unique designs and stand a chance to win exciting cash prizes. Don't miss out on this incredible opportunity to leave your artistic mark on ASA's SSGG section. Join the creative adventure today and make a lasting impression with your extraordinary skills!
Webinar: Tips for (Your First) JSM!
Nancy Zhang (Penn) and Michael Wu (Fred Hutch)
Time: Thursday July 27 1-2PM PST 4-5 PM EST.
Panel Discussion: Industry Careers and Internships
John Palcza (Merck), Audrey Y. Chu (GSK), Olukayode Sosina (Regeneron), Jingchunzi Shi (23andMe)
Thursday, May 25, 2023 11:00 AM - 12:00 PM EST
ASA-SSGG Short Course Series: Selective Introduction to Multi-Omics Analysis
Rick Chang, Sierra Niemiec, Dr. Jack Pattee, Wenjia Wang (Course Instructors)
Dr. George Tseng and Dr. Katerina Kechris (Course Directors)
Date/Time: April 11, 13, 18, 20, 2023 (4x90min live sessions over two weeks). 3:00-4:30pm (Eastern)
Webinar: Bayesian Methods for Spatially Resolved Transcriptomics Data Analysis
Qiwei Li (University of Texas at Dallas)
Date/Time: Mar 27, 2023 02:00 PM EST
Webinar: Recommendations on the use and reporting of race, ethnicity, and ancestry in genetic research
Alyna Khan (University of Washington)
Dr. Sarah C. Nelson (University of Washington)
Date/Time: 02/28/2023 | 1:00 - 2:00 PM (Eastern) / 10:00 - 11:00 AM (Pacific)
Panel Discussion: NIH Grant Funding and Grant Review
Dr. Katerina Kechris (University of Colorado)
Dr. Jingyi Jessica Li (University of California, Los Angeles)
Dr. Li-Xuan Qin (Memorial Sloan Kettering Cancer Center)
Dr. Grzegorz A. Rempala (The Ohio State University)
Date/Time: Jan 31, 2023. 02:30 - 03:30 PM in Eastern Time (US and Canada)
Webinar: Manuscript Writing Tips and Guidelines
Speaker: Drs Mingyao Li and Sanjay Shete
Time: Nov 17, 2022 01:00 PM (Eastern)
Webinar: Multivariate Integration of Multi-Omics Data
Speaker: Dr. Kim-Anh Lê Cao
Time: Oct 24, 2022 05:00 PM (Eastern)
Webinar: Pharmacogenomics (PGx) at Merck: Strategy, Projects and Research
Speaker: Dr. Judong Shen (Merck Research Laboratories)
Time: Sep 26, 2022 03:00 PM (Eastern)
Career Panel: Grant Application Panel Discussion
Dr. Saonli Basu, University of Minnesota
Dr. Mingyao Li, University of Pennsylvania
Dr. Victoriya Volkova, NIH BMRD SRO
Dr. Judy Huixia Wang, NSF DMS PD
Date/Time: Jun 16, 2022 01:00 PM in Eastern Time (US and Canada)
Webinar: Using Genomic Data Repositories for Secondary Analysis: Promises and Challenges
Speaker: Dr. Saonli Basu
Time: Apr 25, 2022 02:00 PM in Eastern Time (US and Canada)
Webinar: Multi-Omics Integration: Problems, Potential and Promise
Speaker: Dr. Katerina Kechris (Colorado School of Public Health)
Title: Time: March 21, 2022, 1-2 pm in Eastern Time (US and Canada)
Short Course: An Introduction to Deep Learning in Omics
Speaker: Dr. Wei Sun (Fred Hutchinson Cancer Research Center)
Dr. Nancy Zhang (the Wharton School at the University of Pennsylvania)
Time: Jan 11, 13, 18, and 20, 2022, 3pm-4:30 pm in Eastern Time (US and Canada)
Webinar: Leveraging Mentorship
Speaker: Dr. Ruth Gotian
Time: Feb 7, 2022, 1-2 pm in Eastern Time (US and Canada)
Webinar: Network-based methods for analysis of microbiome and metabolomic data
Speaker: Dr. Jing Ma (Fred Hutchinson Cancer Research Center)
Title: Time: Nov 22, 2021 12 pm in Eastern Time (US and Canada)
Webinar: Advanced statistical methods for genetic association studies
Speaker: Dr. Zhengzheng Tang (University of Wisconsin-Madison)
Time: Oct 25, 2021, 02:00PM in Eastern Time (US and Canada)
Webinar: Statistical Methods for Analysis of Heterogeneous Tumor Samples
Speaker: Dr. Wenyi Wang
Title: Time: Sep 20, 2021 02:00 PM in Eastern Time (US and Canada)
Annual Business Meeting
Data/Time: on August 13th 1:00-2:30 PST/2:00-3:30 MST/3:00-4:30 CST/4:00-5:30 EST
Career Panel: Time Management, Research Strategy and Healthy Habits for Graduate Students
Moderator: Nicholas Weaver
Panelists:
Dr. Kelsey Grinde, Assistant Professor of Statistics, Macalester College
Dr. Rui Duan, Assistant Professor of Biostatistics, Harvard University
Dr. Danielle Braun, Senior Research Scientist in Biostatistics, Dana-Farber Cancer Institute
Date/Time: Jun 28, 2021 02:00 PM in Eastern Time (US and Canada)
Webinar: Statistical Genomics at Procter and Gamble
Speaker: Drs. Dionne Swift and Kellen Kresswell (Data and Modeling Sciences Organization)
Date/Time:Jun 24, 2021 01:00 PM in Eastern Time (US and Canada)
Webinar (jointly hosted with WNAR): Computational Analyses of Multi-Modal Single-Cell Data
Speaker: Dr. John Marioni (https://www.ebi.ac.uk/research/marioni)
Date/Time:May 28 2021 at 1 pm EDT
Webinar: Genetics of Within-Subject Variability and Diabetes Complications
Speaker: Dr. Jin Zhou. Associate Professor, University of Arizona
Date/Time:April 26 2021 at 4 pm EST
Webinar: Interrogating the Gut Microbiome: Estimation of Bacterial Growth Rate and Prediction of Biosynthetic Gene Clusters
Speaker: Dr. Hongzhe Li/Hongzhe Lee, Perelman Professor of Biostatistics, Epidemiology and Informatics, Director, Center for Statistics in Big Data, Department of Biostatistics, Epidemiology and InformaticsUniversity of Pennsylvania.
Date/Time: March 22 2021 at 2 pm EST
Webinar: Addressing bias in genetic epidemiology for admixed populations
Speaker: Dr. Genevieve Wojcik, Assistant Professor of Epidemiology, Johns Hopkins Bloomberg School of Public Health.
Date/Time: February 22 2021 at 2 pm EST (11 am PST).