Breadcrumb
Data Science Bootcamps
Data Science Bootcamps – Fall 2025
Sponsored by Aegon Transamerica
The Department of Statistics and Actuarial Science is excited to announce this year’s Data Science Bootcamps schedule. Our distinguished presenters include Duncan Leaf, Congrui Yi, Jin Meng, and Kate Ralston, who will present on data analytics programming, business communication, and observational data analysis.
Dr. Duncan E. Leaf
Research Scientist, Center for Health Policy & Economics, University of Southern California
Dr. Leaf will give four presentations:
Tuesday, September 2, 6:00-8:00pm CT
Zoom presentation: https://uiowa.zoom.us/j/97612545875?pwd=izZZhc35ABbIa1DgLvDSGtJRXKF3py.1
Understanding an Observational Data-Generating Process
This lecture sets up the motivation for understanding observational data and how it is different from randomized experimental data. While randomized experiments and observational studies are usually treated as two separate areas of statistics, I will argue that data from randomized experiments may be observational until proven otherwise. Therefore, understanding bias and causal reasoning is also useful when working with experimental data. I will introduce our working example of analyzing diabetes in the NHANES data set.
Tuesday, September 9, 6:00-8:00pm CT
Zoom presentation: https://uiowa.zoom.us/j/97612545875?pwd=izZZhc35ABbIa1DgLvDSGtJRXKF3py.1
Complex Survey Sample Design and Analysis
This lecture walks through details of a complex survey design using NHANES as an example. We then look at weighting methods used to make a survey sample look like the population and variance estimation methods used to get valid inferences from survey samples, with R and SAS examples.
Tuesday, September 16, 6:00-8:00pm CT
Zoom presentation: https://uiowa.zoom.us/j/97612545875?pwd=izZZhc35ABbIa1DgLvDSGtJRXKF3py.1
Bias and Causal Inference
The primary goal of this lecture is understanding the fundamental problems of causal inference. Counterfactual reasoning is introduced using potential outcomes notation. The secondary goal is to introduce basic causal analysis methods: inverse probability of treatment weighting and instrumental variable regression. R examples will be provided.
Tuesday, September 23, 6:00-8:00pm CT
Zoom presentation: https://uiowa.zoom.us/j/97612545875?pwd=izZZhc35ABbIa1DgLvDSGtJRXKF3py.1
Project Presentations
Students will give two presentations of their NHANES data analysis project. The first presentation is non-technical and aimed at decision makers who have no knowledge or interest of statistical methods. The second presentation will explain your methods and results for a technical audience.
Dr. Congrui Yi
Senior Research Scientist, Meta
Dr. Yi will give two presentations:
Tuesday, September 30, 6:00-8:00pm CT
Zoom presentation: https://uiowa.zoom.us/j/99513585111
Introduction to Multi-Armed Bandits
Multi-armed bandit (MAB) is a simple but powerful modeling framework for making sequential decisions given distributional uncertainty and dynamic environments. Given the right to choose an action at a time and collect corresponding rewards, MAB algorithms aim to maximize cumulative rewards by balancing two competing objectives: exploration of new or less chosen actions, and exploitation of ones shown to be rewarding. This lecture introduces basic concepts including action, reward, regret, exploration-exploitation trade-off, and most common types of MAB algorithms, such as epsilon-greedy, Upper Confidence Bound (UCB), Thompson Sampling (TS), and Boltzmann Exploration. Additionally, it will include a hands-on Python coding exercise and a discussion of real-world applications.
Tuesday, October 7, 6:00-8:00pm CT
Zoom presentation: https://uiowa.zoom.us/j/99513585111
Introduction to Contextual Bandits
Following the introduction to MAB in lecture 1, this lecture continues to cover contextual bandits, which extends the classic MAB by incorporating contextual information, allowing us to handle more complex real world problems and make more informed decisions. We will go through several types of models including LinUCB (Linear Upper Confidence Bound), LinTS (Linear Thompson Sampling), BLIP-TS (Bayesian Linear Probit Regression with TS), Neural Contextual Bandits etc. Moreover, we will associate each type with examples of industrial applications in areas such as online advertising, recommender systems and dynamic pricing, with a focus on motivations and formulations.
Dr. Jin Meng
Senior Data Scientist, Zurich North America
Dr. Meng will give two presentations:
Tuesday, October 14, 6:00-8:00pm CT
In-person presentation: 107 EPB
Data Science Application in Insurance on Traditional Tabular Data
In the first lecture, we will cover the relatively traditional approach of analyzing tabular data in the insurance field. The overall objective of this lecture is to give students both conceptual understanding and practical experience with data science workflows in a real-world insurance context.
Tuesday, October 21, 6:00-8:00pm CT
In-person presentation: 107 EPB
Data Science Application in Insurance on Unstructured Textual Data
In the second lecture, we will explore more cutting-edge techniques in natural language processing (NLP) using large language models (LLMs). The focus will be on building a chatbot solution, as a data science proof-of-concept solution utilizing a commonly used LLM and a Retrieval-Augmented Generation (RAG) framework to provide accurate answers to user questions based on external domain knowledge documents provided to the LLM. The goal of this lecture is to expose students to data science technique stack used in developing end-to-end proof-of-concept or prototype solutions based on cutting edge techniques.
Dr. Kate Ralston
Director of Data Analytics, Enrollment Management, The University of Iowa
Dr. Ralston will give two presentations:
Tuesday, October 28, 6:00-8:00pm CT
In-person presentation: 107 EPB
Before You Build: How to Scope, Ask, and Align Like a Pro
The goal of this session is to introduce participants to early stages of project communication including gathering requirements, developing a project scope, timelines and communication plan. We will learn to distinguish between the business need, project goals and data questions, practice requirement gathering and communicating technical choices necessary to complete a project. In addition, we will discuss easy wins, common setbacks and safeguards that help project success.
**A homework will be assigned between these two sessions.**
Tuesday, November 4, 6:00-8:00pm CT
In-person presentation: 107 EPB
Making the Message Matter: Presenting Results with Purpose
During this session participants will learn to identify multiple audiences and their needs to construct a targeted and meaningful project deliverable. We will practice discussing projects in multiple formats: elevator pitch, visual presentation and executive summary report. Additionally, participants will have an opportunity to learn to handle questions and contentious project feedback with expertise and grace.
Past Data Science Bootcamps
Data Science Bootcamp 2024
We are excited to announce the data science bootcamp offered by the Department of Statistics and Actuarial Science: "Observational Studies: Design and Analysis." This bootcamp is open to all Data Science, Statistics, or Actuarial Science students.
On October 8th our speaker is Dr. Duncan E. Leaf, who is a research scientist at the USC Schaeffer Center for Health Policy & Economics. You can learn more about Duncan here: https://healthpolicy.usc.edu/staff/duncan-leaf-phd/
Dr. Leaf's research centers on understanding the economic impacts of health policy decisions, with a particular focus on exploring the effects of palliative care interventions on medical costs and quality of life as well as simulating the long-term effects of early-childhood education interventions. In his BootCamp talk, Dr. Leaf will share key insights on effectively communicating complex health policy analysis, translating technical research into actionable strategies for policymakers, and bridging the gap between statisticians, health economists, and decision-makers to drive impactful change. His vast experience at the intersection of Statistics, Health Policy, and Economics makes him an invaluable speaker for anyone interested in the critical role of analytics in shaping economic policy.
Bootcamp Overview:
- Title: Observational Studies: Design and Analysis
- Duration: 8 hours
- Dates: Tuesdays, October 8, 15, 22, and 29
- Time: 6:00 - 8:00 p.m. CST
Objectives:
Students will learn key concepts, including:
1. Introduction to observational data
2. Complex survey sample design and analysis
3. Addressing bias in observational studies
4. Causal inference methodologies
The bootcamp will feature exercises and group discussions to reinforce these concepts,
November 5, 2024
We are pleased to announce that Paul Hampton, Head of the Strategic Insights, Data, & Analytics team at Transamerica, will be our speaker for the upcoming Business Communications BootCamp - Part II. You can learn more about Paul here: https://www.linkedin.com/in/pauldhampton1/
Paul leads a team dedicated to developing cutting-edge data infrastructure and analytics solutions that play a pivotal role in shaping business decisions at Transamerica. In his second BootCamp talk, he will continue from the first part and share valuable insights on managing and expanding a data science team, effective strategies for communicating technical results to business leaders, and fostering collaboration between data scientists and decision-makers.
Paul will also discuss how clear and impactful communication of his team’s findings has contributed to both team growth and the overall success of the business.