Data Science Bootcamps – Fall 2025

Sponsored by Aegon Transamerica

The Department of Statistics and Actuarial Science is excited to announce this year’s Data Science Bootcamps schedule. Our distinguished presenters include Duncan Leaf, Congrui Yi, Jin Meng, and Kate Ralston, who will present on data analytics programming, business communication, and observational data analysis.

 

Dr. Duncan E. Leaf

Research Scientist, Center for Health Policy & Economics, University of Southern California

Dr. Leaf will give four presentations:

 

Tuesday, September 2, 6:00-8:00pm CT

Zoom presentation:  https://uiowa.zoom.us/j/97612545875?pwd=izZZhc35ABbIa1DgLvDSGtJRXKF3py.1

Understanding an Observational Data-Generating Process

This lecture sets up the motivation for understanding observational data and how it is different from randomized experimental data. While randomized experiments and observational studies are usually treated as two separate areas of statistics, I will argue that data from randomized experiments may be observational until proven otherwise. Therefore, understanding bias and causal reasoning is also useful when working with experimental data. I will introduce our working example of analyzing diabetes in the NHANES data set.

 

Tuesday, September 9, 6:00-8:00pm CT

Zoom presentation:  https://uiowa.zoom.us/j/97612545875?pwd=izZZhc35ABbIa1DgLvDSGtJRXKF3py.1

Complex Survey Sample Design and Analysis

This lecture walks through details of a complex survey design using NHANES as an example. We then look at weighting methods used to make a survey sample look like the population and variance estimation methods used to get valid inferences from survey samples, with R and SAS examples.

 

Tuesday, September 16, 6:00-8:00pm CT

Zoom presentation:  https://uiowa.zoom.us/j/97612545875?pwd=izZZhc35ABbIa1DgLvDSGtJRXKF3py.1

Bias and Causal Inference

The primary goal of this lecture is understanding the fundamental problems of causal inference. Counterfactual reasoning is introduced using potential outcomes notation. The secondary goal is to introduce basic causal analysis methods: inverse probability of treatment weighting and instrumental variable regression. R examples will be provided.

 

Tuesday, September 23, 6:00-8:00pm CT

Zoom presentation:  https://uiowa.zoom.us/j/97612545875?pwd=izZZhc35ABbIa1DgLvDSGtJRXKF3py.1

Project Presentations

Students will give two presentations of their NHANES data analysis project. The first presentation is non-technical and aimed at decision makers who have no knowledge or interest of statistical methods. The second presentation will explain your methods and results for a technical audience.

 

Dr. Congrui Yi

Senior Research Scientist, Meta

Dr. Yi will give two presentations:

 

Tuesday, September 30, 6:00-8:00pm CT

Zoom presentation: https://uiowa.zoom.us/j/99513585111

Introduction to Multi-Armed Bandits

Multi-armed bandit (MAB) is a simple but powerful modeling framework for making sequential decisions given distributional uncertainty and dynamic environments. Given the right to choose an action at a time and collect corresponding rewards, MAB algorithms aim to maximize cumulative rewards by balancing two competing objectives: exploration of new or less chosen actions, and exploitation of ones shown to be rewarding. This lecture introduces basic concepts including action, reward, regret, exploration-exploitation trade-off, and most common types of MAB algorithms, such as epsilon-greedy, Upper Confidence Bound (UCB), Thompson Sampling (TS), and Boltzmann Exploration. Additionally, it will include a hands-on Python coding exercise and a discussion of real-world applications.

 

Tuesday, October 7, 6:00-8:00pm CT

Zoom presentation: https://uiowa.zoom.us/j/99513585111

Introduction to Contextual Bandits

Following the introduction to MAB in lecture 1, this lecture continues to cover contextual bandits, which extends the classic MAB by incorporating contextual information, allowing us to handle more complex real world problems and make more informed decisions. We will go through several types of models including LinUCB (Linear Upper Confidence Bound), LinTS (Linear Thompson Sampling), BLIP-TS (Bayesian Linear Probit Regression with TS), Neural Contextual Bandits etc. Moreover, we will associate each type with examples of industrial applications in areas such as online advertising, recommender systems and dynamic pricing, with a focus on motivations and formulations.

 

Dr. Jin Meng

Senior Data Scientist, Zurich North America

Dr. Meng will give two presentations:

 

Tuesday, October 14, 6:00-8:00pm CT

In-person presentation: 107 EPB

Data Science Application in Insurance on Traditional Tabular Data

In the first lecture, we will cover the relatively traditional approach of analyzing tabular data in the insurance field. The overall objective of this lecture is to give students both conceptual understanding and practical experience with data science workflows in a real-world insurance context.

 

Tuesday, October 21, 6:00-8:00pm CT

In-person presentation: 107 EPB

Data Science Application in Insurance on Unstructured Textual Data

In the second lecture, we will explore more cutting-edge techniques in natural language processing (NLP) using large language models (LLMs). The focus will be on building a chatbot solution, as a data science proof-of-concept solution utilizing a commonly used LLM and a Retrieval-Augmented Generation (RAG) framework to provide accurate answers to user questions based on external domain knowledge documents provided to the LLM. The goal of this lecture is to expose students to data science technique stack used in developing end-to-end proof-of-concept or prototype solutions based on cutting edge techniques.

 

Dr. Kate Ralston

Director of Data Analytics, Enrollment Management, The University of Iowa

Dr. Ralston will give two presentations:

 

Tuesday, October 28, 6:00-8:00pm CT

In-person presentation:  107 EPB

Before You Build: How to Scope, Ask, and Align Like a Pro

The goal of this session is to introduce participants to early stages of project communication including gathering requirements, developing a project scope, timelines and communication plan. We will learn to distinguish between the business need, project goals and data questions, practice requirement gathering and communicating technical choices necessary to complete a project. In addition, we will discuss easy wins, common setbacks and safeguards that help project success.

 

**A homework will be assigned between these two sessions.**

 

Tuesday, November 4, 6:00-8:00pm CT

In-person presentation:  107 EPB

Making the Message Matter: Presenting Results with Purpose

During this session participants will learn to identify multiple audiences and their needs to construct a targeted and meaningful project deliverable. We will practice discussing projects in multiple formats: elevator pitch, visual presentation and executive summary report. Additionally, participants will have an opportunity to learn to handle questions and contentious project feedback with expertise and grace.

 

 

 


Past Data Science Bootcamps

Data Science Bootcamp 2024

We are excited to announce the data science bootcamp offered by the Department of Statistics and Actuarial Science: "Observational Studies: Design and Analysis." This bootcamp is open to all Data Science, Statistics, or Actuarial Science students.

On October 8th our speaker is Dr. Duncan E. Leaf, who is a research scientist at the USC Schaeffer Center for Health Policy & Economics. You can learn more about Duncan here: https://healthpolicy.usc.edu/staff/duncan-leaf-phd/

Dr. Leaf's research centers on understanding the economic impacts of health policy decisions, with a particular focus on exploring the effects of palliative care interventions on medical costs and quality of life as well as simulating the long-term effects of early-childhood education interventions. In his BootCamp talk, Dr. Leaf will share key insights on effectively communicating complex health policy analysis, translating technical research into actionable strategies for policymakers, and bridging the gap between statisticians, health economists, and decision-makers to drive impactful change. His vast experience at the intersection of Statistics, Health Policy, and Economics makes him an invaluable speaker for anyone interested in the critical role of analytics in shaping economic policy.


Bootcamp Overview:


- Title: Observational Studies: Design and Analysis
- Duration: 8 hours
- Dates: Tuesdays, October 8, 15, 22, and 29
- Time: 6:00 - 8:00 p.m. CST

Objectives:
Students will learn key concepts, including:
1. Introduction to observational data
2. Complex survey sample design and analysis
3. Addressing bias in observational studies
4. Causal inference methodologies

The bootcamp will feature exercises and group discussions to reinforce these concepts,

 

November 5, 2024

We are pleased to announce that Paul Hampton, Head of the Strategic Insights, Data, & Analytics team at Transamerica, will be our speaker for the upcoming Business Communications BootCamp - Part II. You can learn more about Paul here: https://www.linkedin.com/in/pauldhampton1/

Paul leads a team dedicated to developing cutting-edge data infrastructure and analytics solutions that play a pivotal role in shaping business decisions at Transamerica. In his second BootCamp talk, he will continue from the first part and share valuable insights on managing and expanding a data science team, effective strategies for communicating technical results to business leaders, and fostering collaboration between data scientists and decision-makers. 

Paul will also discuss how clear and impactful communication of his team’s findings has contributed to both team growth and the overall success of the business.