Category: Uncategorized

  • Causal Inference Notation and Concepts

    In this post, I’m going to expand on the basics I laid out more informally here. I’m going to define some commonly used notation and key terms that I plan to use as other posts, so that this post can serve as a reference in the future. I may also add to it over time…

  • Explaining vs. Predicting

    Explaining vs. Predicting

    I came across two nice written pieces recently. These and other closely related points have been made for a long time, and I’ve made related points before. I think it’s an underappreciated and somewhat counterintuitive point, so I like to harp on this a little. When people seek to answer questions from data by using…

  • Confidence Intervals

    Confidence Intervals

    On Friday, I responded to a prompt on the platform formerly known as Twitter asking for controversial statistics opinions. I offered one of my own: This take did prove controversial—I saw people some people I consider very smart who agreed with me and some who disagreed. In light of the discussions, I think I have…

  • Causal Inference Basics

    Causal Inference Basics

    The goal of causal inference is to understand the effect of an intervention. We want to estimate the difference between what happens if people receive a treatment compared to what would have happened if they hadn’t been treated. Some examples of this include: Why Is Estimating Causal Effects Tricky? Let’s focus on the multivitamin example.…

  • Workshop for Ukraine: Conducting Synthetic Data Experiments in R

    Workshop for Ukraine: Conducting Synthetic Data Experiments in R

    On Thursday, I led a workshop on conducting synthetic data experiments (simulation studies) in R to raise money to support Ukraine. It was a lot of fun walking through what simulations studies are, why you might want to conduct one, and how to do the process from start to finish using the R simulator package.…

  • New Draft of “Fused Extended Two-Way Fixed Effects for Difference-in-Differences With Staggered Adoptions”

    New Draft of “Fused Extended Two-Way Fixed Effects for Difference-in-Differences With Staggered Adoptions”

    I’m excited to share that I posted an update to “Fused Extended Two-Way Fixed Effects for Difference-in-Differences With Staggered Adoptions” on arXiv. Mainly what’s new in this draft is added theory, but there are some other minor changes. The most notable change to the exposition (in my opinion) is that I revised my introduction of…

  • Presentation on Fused Extended Two-Way Fixed Effects

    Presentation on Fused Extended Two-Way Fixed Effects

    On Thursday, I was lucky to present Fused Extended Two-Way Fixed Effects to the causal inference reading group at USC. I’m very grateful to Angela Zhou, Zijun Gao, and Dennis Shen for hosting me, Jacob Bien for putting me in touch with the hosts and coordinating, and everyone who came for their attention and insightful…

  • New Paper: “Fused Extended Two-Way Fixed Effects for Difference-in-Differences with Staggered Adoptions”

    New Paper: “Fused Extended Two-Way Fixed Effects for Difference-in-Differences with Staggered Adoptions”

    I have a new paper on arXiv (link) that proposes a novel machine learning estimator for difference-in-differences with staggered adoptions, fused extended two-way fixed effects (FETWFE). Its main advantage over existing methods is that it is more efficient. Unlike existing methods, it leverages our knowledge that treatment effects for nearby times are likely to be…

  • Presentation at Data Con LA 2023 on PRESTO

    Presentation at Data Con LA 2023 on PRESTO

    On Saturday I gave the presentation “Predicting Purchases, Rare Diseases, and More: Using Ordinal Regression to Estimate Rare Event Probabilities” at Data Con LA 2023. I discussed using the proportional odds ordinal regression model to improve the estimation of probability estimates in classification with class imbalance. I built up to discussing PRESTO, the method developed…

  • How to conduct a synthetic data experiment

    How to conduct a synthetic data experiment

    Simulation studies (sometimes called synthetic data experiments or Monte Carlo simulations) are useful tools for generating evidence about whether a statistical claim is true. For example: Here’s the idea: Recently I taught a tutorial on the basics on simulation studies for undergraduate students as a part of the USC JumpStart program. I taught the basics…