Blog

Confidence Intervals

Jul 28, 2024

—

by

gfaletto

in Uncategorized

On Friday, I responded to a prompt on the platform formerly known as Twitter asking for controversial statistics opinions. I offered one of my own: This take did prove controversial—I saw people some people I consider very smart who agreed…
Causal Inference Basics

Jul 8, 2024

—

by

gfaletto

in Uncategorized

The goal of causal inference is to understand the effect of an intervention. We want to estimate the difference between what happens if people receive a treatment compared to what would have happened if they hadn’t been treated. Some examples…
Workshop for Ukraine: Conducting Synthetic Data Experiments in R

May 26, 2024

—

by

gfaletto

in Uncategorized

On Thursday, I led a workshop on conducting synthetic data experiments (simulation studies) in R to raise money to support Ukraine. It was a lot of fun walking through what simulations studies are, why you might want to conduct one,…
New Draft of “Fused Extended Two-Way Fixed Effects for Difference-in-Differences With Staggered Adoptions”

Apr 29, 2024

—

by

gfaletto

in Uncategorized

I’m excited to share that I posted an update to “Fused Extended Two-Way Fixed Effects for Difference-in-Differences With Staggered Adoptions” on arXiv. Mainly what’s new in this draft is added theory, but there are some other minor changes. The most…
Presentation on Fused Extended Two-Way Fixed Effects

Feb 11, 2024

—

by

gfaletto

in Uncategorized

On Thursday, I was lucky to present Fused Extended Two-Way Fixed Effects to the causal inference reading group at USC. I’m very grateful to Angela Zhou, Zijun Gao, and Dennis Shen for hosting me, Jacob Bien for putting me in…
New Paper: “Fused Extended Two-Way Fixed Effects for Difference-in-Differences with Staggered Adoptions”

Dec 13, 2023

—

by

gfaletto

in Uncategorized

I have a new paper on arXiv (link) that proposes a novel machine learning estimator for difference-in-differences with staggered adoptions, fused extended two-way fixed effects (FETWFE). Its main advantage over existing methods is that it is more efficient. Unlike existing…
Presentation at Data Con LA 2023 on PRESTO

Aug 14, 2023

—

by

gfaletto

in Uncategorized

On Saturday I gave the presentation “Predicting Purchases, Rare Diseases, and More: Using Ordinal Regression to Estimate Rare Event Probabilities” at Data Con LA 2023. I discussed using the proportional odds ordinal regression model to improve the estimation of probability…
How to conduct a synthetic data experiment

Jul 8, 2023

—

by

gfaletto

in Uncategorized

Simulation studies (sometimes called synthetic data experiments or Monte Carlo simulations) are useful tools for generating evidence about whether a statistical claim is true. For example: Here’s the idea: Recently I taught a tutorial on the basics on simulation studies…
PRESTO accepted to ICML 2023

Apr 27, 2023

—

by

gfaletto

in Uncategorized

I’m excited to announce that “Predicting Rare Events by Shrinking Towards Proportional Odds” has been accepted to the Fortieth International Conference on Machine Learning (ICML 2023)! In the paper, we propose PRESTO, a novel method for improving classification in the…
cssr R Package

Mar 13, 2023

—

by

gfaletto

in Uncategorized

In a 2022 research paper that I wrote with my advisor Jacob Bien, we proposed a novel feature selection method called cluster stability selection. Cluster stability selection is a method for identifying features that are useful for predicting a response…

About the author

Hi, I’m Greg Faletto. I’m a data scientist working on getting key metrics from messy real-world data and using causal inference to measure the impact (lift) of new features and product launches. Previously I worked on measuring the impact of linear and streaming television ad campaigns at VideoAmp, developed a methodology for A/B testing at Google, worked on machine learning at ZipRecruiter and Live Nation, and taught “Data Analysis for Decision Making” at the University of Southern California Marshall School of Business. I completed my Ph.D. in Statistics at the University of Southern California, where I wrote my dissertation on machine learning and causal inference. At USC I developed new machine learning methods for causal inference, estimating probabilities of rare events, and identifying the most important variables for predicting an outcome and wrote open-source software. My research has been published at the International Conference on Machine Learning and in the Proceedings of the National Academy of Sciences.