Author: gfaletto

  • New Draft of “Fused Extended Two-Way Fixed Effects for Difference-in-Differences With Staggered Adoptions”

    New Draft of “Fused Extended Two-Way Fixed Effects for Difference-in-Differences With Staggered Adoptions”

    I’m excited to share that I posted an update to “Fused Extended Two-Way Fixed Effects for Difference-in-Differences With Staggered Adoptions” on arXiv. Mainly what’s new in this draft is added theory, but there are some other minor changes. The most notable change to the exposition (in my opinion) is that I revised my introduction of…

  • Presentation on Fused Extended Two-Way Fixed Effects

    Presentation on Fused Extended Two-Way Fixed Effects

    On Thursday, I was lucky to present Fused Extended Two-Way Fixed Effects to the causal inference reading group at USC. I’m very grateful to Angela Zhou, Zijun Gao, and Dennis Shen for hosting me, Jacob Bien for putting me in touch with the hosts and coordinating, and everyone who came for their attention and insightful…

  • New Paper: “Fused Extended Two-Way Fixed Effects for Difference-in-Differences with Staggered Adoptions”

    New Paper: “Fused Extended Two-Way Fixed Effects for Difference-in-Differences with Staggered Adoptions”

    I have a new paper on arXiv (link) that proposes a novel machine learning estimator for difference-in-differences with staggered adoptions, fused extended two-way fixed effects (FETWFE). Its main advantage over existing methods is that it is more efficient. Unlike existing methods, it leverages our knowledge that treatment effects for nearby times are likely to be…

  • Presentation at Data Con LA 2023 on PRESTO

    Presentation at Data Con LA 2023 on PRESTO

    On Saturday I gave the presentation “Predicting Purchases, Rare Diseases, and More: Using Ordinal Regression to Estimate Rare Event Probabilities” at Data Con LA 2023. I discussed using the proportional odds ordinal regression model to improve the estimation of probability estimates in classification with class imbalance. I built up to discussing PRESTO, the method developed…

  • How to conduct a simulation study

    How to conduct a simulation study

    Simulation studies (sometimes called synthetic data experiments or Monte Carlo simulations) are useful tools for generating evidence about whether a statistical claim is true. For example: Here’s the idea: Recently I taught a tutorial on the basics on simulation studies for undergraduate students as a part of the USC JumpStart program. I taught the basics…

  • PRESTO accepted to ICML 2023

    PRESTO accepted to ICML 2023

    I’m excited to announce that “Predicting Rare Events by Shrinking Towards Proportional Odds” has been accepted to the Fortieth International Conference on Machine Learning (ICML 2023)! In the paper, we propose PRESTO, a novel method for improving classification in the class imbalance setting. You can read my brief summary of the paper on Twitter.

  • cssr R Package

    cssr R Package

    In a 2022 research paper that I wrote with my advisor Jacob Bien, we proposed a novel feature selection method called cluster stability selection. Cluster stability selection is a method for identifying features that are useful for predicting a response variable. It has applications in medical research (including genomics and genetics), economics, analyzing survey data,…

  • My Math Review Notes

    My Math Review Notes

    I recently took the first-year screening exam for Ph.D. students in the statistics group in the Department of Data Sciences and Operations at Marshall. Since I started applying to grad school, I’ve been writing up review notes to help me with math. Originally the purpose of the notes was to help me study for the…

  • Our Entry in the OCRUG Hackathon 2019

    Our Entry in the OCRUG Hackathon 2019

    I was a part of Team Save the WoRld along with Faizan Haque, Javier Orraca, Sam Park, and Shruhi Desai in the OCRUG Hackathon 2019 held at UC Irvine on May 18th and 19th. (In fact, I am writing this blog post at the tail end of our time before we present our results!) The…

  • Presentation on Multi-Task Learning

    Presentation on Multi-Task Learning

    Today I gave an in-class presentation at USC on two papers in multi-task learning (or multivariate regression–linear regression when the response is a vector rather than one number). You could simply train a separate model for each response, but when the responses are related, there are advantages to considering them all at the same time…