I'm interested in computer vision, deep learning, and autonomous systems. Most of my research highlights that less is more: rather than scale up existing frameworks for marginal gains, I hope to participate in research using novel data pipelines (event cameras, sensor fusion) and leveraging latent data structure (robust optical flow, vision-informed poststratified polling) to produce exceptional real-world outcomes for autonomous systems and robotics. Some papers are highlighted.
Intersection Safety Systems: Classification and Prediction Across Different Road Users and Conditions Stephen Scarano,
Anand Seshadri ASCE ICTD, Upcoming 2026
Evaluating detection, classification, localization, and path prediction performance of ML-based intersection systems across lighting, occlusion, and turning conditions for different road users and sensor combinations.
Multi-Tag Visual Localization and Gripper-Based Autonomous Package Retrieval Arav Singh,
Tony Gurney,
Mohammad Goli,
Stephen Scarano,
TBD, Upcominng 2026
We produce an modular drone system for multi-tag detection, localization, and extraction of a labeled package.
By leveraging a novel generalization of the Hough Transform on optical flow vectors, we present an approach to estimating camera rotation in crowded, real-world scenes from handheld monocular video that outcompetes the SOTA on speed and accuracy.
Leveraging vision-language inference over Twitter profiles, we correct social poll respondent biases on Twitter (now X) and produce useful social polling signals of the 2020 U.S Presidental Election. This work won the Best Paper Finalist Award at ICWSM and our has gone on to outcompete mainstream polling averages for the 2024 U.S Presidential Election prediction.
We provide and analyze a novel dataset of Twitter (now X) polls and respondents, including gender, age, location, and partisanship information inferred by a vision-language model and corrobrated by human coders. This paper provides insights into sample biases in social polling, and was later developed into a demographically post-stratified model for election prediction (see above).
Feel free to steal this website's source code. Do not scrape the HTML from this page itself, as it includes analytics tags that you do not want on your own website — use the github code instead. Also, consider using Leonid Keselman's Jekyll fork of this page.