Anita Madani

I am a PhD student in Electrical & Computer Engineering at Johns Hopkins University. I work with Professor Rama Chellappa and Dr Vishal Patel in AIEM and VIU Labs.

My research lies at the intersection of multimodal foundation models, large vision–language models, generative models, and geometric computer vision. I develop correspondence-aware architectures that combine semantic priors from LLMs with flow-based and implicit representations, enabling precise registration, structural reasoning, and robust alignment and temporal modeling in remote sensing and world-modeling applications.

Since 2024, I have also been part of the WRIVA program at JHU, working on wide-area visual recognition, localization, and 3D reconstruction.

Outside of research, I enjoy film, yoga, creative writing, and learning new languages.

Anita Madani

News

Research

Morphing Through Time
Anita Madani, Vishal M. Patel — WACV 2026 (to appear)
Remote sensing change detection often suffers from severe misalignment when image acquisitions are separated by large seasonal or multi-year gaps. This paper introduces a diffusion-based semantic morphing pipeline that synthesizes intermediate “bridging” frames between bi-temporal images, enabling robust, stepwise correspondence estimation. The generated morphs guide RoMa-based dense registration, followed by a lightweight U-Net that produces a high-fidelity warp preserving true structural changes. Experiments on LEVIR-CD, WHU-CD, and DSIFN-CD demonstrate consistent improvements in both registration accuracy and downstream change detection across multiple backbones, highlighting the generality and effectiveness of diffusion-assisted temporal alignment.
Paper thumbnail
co-authors, Anita Madani — WACV 2026
DiffRegCD leverages diffusion models to predict dense flow fields between bi-temporal satellite images and warps multi-scale features before change prediction. The approach yields improved alignment in complex urban scenes while preserving genuine appearance changes and hard boundary details.

Selected Projects

WRIVA project thumbnail
WRIVA: Wide-Area Registration, Localization, and 3D Reconstruction
Johns Hopkins University / IARPA — 2024–Present
Contributing to the IARPA WRIVA program with research spanning image registration, visual localization, and large-scale 3D reconstruction. Developed correspondence and flow-based algorithms robust to extreme viewpoint shifts, doppelgänger scenes, and heterogeneous camera models across ground, UAV, and satellite imagery. Built multi-institution pipelines using Airflow, AWS, and Docker for automated evaluation and scalable benchmarking on real-world datasets.
Semantic graph thumbnail
Search Engine Optimization via Persian Semantic Graphs
2021–2022
Built a large-scale semantic graph over 2M+ Persian documents and applied GNN models (GCN, GAT) to obtain improved text embeddings. Increased retrieval quality using spectral clustering and PageRank-based ranking, reducing query latency by 23% and improving mean reciprocal rank (MRR) by 17%, enabling a significantly more responsive and accurate search engine pipeline.
Swarm learning thumbnail
Financial Deep Learning with Swarm Learning
2021
Implemented decentralized regression for financial time-series using swarm learning, combining LSTM and TCN architectures. Achieved a 7% reduction in RMSE while preserving institution-level privacy through federated-style weight sharing. Demonstrated the effectiveness of distributed learning under non-IID conditions.
Speech dataset thumbnail
Persian Speech Attribute Classification
Asr-e-Gooyesh Pardaz — Summer 2022
Constructed a 50k-utterance Persian speech dataset and trained deep neural networks for gender and age classification using TensorFlow and PyTorch. Reduced prediction error by 32% through dataset optimization, feature engineering, and architectural improvements specific to noisy, real-world speech signals.

Teaching

I enjoy teaching, mentoring, and supporting students across computer vision, imaging, and systems courses.

Service

Community and reviewing.