Tushar Lab

Research

We develop data-centric AI frameworks for healthcare and medicine — building tools, data, and evaluation methods that are clinically grounded, rigorously tested, and worthy of real-world trust.

What We Work On

Research Areas

Our research spans four interconnected themes — from curating better data and building intelligent clinical tools, to leveraging simulation and synthetic data to advance healthcare AI.

Data-Centric AI

Advancing AI by improving the data it learns from — through better curation, quality assessment, diversity analysis, and rigorous benchmarking for clinical reliability.

Intelligent Clinical Tools

Designing AI-powered tools that integrate meaningfully into clinical workflows — reducing burden, supporting decision-making, and adapting to real-world healthcare settings.

Simulation & Virtual Trials

Leveraging physics-based simulation and virtual patient models to evaluate AI rigorously, safely, and at scale — where real-world clinical testing alone is limited or costly.

Synthetic & Generative Data

Creating controllable, privacy-preserving synthetic datasets and generative models for AI training, evaluation, and reproducible experimentation in medicine.

Flagship Work

Featured Projects

Selected projects that represent our lab's core contributions — advancing how medical AI is built, evaluated, and trusted in clinical practice.

HAID

Health AI Data Resource

An open-access repository providing 13+ standardized medical imaging datasets covering lung cancer, COVID-19, and universal lesions with expert-verified annotations, preprocessed files, and train-test splits to democratize access to high-quality clinical data for AI/ML development.

Why It Matters

Democratizes access to high-quality, standardized clinical data — accelerating reproducible AI research across multiple disease domains.

View Project →

TriAnnot

Multi-Stage AI Pipeline for Lung Nodule Annotation

A comprehensive, freely available pipeline integrating lung segmentation, nodule detection, and malignancy classification into a unified tri-stage workflow designed to prioritize sensitivity while reducing annotator burden for first-pass lung nodule annotation in screening CT.

Why It Matters

Reduces manual annotation burden and improves consistency in lung nodule identification — enabling faster, more reliable screening workflows.

View Project →

All Projects

Research Portfolio

A broader view of ongoing and published work across the lab — from tools and datasets to evaluation frameworks and generative methods.

VLST

Virtual Lung Screening Trial

A large-scale virtual imaging trial simulating low-dose CT lung cancer screening across diverse patient populations, scanner models, and acquisition protocols.

View Project →

AI in Lung Health

Benchmarking Detection and Diagnostic Models Across Multiple CT Scan Datasets

A public, reproducible multi-dataset benchmark for lung nodule detection and cancer classification, demonstrating that detection performance is strongly driven by dataset characteristics.

View Project →

NoMAISI

Nodule-Oriented Medical AI for Synthetic Imaging

A generative framework using flow-based diffusion and ControlNet conditioning to create realistic synthetic CT images with AI-generated lung nodules for data augmentation.

View Project →

HAID

Health AI Data Resource

An open-access repository providing 13+ standardized medical imaging datasets with expert-verified annotations and preprocessed files for AI/ML development.

View Project →

TriAnnot

Multi-Stage AI Pipeline for Lung Nodule Annotation

A freely available pipeline integrating lung segmentation, nodule detection, and malignancy classification into a unified tri-stage workflow for screening CT.

View Project →

Collaboration

Interested in collaborating?

We welcome collaborations with clinical, industry, and academic partners. Reach out to discuss potential projects.

Get in Touch