back to projects
arXiv preprint · MITml research
NeuroLens
Adversarial ML toolkit with a cross-modal CLIP → ResNet transfer study.
ResNet-18 clean acc.≥93%CIFAR-10 target
CLIP-lite R@1≥60%Flickr8k retrieval target
StatusarXiv preprintUnder review
The problem
Adversarial robustness research has focused on unimodal classifiers, but CLIP-style vision-language models create a new attack surface. If perturbations crafted against a multimodal model transfer to a downstream unimodal classifier, deploying CLIP in a pipeline could silently compromise everything downstream.
Architecture
Models built from scratch in PyTorch (ResNet-18 on CIFAR-10, Transformer on AG News, CLIP-lite on Flickr8k — no torchvision pretrained imports)→
optimize perturbation δ against CLIP-lite contrastive loss→
apply δ to standalone ResNet-18→
measure transfer rate vs. direct PGD attack as the upper bound→
defenses with certified guarantees as baselines→
W&B for experiment tracking→
writeup as arXiv preprint.
Key decisions
PythonPyTorchFGSMPGDCLIP-liteResNet-18