Apply for a GPU community grant: Academic project

#1
by jiang-cc - opened

AD-Copilot is an open-source vision-language assistant for industrial anomaly detection (IAD), presented in our paper "AD-Copilot: A Vision-Language Assistant for Industrial Anomaly Detection via Visual In-context Comparison" (arXiv:2603.13779, under review at IEEE Transactions on Image Processing).

What it does: Given a normal reference image and a query image of an industrial product, AD-Copilot detects, classifies, describes, and localizes anomalies through natural language interaction โ€” functioning as an AI-powered industrial inspector.

Key contributions:

  • A novel Comparison Encoder using cross-attention between paired image features, enabling fine-grained visual comparison at the encoding stage rather than only in the language space
  • Chat-AD: a large-scale multimodal IAD dataset with 620k+ samples across 327 industrial categories - MMAD-BBox: a new benchmark for fine-grained anomaly localization with bounding-box evaluation
  • Achieves 82.3% accuracy on the MMAD benchmark, surpassing all proprietary and open-source models (including GPT-4o, Gemini 1.5 Pro) and exceeding ordinary human performance, without any data leakage
  • 3.35ร— improvement over baseline on anomaly localization (MMAD-BBox)

Why GPU is needed: The model (7B, ~16GB bfloat16) requires GPU for inference. We are building an interactive Gradio demo on HuggingFace Spaces (jiang-cc/AD-Copilot) so researchers and industrial practitioners can try AD-Copilot directly in the browser.

Open resources:

This is an academic project from Southern University of Science and Technology (SUSTech). We would greatly appreciate ZeroGPU access to make our demo publicly available for the community.

Hi @jiang-cc , we've assigned ZeroGPU to this Space. Please check the compatibility and usage sections of this page so your Space can run on ZeroGPU.
If you can, we ask that you upgrade to Pro ($9/month) to enjoy higher ZeroGPU quota and other features like Dev Mode, Private Storage, and more: hf.co/pro

Sign up or log in to comment