This repository is the official implementation of VL-SAE, which helps users to understand the vision-language alignment of VLMs via concepts. We present the demo of VL-SAE with OpenCLIP and LLaVA 1.5 ...