Large Multimodal Model Compression via Efficient Pruning and Distillation

Publication
WWW 2024 (Oral)