Aim SmartyCam vs GoPro

AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning

Large language models (LLMs) have enabled the creation of multi-modal LLMs that exhibit strong comprehension of visual data such as images and videos. However, these models usually rely on extensive ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

AIM: Adaptive Inference of Multi-Modal LLMs via Token Merging and Pruning

Trending now