标签: 多模态 | 小嗷犬

多模态

2024

【论文笔记】Towards Privacy-Aware Sign Language Translation at Scale

【论文笔记】Towards Privacy-Aware Sign Language Translation at Scale21

大模型论文笔记手语翻译多模态

2024-11-16

【论文笔记】SCOPE: Sign Language Contextual Processing with Embedding from LLMs

【论文笔记】SCOPE: Sign Language Contextual Processing with Embedding from LLMs22

大模型论文笔记手语翻译多模态

2024-11-16

【论文笔记】Wings: Learning Multimodal LLMs without Text-only Forgetting

【论文笔记】Wings: Learning Multimodal LLMs without Text-only Forgetting23

大模型论文笔记多模态

2024-11-09

【论文笔记】VCoder: Versatile Vision Encoders for Multimodal Large Language Models

【论文笔记】VCoder: Versatile Vision Encoders for Multimodal Large Language Models24

大模型论文笔记多模态

2024-11-08

【论文笔记】Dense Connector for MLLMs

【论文笔记】Dense Connector for MLLMs25

大模型论文笔记多模态

2024-11-03

【论文笔记】Attention Prompting on Image for Large Vision-Language Models

【论文笔记】Attention Prompting on Image for Large Vision-Language Models26

大模型论文笔记多模态

2024-11-02

【论文笔记】Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining

【论文笔记】Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining27

论文笔记手语翻译多模态

2024-10-29

【论文笔记】C$^2$RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval

【论文笔记】C$^2$RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval28

论文笔记手语翻译多模态

2024-10-29

【论文笔记】Perceiver: General Perception with Iterative Attention

【论文笔记】Perceiver: General Perception with Iterative Attention29

论文笔记多模态

2024-10-27

【论文笔记】xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

【论文笔记】xGen-MM (BLIP-3): A Family of Open Large Multimodal Models30

大模型论文笔记多模态

2024-10-27