加载头像
多模态
2024
【论文笔记】Improving Gloss-free Sign Language Translation by Reducing Representation Density
【论文笔记】Improving Gloss-free Sign Language Translation by Reducing Representation Density1
【论文笔记】LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
【论文笔记】LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models2
【论文笔记】Towards Privacy-Aware Sign Language Translation at Scale
【论文笔记】Towards Privacy-Aware Sign Language Translation at Scale3
【论文笔记】SCOPE: Sign Language Contextual Processing with Embedding from LLMs
【论文笔记】SCOPE: Sign Language Contextual Processing with Embedding from LLMs4
【论文笔记】Wings: Learning Multimodal LLMs without Text-only Forgetting
【论文笔记】Wings: Learning Multimodal LLMs without Text-only Forgetting5
【论文笔记】VCoder: Versatile Vision Encoders for Multimodal Large Language Models
【论文笔记】VCoder: Versatile Vision Encoders for Multimodal Large Language Models6
【论文笔记】Dense Connector for MLLMs
【论文笔记】Dense Connector for MLLMs7
【论文笔记】Attention Prompting on Image for Large  Vision-Language Models
【论文笔记】Attention Prompting on Image for Large Vision-Language Models8
【论文笔记】Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining
【论文笔记】Gloss-free Sign Language Translation: Improving from Visual-Language Pretraining9
【论文笔记】C$^2$RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval
【论文笔记】C$^2$RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval10
引用到评论
随便逛逛博客分类文章标签
复制地址关闭热评深色模式轉為繁體