論文まとめ:Beyond Aesthetics: Cultural Competence in Text-to-Image Models
214{icon} {views} タイトル:Beyond Aesthetics: Cultural Competence in Text-to-Image Models 著者:Nithish Kannen, Arif […]...
論文まとめ:Unveiling Encoder-Free Vision-Language Models
695{icon} {views} タイトル:Unveiling Encoder-Free Vision-Language Models 著者:Haiwen Diao, Yufeng Cui, Xiaotong Li, […]...
論文まとめ:MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures
282{icon} {views} タイトル:MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures URL:https://mixeval.g […]...
論文まとめ:Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
536{icon} {views} * タイトル:Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text […]...
論文まとめ:ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimodal Models
275{icon} {views} 論文タイトル:ConTextual: Evaluating Context-Sensitive Text-Rich Visual Reasoning in Large Multimod […]...
論文まとめ:COLE: A Hierarchical Generation Framework for Graphic Design
398{icon} {views} * タイトル:COLE: A Hierarchical Generation Framework for Graphic Design * 著者:Peidong Jia, Chenxu […]...
論文まとめ:Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4
688{icon} {views} 論文タイトル:Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4 著者:Sond […]...
論文まとめ:Gemini: A Family of Highly Capable Multimodal Models
538{icon} {views} タイトル:Gemini: A Family of Highly Capable Multimodal Models 著者:Gemini Team((842 additional aut […]...
論文まとめ:Weak to Strong Generalization: Eliciting Strong Capabilities with Weak SUPERVISION
535{icon} {views} タイトル:Weak to Strong Generalization: Eliciting Strong Capabilities with Weak SUPERVISION 著者:O […]...
論文まとめ:Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
1k{icon} {views} タイトル:Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets 著者:Stabl […]...