HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems
242{icon} {views} タイトル:HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems […]...
論文まとめ:OmniGen: Unified Image Generation
167{icon} {views} タイトル:OmniGen: Unified Image Generation 著者:Shitao Xiao, Yueze Wang, Junjie Zhou, Huaying Yuan […]...
論文まとめ:SAM 2: Segment Anything in Images and Videos
664{icon} {views} タイトル:SAM 2: Segment Anything in Images and Videos 著者:Nikhila Ravi, Valentin Gabeur, Yuan-Tin […]...
論文まとめ:RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
397{icon} {views} タイトル:RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation 著者:D […]...
論文まとめ:Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
312{icon} {views} タイトル:Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks 著者:Bin Xia […]...
論文まとめ:LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control
125{icon} {views} タイトル:LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control 著者:Ji […]...
論文まとめ:Beyond Aesthetics: Cultural Competence in Text-to-Image Models
120{icon} {views} タイトル:Beyond Aesthetics: Cultural Competence in Text-to-Image Models 著者:Nithish Kannen, Arif […]...
論文まとめ:Unveiling Encoder-Free Vision-Language Models
296{icon} {views} タイトル:Unveiling Encoder-Free Vision-Language Models 著者:Haiwen Diao, Yufeng Cui, Xiaotong Li, […]...
論文まとめ:MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures
200{icon} {views} タイトル:MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures URL:https://mixeval.g […]...
論文まとめ:Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis
322{icon} {views} * タイトル:Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text […]...