論文まとめ:Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
7{icon} {views} タイトル:Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction 著者:Ke […]...
CVPRの論文の被引用数を分析・予測してみた
59{icon} {views} CVPR2022・2023で採択された論文の被引用数を分析し、GitHubリポジトリやArxivでの公開が引用数に与える影響を調査しました。結果、これらの要因が引用数の増加に有意に寄与し […]...
論文まとめ:Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling
19{icon} {views} タイトル:Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and […]...
YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
116{icon} {views} タイトル:YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information 著者:Chie […]...
論文まとめ:SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
63{icon} {views} タイトル:SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling 著者 […]...
論文まとめ:LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
73{icon} {views} 論文タイトル:LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation 著者:Weiquan Huang […]...
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems
200{icon} {views} タイトル:HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems […]...
論文まとめ:OmniGen: Unified Image Generation
129{icon} {views} タイトル:OmniGen: Unified Image Generation 著者:Shitao Xiao, Yueze Wang, Junjie Zhou, Huaying Yuan […]...
論文まとめ:SAM 2: Segment Anything in Images and Videos
398{icon} {views} タイトル:SAM 2: Segment Anything in Images and Videos 著者:Nikhila Ravi, Valentin Gabeur, Yuan-Tin […]...
論文まとめ:RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
293{icon} {views} タイトル:RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation 著者:D […]...