論文まとめ:EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
1.8k{icon} {views} タイトル:EVA: Exploring the Limits of Masked Visual Representation Learning at Scale 著者:Yuxin F […]...
論文まとめ:ControlVideo: Training-free Controllable Text-to-Video Generation
639{icon} {views} タイトル:ControlVideo: Training-free Controllable Text-to-Video Generation 著者:Yabo Zhang, Yuxian […]...
論文まとめ:Guided Image Synthesis via Initial Image Editing in Diffusion Model
857{icon} {views} タイトル:Guided Image Synthesis via Initial Image Editing in Diffusion Model 著者:Jiafeng Mao, Xue […]...
論文まとめ:Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
5.6k{icon} {views} タイトル:Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection […]...
論文まとめ:Sentence Simplification via Large Language Models
543{icon} {views} 論文:Sentence Simplification via Large Language Models 著者:Yutao Feng, Jipeng Qiang, Yun Li, Yu […]...
論文まとめ:Flamingo: a Visual Language Model for Few-Shot Learning
1.9k{icon} {views} タイトル:Flamingo: a Visual Language Model for Few-Shot Learning 著者:Jean-Baptiste Alayrac, Jeff […]...
論文まとめ:Zero-1-to-3: Zero-shot One Image to 3D Object
2.5k{icon} {views} タイトル:Zero-1-to-3: Zero-shot One Image to 3D Object 著者:Ruoshi Liu, Rundi Wu, Basile Van Hoor […]...
論文まとめ:Zero-shot Image-to-Image Translation
4.3k{icon} {views} タイトル:Zero-shot Image-to-Image Translation 著者:Gaurav Parmar, Krishna Kumar Singh, Richard Zh […]...
論文まとめ:Generalized Decoding for Pixel, Image, and Language
824{icon} {views} タイトル:Generalized Decoding for Pixel, Image, and Language 著者:Xueyan Zou, Zi-Yi Dou, Jianwei Y […]...
論文まとめ:BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
9.6k{icon} {views} タイトル:BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large […]...