論文まとめ:Zero-1-to-3: Zero-shot One Image to 3D Object
2.5k{icon} {views} タイトル:Zero-1-to-3: Zero-shot One Image to 3D Object 著者:Ruoshi Liu, Rundi Wu, Basile Van Hoor […]...
論文まとめ:Zero-shot Image-to-Image Translation
4.2k{icon} {views} タイトル:Zero-shot Image-to-Image Translation 著者:Gaurav Parmar, Krishna Kumar Singh, Richard Zh […]...
論文まとめ:Generalized Decoding for Pixel, Image, and Language
809{icon} {views} タイトル:Generalized Decoding for Pixel, Image, and Language 著者:Xueyan Zou, Zi-Yi Dou, Jianwei Y […]...
論文まとめ:BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
9.4k{icon} {views} タイトル:BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large […]...
論文まとめ:InstructPix2Pix: Learning to Follow Image Editing Instructions
1.3k{icon} {views} タイトル:InstructPix2Pix: Learning to Follow Image Editing Instructions 著者:Tim Brooks, Aleksand […]...
論文まとめ:StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthesis
934{icon} {views} タイトル:StyleTTS: A Style-Based Generative Model for Natural and Diverse Text-to-Speech Synthes […]...
論文まとめ:OCR-free Document Understanding Transformer
3.4k{icon} {views} タイトル:OCR-free Document Understanding Transformer 著者:Geewook Kim, Teakgyu Hong, Moonbin Yim, […]...
論文まとめ:Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval
657{icon} {views} タイトル:Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval 著者:F […]...
論文まとめ:Large Language Models are Zero-Shot Reasoners
6.8k{icon} {views} タイトル:Large Language Models are Zero-Shot Reasoners 著者:Takeshi Kojima, Shixiang Shane Gu, Ma […]...
論文まとめ:Extremely Simple Activation Shaping for Out-of-Distribution Detection
1.2k{icon} {views} タイトル:Extremely Simple Activation Shaping for Out-of-Distribution Detection 著者:Andrija Djuri […]...