論文まとめ:OCR-free Document Understanding Transformer
3k{icon} {views} タイトル:OCR-free Document Understanding Transformer 著者:Geewook Kim, Teakgyu Hong, Moonbin Yim, J […]...
論文まとめ:Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval
527{icon} {views} タイトル:Lightweight Attentional Feature Fusion: A New Baseline for Text-to-Video Retrieval 著者:F […]...
論文まとめ:Large Language Models are Zero-Shot Reasoners
6k{icon} {views} タイトル:Large Language Models are Zero-Shot Reasoners 著者:Takeshi Kojima, Shixiang Shane Gu, Mach […]...
論文まとめ:Extremely Simple Activation Shaping for Out-of-Distribution Detection
1k{icon} {views} タイトル:Extremely Simple Activation Shaping for Out-of-Distribution Detection 著者:Andrija Djurisi […]...
論文まとめ:Domino: Discovering Systematic Errors with Cross-Modal Embeddings
288{icon} {views} タイトル:Domino: Discovering Systematic Errors with Cross-Modal Embeddings 著者:Sabri Eyuboglu, Ma […]...
論文まとめ:Exploring Visual Prompts for Adapting Large-Scale Models
1.5k{icon} {views} タイトル:Exploring Visual Prompts for Adapting Large-Scale Models 著者:Hyojin Bahng, Ali Jahanian […]...
論文まとめ:Visual onoma-to-wave: environmental sound synthesis from visual onomatopoeias and sound-source images
494{icon} {views} タイトル:Visual onoma-to-wave: environmental sound synthesis from visual onomatopoeias and sound […]...
論文まとめ:Imagen Video: High Definition Video Generation with Diffusion Models
960{icon} {views} タイトル:Imagen Video: High Definition Video Generation with Diffusion Models 著者:Jonathan Ho*, W […]...
論文まとめ:High-Resolution Image Synthesis with Latent Diffusion Models
3.2k{icon} {views} タイトル:High-Resolution Image Synthesis with Latent Diffusion Models 著者:Robin Rombach, Andreas […]...
論文まとめ:Text2Human: Text-Driven Controllable Human Image Generation
677{icon} {views} * タイトル:Text2Human: Text-Driven Controllable Human Image Generation * 論文URL:https://arxiv.org […]...