論文まとめ:Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets
572{icon} {views} タイトル:Stable Video Diffusion: Scaling Latent Video Diffusion Models to Large Datasets 著者:Stab […]...
論文まとめ:Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
795{icon} {views} 論文URL:Video-LLaVA: Learning United Visual Representation by Alignment Before Projection 著者:B […]...
論文まとめ:LCM-LoRA: A Universal Stable-Diffusion Acceleration Module
1.6k{icon} {views} タイトル:LCM-LoRA: A Universal Stable-Diffusion Acceleration Module 論文URL:https://arxiv.org/abs […]...
論文まとめ:Improving Image Generation with Better Captions
1.6k{icon} {views} タイトル:Improving Image Generation with Better Captions 著者:James Betker、Gabriel Gohなど(OpenAIの人 […]...
論文まとめ:SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
918{icon} {views} * タイトル:SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis * 著者:Dust […]...
論文まとめ:RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose
2.1k{icon} {views} タイトル:RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose 著者:Tao Jiang, Peng Lu, […]...
論文まとめ:Generating Images with Multimodal Language Models
200{icon} {views} タイトル:Generating Images with Multimodal Language Models 著者:Jing Yu Koh, Daniel Fried, Ruslan […]...
論文まとめ:Visual Programming: Compositional visual reasoning without training
339{icon} {views} タイトル:Visual Programming: Compositional visual reasoning without training 著者:Tanmay Gupta, An […]...
論文まとめ:Evaluating and Inducing Personality in Pre-trained Language Models
409{icon} {views} タイトル:Evaluating and Inducing Personality in Pre-trained Language Models 著者:Guangyuan Jiang, […]...
論文まとめ:UniVTG: Towards Unified Video-Language Temporal Grounding
371{icon} {views} タイトル:UniVTG: Towards Unified Video-Language Temporal Grounding 著者:Kevin Qinghong Lin, Pengch […]...