Daily Papers
1.Make-A-Character: High Quality Text-to-3D Character Generation within Minutes ( paper | webpage )
2.Audiobox: Unified Audio Generation with Natural Language Prompts ( paper | demo )
3.Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases ( paper | repo )
4.One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications ( paper )
5.RichDreamer: A Generalizable Normal-Depth Diffusion Model for Detail Richness in Text-to-3D ( papaer | webpage )
AI News
1.stanford CS25: Transformers United V3 ( link )
2.Microsoft Copilot App Available on Play Store ( link )
3.Autonomous chemical research with large language models( link )
AI Repos
1.microchain:function calling-based LLM agents. ( repo )
2.Implementation of "SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing" ( repo )
3.FastGPT is a knowledge-based QA system built on the LLM, offers out-of-the-box data processing and model invocation capabilities, allows for workflow orchestration through Flow visualization! ( repo )