时间: 2023.5.29-2023.6.5
本周大事记
OpenAI出手解决GPT-4数学推理并公开了论文和数据集
标题:Let’s Verify Step by Step
论文:arxiv.org
数据集:github.com
最新技术:
Role-Play with Large Language Models
论文:arxiv.org
PandaGPT: One Model To Instruction-Follow Them All
论文:arxiv.org
Impossible Distillation: from Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing
论文:arxiv.org
Backpack Language Models
论文:arxiv.org
Training Socially Aligned Language Models in Simulated Human Society
论文:arxiv.org
代码:github.com
OlaGPT: Empowering LLMs With Human-like Problem-Solving Abilities
论文:arxiv.org
Break-A-Scene: Extracting Multiple Concepts from a Single Image
论文:arxiv.org
Any-to-Any Generation via Composable Diffusion
论文:arxiv.org
READ: Recurrent Adaptation of Large Transformers
论文:arxiv.org
Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust
论文:arxiv.org
The Impact of Positional Encoding on Length Generalization in Transformers
论文:arxiv.org
BiomedGPT: A Unified and Generalist Biomedical Generative Pre-trained Transformer for Vision, Language, and Multimodal Tasks
论文:arxiv.org
Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
论文:arxiv.org
Fine-Tuning Language Models with Just Forward Passes
论文:arxiv.org
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training
论文:arxiv.org
Bytes Are All You Need: Transformers Operating Directly On File Bytes
论文:arxiv.org
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
论文:arxiv.org
SQL-PaLM: Improved Large Language ModelAdaptation for Text-to-SQL
论文:arxiv.org
CodeTF: One-stop Transformer Library for State-of-the-art Code LLM
论文:arxiv.org
Segment Anything in High Quality
论文: arxiv.org
代码: github.com
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length
论文:arxiv.org
代码:github.com
StyleDrop: Text-to-Image Generation in Any Style
论文:arxiv.org
Large Language Models as Tool Makers
论文:arxiv.org
ReviewerGPT? An Exploratory Study on Using Large Language Models for Paper Reviewing
论文:arxiv.org
PHOTOSWAP: Personalized Subject Swapping in Images
High-Fidelity Image Compression with Score-based Generative Models
论文:arxiv.org
课程:
如何使用 Megatron-LM 训练语言模型
大模型预训练和微调技术及心得
Stable Diffusion的专家级教程
Long Term Memory in AI - Vector Search and Databases
吴恩达DeepLearningAI推出短课程新课:
How Diffusion Models Work
LangChain for LLM Application Development
Building Systems with the ChatGPT API
商业:
王慧文再融16亿!大模型创业百天成独角兽
奇绩创坛 2023 春季创业营路演日,共 60 个项目,100% 技术驱动mp.weixin.qq.com
Stability AI:AI开源商业化试验田,Killer Model能成长为Killer App吗?
Inflection创始人:从DeepMind到Pi,AI智能体如何迎来寒武纪大爆发
英伟达推出ACE for Games游戏定制化AI方案,实现智能NPC,对话,动画
案例:
Drag3D: DragGAN meets GET3D
AI「复刻」现实女友爆火!国外小哥开源GirlfriendGPT