时间: 2023.7.3-2023.7.9
最新技术:
Mixture-of-Experts Meets Instruction Tuning:A Winning Combination for Large Language Models
论文:arxiv.org
Lost in the Middle: How Language Models Use Long Contexts
论文:arxiv.org
A Survey on Evaluation of Large Language Models
论文:arxiv.org
LongNet: Scaling Transformers to 1,000,000,000 Tokens
论文: arxiv.org
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
论文:arxiv.org
InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback
Demystifying GPT Self-Repair for Code Generation
论文:arxiv.org
Conformer LLMs -- Convolution Augmented Large Language Models
论文:arxiv.org
Segment Anything Meets Point Tracking
论文:arxiv.org
GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents
论文:arxiv.org
RLHF半年工作速览
开源LLM微调训练指南:如何打造属于自己的LLM模型
大模型综述
课程:
基于检索的LM及应用
TensorRT 教程
能「说」会「画」, VisCPM: SOTA 开源中文多模态大模型
LangChain: Chat with Your Data
商业:
朱啸虎没错,大模型没有投资价值
AI Agents大爆发:软件2.0雏形初现,OpenAI的下一步
语言大模型的进化轨迹
Nature:大模型越大越好吗
案例:
ControlVideo demo
支持野生编辑的DragGAN
ChatPDF: Chat with pdf