大语言模型
1 OpenAI
https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf
https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf
https://arxiv.org/abs/2005.14165
https://arxiv.org/abs/2107.03374
https://arxiv.org/abs/2203.02155
https://arxiv.org/abs/2303.08774
https://openai.com/index/chatgpt/
https://openai.com/index/hello-gpt-4o/
https://openai.com/index/introducing-openai-o1-preview/
https://openai.com/index/deliberative-alignment/
2 Claude & Gemini
https://www-cdn.anthropic.com/de8ba9b01c9ab7cbabf5c33b80b7bbc618857627/Model_Card_Claude_3.pdf
https://arxiv.org/abs/2312.11805
3 LLAMA family
https://arxiv.org/abs/2302.13971
https://arxiv.org/abs/2307.09288
https://arxiv.org/abs/2407.21783
https://arxiv.org/abs/2310.06825
https://arxiv.org/abs/2401.04088
https://arxiv.org/abs/2410.07073
4 DeepSeek & Qwen
https://arxiv.org/abs/2401.02954
https://arxiv.org/abs/2401.14196
https://arxiv.org/abs/2401.06066
https://arxiv.org/abs/2405.04434
https://github.com/deepseek-ai/DeepSeek-V3
https://arxiv.org/abs/2409.12191
https://arxiv.org/abs/2409.12186
https://arxiv.org/abs/2412.15115
5 Apple & 工业界
https://arxiv.org/abs/2407.21075
https://github.com/huggingface/smollm/
https://arxiv.org/abs/2412.08905
大语言模型评测
https://arxiv.org/abs/2009.03300
https://arxiv.org/abs/2310.16049
https://arxiv.org/abs/2103.03874
https://arxiv.org/abs/2311.07911
https://arcprize.org/arc
Prompting, ICL & Chain of Thought
https://arxiv.org/abs/2406.06608
https://arxiv.org/abs/2201.11903
https://arxiv.org/abs/2305.10601
https://aclanthology.org/2021.emnlp-main.243/
https://arxiv.org/abs/2211.01910
RAG
https://nlp.stanford.edu/IR-book/information-retrieval-book.html
https://arxiv.org/abs/2005.11401
https://arxiv.org/abs/2210.07316
https://arxiv.org/pdf/2404.16130
https://arxiv.org/abs/2309.15217
https://docs.llamaindex.ai/en/stable/understanding/rag/
https://python.langchain.com/docs/tutorials/rag/
Agents
https://arxiv.org/abs/2310.06770
https://arxiv.org/abs/2405.15793
https://arxiv.org/abs/2410.03859
https://arxiv.org/abs/2210.03629
https://arxiv.org/abs/2310.08560
https://arxiv.org/abs/2305.16291
https://www.anthropic.com/research/building-effective-agents
代码生成
https://arxiv.org/abs/2211.15533
https://huggingface.co/datasets/bigcode/the-stack-v2
https://arxiv.org/abs/2402.19173
https://arxiv.org/abs/2401.14196
https://arxiv.org/abs/2409.12186
https://ai.meta.com/research/publications/code-llama-open-foundation-models-for-code/
https://arxiv.org/abs/2107.03374
https://arxiv.org/abs/2401.08500
https://criticgpt.org/criticgpt-openai/
视觉
https://github.com/ultralytics/ultralytics
https://arxiv.org/abs/2304.08069
https://arxiv.org/abs/2301.12597
https://arxiv.org/abs/2412.03555
https://arxiv.org/abs/2401.06209
https://arxiv.org/abs/2304.02643
https://arxiv.org/abs/2408.00714
https://github.com/IDEA-Research/GroundingDINO
https://arxiv.org/abs/2304.08485
https://huyenchip.com/2023/10/10/multimodal.html
https://arxiv.org/abs/2405.09818
https://arxiv.org/abs/2411.14402
https://arxiv.org/abs/2411.14402
语音
https://arxiv.org/abs/2212.04356
https://arxiv.org/abs/2407.21783
https://arxiv.org/abs/2205.04421
https://www.latent.space/p/realtime-api
Diffusion
1 Latent Diffusion
https://arxiv.org/abs/2112.10752
https://stability.ai/news/stable-diffusion-v2-release
https://arxiv.org/abs/2307.01952
https://arxiv.org/abs/2403.03206
https://github.com/black-forest-labs/flux
2 DALL-E
https://arxiv.org/abs/2102.12092
https://arxiv.org/abs/2204.06125
https://cdn.openai.com/papers/dall-e-3.pdf
3 ImageGen
https://arxiv.org/abs/2205.11487
https://deepmind.google/technologies/imagen-2/
https://arxiv.org/abs/2408.07009
4 Consistency Model
https://arxiv.org/abs/2303.01469
5 Sora
https://openai.com/index/sora/
微调
https://arxiv.org/abs/2106.09685
https://arxiv.org/abs/2305.14314
https://arxiv.org/abs/2305.18290
https://arxiv.org/abs/2404.03592
https://www.microsoft.com/en-us/research/blog/orca-agentinstruct-agentic-flows-can-be-effective-synthetic-data-generators/
https://www.interconnects.ai/p/openais-reinforcement-finetuning
https://arxiv.org/abs/2305.20050