AI工程师必读论文:链接汇总

文摘   2025-01-02 14:14   上海  

大语言模型

1 OpenAI

https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf

https://cdn.openai.com/better-language-models/language_models_are_unsupervised_multitask_learners.pdf

https://arxiv.org/abs/2005.14165

https://arxiv.org/abs/2107.03374

https://arxiv.org/abs/2203.02155

https://arxiv.org/abs/2303.08774

https://openai.com/index/chatgpt/

https://openai.com/index/hello-gpt-4o/

https://openai.com/index/introducing-openai-o1-preview/

https://openai.com/index/deliberative-alignment/

2 Claude & Gemini

https://www-cdn.anthropic.com/de8ba9b01c9ab7cbabf5c33b80b7bbc618857627/Model_Card_Claude_3.pdf

https://arxiv.org/abs/2312.11805

3 LLAMA family

https://arxiv.org/abs/2302.13971

https://arxiv.org/abs/2307.09288

https://arxiv.org/abs/2407.21783

https://arxiv.org/abs/2310.06825

https://arxiv.org/abs/2401.04088

https://arxiv.org/abs/2410.07073

4 DeepSeek & Qwen

https://arxiv.org/abs/2401.02954

https://arxiv.org/abs/2401.14196

https://arxiv.org/abs/2401.06066

https://arxiv.org/abs/2405.04434

https://github.com/deepseek-ai/DeepSeek-V3

https://arxiv.org/abs/2409.12191

https://arxiv.org/abs/2409.12186

https://arxiv.org/abs/2412.15115

5 Apple & 工业界

https://arxiv.org/abs/2407.21075

https://github.com/huggingface/smollm/

https://arxiv.org/abs/2412.08905


大语言模型评测

https://arxiv.org/abs/2009.03300

https://arxiv.org/abs/2310.16049

https://arxiv.org/abs/2103.03874

https://arxiv.org/abs/2311.07911

https://arcprize.org/arc


Prompting, ICL & Chain of Thought

https://arxiv.org/abs/2406.06608

https://arxiv.org/abs/2201.11903

https://arxiv.org/abs/2305.10601

https://aclanthology.org/2021.emnlp-main.243/

https://arxiv.org/abs/2211.01910


RAG

https://nlp.stanford.edu/IR-book/information-retrieval-book.html

https://arxiv.org/abs/2005.11401

https://arxiv.org/abs/2210.07316

https://arxiv.org/pdf/2404.16130

https://arxiv.org/abs/2309.15217

https://docs.llamaindex.ai/en/stable/understanding/rag/

https://python.langchain.com/docs/tutorials/rag/


Agents

https://arxiv.org/abs/2310.06770

https://arxiv.org/abs/2405.15793

https://arxiv.org/abs/2410.03859

https://arxiv.org/abs/2210.03629

https://arxiv.org/abs/2310.08560

https://arxiv.org/abs/2305.16291

https://www.anthropic.com/research/building-effective-agents


代码生成

https://arxiv.org/abs/2211.15533

https://huggingface.co/datasets/bigcode/the-stack-v2

https://arxiv.org/abs/2402.19173

https://arxiv.org/abs/2401.14196

https://arxiv.org/abs/2409.12186

https://ai.meta.com/research/publications/code-llama-open-foundation-models-for-code/

https://arxiv.org/abs/2107.03374

https://arxiv.org/abs/2401.08500

https://criticgpt.org/criticgpt-openai/


视觉

https://github.com/ultralytics/ultralytics

https://arxiv.org/abs/2304.08069

https://arxiv.org/abs/2301.12597

https://arxiv.org/abs/2412.03555

https://arxiv.org/abs/2401.06209

https://arxiv.org/abs/2304.02643

https://arxiv.org/abs/2408.00714

https://github.com/IDEA-Research/GroundingDINO

https://arxiv.org/abs/2304.08485

https://huyenchip.com/2023/10/10/multimodal.html

https://arxiv.org/abs/2405.09818

https://arxiv.org/abs/2411.14402

https://arxiv.org/abs/2411.14402


语音

https://arxiv.org/abs/2212.04356

https://arxiv.org/abs/2407.21783

https://arxiv.org/abs/2205.04421

https://www.latent.space/p/realtime-api


Diffusion

1 Latent Diffusion

https://arxiv.org/abs/2112.10752

https://stability.ai/news/stable-diffusion-v2-release

https://arxiv.org/abs/2307.01952

https://arxiv.org/abs/2403.03206

https://github.com/black-forest-labs/flux

2 DALL-E

https://arxiv.org/abs/2102.12092

https://arxiv.org/abs/2204.06125

https://cdn.openai.com/papers/dall-e-3.pdf

3 ImageGen

https://arxiv.org/abs/2205.11487

https://deepmind.google/technologies/imagen-2/

https://arxiv.org/abs/2408.07009

4 Consistency Model

https://arxiv.org/abs/2303.01469

5 Sora

https://openai.com/index/sora/


微调

https://arxiv.org/abs/2106.09685

https://arxiv.org/abs/2305.14314

https://arxiv.org/abs/2305.18290

https://arxiv.org/abs/2404.03592

https://www.microsoft.com/en-us/research/blog/orca-agentinstruct-agentic-flows-can-be-effective-synthetic-data-generators/

https://www.interconnects.ai/p/openais-reinforcement-finetuning

https://arxiv.org/abs/2305.20050


思源数据科学
Towards AGI
 最新文章