
图解GPT-2 | The Illustrated GPT-2 (Visualizing Transformer Language Models)_Ann's Blog的博客-CSDN博客
blog.csdn.net/qq_36667170/article/details/125529598?spm=1001.2014.3001.5501
Sep 8, 2023
2

解析 Transformer 模型 | Way to AGI
blog.waytoagi.com/article/transformer_explained
Sep 7, 2023
5

PyTorch 2.0
pytorch.org/get-started/pytorch-2.0/
Sep 4, 2023
2

Google Gemini Eats The World – Gemini Smashes GPT-4 By 5X, The GPU-Poors
www.semianalysis.com/p/google-gemini-eats-the-world-gemini?utm_source=substack&utm_medium=email
Aug 29, 2023
1
GPU Performance Background User's Guide - NVIDIA Docs --- GPU 性能背景用户指南 - NVIDIA Docs
docs.nvidia.com/deeplearning/performance/dl-performance-gpu-background/index.html
Aug 28, 2023
1

Best Practices for Building and Deploying Recommender Systems - NVIDIA Docs --- 构建和部署推荐系统的最佳实践 - NVIDIA Docs
docs.nvidia.com/deeplearning/performance/recsys-best-practices/index.html
Aug 24, 2023
1
Memory-Limited Layers User's Guide - NVIDIA Docs --- 内存限制层用户指南 - NVIDIA Docs
docs.nvidia.com/deeplearning/performance/dl-performance-memory-limited/index.html
Aug 23, 2023
1
Recurrent Layers User's Guide - NVIDIA Docs --- 循环层用户指南 - NVIDIA Docs
docs.nvidia.com/deeplearning/performance/dl-performance-recurrent/index.html
Aug 23, 2023
1
Convolutional Layers User's Guide - NVIDIA Docs --- 卷积层用户指南 - NVIDIA Docs
docs.nvidia.com/deeplearning/performance/dl-performance-convolutional/index.html
Aug 22, 2023
5
Linear/Fully-Connected Layers User's Guide - NVIDIA Docs --- 线性/全连接层用户指南 - NVIDIA Docs
docs.nvidia.com/deeplearning/performance/dl-performance-fully-connected/index.html
Aug 21, 2023
3
Matrix Multiplication Background User's Guide - NVIDIA Docs --- 矩阵乘法背景用户指南 - NVIDIA Docs
docs.nvidia.com/deeplearning/performance/dl-performance-matrix-multiplication/index.html
Aug 17, 2023
1

The History of Open-Source LLMs: Better Base Models (Part Two)
cameronrwolfe.substack.com/p/the-history-of-open-source-llms-better?utm_source=substack&utm_medium=email
Aug 1, 2023
3

The History of Open-Source LLMs: Early Days (Part One)
cameronrwolfe.substack.com/p/the-history-of-open-source-llms-early
Aug 1, 2023
1

“这是一种战争行为”:解码美国对华芯片封锁行动
cn.nytimes.com/usa/20230713/semiconductor-chips-us-china/?utm_source=news-list&utm_medium=email&utm_campaign=newsletter
Jul 28, 2023
12

NLP(十八):LLM 的推理优化技术纵览
zhuanlan.zhihu.com/p/642412124
Jul 17, 2023
1

NLP(十七):从 FlashAttention 到 PagedAttention, 如何进一步优化 Attention 性能
zhuanlan.zhihu.com/p/638468472
Jul 14, 2023
3

Long Live DSA (5)
zhuanlan.zhihu.com/p/640870528
Jul 3, 2023
2
THUDM/ChatGLM2-6B: ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
github.com/THUDM/ChatGLM2-6B?utm_campaign=explore-email&utm_medium=email&utm_source=newsletter&utm_term=daily
Jun 30, 2023
2

对壁仞科技BR100的FP32性能的商榷
zhuanlan.zhihu.com/p/553502423
Jun 28, 2023
2