Linear/Fully-Connected Layers User's Guide - NVIDIA Docs --- 线性/全连接层用户指南 - NVIDIA Docs
docs.nvidia.com/deeplearning/performance/dl-performance-fully-connected/index.html
Aug 21, 2023
3
Matrix Multiplication Background User's Guide - NVIDIA Docs --- 矩阵乘法背景用户指南 - NVIDIA Docs
docs.nvidia.com/deeplearning/performance/dl-performance-matrix-multiplication/index.html
Aug 17, 2023
1

The History of Open-Source LLMs: Better Base Models (Part Two)
cameronrwolfe.substack.com/p/the-history-of-open-source-llms-better?utm_source=substack&utm_medium=email
Aug 1, 2023
3

The History of Open-Source LLMs: Early Days (Part One)
cameronrwolfe.substack.com/p/the-history-of-open-source-llms-early
Aug 1, 2023
1

“这是一种战争行为”:解码美国对华芯片封锁行动
cn.nytimes.com/usa/20230713/semiconductor-chips-us-china/?utm_source=news-list&utm_medium=email&utm_campaign=newsletter
Jul 28, 2023
12

NLP(十八):LLM 的推理优化技术纵览
zhuanlan.zhihu.com/p/642412124
Jul 17, 2023
1

NLP(十七):从 FlashAttention 到 PagedAttention, 如何进一步优化 Attention 性能
zhuanlan.zhihu.com/p/638468472
Jul 14, 2023
3

Long Live DSA (5)
zhuanlan.zhihu.com/p/640870528
Jul 3, 2023
2
THUDM/ChatGLM2-6B: ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
github.com/THUDM/ChatGLM2-6B?utm_campaign=explore-email&utm_medium=email&utm_source=newsletter&utm_term=daily
Jun 30, 2023
2

对壁仞科技BR100的FP32性能的商榷
zhuanlan.zhihu.com/p/553502423
Jun 28, 2023
2