Kevin Di's Highlights on '分析transformer模型的参数量、计算量、中间激活、KV cache' | Glasp