2310.11453.pdf thumbnail
2310.11453.pdf
arxiv.org
Another strand of quantizing deep neural networks is quantization-aware training memory and computation self-attention and feed-forward networks We leave the other components high-precision, e.g., 8-bit in our experiments e quantize the activation to 8-bit and leave lower precision in future work.
2 Users
0 Comments
54 Highlights
0 Notes

Top Highlights

  • Another strand of quantizing deep neural networks is quantization-aware training
  • memory and computation
  • self-attention and feed-forward networks
  • We leave the other components high-precision, e.g., 8-bit in our experiments
  • e quantize the activation to 8-bit and leave lower precision in future work.

Domain

Ready to highlight and find good content?

Glasp is a social web highlighter that people can highlight and organize quotes and thoughts from the web, and access other like-minded people’s learning.