Prateek Joshi


28 Quotes

"1. Model Pruning"
Prateek Joshi
How to make LLMs faster
"Reduce the size of the model by eliminating parameters."
Prateek Joshi
How to make LLMs faster
"2. Quantization"
Prateek Joshi
How to make LLMs faster
"Reduce the precision of the numerical values used within the model. For example, you can switch from float32 to float16 (or even further down to int8)."
Prateek Joshi
How to make LLMs faster
"3. Model Distillation"
Prateek Joshi
How to make LLMs faster
"Train a smaller model to imitate the behavior of a larger model."
Prateek Joshi
How to make LLMs faster
"4. Parallel Processing"
Prateek Joshi
How to make LLMs faster
"5. Subword Tokenization"
Prateek Joshi
How to make LLMs faster
"Break words into smaller units (i.e. subwords). This will allow you to reduce the size of the vocabulary."
Prateek Joshi
How to make LLMs faster
"6. Optimized Libraries"
Prateek Joshi
How to make LLMs faster
"Use highly optimized libraries (like Nvidia's TensorRT) to run your AI workloads. It can significantly boost the performance."
Prateek Joshi
How to make LLMs faster
"7. Batch Inference Workloads"
Prateek Joshi
How to make LLMs faster
"A good chunk of the chip's memory bandwidth is consumed by the model parameters that you load."
Prateek Joshi
How to make LLMs faster
"You don't have to load model parameters for every input sequence. You can batch them together and load the parameters only once."
Prateek Joshi
How to make LLMs faster
"8. Adapters"
Prateek Joshi
How to make LLMs faster
"They are compact additional layers in the model (e.g. LoRa, QLoRa). These layers are tunable, which means you can train them to do what you want. You can make these layers lightweight, which helps the model to learn quickly."
Prateek Joshi
How to make LLMs faster
"These are AI-infused products that are designed to solve specific problems in a particular vertical."
Prateek Joshi
Verticalized AI
"Verticalized AI models are specific to that domain and cannot really do much outside of that domain."
Prateek Joshi
Verticalized AI
"1. Luxury of dealing only with domain-specific data"
Prateek Joshi
Verticalized AI
"2. Infusing domain knowledge into the product"
Prateek Joshi
Verticalized AI
"3. Domain expertise of the people"
Prateek Joshi
Verticalized AI
"4. Integrating with existing systems"
Prateek Joshi
Verticalized AI
"5. Customer willingness to pay is high"
Prateek Joshi
Verticalized AI
"6. Value capture is high"
Prateek Joshi
Verticalized AI
"Because it will cost way more for the customer to go out and use disjointed tools."
Prateek Joshi
Verticalized AI
"7. Intra-domain network effects"
Prateek Joshi
Verticalized AI
"8. Snowball effect of product usage data"
Prateek Joshi
Verticalized AI
"Verticalized AI is particularly well suited to the enterprise. And companies that build verticalized AI applications are poised to win big time."
Prateek Joshi
Verticalized AI

Want to Save Quotes?

Glasp is a social web highlighter that people can highlight and organize quotes and thoughts from the web, and access other like-minded people’s learning.