Defeating Nondeterminism in LLM Inference
thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/
Sep 13, 2025
11

Difference Array | Range update query in O(1) - GeeksforGeeks
www.geeksforgeeks.org/difference-array-range-update-query-o1/
Jan 6, 2025
1
Enterprise h2oGPTe
h2ogpte.genai.h2o.ai/feedback
Dec 27, 2024
Hermes 3 Technical Report - 2408.11857v1.pdf
arxiv.org/pdf/2408.11857
Dec 15, 2024
1
NousResearch/Hermes-3-Llama-3.2-3B · Hugging Face
huggingface.co/NousResearch/Hermes-3-Llama-3.2-3B
Dec 15, 2024
2410.23261v1.pdf
arxiv.org/pdf/2410.23261
Nov 24, 2024
1
2304.12206v2.pdf
arxiv.org/pdf/2304.12206
Jul 5, 2024

Mat’s Blog - CUDA MODE - Accelerate your code with massively parallel programming plus some other tricks
blog.matdmiller.com/posts/2024-02-15_custom_cuda_kernel_intro_and_benchmarks/notebook.html
Jun 20, 2024
7
Chroma - LlamaIndex
docs.llamaindex.ai/en/stable/examples/vector_stores/ChromaIndexDemo/
Jun 15, 2024

FineWeb: decanting the web for the finest text data at scale
huggingfacefw-blogpost-fineweb-v1.static.hf.space/dist/index.html
Jun 6, 2024
5

What We Learned from a Year of Building with LLMs (Part I)
www.oreilly.com/radar/what-we-learned-from-a-year-of-building-with-llms-part-i/
May 31, 2024
4
2402.00530v1.pdf
arxiv.org/pdf/2402.00530v1
May 29, 2024
2
Your Work | Kaggle
www.kaggle.com/work/collections/14111779
May 29, 2024
1
Efficient NLP Model Finetuning via Multistage Data Filtering - 0455.pdf
www.ijcai.org/proceedings/2023/0455.pdf
May 24, 2024
16
2405.00732v1.pdf
arxiv.org/pdf/2405.00732
May 23, 2024
252
RoFormer - 2104.09864v5.pdf
arxiv.org/pdf/2104.09864
May 21, 2024
14

Transformer Architecture: The Positional Encoding - Amirhossein Kazemnejad's Blog
kazemnejad.com/blog/transformer_architecture_positional_encoding/
May 21, 2024
1

Notion – The all-in-one workspace for your notes, tasks, wikis, and databases.
yaofu.notion.site/Towards-100x-Speedup-Full-Stack-Transformer-Inference-Optimization-43124c3688e14cffaf2f1d6cbdf26c6c
May 19, 2024
arxiv.org/pdf/2403.19887v1
May 11, 2024
1
Musings on Building a Generative AI Product
www.linkedin.com/blog/engineering/generative-ai/musings-on-building-a-generative-ai-product
May 7, 2024
162
2404.14619v1.pdf
arxiv.org/pdf/2404.14619
Apr 28, 2024
2
2404.02258.pdf
arxiv.org/pdf/2404.02258.pdf
Apr 14, 2024

Answer.AI - A few tips for working on high-surface-area problems
www.answer.ai/posts/2024-04-12-tips.html
Apr 14, 2024
1
2203.11171.pdf
arxiv.org/pdf/2203.11171.pdf
Apr 13, 2024
311
2404.07965.pdf
arxiv.org/pdf/2404.07965.pdf
Apr 12, 2024
1
2403.07815.pdf
arxiv.org/pdf/2403.07815.pdf
Mar 31, 2024
324
2403.17297.pdf
arxiv.org/pdf/2403.17297.pdf
Mar 28, 2024
2403.08763.pdf
arxiv.org/pdf/2403.08763.pdf
Mar 28, 2024
2403.13372.pdf
arxiv.org/pdf/2403.13372.pdf
Mar 26, 2024
402

Keras documentation: Video Classification with a CNN-RNN Architecture
keras.io/examples/vision/video_classification/
Mar 26, 2024
3