📥 Last added

What's included

Domain specific architectures for AI inference

· fleetwood.dev · 25 mins

DeepSeek-V3 Explained 1: Multi-head Latent Attention

· Shirley Li · 8 mins

Optimizing Transformer-Based Diffusion Models for Video Generation with NVIDIA TensorRT

· Allen Philip · 5 mins

You could have designed state of the art positional encoding

· huggingface.co · 11 mins

attention is logarithmic, actually

· supaiku.com · 9 mins

AI Arrives In The Middle East: US Strikes A Deal with UAE and KSA – SemiAnalysis

· Dylan Patel · 12 mins

Everything, Everywhere, All at Once: Is Mechanistic Interpretability Identifiable?

· Maxime Méloux, Silviu Maniu, François Portet, Maxime Peyrard · 38 mins

Are Transformers universal approximators of sequence-to-sequence functions?

· Chulhee Yun, Srinadh Bhojanapalli, Ankit Singh Rawat, Sashank J. Reddi, Sanjiv Kumar · 1 min

a Hugging Face Space by nanotron

· huggingface.co

arXiv:quant-ph/0011122v2 20 Dec 2000

· · 1 hr 19 mins

$60 Billion Dollars in losses ...

· rohit · 1 min

An elementary proof of a universal approximation theorem

· Chris Monico · 8 mins

A Group and Its Center, Intuitively

· math3ma.com · 3 mins

Understanding Entanglement With SVD

· math3ma.com · 8 mins

Training Large Language Models to Reason in a Continuous Latent Space

· arxiv.org · 39 mins

A Guide on Semiconductor Development

· Daud's Scout · 13 mins

Recommended Books and Resources

· Irrational Analysis · 5 mins

The Computer as a Communication Device

· J.C.R. Licklider and Robert W. Taylor · 27 mins

ARM's Chernobyl Moment

· Irrational Analysis · 5 mins

The Uses of Complacency

· Sarah Constantin · 10 mins

The Book of Shaders

· The Book of Shaders · 2 mins

Unstructured Thoughts on the Problems of OSS/FOSS

· gingerbill.org · 9 mins

On Bloat

· google.com

Training Large Language Models to Reason in a Continuous Latent Space

· Shibo Hao, Sainbayar Sukhbaatar, DiJia Su, Xian Li, Zhiting Hu, Jason Weston, Yuandong Tian · 2 mins

On the Biology of a Large Language Model

· transformer-circuits.pub · 2 hrs 42 mins

Do Llamas Work in English? On the Latent Language of Multilingual Transformers

· Chris Wendler, Veniamin Veselovsky, Giovanni Monea · 1 min

The Unsustainability of Moore’s Law

· Charles Rosenbauer · 12 mins

The Era of Experience Paper

· · 24 mins

"Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?"

· Nathan Lambert · 3 mins

tt-metal/tech_reports/memory/allocator.md at main · tenstorrent/tt-metal

· https://github.com/tenstorrent/ · 4 mins

What Modern NVMe Storage Can Do, And How To Exploit It: High-Performance I/O for High-Performance Storage Engines

· Gabriel Haas and Viktor Leis · 42 mins

Memory on Tenstorrent

· Martin's website/blog thingy · 11 mins

Multi-layer language heads: the output latent is for text (and nothing else)

· Sebastian · 4 mins

Subnanosecond flash memory enabled by 2D-enhanced hot-carrier injection

· Yutong Xiang, Chong Wang, Chunsen Liu, Tanjun Wang, Yongbo Jiang, Yang Wang, Shuiyuan Wang, Peng Zhou · 28 mins

Andrew_S._Tanenbaum_-_Structured_Computer_Organization

· · 19 hrs 25 mins

CS336: Language Modeling from Scratch

· stanford-cs336.github.io · 4 mins

A Gentle Introduction to Lambda Calculus - Part 1: Syntax

· lucasfcosta.com · 6 mins

Getting Started

· lambda-the-ultimate.org · 16 mins

curry-howard.dvi

· JS · 5 hrs 7 mins

Intelligence as efficient model building

· Alex's blog · 35 mins

Contextualization Machines

· stochasm.blog · 14 mins

What Is ChatGPT Doing … and Why Does It Work?

· stephenwolfram.com · 1 hr 11 mins

Mission Apollo: Landing Optical Circuit Switching at Datacenter Scale

· Ryohei Urata, Hong Liu, Kevin Yasumura, Erji Mao, Jill Berger, Xiang Zhou, Cedric Lam, Roy Bannon, Darren Hutchinson, Daniel Nelson, Leon Poutievski, Arjun Singh, Joon Ong, Amin Vahdat · 34 mins

The Illustrated Transformer

· Jay Alammar · 15 mins

Driven by Compression Progress: A Simple Principle Explains Essential Aspects of Subjective Beauty, Novelty, Surprise, Interestingness, Attention, Curiosity, Creativity, Art, Science, Music, Jokes

· Juergen Schmidhuber · 50 mins

Position: Model Collapse Does Not Mean What You Think

· Rylan Schaeffer, Joshua Kazdan, Alvan Caleb Arulandu, Sanmi Koyejo · 2 mins

paper

· · 1 hr 59 mins

88_HC2024.Tenstorrent.Jasmina.Davor.v7

· · 5 mins

RWKV Language Model

· rwkv.com · 1 min

Recent AI model progress feels mostly like

· lc · 9 mins

Device Placement Optimization with Reinforcement Learning

· Azalia Mirhoseini, Hieu Pham, Quoc V. Le, Benoit Steiner, Rasmus Larsen, Yuefeng Zhou, Naveen Kumar, Mohammad Norouzi, Samy Bengio, Jeff Dean · 24 mins

Building an Open Future

· tenstorrent.com · 9 mins

diffusion transofrmers

· google.com · 1 min

diffusion transformers

· google.com · 1 min

Faking ADTs and GADTs in Languages That Shouldn't Have Them

· Justin Le · 27 mins

Accelerate

· acceleratehs.org · 1 min

Ok Rust, You Really Have a Readability Problem

· David Lee · 1 min

Circuit Tracing: Revealing Computational Graphs in Language Models

· transformer-circuits.pub · 2 hrs 49 mins

Things that go wrong with disk IO

· eatonphil.com · 5 mins

Analyzing Modern NVIDIA GPU cores

· Rodrigo Huerta, Mojtaba Abaie Shoushtary, José-Lorenzo Cruz, Antonio González · 2 mins

tt-metal/tech_reports/AdvancedPerformanceOptimizationsForModels/AdvancedPerformanceOptimizationsForModels.md at main · tenstorrent/tt-metal · GitHub

· https://github.com/tenstorrent/ · 20 mins

paper.dvi

· · 28 mins

þÿKevin-and-Nick.PDF

· þÿAdministrator · 28 mins

Move Slow and Fix Things

· Matthias Endler · 6 mins

Why is Yazi fast?

· https://github.com/sxyazi · 5 mins

User Guide for NVPTX Back-end

· llvm.org · 35 mins

An AnandTech Interview with Jim Keller: 'The Laziest Person at Tesla'

· Dr. Ian Cutress · 58 mins

Notes/Primer on Clang Compiler Frontend (1) : Introduction and Architecture

· youssefaa.com · 8 mins

Implementation of simple microprocessor using verilog

· stackoverflow.com · 11 mins

learn-fpga/FemtoRV/TUTORIALS/FROM_BLINKER_TO_RISCV/README.md at master · BrunoLevy/learn-fpga · GitHub

· https://github.com/BrunoLevy/ · 1 hr 20 mins

Why async Rust?

· without.boats · 24 mins

Softmax Attention is a Fluke

· Ethan Smith · 11 mins

Transformers Laid Out

· Pramod’s Blog · 29 mins

Template Haskell

· haskell.org · 16 mins

A friendly introduction to machine learning compilers and optimizers

· Chip Huyen · 17 mins

Comments on Source

· lua-users.org · 4 mins

The_Implementation_of_Lua_5.0

· · 25 mins

Bloom’s 3 Stages of Talent Development

· Justin Skycak (@justinskycak) · 2 mins

introduction-to-algorithms-and-machine-learning

· · 2 hrs 42 mins

justinmath-linearAlgebra

· · 48 mins

Russell’s Paradox and Possible Solutions

· ups.edu · 5 mins

The Making of Python

· Bill Venners · 8 mins

Advice on Upskilling

· · 1 hr 7 mins

tt-metal/METALIUM_GUIDE.md at main · tenstorrent/tt-metal · GitHub

· https://github.com/tenstorrent/ · 9 mins

Scoping out the Tenstorrent Wormhole

· fosdem.org · 1 min

What’s the (floating) Point of all these data types? A (not so) brief overview of the history and usage of datatypes within the wide world of computation

· fosdem.org · 2 mins

Physics of language models

· allen-zhu.com · 4 mins

Tenstorrent first thoughts

· Martin's website/blog thingy · 4 mins

7 x 11.5 long title.p65

· vinodd · 9 mins

Neural Networks, Manifolds, and Topology

· colah.github.io · 12 mins

Attention from Beginners Point of View

· mrinalxdev.github.io · 3 mins

(How) Do Language Models Track State?

· Belinda Z. Li, Zifan Carl Guo, Jacob Andreas · 2 mins

Why Attention Is All You NeedWhy Attention Is All You Need

· k-a.in · 8 mins

CFD Python: 12 steps to Navier-Stokes

· Spruce Interactive · 4 mins

tt-mlir documentation

· tenstorrent.com · 9 mins

Tutorials

· llvm.org · 1 min

Yizhou Shan's Home Page

· Yizhou Shan · 9 mins

Attention in SRAM on Tenstorrent Grayskull

· Moritz Thüning · 23 mins

Crossing the uncanny valley ofconversational voice

· Sesame · 9 mins

Project 3: Distributed File System

· · 15 mins

By John Taylor Gatto

· John Taylor Gatto · 1 min

How to Think About TPUs

· Roy Frostig · 20 mins

Programming Really Is Simple Mathematics

· Bertrand Meyer, Reto Weber · 1 min

Tenstorrent Wormhole Series Part 1: Physicalities

· corsix.org · 5 mins

Community Highlight: Tenstorrent Wormhole Series Part 2: Which disabled rows?

· Tenstorrent · 4 mins

Execution-based Code Generation using Deep Reinforcement Learning

· Parshin Shojaee, Aneesh Jain, Sindhu Tipirneni, Chandan K. Reddy · 47 mins

þÿThe Impact of Generative AI on Critical Thinking: Self-Reported Reductions in Cognitive Effort and Confidence Effects From a Survey of Knowledge Workers

· · 1 hr 19 mins

neural video codecs: the future of video compression

· ahmad sandid · 18 mins

Lego Mindset vs. Woodworking Mindset

· Scott Stevenson · 8 mins

Gestalt Programming: A New Concept in Automatic Programming

· dl.acm.org

Mastering LLM Techniques: Evaluation

· NVIDIA Technical Blog · 9 mins

Mastering LLM Inference Techniques: Inference Optimization

· NVIDIA · 2 mins

Automating GPU Kernel Generation with DeepSeek-R1 and Inference Time Scaling

· NVIDIA Technical Blog · 3 mins

The high-return activity of raising others’ aspirations

· Tyler Cowen · 1 min

Build Your Own Text Editor

· viewsourcecode.org · 1 min

The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey

· Saurav Pawar, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Vinija Jain, Aman Chadha, Amitava Das · 1 hr 46 mins

Tilde, my LLVM alternative

· yasserarg.com · 3 mins

A WebAssembly compiler that fits in a tweet

· WebAssembly from the Ground Up · 13 mins

Transformer Memory as a Differentiable Search Index

· Yi Tay, Vinh Q. Tran, Mostafa Dehghani, Jianmo Ni, Dara Bahri, Harsh Mehta, Zhen Qin, Kai Hui, Zhe Zhao, Jai Gupta, Tal Schuster, William W. Cohen, Donald Metzler · 27 mins

history.dvi

· · 11 mins

Proof of correctness of data representation

· dl.acm.org

Unnamed Document

· dl.acm.org

Elements of Programming

· Alexander Stepanov and Paul McJones · 3 hrs 47 mins

Unveiling_DeepSeek.pdf

· Google Docs · 7 mins

Stating the problem in Lean

· Andrew Helwer · 9 mins

DeepSeek-V3 Explained: A Deep Dive into the Next-Generation AI Model

· NeuralNomad · 8 mins

Foundations of Large Language Models

· Tong Xiao, Jingbo Zhu · 1 min

þÿA comprehensive study of Convergent and Commutative Replicated Data Types

· þÿMarc Shapiro, Nuno Preguiça, Carlos Baquero, Marek Zawirski · 59 mins

hist

· · 27 mins

þÿPart II - Algebraic Topology (Theorems with proof)

· þÿDexter Chua · 41 mins

fundamental-group

· · 52 mins

Category Theory: Lecture Notes and Online Books

· Logic Matters · 7 mins

Why Futhark?

· futhark-lang.org · 2 mins

Algorithms for Modern Hardware

· Danila Kutenin · 7 mins

DeepSeek-V3 Technical Report

· DeepSeek-AI, Aixin Liu, Bei Feng, Bing Xue, Bingxuan Wang, Bochao Wu, Chengda Lu, Chenggang Zhao, Chengqi Deng, Chenyu Zhang, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fucong Dai, Fuli Luo, Guangb... · 1 hr 25 mins

Linkers part 1

· airs.com · 3 mins

gcvsmalloc

· · 40 mins

``You and Your Research''

· virginia.edu · 55 mins

Bloom filters debunked: Dispelling 30 Years of bad math with Coq!

· Kiran Gopinathan · 9 mins

DeepSeek-V3/DeepSeek_V3.pdf at main · deepseek-ai/DeepSeek-V3

by Marcus Hutter and David Quarel and Elliot Catt

· Marcus Hutter · 4 mins

Deepseek: The Quiet Giant Leading China’s AI Race

· Jordan Schneider · 20 mins

The Double-E Infix Expression Parsing Method

· Erik Eidt · 5 mins

(26) Demystifying Debuggers, Part 2: The Anatomy Of A Running Program Demystifying Debuggers, Part 2: The Anatomy Of A Running Program

· Ryan Fleury · 23 mins

Towards a Categorical Foundation of Deep Learning: A Survey

· Francesco Riccardo Crescenzi · 2 mins

Soft question: Deep learning and higher categories

· MathOverflow · 2 mins

Algebraic Databases

· Patrick Schultz, David I. Spivak, Christina Vasilakopoulou, Ryan Wisnesky · 1 min

Categorical Databases

· Patrick Schultz, David Spivak MIT Ryan Wisnesky Categorical Informatics and others · 9 mins

walter

· · 20 mins

FPGAs for Software Engineers 0: The Basics

· Ross Schlaikjer · 17 mins

Data-Oriented Design

· dataorienteddesign.com · 30 mins

A note about "The Humane Representation of Thought"

· worrydream.com · 2 mins

BLT__Patches_Scale_Better_Than_Tokens

· · 48 mins

On Ousterhout’s Dichotomy Oct 6, 2024

· matklad.github.io · 3 mins

The categorical abstract machine

· G. Cousineau, P.-L. Curien, M. Mauny · 3 mins

Position: Categorical Deep Learning is an Algebraic Theory of All Architectures

· Bruno Gavranović, Paul Lessard, Andrew Dudzik, Tamara von Glehn, João G. M. Araújo, Petar Veličković · 1 hr 6 mins

Fundamental Components of Deep Learning: A category-theoretic approach

· Bruno Gavranović · 4 hrs 41 mins

Logic and linear algebra: an introduction

· Daniel Murfet · 1 min

Gemini: A Family of Highly Capable Multimodal Models

· Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin C... · 1 hr 53 mins

Flow Matching Guide and Code

· Yaron Lipman, Marton Havasi, Peter Holderrieth, Neta Shaul, Matt Le, Brian Karrer, Ricky T. Q. Chen, David Lopez-Paz, Heli Ben-Hamu, Itai Gat · 2 hrs 14 mins

Logical Complexity of Proofs

· Gödel's Lost Letter and P=NP · 3 mins

Mastering Board Games by External and Internal Planning with Language Models

· · 2 hrs 4 mins

Proofs and Types

· · 3 hrs 10 mins

Richard Hamming - Wikipedia

· wikipedia.org · 13 mins

What is the "question" that programming language theory is trying to answer?

· PhD · 2 mins

Introducing Limbo: A complete rewrite of SQLite in Rust

· turso.tech · 6 mins

TLA+ is hard to learn

· Lorin Hochstein · 5 mins

How hard is constraint programming?

· Wayne Joubert · 2 mins

Fundamental Components of Deep Learning: A category-theoretic approach

· Bruno Gavranović · 4 hrs 41 mins

Geeks, MOPs, and sociopaths in subculture evolution

· Meaningness · 9 mins

Advanced programming languages

· might.net · 11 mins

ugh.book

· simsong · 5 hrs 42 mins

Working memory - Wikipedia

· wikipedia.org · 50 mins

Working hurts less than procrastinating, we fear the twinge of starting

· Eliezer Yudkowsky · 4 mins

llama.cpp guide - Running LLMs locally, on any hardware, from scratch

· steelph0enix.dev · 53 mins

GitHub - avinassh/py-caskdb: (educational) build your own disk based KV store

· https://github.com/avinassh/ · 4 mins

Depth-First Procrastination

· — George Herbert · 2 mins

Command Line Interface Guidelines

· clig.dev · 40 mins

How Many Computers Are In Your Computer?

· Gwern Branwen · 10 mins

Category theory for scientists (Old version)

· David I. Spivak · 6 hrs 31 mins

Genie 2: A large-scale foundation world model

· Google DeepMind · 6 mins

Design Of This Website

· Gwern Branwen · 1 hr 14 mins

WilliamYi96/Awesome-Energy-Based-Models: A curated list of resources on energy-based models.

· https://github.com/WilliamYi96/ · 6 mins

"CBLL, Research Projects, Computational and Biological Learning Lab, Courant Institute, NYU"

· New York University · 5 mins

yataobian/awesome-ebm: Collecting research materials on EBM/EBL (Energy Based Models, Energy Based Learning)

· https://github.com/yataobian/ · 13 mins

TuringConf

· Dana Scott · 20 mins

Omens of exceptional talent

· Alexey Guzey · 3 mins

An Introduction to Current Theories of Consciousness

· lesswrong.com · 55 mins

Being the (Pareto) Best in the World

· johnswentworth · 4 mins

Greg Yang

· thegregyang.com · 6 mins

Some questions

· google.com · 1 min

A Century of Mathematics in America, Part I

· Peter Duren, Editor with assistance of Richard A. Askey and Uta C. Merzbach · 1 hr 13 mins

Fastest contributed programs, grouped by programming language implementation

· pages.debian.net · 1 min

Haskell as fast as C: working at a high altitude for low level performance

· Don Stewart · 12 mins

On Competing with C Using Haskell

· kqr · 10 mins

Performance

· haskell.org · 5 mins

TS_Tutorial

· · 1 hr 41 mins

Category Theory usage in Algebraic Topology

· Mathematics Stack Exchange · 1 min

Topos Theory in a Nutshell

· John Baez · 10 mins

context

· · 7 hrs 6 mins

Proof Explorer

· us.metamath.org · 1 hr 32 mins

An Invitation to Applied Category Theory

· Fong & Spivak & David I · 1 min

An Invitation to Applied Category Theory

· Brendan Fong · 3 mins

Introducing io_uring_spawn

· Jake Edge · 19 mins

Information Theory: A Tutorial Introduction

· James V Stone · 27 mins

Daniel Lemire's blog

· lemire.me · 40 mins

A Beginner's Guide to Vectorization By Hand: Part 3

· sbaziotis.com · 8 mins

Noghartt's garden

· Guilherme Ananias <me@noghartt.dev> · 1 min

Competitive Programming

· cpbook.net · 3 mins

My favorite books

· gabespace.xyz · 1 min

MOND←TECH MAGAZINE

· herecomesthemoon.net · 1 min

Coalescence: making LLM inference 5x faster

· dottxt, Inc. · 9 mins

þÿClassics in the History of Psychology -- Miller (1956)

· þÿDan J. Denis · 33 mins

Creating enums at comptime

· openmymind.net · 5 mins

Zig's new declaration literals

· openmymind.net · 3 mins

Zig's (.{}){} syntax

· openmymind.net · 4 mins

How to get from high school math to cutting-edge ML/AI: a detailed 4-stage roadmap with links to the best learning resources that I’m aware of.

· Justin Skycak · 16 mins

Fundamental Components of Deep Learning: A category-theoretic approach

· Bruno Gavranović · 4 hrs 41 mins

How LLVM Optimizes a Function

· regehr.org · 11 mins

PS2_and_PC_BIOS_Interface_Technical_Reference_Apr87

· · 2 hrs 33 mins

How 99% of C Tutorials Get it Wrong

· sbaziotis.com · 5 mins

A Beginner's Guide to Vectorization By Hand: Part 1

· sbaziotis.com · 9 mins

Tell the Compiler What You Know

· sbaziotis.com · 6 mins

Compiler Optimization in a Language you Can Understand

· Stefanos Baziotis · 16 mins

How Target-Independent is Your IR?

· Stefanos Baziotis · 7 mins

Bibliopolis-Book-retypeset-1984

· · 1 hr 12 mins

Numerical Recipes

· numerical.recipes · 1 min

Unpacking Intuition

· Martin EP Seligman · 10 mins

For Beginners

· Julie Moronuki · 10 mins

Oasis: A Universe in a Transformer

· oasis-model.github.io · 4 mins

A Fat Pointer Library

· libcello.org · 7 mins

The Basics

· Thorsten Ball · 2 mins

TCP Server in Zig - Part 5a - Poll

· openmymind.net · 19 mins

(8810) YouTube

6.824 Schedule: Spring 2022

· mit.edu · 1 min

2305.20091

· · 37 mins

Humans in 4D: Reconstructing and Tracking Humans with Transformers

· Jitendra Malik · 2 mins

slpj-book-1987.djvu

· franz · 7 hrs 20 mins

Typing the technical interview

· aphyr.com · 12 mins

Reversing the technical interview

· aphyr.com · 3 mins

Hexing the technical interview

· aphyr.com · 11 mins

Nine Rules for SIMD Acceleration of Your Rust Code (Part 1)

· Carl M. Kadie · 16 mins

Conscious exotica

· Murray Shanahan · 32 mins

B-trees and database indexes

· Benjamin Dicken · 17 mins

Safe C++

· safecpp.org · 1 hr 41 mins

Tutorial on Diffusion Models for Imaging and Vision

· Stanley H. Chan · 2 hrs 8 mins

Async Rust can be a pleasure to work with (without `Send + Sync + 'static`)

· Evan Schwartz · 20 mins

The Perfect Plan

· DeGatchi · 10 mins

The Fast Track

· Carter · 2 mins

Zig's BoundedArray

· openmymind.net · 4 mins

Linus Torvalds talks AI, Rust adoption, and why the Linux kernel is 'the only thing that matters'

· ZDNET · 5 mins

Intercepting and modifying Linux system calls with ptrace

· eatonphil.com · 11 mins

What's the big deal about Deterministic Simulation Testing?

· eatonphil.com · 11 mins

Zig and Emulators

· floooh.github.io · 15 mins

A ToC of the 20 part linker essay

· JesseW · 1 min

trading_interview_blog

· · 20 mins

`zig cc`: a Powerful Drop-In Replacement for GCC/Clang

· Andrew Kelley · 19 mins

Zig Build System

· ziglang.org · 21 mins

Resources for Amateur Compiler Writers

· c9x.me · 4 mins

MattPD/cpplinks: A categorized list of C++ resources.

· https://github.com/MattPD/ · 1 min

Putting the “You” in CPU

· Lexi Mattick · 1 min

How to Compile Your Language

· isuckatcs.github.io · 3 mins

Introduction to the Odin Programming Language

· Karl Zylinski · 46 mins

Arena allocator tips and tricks

· nullprogram.com · 10 mins

No Starch Press

· nostarch.com · 1 min

Part 2: Portable Executable Files

· x86re.com · 19 mins

bytecode interpreters for tiny computers

· dercuano.github.io · 37 mins

How I built zig-sqlite

· rischmann.fr · 8 mins

The Hunt for the Missing Data Type

· Hillel Wayne · 11 mins

Microfeatures I'd like to see in more languages

· buttondown.email · 6 mins

Google’s Fully Homomorphic Encryption Compiler — A Primer

· Math ∩ Programming · 14 mins

Will I be able to access proprietary platform APIs (e.g. Android / iOS)?

· webassembly.org · 11 mins

The future of Clang-based tooling

· Peter Goodman · 9 mins

Fast Multidimensional Matrix Multiplication on CPU from Scratch

· siboehm.com · 12 mins

Efficient n-states on x86 systems

· halobates.de · 4 mins

Program tuning as a resource allocation problem

· halobates.de · 5 mins

How web bloat impacts users with slow connections

· danluu.com · 46 mins

Files are hard

· danluu.com · 19 mins

Ringing in a new asynchronous I/O API

· Jonathan Corbet · 7 mins

applicative-mental-models

· þÿAndi Kleen · 5 mins

applicative-mental-models

· þÿAndi Kleen · 5 mins

Optimizing subroutines in assembly language

· Agner Fog · 4 hrs 1 mins

Brian Robert Callahan

· briancallahan.net · 13 mins

QBE vs LLVM

· c9x.me · 2 mins

Recent presentations and papers

· Andi Kleen · 7 mins

brotli-2015-09-22

· · 6 mins

How long does it take to make a context switch?

· tsunanet.net · 11 mins

Ghostty Devlog 001

· Mitchell Hashimoto · 6 mins

Tiled Matrix Multiplication

· penny-xu.github.io · 5 mins

Rust Atomics and Locks

· Mara Bos · 3 mins

Compiler Backend

· c9x.me · 3 mins

Vale's Memory Safety Strategy: Generational References and Regions

· Evan Ovadia · 15 mins

Introduction

· 1024cores.net · 4 mins

Cache-Oblivious Algorithms

· 1024cores.net · 4 mins

A Memory Allocator

· oswego.edu · 10 mins

Cramming: Training a Language Model on a Single GPU in One Day

· Jonas Geiping, Tom Goldstein · 48 mins

The MiniPile Challenge for Data-Efficient Language Models

· Jean Kaddour · 15 mins

Stretching Each Dollar: Diffusion Training from Scratch on a Micro-Budget

· arXiv.org · 2 mins

1024cores

· 1024cores.net · 1 min

Implementing interactive languages

· scattered-thoughts.net · 7 mins

Pointers Are Complicated, or: What's in a Byte?

· ralfj.de · 11 mins

Three Architectures for a Responsive IDE

· rust-analyzer.github.io · 8 mins

How a Zig IDE Could Work Feb 10, 2023

· matklad.github.io · 7 mins

Properly Testing Concurrent Data Structures Jul 5, 2024

· matklad.github.io · 18 mins

Parse, don’t validate

· lexi-lambda.github.io · 16 mins

Too Fast, Too Megamorphic: what influences method call performance in Java?

· Richard Warburton · 9 mins

The Black Magic of (Java) Method Dispatch

· Let’s implement these · 1 min

Why null sucks, even if it's checked

· Maybe Waffle · 12 mins

Unnamed Document

· dl.acm.org

Resources for Building Programming Languages

· Colin Davis · 4 mins

Little 'Big Ideas' in Programming Language Design

· Colin Davis · 7 mins

Computer Networking: A Top-Down Approach

· amazon.com · 2 mins

Using Uninitialized Memory for Fun and Profit Posted on Friday, March 14, 2008.

· Russ Cox · 5 mins

Zip Files All The Way Down

· Russ Cox · 10 mins

UTF-8: Bits, Bytes, and Benefits Posted on Friday, March 5, 2010.

· Russ Cox · 4 mins

Minimal Boolean Formulas

· Russ Cox · 20 mins

Hacking the OS X Kernel for Fun and Profiles Posted on Tuesday, August 13, 2013.

· Russ Cox · 8 mins

How To Build a User-Level CPU Profiler Posted on Thursday, August 8, 2013.

· Russ Cox · 7 mins

An Encoded Tree Traversal

· Russ Cox · 5 mins

Our Software Dependency Problem

· Russ Cox · 21 mins

The Magic of Sampling, and its Limitations Posted on Saturday, February 4, 2023.

· Russ Cox · 8 mins

Running the “Reflections on Trusting Trust” Compiler Posted on Wednesday, October 25, 2023.

· Russ Cox · 19 mins

Improving the Font Pipeline

· hypersect.com · 18 mins

Easy Scalable Text Rendering on the GPU

· Evan Wallace · 6 mins

Adventures in Text Rendering: Kerning and Glyph Atlases

· David Stern · 14 mins

Exploring the Power of Negative Space Programming

· double-trouble.dev · 4 mins

CompilerTalkFinal

· · 15 mins

Graydon Hoare: 21 compilers and 3 orders of magnitude in 60 minutes

· Charles Stewart · 1 min

p75-hoare

· · 32 mins

Updating the Go Memory Model

· Russ Cox · 16 mins

Programming Language Memory Models (Memory Models, Part 2) Posted on Tuesday, July 6, 2021. PDF

· Russ Cox · 35 mins

Hardware Memory Models (Memory Models, Part 1) Posted on Tuesday, June 29, 2021. PDF

· Russ Cox · 22 mins

Baby Steps to a C Compiler

· Wilfred Hughes · 3 mins

Kernel Programming Guide

· apple.com · 1 min

Tiny Tapeout

· Quicker, easier and cheaper to make your own chip! · 2 mins

Why Pascal is Not My Favorite Programming Language

· Brian W. Kernighan · 24 mins

What Color is Your Function?

· stuffwithstuff.com · 13 mins

What is an Invariant? Oct 6, 2023

· matklad.github.io · 8 mins

Chess-GPT's Internal World Model

· Adam Karvonen · 14 mins

Emergent World Models and Latent Variable Estimation in Chess-Playing Language Models

· arXiv.org · 1 min

Manipulating Chess-GPT's World Model

· Adam Karvonen · 13 mins

Crafting an Interpreter in Zig - part 1

· Zig NEWS · 3 mins

Teach Yourself Programming in Ten Years

· norvig.com · 11 mins

What Every Computer Scientist Should Know About Floating-Point Arithmetic

· oracle.com · 2 hrs 5 mins

The Development of the C Language*

· bell-labs.com · 33 mins

Class Warfare

· the singularity is nearer · 5 mins

Ownership

A Note About Zig Books for the Zig Community

· kristoff.it · 7 mins

Your Starting Point!

· scratchapixel.com · 20 mins

Zig Interfaces for the Uninitiated, an update

· Zig NEWS · 5 mins

Zig Interfaces for the Uninitiated

· nmichaels.org · 7 mins

Exploring Compile-Time Interfaces in Zig

· Jerry Thomas · 6 mins

Aro - a C compiler

· vexu.eu · 1 min

Do you want to learn how databases really work?

· Peter Kraft · 1 min

Database Systems

· CMU 15-445/645 · 1 min

Discovering and exploring mmap using Go

· Bruno Calza · 6 mins

But how, exactly, databases use mmap?

· Bruno Calza · 8 mins

reHow memory mapped files, filesystems and cloud storage works

· ayende.com · 3 mins

Implementing a file pager in Zig

· ayende.com · 5 mins

Criticizing Hare language approach for generic data structures

· ayende.com · 4 mins

spikedoanz/from-bits-to-intelligence: machine learninig stack in under 100,000 lines of code

· https://github.com/spikedoanz/ · 3 mins

One year of C

· floooh.github.io · 7 mins

Heap Memory and Allocators

· openmymind.net · 17 mins

Pointers

· openmymind.net · 13 mins

Learning Zig - Pointers

Emulator 101

· emulator101.com

Data Compression Explained

· Matt Mahoney · 3 hrs 5 mins

Twitter's Recommendation Algorithm

· Twitter Friday · 6 mins

Programming languages resources

· Max Bernstein · 3 mins

3D Math Primer for Graphics and Game Development

· gamemath.com · 2 mins

Welcome to OpenGL

· learnopengl.com · 3 mins

WebGPU Fundamentals

· webgpufundamentals.org · 1 min

An opinionated beginner’s guide to Haskell in mid-2019

· typesanitizer.com · 13 mins

Are tagged unions overrated?

· typesanitizer.com · 9 mins

C++ Core Guidelines

· isocpp.github.io · 7 hrs 19 mins

What every systems programmer should know about concurrency

· Matt Kline · 25 mins

compiler_construction

· · 36 mins

How do we tell truths that might hurt?

· Edsger W.Dijkstra · 3 mins

The next fifty years

· University of Texas in Austin · 8 mins

Recommender Systems: A Primer

· Pablo Castells, Dietmar Jannach · 1 hr 53 mins

http client in the standard library · Issue #2007 · ziglang/zig

· https://github.com/ziglang/ · 7 mins

Introduction to Compilers and Language Design

· Douglas Thain · 2 mins

Bare Metal Zig

· Austin Hanson · 9 mins

Comparing SIMD on x86-64 and arm64

· yiningkarlli.com · 53 mins

Compiler Optimizations Are Hard Because They Forget

· Aria Beingessner · 9 mins

C Isn't A Programming Language Anymore

· Aria Beingessner · 15 mins

Writing a C Compiler, Part 1

· norasandler.com · 15 mins

GitHub - DoctorWkt/acwj: A Compiler Writing Journey

· https://github.com/DoctorWkt/ · 3 mins

A new JIT engine for PHP-8.4/9

· externals.io · 2 mins

Unknown

· · 1 hr 9 mins

Introduction 2016 NUMA Deep Dive Series

· staroceans.org · 3 mins

von Neumann architecture - Wikipedia

· wikipedia.org · 16 mins

Compiling tree transforms to operate on packed representations

· ecoop.org · 1 min

Pipelines Support Vectorized, Point-Free, and Imperative Style

· oilshell.org · 4 mins

Entering text in the terminal is complicated

· Julia Evans · 7 mins

What happens when you start a process on Linux?

· Julia Evans · 4 mins

Debug your programs like they're closed source!

· Julia Evans · 6 mins

How I got better at debugging

· Julia Evans · 4 mins

Media Page Under Construction

· Handmade Cities · 1 min

Infographics: Operation Costs in CPU Clock Cycles

· "No Bugs" Hare · 18 mins

Handles are the better pointers

· floooh.github.io · 12 mins

You're Not Sick of Programming

· Shubham Jain's Blog · 1 min

Zig Bare Metal Programming on STM32F103 — Booting up

· Mattia Maldini · 12 mins

OWASP Top Ten

· owasp.org · 3 mins

Introduction

· owasp.org · 1 min

The Copenhagen Book

· The Copenhagen Book · 1 min

Undefined Behavior deserves a better reputation

· ralfj.de · 10 mins

KHM+15

· · 40 mins

Learning LLVM (Part-1) - Writing a simple LLVM pass

· sh4dy · 3 mins

Some Were Meant for C

· Stephen Kell · 54 mins

Xv6, a simple Unix-like teaching operating system

· mit.edu · 3 mins

C Is Not a Low-level Language

· David Chisnall · 14 mins

Should you learn C to "learn how the computer works"?

· steveklabnik.com · 10 mins

A Guide to Undefined Behavior in C and C++, Part 1

· regehr.org · 14 mins

Using neural nets to recognize handwritten digits

· neuralnetworksanddeeplearning.com · 55 mins

When Network is Faster than Cache

· Simon Hearne · 10 mins

John Carmack on Functional Programming in C++

· Go away, the cloud is full · 10 mins

Zig-style generics are not well-suited for most languages

· typesanitizer.com · 10 mins

WebGL2 vs WebGL1

· webgl2fundamentals.org · 15 mins

WebGL How It Works

· webglfundamentals.org · 8 mins

The_Night_Watch

· · 11 mins

FreeType

· GitLab · 2 mins

A Freestanding Rust Binary

· Philipp Oppermann · 14 mins

Manually linking Rust binaries to support out-of-tree LLVM passes

· Chris Chandler · 6 mins

The Rust Reference

· rust-lang.org · 6 mins

The Rust Borrow Checker - A Deep Dive - Nell Shamrell-Harrington, Microsoft

· CNCF [Cloud Native Computing Foundation] · 22:23

Rust Compiler Development Guide

· rust-lang.org · 11 mins

How to speed up the Rust compiler one last time

· mozilla.org · 8 mins

How to speed up the Rust compiler in March 2024

· Nicholas Nethercote · 4 mins

Zig Bits 0x4: Building an HTTP client/server from scratch

· orhun.dev · 16 mins

Do We Really Need A Link Step?

· Robert · 3 mins

Death Note: L, Anonymity & Eluding Entropy

· Gwern Branwen · 40 mins

jamiebuilds/the-super-tiny-compiler: :snowman: Possibly the smallest compiler ever

· https://github.com/jamiebuilds/ · 1 min

5 Days to Virtualization: A Series on Hypervisor Development

· Daax Rynd · 4 mins

In-depth analysis on Valorant’s Guarded Regions

· Xyrem Engineering · 11 mins

Exploit Development: No Code Execution? No Problem! Living The Age of VBS, HVCI, and Kernel CFG

· future builds · 1 hr 10 mins

Reader

· jina.ai · 5 mins

CheerpX versus WebContainers

· Leaning Technologies Developer · 3 mins

Creating a Rootkit to Learn C

· The Human Machine Interface · 25 mins

Picsart-AI-Research/LIVE-Layerwise-Image-Vectorization: [CVPR 2022 Oral] Towards Layer-wise Image Vectorization

· https://github.com/Picsart-AI-Research/ · 2 mins

Udacity CS344: Intro to Parallel Programming

· NVIDIA Developer · 2 mins

CS 361: Systems Programming

· uic.edu · 2 mins

Resolving Rust Symbols

· Shriram Balaji · 12 mins

When FFI Function Calls Beat Native C

· Chris Wellons · 7 mins

Cap'n Proto, FlatBuffers, and SBE

· capnproto.org · 12 mins

A Database Without Dynamic Memory Allocation

· tigerbeetle.com · 5 mins

Wizard Zines Collection!

· wizard zines · 2 mins

Aggregating Millions of Groups Fast in Apache Arrow DataFusion 28.0.0

· alamb, Dandandan, tustvold · 12 mins

Problems of C, and how Zig addresses them

· Avestura's Blog · 11 mins

How to use hash map contexts to save memory when doing a string table

· Zig NEWS · 2 mins

resume.txt

· GitHub · 8 mins

Leslie Lamport

· lamport.azurewebsites.net · 2 hrs 55 mins

Indices and tables

· compilergym.com · 1 min

448997590_1496256481254967_2304975057370160015_n

· · 46 mins

I've spent the past ~2 weeks building a GPU from...

· adammaj · 2 mins

Bare Bones

· osdev.org · 21 mins

Unnamed video

· Unknown

The Graphics Codex

· graphicscodex.com · 2 mins

[2305.13009] Textually Pretrained Speech Language Models

· Download PDF · 1 min

Notes on partial borrows

· Rust Internals · 6 mins

Dioxus Labs + “High-level Rust”

· Notion · 18 mins

Compile-Time Configuration For Zig Libraries

· openmymind.net · 3 mins

Generics

· openmymind.net · 5 mins

Zig's HashMap - Part 1

· Jan · 10 mins

Zig Parser

· Mitchell Hashimoto · 12 mins

Copying Better: How To Acquire The Tacit Knowledge of Experts

· Cedric Chin · 23 mins

Unnamed video

· Unknown

Causal ordering

· scattered-thoughts.net · 4 mins

Assorted thoughts on zig (and rust)

· scattered-thoughts.net · 19 mins

Columnar kernels in go?

· scattered-thoughts.net · 10 mins

An opinionated map of incremental and streaming systems

· scattered-thoughts.net · 6 mins

Internal consistency in streaming systems

· scattered-thoughts.net · 25 mins

Pain we forgot

· scattered-thoughts.net · 12 mins

Have you tried rubbing a database on it?

· hytradboi.com · 1 min

The shape of data

· scattered-thoughts.net · 23 mins

Reflections on a decade of coding

· scattered-thoughts.net · 2 mins

Prospecting for Hash Functions

· Chris Wellons · 9 mins

The Missing Zig Polymorphism / Runtime Dispatch Reference

· revivalizer.xyz · 6 mins

Nanosystems

· -Bob Schwabach · 2 mins

How To Become A Hacker

· Eric Steven Raymond · 36 mins

the rr debugging experience

· rr-project.org · 6 mins

Text Buffer Reimplementation

· @njukidreborn · 12 mins

What Is The Minimal Set Of Optimizations Needed For Zero-Cost Abstraction?

· ocallahan.org · 5 mins

Using ASCII waveforms to test hardware designs

· Andrew Ray · 5 mins

Rust 2019 and beyond: limits to (some) growth.

· dreamwidth.org · 9 mins

Your ABI is Probably Wrong

· outerproduct.net · 4 mins

GitHub - sirupsen/napkin-math: Techniques and numbers for estimating system's performance from first-principles

· https://github.com/sirupsen/ · 7 mins

Don't write bugs

· teamten.com · 2 mins

technicalities: "not rocket science" (the story of monotone and bors)

· dreamwidth.org · 4 mins

Why is Python slow

· kmod's blog · 4 mins

Design duality and the expression problem

· tedinski.com · 11 mins

Random Thoughts On Rust: crates.io And IDEs

· ocallahan.org · 4 mins

John Carmack on Inlined Code

· John Carmack · 10 mins

📥 Last added

Saved in the past week and not yet archived (default view)

What's included

Domain specific architectures for AI inference

DeepSeek-V3 Explained 1: Multi-head Latent Attention

Optimizing Transformer-Based Diffusion Models for Video Generation with NVIDIA TensorRT

You could have designed state of the art positional encoding

attention is logarithmic, actually

AI Arrives In The Middle East: US Strikes A Deal with UAE and KSA – SemiAnalysis

Transformers Represent Belief State Geometry in their Residual Stream

Llama from scratch (or how to implement a paper without crying)

The Curse of Knowing How, or; Fixing Everything

High-Performance Domain-Specific Compilation Without Domain-Specific Compilers

The MAP-Elites Algorithm: Finding Optimality Through Diversity

How To Scale

Deep Dive into Yann LeCun’s JEPA

Everything, Everywhere, All at Once: Is Mechanistic Interpretability Identifiable?

Are Transformers universal approximators of sequence-to-sequence functions?

a Hugging Face Space by nanotron

arXiv:quant-ph/0011122v2 20 Dec 2000

$60 Billion Dollars in losses ...

An elementary proof of a universal approximation theorem

A Group and Its Center, Intuitively

Understanding Entanglement With SVD

Training Large Language Models to Reason in a Continuous Latent Space

A Guide on Semiconductor Development

Recommended Books and Resources

The Computer as a Communication Device

ARM's Chernobyl Moment

The Uses of Complacency

The Book of Shaders

Unstructured Thoughts on the Problems of OSS/FOSS

On Bloat

Training Large Language Models to Reason in a Continuous Latent Space

On the Biology of a Large Language Model

Do Llamas Work in English? On the Latent Language of Multilingual Transformers

The Unsustainability of Moore’s Law

The Era of Experience Paper

"Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?"

tt-metal/tech_reports/memory/allocator.md at main · tenstorrent/tt-metal

What Modern NVMe Storage Can Do, And How To Exploit It: High-Performance I/O for High-Performance Storage Engines

Memory on Tenstorrent

Multi-layer language heads: the output latent is for text (and nothing else)

Subnanosecond flash memory enabled by 2D-enhanced hot-carrier injection

Andrew_S._Tanenbaum_-_Structured_Computer_Organization

CS336: Language Modeling from Scratch

A Gentle Introduction to Lambda Calculus - Part 1: Syntax

Getting Started

curry-howard.dvi

Intelligence as efficient model building

Contextualization Machines

What Is ChatGPT Doing … and Why Does It Work?

Mission Apollo: Landing Optical Circuit Switching at Datacenter Scale

The Illustrated Transformer

Driven by Compression Progress: A Simple Principle Explains Essential Aspects of Subjective Beauty, Novelty, Surprise, Interestingness, Attention, Curiosity, Creativity, Art, Science, Music, Jokes

Position: Model Collapse Does Not Mean What You Think

paper

88_HC2024.Tenstorrent.Jasmina.Davor.v7

RWKV Language Model

Recent AI model progress feels mostly like

Device Placement Optimization with Reinforcement Learning

Building an Open Future

diffusion transofrmers

diffusion transformers

Faking ADTs and GADTs in Languages That Shouldn't Have Them

Accelerate

Ok Rust, You Really Have a Readability Problem

Circuit Tracing: Revealing Computational Graphs in Language Models

Things that go wrong with disk IO

Analyzing Modern NVIDIA GPU cores

tt-metal/tech_reports/AdvancedPerformanceOptimizationsForModels/AdvancedPerformanceOptimizationsForModels.md at main · tenstorrent/tt-metal · GitHub

paper.dvi

þÿKevin-and-Nick.PDF

Move Slow and Fix Things

Why is Yazi fast?

User Guide for NVPTX Back-end

An AnandTech Interview with Jim Keller: 'The Laziest Person at Tesla'

Notes/Primer on Clang Compiler Frontend (1) : Introduction and Architecture

Implementation of simple microprocessor using verilog

learn-fpga/FemtoRV/TUTORIALS/FROM_BLINKER_TO_RISCV/README.md at master · BrunoLevy/learn-fpga · GitHub