jxmorris12
Wed Dec 7, 2016 4:53am PST
Karma:
1259
about
personal website: jxmo.io
submitted
Fri Feb 21, 2025 6:23pm PST
I think Yann Lecun was right about LLMs (but perhaps only by accident)
@jxmorris12
25
96
119
Fri Feb 21, 2025 3:53am PST
Please Commit More Blatant Academic Fraud (2021)
@jxmorris12
28
107
145
Thurs Feb 20, 2025 10:46pm PST
Demystifying Noise Contrastive Estimation
@jxmorris12
1
Thurs Feb 20, 2025 2:09pm PST
It's time to become an ML engineer (2022)
@jxmorris12
13
48
38
Wed Feb 19, 2025 6:44pm PST
Approximating KL Divergence (2020)
@jxmorris12
2
Wed Feb 19, 2025 6:24pm PST
The Ultra-Scale Playbook: Training LLMs on GPU Clusters
@jxmorris12
3
3
29
Wed Feb 19, 2025 2:37am PST
Implementing LLaMA3 in 100 Lines of Pure Jax
@jxmorris12
8
22
163
Tues Feb 18, 2025 8:48pm PST
Outperforming cuBLAS on H100: A Worklog
@jxmorris12
3
Tues Feb 18, 2025 3:14am PST
Gravel Map
@jxmorris12
7
17
54
Mon Feb 17, 2025 4:56am PST
Stochastic Integration for Poets
@jxmorris12
3
Mon Feb 17, 2025 3:09am PST
NASA writes space-proof code [video]
@jxmorris12
3
Sun Feb 16, 2025 3:27pm PST
Large Lambda Model
@jxmorris12
1
Sun Feb 16, 2025 7:08am PST
Softmax forever, or why I like softmax
@jxmorris12
14
110
181
Fri Feb 14, 2025 7:30pm PST
Diffusion Without Tears
@jxmorris12
8
20
62
Fri Feb 14, 2025 3:42pm PST
AGI Safety Course Workbook
@jxmorris12
1
Wed Feb 12, 2025 3:22pm PST
AI Nationalism – Ian Hogarth
@jxmorris12
3
Wed Feb 12, 2025 2:28pm PST
A Beginners' Guide to Misprints in Magic
@jxmorris12
1
1
Mon Feb 10, 2025 7:06pm PST
Flow with What You Know
@jxmorris12
1
Mon Feb 10, 2025 6:34pm PST
Going with the Flow: An Introduction to Normalizing Flows
@jxmorris12
1
Mon Feb 10, 2025 4:46pm PST
Diffusion Meets Flow Matching: Two Sides of the Same Coin
@jxmorris12
2
Mon Feb 10, 2025 4:51am PST
Muon: An optimizer for hidden layers in neural networks
@jxmorris12
3
Sun Feb 9, 2025 3:00pm PST
Honeycrisp: An Apple-First Deep Learning Framework
@jxmorris12
6
Fri Feb 7, 2025 3:39am PST
My Gear
@jxmorris12
3
Thurs Feb 6, 2025 2:00pm PST
DOGE for AI
@jxmorris12
4
Tues Feb 4, 2025 9:18pm PST
How to Backpack
@jxmorris12
2
Tues Feb 4, 2025 8:13pm PST
GRPO with Verifiable Rewards Is Contrastive Loss
@jxmorris12
2
Tues Feb 4, 2025 4:51pm PST
Bit Prediction
@jxmorris12
1
Tues Feb 4, 2025 4:35pm PST
How Could Telepathy Work?
@jxmorris12
1
1
2
Tues Feb 4, 2025 12:07am PST
How to Scale Your Model
@jxmorris12
1
1
4
Sat Feb 1, 2025 10:11pm PST
RLHF Book
@jxmorris12
10
37
479
Wed Jan 29, 2025 6:57pm PST
Reading notes: unsupervised word translation
@jxmorris12
1
Tues Jan 28, 2025 3:40pm PST
The Art of Debugging
@jxmorris12
2
Tues Jan 28, 2025 3:34pm PST
What a $500k grant proposal looks like
@jxmorris12
3
Sat Jan 25, 2025 3:16pm PST
Deep Reinforcement Learning Doesn't Work Yet
@jxmorris12
1
Fri Jan 24, 2025 3:46am PST
How far can you get in 40 minutes from each subway station in NYC?
@jxmorris12
44
202
328
Thurs Jan 23, 2025 8:39pm PST
Attention Sinks in LLMs for endless fluency
@jxmorris12
1
Tues Jan 21, 2025 5:09pm PST
Flow with What You Know: An Introduction to Flow-Based Models
@jxmorris12
1
4
2
Fri Jan 17, 2025 9:24pm PST
A History of Nvidia Stream Multiprocessor (2020)
@jxmorris12
1
2
Thurs Jan 16, 2025 2:41am PST
Gaming TruthfulQA: Simple Heuristics Exposed Dataset Weaknesses
@jxmorris12
3
Wed Jan 15, 2025 1:04am PST
Ε, a Nuisance No More
@jxmorris12
1
2
10
Mon Jan 13, 2025 2:49pm PST
Learning CUDA by Optimizing Softmax
@jxmorris12
2
Sat Jan 11, 2025 1:33am PST
History of Residuals and a Word of Caution
@jxmorris12
2
Fri Jan 3, 2025 8:03pm PST
MixBox: Practical Pigment Mixing for Digital Painting [pdf]
@jxmorris12
3
Thurs Jan 2, 2025 3:33pm PST
Detecting Tanks (2017)
@jxmorris12
1
2
Thurs Jan 2, 2025 1:49pm PST
The Bittersweet Lesson
@jxmorris12
2
Thurs Jan 2, 2025 1:48pm PST
Why transformers are obviously good models of language
@jxmorris12
6
Thurs Dec 26, 2024 11:55pm PST
What would happen if you made a planet out of fish?
@jxmorris12
2
2
22
Wed Dec 25, 2024 2:20am PST
Diffusion Meets Flow Matching: Two Sides of the Same Coin
@jxmorris12
2
Thurs Dec 19, 2024 1:02am PST
Educating Silicon
@jxmorris12
1
Wed Dec 18, 2024 2:33pm PST
Quick software tips for new ML researchers
@jxmorris12
3
Tues Dec 17, 2024 8:22pm PST
The Baked Data architectural pattern
@jxmorris12
2
Tues Dec 17, 2024 1:58am PST
What Is Entropix Doing?
@jxmorris12
1
Fri Dec 13, 2024 4:31pm PST
DeltaNet Explained (Part I)
@jxmorris12
1
Thurs Dec 12, 2024 8:15pm PST
How To Change Your Behavior
@jxmorris12
2
Thurs Dec 12, 2024 7:19pm PST
Infini-Gram: Scaling Unbounded N-Gram Language Models to a Trillion Tokens
@jxmorris12
2
Mon Dec 9, 2024 3:40am PST
Making Transformers Do Math
@jxmorris12
1
Mon Nov 18, 2024 8:14pm PST
Sunsethue – Today's Sunset Forecast
@jxmorris12
1
Mon Nov 18, 2024 7:35pm PST
An Unserious Take on Axiomatic Knowledge in the Era of Foundation Models
@jxmorris12
2
Thurs Oct 31, 2024 3:35pm PST
A Meticulous Guide to Advances in Deep Learning Efficiency over the Years
@jxmorris12
1
3
Thurs Oct 10, 2024 2:44pm PST
Clowning in Pennsylvania
@jxmorris12
1
Fri Oct 4, 2024 5:32pm PST
Contextual Document Embeddings
@jxmorris12
1
Sun Aug 11, 2024 4:05pm PST
Experiments in Self-Assembly
@jxmorris12
1
Thurs Jul 4, 2024 3:04pm PST
Matrixmultiplication.xyz
@jxmorris12
1
Mon Jul 1, 2024 10:21pm PST
Not Quite Past – Real Ceramic Tiles Designed by AI
@jxmorris12
1
Tues Jun 25, 2024 9:33pm PST
Data Compression with Arithmetic Coding
@jxmorris12
1
Thurs Jun 13, 2024 12:18am PST
Int4 Decoding GQA CUDA Optimizations for LLM Inference
@jxmorris12
1
Fri Jun 7, 2024 4:33pm PST
Einsum Is Easy and Useful
@jxmorris12
2
Tues Jun 4, 2024 12:54am PST
Optimizing Matrix Multiplication
@jxmorris12
2
Thurs May 30, 2024 1:10pm PST
Kullback-Leibler (KL) Is All You Need
@jxmorris12
2
Tues May 28, 2024 5:25pm PST
How Do Language Models Put Attention Weights over Long Context?
@jxmorris12
2
Tues May 28, 2024 2:53pm PST
Chess Engines: A Zero to One
@jxmorris12
3
Sat May 25, 2024 5:20pm PST
A Personal History of Legion, by Way of Its Papers
@jxmorris12
1
Fri May 24, 2024 1:55pm PST
Bananagrams Is NP-Complete
@jxmorris12
2
Sun May 12, 2024 12:35pm PST
A Better Lesson
@jxmorris12
1
1
Tues Apr 23, 2024 3:12pm PST
Seeking the Productive Life: Some Details of My Personal Infrastructure
@jxmorris12
1
Thurs Apr 11, 2024 3:09pm PST
A Neighborhood with Friends
@jxmorris12
2
Sun Mar 31, 2024 1:12pm PST
How to graduate your PhD when you have no hope
@jxmorris12
22
150
170
Thurs Mar 28, 2024 5:36pm PST
PyTorch Word Embeddings Tutorial
@jxmorris12
1
Sat Mar 16, 2024 7:43pm PST
Integer Tokenization Is Insane (2023)
@jxmorris12
6
8
35
Mon Mar 11, 2024 7:43pm PST
Diffusion models from scratch, from a new theoretical perspective
@jxmorris12
9
40
379
Mon Mar 4, 2024 10:15pm PST
A Refined Similarity-Based Bigram Model
@jxmorris12
1
Thurs Feb 22, 2024 8:23pm PST
Speculative Sampling
@jxmorris12
1
Thurs Feb 22, 2024 6:15pm PST
An Introduction to Optimization: Combinatorial Optimization
@jxmorris12
4
Wed Feb 21, 2024 9:36pm PST
Definite Optimism as Human Capital
@jxmorris12
2
Wed Feb 21, 2024 4:53pm PST
Singular Value Decomposition Part 1: Perspectives on Linear Algebra
@jxmorris12
2
Wed Feb 21, 2024 4:10pm PST
Singular Value Decomposition as Simply as Possible
@jxmorris12
1
Mon Feb 12, 2024 3:19pm PST
Too much efficiency makes everything worse: overfitting and Goodhart's law
@jxmorris12
2
Sat Feb 10, 2024 6:57am PST
Detecting Mismatches in Machine-Learning Systems
@jxmorris12
1