jxmorris12
Wed Dec 7, 2016 4:53am PST
Karma:
2939
about
personal website: jxmo.io
submitted
Sun Aug 24, 2025 8:09pm PST
How many paths of length K are there between A and B? (2021)
@jxmorris12
4
7
32
Sun Aug 24, 2025 2:38am PST
How A Neuron Learns
@jxmorris12
3
Fri Aug 22, 2025 2:45am PST
GPT, Fast (2023)
@jxmorris12
1
Fri Aug 22, 2025 2:43am PST
GPT-Fast
@jxmorris12
1
Thurs Aug 21, 2025 7:25pm PST
Exploring EXIF (2023)
@jxmorris12
8
11
80
Thurs Aug 21, 2025 7:10pm PST
The Practitioner's Guide to the Maximal Update Parameterization
@jxmorris12
1
Wed Aug 20, 2025 6:52pm PST
The scientific method and its application to the science of deep learning
@jxmorris12
1
Mon Aug 18, 2025 10:36pm PST
Solving Humanity's Last Exam Problems
@jxmorris12
2
Mon Aug 18, 2025 8:56pm PST
Why We Think
@jxmorris12
1
Mon Aug 18, 2025 12:03pm PST
Philosophical Thoughts on Kolmogorov-Arnold Networks (2024)
@jxmorris12
3
5
53
Sat Aug 16, 2025 3:59pm PST
Matmul() using PyTorch's MPs back end is faster than Apple's MLX
@jxmorris12
2
Thurs Aug 14, 2025 7:13pm PST
The Making of Gemini Plays Pokémon
@jxmorris12
1
1
1
Wed Aug 13, 2025 2:18pm PST
Facebook is not worth $33B (2010)
@jxmorris12
1
1
8
Mon Aug 11, 2025 4:34am PST
Comefrom
@jxmorris12
4
Sat Aug 9, 2025 2:56pm PST
Diffusion Language Models Are Super Data Learners
@jxmorris12
1
Fri Aug 8, 2025 2:45pm PST
How to build a router for MOE models
@jxmorris12
2
Thurs Aug 7, 2025 6:20pm PST
The Eponymous Principles of Management – Coase's Ceiling and Floor
@jxmorris12
2
Thurs Aug 7, 2025 2:02pm PST
No One Is Working
@jxmorris12
21
49
58
Thurs Aug 7, 2025 1:10am PST
No One Is Working
@jxmorris12
7
Wed Aug 6, 2025 9:43pm PST
SFT Is Bad RL
@jxmorris12
2
Wed Aug 6, 2025 3:17pm PST
A Simple CPU on the Game of Life (2021)
@jxmorris12
9
13
87
Mon Aug 4, 2025 6:11pm PST
Trends in LLM-Generated Citations on ArXiv
@jxmorris12
2
Sat Aug 2, 2025 5:10pm PST
'AI' just means LLMs now
@jxmorris12
1
1
2
Fri Aug 1, 2025 4:12pm PST
Ada Lovelace and the Analytical Engine
@jxmorris12
5
Thurs Jul 31, 2025 5:17pm PST
How long before superintelligence? (1997)
@jxmorris12
12
59
57
Tues Jul 29, 2025 2:57pm PST
Attention is your scarcest resource (2020)
@jxmorris12
39
189
334
Mon Jul 28, 2025 8:42pm PST
DeltaNet Explained
@jxmorris12
1
Thurs Jul 17, 2025 5:28pm PST
All AI models might be the same
@jxmorris12
33
151
311
Thurs Jul 17, 2025 2:41pm PST
Life Update – On Health
@jxmorris12
1
Wed Jul 16, 2025 6:34pm PST
Asymmetry of Verification and Verifier's Law
@jxmorris12
5
Wed Jul 16, 2025 6:05pm PST
Soviet College Admission – My Dad's Story (1970)
@jxmorris12
3
3
30
Tues Jul 15, 2025 4:16pm PST
H-Net – Inference
@jxmorris12
2
Thurs Jul 10, 2025 8:47pm PST
How to scale RL to 10^26 FLOPs
@jxmorris12
6
6
82
Thurs Jul 10, 2025 8:25pm PST
Britain is cheap, and should learn to love it
@jxmorris12
1
1
2
Wed Jul 9, 2025 10:53pm PST
Database Sharding
@jxmorris12
2
Wed Jul 9, 2025 9:52pm PST
Microdosing Willpower: My Takeaways from Microdosing Ozempic
@jxmorris12
4
Wed Jul 9, 2025 4:59pm PST
The upcoming GPT-3 moment for RL
@jxmorris12
29
97
232
Tues Jul 8, 2025 7:12pm PST
The Tradeoffs of SSMs and Transformers
@jxmorris12
2
8
69
Tues Jul 8, 2025 3:15pm PST
Things you can do –with uv
@jxmorris12
3
Mon Jul 7, 2025 3:23pm PST
The era of exploration
@jxmorris12
4
11
106
Thurs Jul 3, 2025 8:31pm PST
Just Ask for Generalization (2021)
@jxmorris12
3
4
38
Thurs Jul 3, 2025 6:00pm PST
Will Scaling Solve Robotics?
@jxmorris12
5
10
15
Wed Jul 2, 2025 5:16pm PST
VLLM: Easy, Fast, and Cheap LLM Serving with PagedAttention
@jxmorris12
3
5
20
Sun Jun 29, 2025 2:48pm PST
LLM Memory
@jxmorris12
5
6
33
Wed Jun 25, 2025 5:08pm PST
What Problems to Solve (1966)
@jxmorris12
29
61
493
Wed Jun 25, 2025 3:47pm PST
Test-Time Training
@jxmorris12
1
Wed Jun 25, 2025 12:13am PST
Thnickels
@jxmorris12
31
125
569
Tues Jun 24, 2025 4:52pm PST
SFStreets: History of San Francisco place names
@jxmorris12
8
25
56
Tues Jun 24, 2025 1:46am PST
Muon Doesn't Clearly Grok Faster
@jxmorris12
1
Mon Jun 23, 2025 1:06pm PST
René Girard and Mimetic Theory for Non-Philosophers
@jxmorris12
1
Sat Jun 21, 2025 9:14pm PST
Becoming a Better Programmer by Tightening Feedback Loops
@jxmorris12
2
Fri Jun 20, 2025 1:56pm PST
Approximating Language Model Training Data from Weights
@jxmorris12
2
Fri Jun 20, 2025 12:23pm PST
Street photos of every building in New York in 1939/1940
@jxmorris12
3
Wed Jun 18, 2025 6:59pm PST
The Launch of GPT-4
@jxmorris12
3
Wed Jun 18, 2025 2:48pm PST
Superintelligence, from First Principles
@jxmorris12
2
Tues Jun 17, 2025 2:55pm PST
How to Shuffle a Big Dataset
@jxmorris12
1
Sun Jun 15, 2025 12:56am PST
Q-learning is not yet scalable
@jxmorris12
20
48
220
Thurs Jun 12, 2025 11:02pm PST
Why are neural networks and cryptographic ciphers so similar?
@jxmorris12
3
Thurs Jun 12, 2025 9:32pm PST
So You Want to Work in Mechanistic Interpretability?
@jxmorris12
2
Wed Jun 11, 2025 1:54pm PST
Bowling Alone: America's Declining Social Capital [pdf] (1995)
@jxmorris12
3
3
13
Wed Jun 11, 2025 1:46pm PST
Introduction to Parallel Programming with CUDA
@jxmorris12
2
Tues Jun 10, 2025 2:49pm PST
Machine Learning of Sets
@jxmorris12
1
Tues Jun 10, 2025 1:52pm PST
The Different Components of Intelligence
@jxmorris12
3
Mon Jun 9, 2025 5:22pm PST
Cutlass Tutorial: Sub-Byte GEMM on Nvidia Blackwell GPUs
@jxmorris12
2
Mon Jun 9, 2025 2:59pm PST
The case for more ambition (in AI research)
@jxmorris12
2
Sat Jun 7, 2025 7:06pm PST
Why is AI hard and Physics simple?
@jxmorris12
1
Sat Jun 7, 2025 4:11pm PST
Generative Modelling in Latent Space
@jxmorris12
2
Sat Jun 7, 2025 2:18pm PST
Understanding the Neural Tangent Kernel
@jxmorris12
1
Fri Jun 6, 2025 7:39pm PST
Specification Engineering is a bet on better code gen and more complexity
@jxmorris12
4
Fri Jun 6, 2025 7:32pm PST
Efficient Streaming Language Models with Attention Sinks
@jxmorris12
5
Fri Jun 6, 2025 2:27pm PST
PhDs for Entrepreneurs
@jxmorris12
1
Fri Jun 6, 2025 2:14pm PST
The WeightWatcher tool for predicting the accuracy of Deep Neural Networks
@jxmorris12
2
Thurs Jun 5, 2025 6:20pm PST
Margins of My Dissertation: Life Lessons That My PhD Taught Me
@jxmorris12
2
Thurs Jun 5, 2025 2:54pm PST
Test-Time Training
@jxmorris12
1
Sun Jun 1, 2025 3:06pm PST
LLM Visualization
@jxmorris12
3
Sat May 31, 2025 10:07pm PST
ML models don't need that much data to be better than you
@jxmorris12
2
Fri May 30, 2025 12:09am PST
Ranking Foods by Protein Efficiency
@jxmorris12
1
3
Thurs May 29, 2025 2:10pm PST
What I learned about running a betting market game night contest
@jxmorris12
1
Thurs May 29, 2025 2:40am PST
'To Easy LoL' – New Orleans jail break may have been inside job
@jxmorris12
1
2
Wed May 28, 2025 12:50am PST
Softmax – research that inspires us
@jxmorris12
1
1
Tues May 27, 2025 7:51pm PST
Neural Network Checklist
@jxmorris12
1
Mon May 26, 2025 8:42pm PST
Highly Opinionated Advice on How to Write ML Papers
@jxmorris12
2
Sun May 25, 2025 8:58pm PST
You could have invented Transformers
@jxmorris12
34
Thurs May 22, 2025 8:40pm PST
The Annotated Kolmogorov-Arnold Network (Kan)
@jxmorris12
1
2
36
Thurs May 22, 2025 8:01pm PST
What Does Any of This Have to Do with Physics?
@jxmorris12
1
1
4
Thurs May 22, 2025 7:54pm PST
What I'm thinking about these days
@jxmorris12
1
Thurs May 22, 2025 6:25pm PST
How to cheat at settlers by loading the dice (2017)
@jxmorris12
20
116
149
Thurs May 22, 2025 2:12pm PST
Neural Vector Embeddings
@jxmorris12
1
1