che_shr_cat

Mon Mar 21, 2016 11:01am PST

Karma:

708

submitted

Sun Dec 7, 2025 8:46pm PST

Embedded Universal Predictive Intelligence: a coherent framework for multi-agent

@che_shr_cat

1

Mon Dec 1, 2025 4:11pm PST

NeurIPS 2025 Best Papers in Comics: From Artificial Hivemind to 1000-Layer RL

@che_shr_cat

3

Mon Nov 24, 2025 10:08pm PST

Visualizing Research: How I Use Gemini 3.0 to Turn Papers into Comics

@che_shr_cat

1

Sat Nov 22, 2025 1:37pm PST

Arc Is a Vision Problem

@che_shr_cat

1

Mon Nov 17, 2025 1:45am PST

AlphaResearch: Accelerating New Algorithm Discovery with Language Models

@che_shr_cat

1

Thurs Nov 13, 2025 6:03pm PST

LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics

@che_shr_cat

1

1

2

Mon Nov 10, 2025 4:43pm PST

Nested Learning: The Illusion of Deep Learning Architectures

@che_shr_cat

6

Thurs Nov 6, 2025 10:57pm PST

Context Engineering 2.0: The Context of Context Engineering

@che_shr_cat

2

Sun Nov 2, 2025 5:56pm PST

A Practitioner's Guide to Kolmogorov-Arnold Networks

@che_shr_cat

1

Sat Nov 1, 2025 10:53pm PST

Kimi Linear: An Expressive, Efficient Attention Architecture

@che_shr_cat

1

Sat Nov 1, 2025 5:49pm PST

The Principles of Diffusion Models (470-pages)

@che_shr_cat

2

Thurs Oct 23, 2025 6:15pm PST

CaT Replaces CoT-SC / Compute as Teacher: Turning Inference Compute Into

@che_shr_cat

2

Sun Oct 19, 2025 6:41pm PST

Tiny Recursive Model (TRM) vs. Hierarchical Reasoning Model (HRM)

@che_shr_cat

2

Wed Oct 15, 2025 1:33pm PST

Barbarians at the Gate: How AI Is Upending Systems Research

@che_shr_cat

2

Thurs Oct 9, 2025 9:53am PST

Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

@che_shr_cat

2

Sun Oct 5, 2025 9:55am PST

Autoreview: The Dragon Hatchling – The Missing Link Between the Transformer and

@che_shr_cat

2

Sat Oct 4, 2025 10:05pm PST

Stochastic Activations

@che_shr_cat

2

Mon Sep 22, 2025 10:31am PST

LLM-JEPA: Large Language Models Meet Joint Embedding Predictive Architectures

@che_shr_cat

1

Sat Sep 13, 2025 12:14pm PST

Review: SpikingBrain Technical Spiking Brain-Inspired Large Models

@che_shr_cat

2

Fri Sep 12, 2025 3:53pm PST

K2-Think: A Parameter-Efficient Reasoning System

@che_shr_cat

1

1

2

Sun Sep 7, 2025 11:35am PST

Canaries in the Coal Mine? Six Facts about the Recent Employment Effects of AI

@che_shr_cat

1

Wed Sep 3, 2025 11:13am PST

Fantastic Pretraining Optimizers and Where to Find Them

@che_shr_cat

2

Sun Aug 31, 2025 4:35pm PST

Solving the compute crisis with physics-based ASICs

@che_shr_cat

5

Wed Aug 27, 2025 9:38pm PST

Critiques of World Models

@che_shr_cat

2

Sun Aug 24, 2025 2:41pm PST

DeepConf: Scaling LLM reasoning with confidence, not just compute

@che_shr_cat

12

35

98

Fri Aug 22, 2025 2:39pm PST

V-JEPA 2: Scaling V-JEPA

@che_shr_cat

2

Sun Aug 17, 2025 1:11pm PST

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

@che_shr_cat

3

Sat Aug 16, 2025 4:59pm PST

Tversky Neural Networks

@che_shr_cat

5

12

131

Tues Aug 12, 2025 12:41am PST

Einstein Fields: A Neural Perspective to Computational General Relativity

@che_shr_cat

2

Fri Aug 8, 2025 9:50pm PST

Tversky Neural Networks: Psychologically Plausible Deep Learning With

@che_shr_cat

2

Thurs Jul 31, 2025 10:57am PST

GEPA: Reflective prompt evolution can outperform reinforcement learning

@che_shr_cat

7

25

92

Tues Jul 29, 2025 3:33pm PST

Subliminal Learning: Language models transmit behavioral traits via hidden

@che_shr_cat

2

Tues Jul 29, 2025 10:50am PST

AlphaGo Moment for Model Architecture Discovery

@che_shr_cat

1

Mon Jul 28, 2025 8:03pm PST

Paper FOMO and ICML 2025 Outstanding Papers

@che_shr_cat

1

Tues Jul 8, 2025 1:16pm PST

Early Signs of Steganographic Capabilities in Frontier LLMs

@che_shr_cat

2

Wed Jun 18, 2025 1:26pm PST

Musicality in Animals

@che_shr_cat

1

Thurs Jun 12, 2025 8:32pm PST

Text-to-LoRA Enables On-the-Fly Model Adaptation

@che_shr_cat

3

Wed Jun 4, 2025 1:00pm PST

Quantum computing and artificial intelligence: status and perspectives

@che_shr_cat

1

Mon Jun 2, 2025 12:57pm PST

The Most Misunderstood Feature of the Sound

@che_shr_cat

2

Sun Jun 1, 2025 4:02pm PST

Darwin Gödel Machine

@che_shr_cat

1

Tues May 27, 2025 12:40pm PST

Are Deeper LLMs Smarter, or Just Longer?

@che_shr_cat

3

Mon Apr 28, 2025 12:40pm PST

Muon Optimizer Accelerates Grokking

@che_shr_cat

8

Thurs Apr 24, 2025 9:43pm PST

ThoughtTerminator

@che_shr_cat

2

Tues Apr 22, 2025 5:03pm PST

Chain of Continuous Thought (Coconut)

@che_shr_cat

3

Fri Apr 4, 2025 4:53pm PST

Intuitive Physics Emergence in V-JEPA

@che_shr_cat

1

Thurs Apr 3, 2025 1:55pm PST

Sound physics And basics of sound perception

@che_shr_cat

2

Thurs Dec 26, 2024 12:18am PST

BLT: Byte Latent Transformer

@che_shr_cat

4

Fri Nov 29, 2024 3:00pm PST

A Single 'Super Weight' Can Break Your Billion-Parameter Model

@che_shr_cat

2

Tues Nov 26, 2024 2:37pm PST

Jax Things to Watch for in 2025

@che_shr_cat

1

Sat Nov 9, 2024 10:51pm PST

Diffusion models are evolutionary algorithms

@che_shr_cat

5

27

126

Wed Nov 6, 2024 11:19pm PST

Make Softmax Great Again

@che_shr_cat

2

Tues Nov 5, 2024 6:01pm PST

Deep Learning Frameworks: The Fourth Pillar of Deep Learning Revolution

@che_shr_cat

1

Wed Jun 26, 2024 8:40pm PST

TextGrad: Automatic "Differentiation" via Text

@che_shr_cat

3

Mon Jun 24, 2024 10:30pm PST

Superconducting Supercomputers

@che_shr_cat

1

Sun Jun 2, 2024 1:26pm PST

Decoder-decoder architecture is coming

@che_shr_cat

2

Sun Apr 28, 2024 6:52pm PST

Chronos: Using Pretrained LLMs for Probabilistic Time Series Forecasting

@che_shr_cat

2

Thurs Feb 29, 2024 5:59pm PST

Big Post About Big Context

@che_shr_cat

3

19

49

Mon Feb 26, 2024 12:47pm PST

Neural Network Diffusion

@che_shr_cat

1

Thurs Feb 8, 2024 10:53am PST

Thermodynamic AI is getting hotter

@che_shr_cat

2

5

51

Tues Jan 16, 2024 1:08pm PST

Training LLMs with AMD GPUs on Frontier Supercomputer

@che_shr_cat

1

Mon Jan 8, 2024 10:03pm PST

Beyond Chinchilla-Optimal Accounting for Inference in Language Model Scaling Law

@che_shr_cat

1

Sun Dec 17, 2023 7:36pm PST

Project CETI

@che_shr_cat

2

Wed Dec 13, 2023 1:56pm PST

GonzoML on Mamba and S6 (+previous post on S4)

@che_shr_cat

1

Sat Dec 9, 2023 11:29am PST

Conway's Game of Life Is Omniperiodic

@che_shr_cat

1

1

2

Thurs Dec 7, 2023 5:14pm PST

GonzoML on Gemini

@che_shr_cat

2

Fri Nov 3, 2023 10:28pm PST

Matryoshka Representation Learning

@che_shr_cat

2

Sun Oct 29, 2023 12:43pm PST

Mindstorms in Natural Language-Based Societies of Mind

@che_shr_cat

2

Fri Oct 27, 2023 7:39pm PST

The convolution empire strikes back

@che_shr_cat

6

56

132

Mon Oct 23, 2023 8:16pm PST

Sparse Universal Transformer

@che_shr_cat

3

Tues Oct 17, 2023 12:56pm PST

MemWalker: An alternative way for working with long documents using transformers

@che_shr_cat

1

Fri Oct 13, 2023 9:39pm PST

"Building Machines That Learn and Think Like People", 7 Years Later

@che_shr_cat

8

40

106

Tues Oct 10, 2023 5:42pm PST

Chain-of-Thought → Tree-of-Thought

@che_shr_cat

1

Mon Oct 9, 2023 1:25pm PST

Mortal Computers

@che_shr_cat

1

1

31

Tues Jun 20, 2023 1:40pm PST

Levanter – Legible, Scalable, Reproducible Foundation Models with Jax

@che_shr_cat

1

Fri May 12, 2023 3:57pm PST

LM-3 –- resurrecting the MIT CADR

@che_shr_cat

1

Mon Jun 13, 2022 10:27am PST

The Annotated Diffusion Model

@che_shr_cat

1

Fri May 6, 2022 7:59pm PST

Road to text-guided image generation: DALL·E, CLIP, GLIDE, DALL·E 2 (unCLIP)

@che_shr_cat

2

Sat Jan 15, 2022 1:58pm PST

Self-replicating radiation-shield for deep-space exploration: Radiotrophic fungi

@che_shr_cat

14

83

197

Sat Jan 15, 2022 1:52pm PST

Revisiting ‘Powers of Ten’ – what we’ve learned about the Universe since 1977

@che_shr_cat

3

Mon Dec 27, 2021 11:09am PST

The Future of Artificial Intelligence Is Self-Organizing and Self-Assembling

@che_shr_cat

4

Thurs Dec 9, 2021 11:51am PST

Foundation Models

@che_shr_cat

1

Sat Oct 2, 2021 5:10pm PST

Unsolved ML Safety Problems

@che_shr_cat

2

Thurs Feb 18, 2021 1:08pm PST

Nx (Numerical Elixir) is now publicly available

@che_shr_cat

5

Tues Jan 12, 2021 10:50am PST

Hardware for Deep Learning. Part 4: ASIC

@che_shr_cat

1

Mon Dec 21, 2020 9:03am PST

JAX Ecosystem

@che_shr_cat

2

Mon Dec 7, 2020 2:43pm PST

Using JAX to accelerate our research

@che_shr_cat

5

Sun Nov 22, 2020 10:12pm PST

Game Plan: What AI Can Do for Football, and What Football Can Do for AI

@che_shr_cat

1

Sat Oct 31, 2020 7:42pm PST

NYPD deploys robot dog after woman shot during Brooklyn parking dispute

@che_shr_cat

1

Thurs Oct 22, 2020 3:44pm PST

List of Animals by Number of Neurons

@che_shr_cat

2

Sat Oct 10, 2020 2:42pm PST

Natural Nuclear Fission Reactor

@che_shr_cat

5

8

49