hckrnws
back
che_shr_cat
Mon Mar 21, 2016 11:01am PST
Karma:
697
submitted
Sun Aug 24, 2025 2:41pm PST
DeepConf: Scaling LLM reasoning with confidence, not just compute
@che_shr_cat
11
34
98
Fri Aug 22, 2025 2:39pm PST
V-JEPA 2: Scaling V-JEPA
@che_shr_cat
2
Sun Aug 17, 2025 1:11pm PST
Speed Always Wins: A Survey on Efficient Architectures for Large Language Models
@che_shr_cat
3
Sat Aug 16, 2025 4:59pm PST
Tversky Neural Networks
@che_shr_cat
5
12
131
Tues Aug 12, 2025 12:41am PST
Einstein Fields: A Neural Perspective to Computational General Relativity
@che_shr_cat
2
Fri Aug 8, 2025 9:50pm PST
Tversky Neural Networks: Psychologically Plausible Deep Learning With
@che_shr_cat
2
Thurs Jul 31, 2025 10:57am PST
GEPA: Reflective prompt evolution can outperform reinforcement learning
@che_shr_cat
7
25
92
Tues Jul 29, 2025 3:33pm PST
Subliminal Learning: Language models transmit behavioral traits via hidden
@che_shr_cat
2
Tues Jul 29, 2025 10:50am PST
AlphaGo Moment for Model Architecture Discovery
@che_shr_cat
1
Mon Jul 28, 2025 8:03pm PST
Paper FOMO and ICML 2025 Outstanding Papers
@che_shr_cat
1
Tues Jul 8, 2025 1:16pm PST
Early Signs of Steganographic Capabilities in Frontier LLMs
@che_shr_cat
2
Wed Jun 18, 2025 1:26pm PST
Musicality in Animals
@che_shr_cat
1
Thurs Jun 12, 2025 8:32pm PST
Text-to-LoRA Enables On-the-Fly Model Adaptation
@che_shr_cat
3
Wed Jun 4, 2025 1:00pm PST
Quantum computing and artificial intelligence: status and perspectives
@che_shr_cat
1
Mon Jun 2, 2025 12:57pm PST
The Most Misunderstood Feature of the Sound
@che_shr_cat
2
Sun Jun 1, 2025 4:02pm PST
Darwin Gödel Machine
@che_shr_cat
1
Tues May 27, 2025 12:40pm PST
Are Deeper LLMs Smarter, or Just Longer?
@che_shr_cat
3
Mon Apr 28, 2025 12:40pm PST
Muon Optimizer Accelerates Grokking
@che_shr_cat
8
Thurs Apr 24, 2025 9:43pm PST
ThoughtTerminator
@che_shr_cat
2
Tues Apr 22, 2025 5:03pm PST
Chain of Continuous Thought (Coconut)
@che_shr_cat
3
Fri Apr 4, 2025 4:53pm PST
Intuitive Physics Emergence in V-JEPA
@che_shr_cat
1
Thurs Apr 3, 2025 1:55pm PST
Sound physics And basics of sound perception
@che_shr_cat
2
Thurs Dec 26, 2024 12:18am PST
BLT: Byte Latent Transformer
@che_shr_cat
4
Fri Nov 29, 2024 3:00pm PST
A Single 'Super Weight' Can Break Your Billion-Parameter Model
@che_shr_cat
2
Tues Nov 26, 2024 2:37pm PST
Jax Things to Watch for in 2025
@che_shr_cat
1
Sat Nov 9, 2024 10:51pm PST
Diffusion models are evolutionary algorithms
@che_shr_cat
5
27
126
Wed Nov 6, 2024 11:19pm PST
Make Softmax Great Again
@che_shr_cat
2
Tues Nov 5, 2024 6:01pm PST
Deep Learning Frameworks: The Fourth Pillar of Deep Learning Revolution
@che_shr_cat
1
Wed Jun 26, 2024 8:40pm PST
TextGrad: Automatic "Differentiation" via Text
@che_shr_cat
3
Mon Jun 24, 2024 10:30pm PST
Superconducting Supercomputers
@che_shr_cat
1
Sun Jun 2, 2024 1:26pm PST
Decoder-decoder architecture is coming
@che_shr_cat
2
Sun Apr 28, 2024 6:52pm PST
Chronos: Using Pretrained LLMs for Probabilistic Time Series Forecasting
@che_shr_cat
2
Thurs Feb 29, 2024 5:59pm PST
Big Post About Big Context
@che_shr_cat
3
19
49
Mon Feb 26, 2024 12:47pm PST
Neural Network Diffusion
@che_shr_cat
1
Thurs Feb 8, 2024 10:53am PST
Thermodynamic AI is getting hotter
@che_shr_cat
2
5
51
Tues Jan 16, 2024 1:08pm PST
Training LLMs with AMD GPUs on Frontier Supercomputer
@che_shr_cat
1
Mon Jan 8, 2024 10:03pm PST
Beyond Chinchilla-Optimal Accounting for Inference in Language Model Scaling Law
@che_shr_cat
1
Sun Dec 17, 2023 7:36pm PST
Project CETI
@che_shr_cat
2
Wed Dec 13, 2023 1:56pm PST
GonzoML on Mamba and S6 (+previous post on S4)
@che_shr_cat
1
Sat Dec 9, 2023 11:29am PST
Conway's Game of Life Is Omniperiodic
@che_shr_cat
1
1
2
Thurs Dec 7, 2023 5:14pm PST
GonzoML on Gemini
@che_shr_cat
2
Fri Nov 3, 2023 10:28pm PST
Matryoshka Representation Learning
@che_shr_cat
2
Sun Oct 29, 2023 12:43pm PST
Mindstorms in Natural Language-Based Societies of Mind
@che_shr_cat
2
Fri Oct 27, 2023 7:39pm PST
The convolution empire strikes back
@che_shr_cat
6
56
132
Mon Oct 23, 2023 8:16pm PST
Sparse Universal Transformer
@che_shr_cat
3
Tues Oct 17, 2023 12:56pm PST
MemWalker: An alternative way for working with long documents using transformers
@che_shr_cat
1
Fri Oct 13, 2023 9:39pm PST
"Building Machines That Learn and Think Like People", 7 Years Later
@che_shr_cat
8
40
106
Tues Oct 10, 2023 5:42pm PST
Chain-of-Thought → Tree-of-Thought
@che_shr_cat
1
Mon Oct 9, 2023 1:25pm PST
Mortal Computers
@che_shr_cat
1
1
31
Tues Jun 20, 2023 1:40pm PST
Levanter – Legible, Scalable, Reproducible Foundation Models with Jax
@che_shr_cat
1
Fri May 12, 2023 3:57pm PST
LM-3 –- resurrecting the MIT CADR
@che_shr_cat
1
Mon Jun 13, 2022 10:27am PST
The Annotated Diffusion Model
@che_shr_cat
1
Fri May 6, 2022 7:59pm PST
Road to text-guided image generation: DALL·E, CLIP, GLIDE, DALL·E 2 (unCLIP)
@che_shr_cat
2
Sat Jan 15, 2022 1:58pm PST
Self-replicating radiation-shield for deep-space exploration: Radiotrophic fungi
@che_shr_cat
14
83
197
Sat Jan 15, 2022 1:52pm PST
Revisiting ‘Powers of Ten’ – what we’ve learned about the Universe since 1977
@che_shr_cat
3
Mon Dec 27, 2021 11:09am PST
The Future of Artificial Intelligence Is Self-Organizing and Self-Assembling
@che_shr_cat
4
Thurs Dec 9, 2021 11:51am PST
Foundation Models
@che_shr_cat
1
Sat Oct 2, 2021 5:10pm PST
Unsolved ML Safety Problems
@che_shr_cat
2
Thurs Feb 18, 2021 1:08pm PST
Nx (Numerical Elixir) is now publicly available
@che_shr_cat
5
Tues Jan 12, 2021 10:50am PST
Hardware for Deep Learning. Part 4: ASIC
@che_shr_cat
1
Mon Dec 21, 2020 9:03am PST
JAX Ecosystem
@che_shr_cat
2
Mon Dec 7, 2020 2:43pm PST
Using JAX to accelerate our research
@che_shr_cat
5
Sun Nov 22, 2020 10:12pm PST
Game Plan: What AI Can Do for Football, and What Football Can Do for AI
@che_shr_cat
1
Sat Oct 31, 2020 7:42pm PST
NYPD deploys robot dog after woman shot during Brooklyn parking dispute
@che_shr_cat
1
Thurs Oct 22, 2020 3:44pm PST
List of Animals by Number of Neurons
@che_shr_cat
2
Sat Oct 10, 2020 2:42pm PST
Natural Nuclear Fission Reactor
@che_shr_cat
5
8
49
Sun Sep 20, 2020 4:50pm PST
Brain2Word: Decoding Brain Activity for Language Generation
@che_shr_cat
2
Sun Sep 6, 2020 1:18pm PST
Thread: Differentiable Self-organizing Systems (A living collection of articles)
@che_shr_cat
4
Sun Jul 26, 2020 6:48pm PST
How to Detect Graviton?
@che_shr_cat
2
Thurs Jul 23, 2020 10:21am PST
AlgebraNets: A new formalism for building neural architectures
@che_shr_cat
6
Mon Jul 20, 2020 7:31pm PST
Discovering Symbolic Models from Deep Learning with Inductive Biases
@che_shr_cat
4
Fri Jul 10, 2020 9:46pm PST
Logarithmic Pruning Is All You Need
@che_shr_cat
1
Tues Jul 7, 2020 6:31pm PST
Transformer Zoo (a deeper dive) [slides]
@che_shr_cat
1
1
6
Sat Jul 4, 2020 6:42pm PST
Curve Detectors [Inner working of neural networks]
@che_shr_cat
2
Mon Jun 29, 2020 8:56am PST
Transformer Zoo [slides]
@che_shr_cat
2
Sun Jun 28, 2020 12:14pm PST
Discovering Symbolic Models from Deep Learning with Inductive Biases
@che_shr_cat
2
Wed Jun 10, 2020 1:37pm PST
Deep learning of physical laws from scarce data
@che_shr_cat
2
13
36
Tues Jun 2, 2020 12:41pm PST
GPT-3: TL;DR and more
@che_shr_cat
1
Fri May 22, 2020 8:01am PST
(TLDR) Longformer: The Long-Document Transformer
@che_shr_cat
1
Tues May 19, 2020 8:03pm PST
Industry’s First 20nm Space-Grade FPGA for Satellite and Space Applications
@che_shr_cat
5
Sat May 16, 2020 10:44pm PST
FP64, FP32, FP16, BFLOAT16, TF32, and Other Members of the Zoo
@che_shr_cat
8
Thurs May 7, 2020 1:59am PST
New Surface Book 3
@che_shr_cat
1
Wed May 6, 2020 7:22pm PST
AI and Efficiency [May 2020 version]
@che_shr_cat
1
Fri May 1, 2020 2:12pm PST
Using Neural Networks to Find Answers in Tables
@che_shr_cat
1
Fri May 1, 2020 2:08pm PST
Ramanujan Machine: Automatically Generated Conjectures on Fundamental Constants
@che_shr_cat
1
Fri May 1, 2020 1:54pm PST
Explainable Deep Learning: A Field Guide for the Uninitiated
@che_shr_cat
2
Fri May 1, 2020 1:51pm PST
6G White Paper on Edge Intelligence
@che_shr_cat
2
Fri May 1, 2020 1:38pm PST
Memristors: From In-Mem Computing, DL Accel, Spiking NNs, to the Future of Neuro
@che_shr_cat
2
Fri May 1, 2020 1:34pm PST
Fact or Fiction: Verifying Scientific Claims
@che_shr_cat
1
Thurs Apr 30, 2020 10:12pm PST
Mercury language is still alive
@che_shr_cat
5
Thurs Apr 23, 2020 9:16am PST
A Scalable Approach to Reducing Gender Bias in Google Translate
@che_shr_cat
3
Thurs Apr 23, 2020 8:50am PST
Schmidhuber: Critique of Honda Prize for Dr. Hinton
@che_shr_cat
3
Tues Mar 31, 2020 5:34pm PST
Word2vec, ... , X2vec: Towards a Theory of Vector Embeddings of Struct.Data
@che_shr_cat
2
Sat Mar 21, 2020 6:47pm PST
Toward Interpretable ML: Transparent Deep Neural Networks and Beyond
@che_shr_cat
2
Mon Mar 9, 2020 3:00pm PST
Picking Winning Tickets Before Training by Preserving Gradient Flow
@che_shr_cat
2