t55
Fri Aug 18, 2023 8:35am PST
Karma:
897
about
ML researcher
submitted
Mon Jun 2, 2025 9:27am PST
ReasoningGym: Reasoning Environments for RL with Verifiable Rewards
@t55
9
28
105
Fri May 23, 2025 12:42pm PST
Show HN: Rehearsal.so, Duolingo for Public Speaking
@t55
1
1
3
Fri May 16, 2025 5:03pm PST
End-to-End Vision Tokenizer Tuning
@t55
3
Fri May 16, 2025 5:03pm PST
YC Interview Mock Practice
@t55
2
Thurs May 8, 2025 11:30pm PST
D1: Scaling Reasoning in Diffusion LLMs via Reinforcement Learning
@t55
4
Thurs May 8, 2025 11:29pm PST
Are LLMs more than autocomplete? AI Debate
@t55
1
Thurs May 8, 2025 6:18pm PST
Block Diffusion: Interpolating Autoregressive and Diffusion Language Models
@t55
4
16
72
Thurs May 8, 2025 6:15pm PST
How to stay in flow while using Cursor or Windsurf
@t55
2
Thurs May 8, 2025 5:37pm PST
Generative Modelling in Latent Space
@t55
2
Tues May 6, 2025 6:50pm PST
Show HN: Debate Uncle Bob – Is SQL Dead? (Voice RPG)
@t55
1
1
6
Wed Apr 16, 2025 5:03pm PST
OpenAI O3 and O4-Mini
@t55
1
1
Thurs Apr 10, 2025 5:27pm PST
Memory in ChatGPT
@t55
1
10
Sat Mar 8, 2025 12:57am PST
Superintelligence startup Reflection AI launches with $130M in funding
@t55
5
26
38
Thurs Mar 6, 2025 6:19pm PST
Intro to DeepSeek's open-source week and why it's a big deal
@t55
9
13
24
Thurs Feb 20, 2025 10:19pm PST
Introduction to CUDA programming for Python developers
@t55
16
95
365
Wed Feb 12, 2025 7:57pm PST
Novelty Left on the Table
@t55
2
Wed Feb 12, 2025 4:22am PST
Competitive Programming with Large Reasoning Models
@t55
1
1
16
Sun Feb 9, 2025 8:00pm PST
The Differences Between Direct Alignment Algorithms Are a Blur
@t55
8
Sat Feb 8, 2025 5:18pm PST
The Octalysis Framework for Gamification and Behavioral Design
@t55
3
Mon Feb 3, 2025 5:56pm PST
S1: Simple Test-Time Scaling
@t55
3
3
40
Mon Feb 3, 2025 2:44pm PST
A Malloc Tutorial [pdf]
@t55
1
1
Sun Feb 2, 2025 5:20pm PST
Reinforcement Learning: An Overview
@t55
7
12
82
Sun Feb 2, 2025 3:59pm PST
What automated firms will look like
@t55
1
2
Sat Feb 1, 2025 3:41pm PST
Large Language Models for Mathematicians (2023)
@t55
10
28
89
Sat Feb 1, 2025 3:40pm PST
Mathematics for Machine Learning
@t55
1
1
1
Fri Jan 31, 2025 10:48pm PST
Propositional Interpretability in Artificial Intelligence
@t55
3
Fri Jan 31, 2025 6:47pm PST
The Tensor Cookbook (2024)
@t55
9
37
199
Fri Jan 31, 2025 6:47pm PST
ArXiv LaTeX Cleaner: Clean the LaTeX code of your paper to submit to ArXiv
@t55
11
42
103
Fri Jan 31, 2025 5:02pm PST
Tesla Unveils Autonomous Cleaning Robot for Robotaxi
@t55
2
1
2
Fri Jan 31, 2025 4:56pm PST
O3-Mini vs. DeepSeek-R1: Which One Is Safer?
@t55
1
1
Thurs Jan 30, 2025 11:46pm PST
Qwen Chat – Another Chinese ChatGPT Rival
@t55
1
4
Thurs Jan 30, 2025 7:10pm PST
Systemic Existential Risks from Incremental AI Development
@t55
6
Thurs Jan 30, 2025 6:41pm PST
The risk from prompt injection attacks on AI systems
@t55
1
Thurs Jan 30, 2025 6:11pm PST
The Desire to Be Liked Is Rotting Your Brain
@t55
1
2
Thurs Jan 30, 2025 6:10pm PST
Thou Shalt Not Overfit
@t55
1
Thurs Jan 30, 2025 6:05pm PST
Obsidian's Web viewer lets you open external links within Obsidian
@t55
1
2
Thurs Jan 30, 2025 5:57pm PST
Quaternions and spherical trigonometry
@t55
7
48
124
Thurs Jan 30, 2025 5:55pm PST
Qwen2.5-VL: State-of-the-art multimodal LLM
@t55
1
3
Thurs Jan 30, 2025 5:53pm PST
Goose – an open-source, extensible AI agent that goes beyond code suggestions
@t55
2
5
40
Thurs Jan 30, 2025 5:49pm PST
I don't believe DeepSeek crashed Nvidia's stock
@t55
2
1
3
Wed Jan 29, 2025 4:38pm PST
Large Language Model Training Using FP4 Quantization
@t55
2
Wed Jan 29, 2025 4:34pm PST
Supervised Fine-Tuning Memorizes, RL Generalizes
@t55
1