t55
Fri Aug 18, 2023 8:35am PST
Karma:
730
about
ML researcher working on https://github.com/PySpur-Dev/pyspur
submitted
Thurs Feb 20, 2025 10:19pm PST
Introduction to CUDA programming for Python developers
@t55
16
90
356
Wed Feb 12, 2025 7:57pm PST
Novelty Left on the Table
@t55
2
Wed Feb 12, 2025 4:22am PST
Competitive Programming with Large Reasoning Models
@t55
1
1
16
Sun Feb 9, 2025 8:00pm PST
The Differences Between Direct Alignment Algorithms Are a Blur
@t55
8
Sat Feb 8, 2025 5:18pm PST
The Octalysis Framework for Gamification and Behavioral Design
@t55
3
Mon Feb 3, 2025 5:56pm PST
S1: Simple Test-Time Scaling
@t55
3
3
40
Mon Feb 3, 2025 2:44pm PST
A Malloc Tutorial [pdf]
@t55
1
1
Sun Feb 2, 2025 5:20pm PST
Reinforcement Learning: An Overview
@t55
7
12
82
Sun Feb 2, 2025 3:59pm PST
What automated firms will look like
@t55
1
2
Sat Feb 1, 2025 3:41pm PST
Large Language Models for Mathematicians (2023)
@t55
10
28
89
Sat Feb 1, 2025 3:40pm PST
Mathematics for Machine Learning
@t55
1
1
1
Fri Jan 31, 2025 10:48pm PST
Propositional Interpretability in Artificial Intelligence
@t55
3
Fri Jan 31, 2025 6:47pm PST
The Tensor Cookbook (2024)
@t55
9
37
199
Fri Jan 31, 2025 6:47pm PST
ArXiv LaTeX Cleaner: Clean the LaTeX code of your paper to submit to ArXiv
@t55
11
42
103
Fri Jan 31, 2025 5:02pm PST
Tesla Unveils Autonomous Cleaning Robot for Robotaxi
@t55
2
1
2
Fri Jan 31, 2025 4:56pm PST
O3-Mini vs. DeepSeek-R1: Which One Is Safer?
@t55
1
1
Thurs Jan 30, 2025 11:46pm PST
Qwen Chat – Another Chinese ChatGPT Rival
@t55
1
4
Thurs Jan 30, 2025 7:10pm PST
Systemic Existential Risks from Incremental AI Development
@t55
6
Thurs Jan 30, 2025 6:41pm PST
The risk from prompt injection attacks on AI systems
@t55
1
Thurs Jan 30, 2025 6:11pm PST
The Desire to Be Liked Is Rotting Your Brain
@t55
1
2
Thurs Jan 30, 2025 6:10pm PST
Thou Shalt Not Overfit
@t55
1
Thurs Jan 30, 2025 6:05pm PST
Obsidian's Web viewer lets you open external links within Obsidian
@t55
1
2
Thurs Jan 30, 2025 5:57pm PST
Quaternions and spherical trigonometry
@t55
7
48
124
Thurs Jan 30, 2025 5:55pm PST
Qwen2.5-VL: State-of-the-art multimodal LLM
@t55
1
3
Thurs Jan 30, 2025 5:53pm PST
Goose – an open-source, extensible AI agent that goes beyond code suggestions
@t55
2
5
40
Thurs Jan 30, 2025 5:49pm PST
I don't believe DeepSeek crashed Nvidia's stock
@t55
2
1
3
Wed Jan 29, 2025 4:38pm PST
Large Language Model Training Using FP4 Quantization
@t55
2
Wed Jan 29, 2025 4:34pm PST
Supervised Fine-Tuning Memorizes, RL Generalizes
@t55
1
Tues Jan 28, 2025 10:11pm PST
DeepSeek's multi-head latent attention and other KV cache tricks
@t55
15
72
292
Mon Jan 13, 2025 3:27pm PST
VideoRAG: Retrieval-Augmented Generation over Video Corpus
@t55
4
Fri Jan 3, 2025 6:52pm PST
Cryptoscammers Impersonated and Hacked Us – Now What?
@t55
2
2
7
Sun Dec 22, 2024 5:30pm PST
Comparing Llama 3.2 vs. Gemma 2 vs. Mistral on philosophical questions
@t55
1
1
Mon Dec 16, 2024 6:20pm PST
Show HN: Graph-Based Editor for LLM Workflows
@t55
3
5
8
Thurs Dec 12, 2024 6:04pm PST
ChatGPT's Advanced Voice Mode adds Santa Mode, Live Video, Screensharing
@t55
7
7
43
Thurs Dec 12, 2024 5:11pm PST
Visual Autoregressive Modeling: Image Generation via Next-Scale Prediction
@t55
2