3 months ago

Fri Nov 14, 2025 11:11pm PST

Show HN: CodeMode – First library for tool calls via code execution

I’ve been testing a simple idea inspired by Apple/Cloudflare/Anthropic:

LLMs are far better at writing a small program than coordinating multiple tool calls.

So instead of giving the model 10+ tools, I exposed one: a TypeScript sandbox with access to the same interfaces.

The model writes a script → it runs once → done.

What changed - +68% reduction in token use - No multi-step drift or retries - Local models (Llama 3.1 8B / Phi-3) became much more reliable

Curious whether others have seen the same thing: Is code execution a better abstraction for agents than tool calls?

comments:

loading comments...