Editing Files at 1000 Tokens per Second
A new model and inference method for high-accuracy full-file edits at 1000 tokens/s.
Editing Files at 1000 Tokens per Second
A new model and inference method for high-accuracy full-file edits at 1000 tokens/s.
Our problems
A list of problems we are excited to solve for Cursor.
Inference characteristics of Llama
A primer on inference math and an examination of the surprising costs of Llama.
Prompt design
Prompting is like web design. Let’s call it prompt design, and build better tools for it.
Editing Files at 1000 Tokens per Second
A new model and inference method for high-accuracy full-file edits at 1000 tokens/s.
Our problems
A list of problems we are excited to solve for Cursor.
Inference characteristics of Llama
A primer on inference math and an examination of the surprising costs of Llama.
Prompt design
Prompting is like web design. Let’s call it prompt design, and build better tools for it.