Antoine Déchappe
About me
Latest research review
Layer Normalization
Jul 20, 2025
Recent writing
Mocking UUIDs in Python Tests with a Generator
Jul 28, 2025
Beware of poetry package named differently from project structure
Mar 27, 2025
Pre-commit options in pyproject.toml should be committed first
Mar 12, 2025
Antoine Déchappe
Blog
Research review
Projects
Bookmarks
Home
❯
Research review
❯
LLMs
❯
Benchmarks
Search
Search
Research review
LLMs
Benchmarks
τ -bench Benchmark for Tool-Agent-User Interaction in Real-World Domains