Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
Building an AI agent that works in production Series' Articles
Back to ThomasP's Series
Why finding where a product is made is an AI problem
ThomasP
ThomasP
ThomasP
Follow
Mar 17
Why finding where a product is made is an AI problem
#
ai
#
machinelearning
#
webdev
#
beginners
Comments
Add Comment
9 min read
The prompt engineering that didn't work (and what did)
ThomasP
ThomasP
ThomasP
Follow
Mar 23
The prompt engineering that didn't work (and what did)
#
ai
#
llm
#
promptengineering
#
machinelearning
Comments
Add Comment
9 min read
Why your LLM agent needs a benchmark before it needs a prompt
ThomasP
ThomasP
ThomasP
Follow
Mar 27
Why your LLM agent needs a benchmark before it needs a prompt
#
ai
#
llm
#
agents
#
testing
Comments
Add Comment
8 min read
GPT-5.1 scored 26%. Gemini 3 Flash scored 74%. Same prompt, same tools.
ThomasP
ThomasP
ThomasP
Follow
Mar 28
GPT-5.1 scored 26%. Gemini 3 Flash scored 74%. Same prompt, same tools.
#
ai
#
llm
#
benchmark
#
agents
Comments
Add Comment
8 min read
LLM-as-Judge: using Claude to review a Gemini agent
ThomasP
ThomasP
ThomasP
Follow
Apr 8
LLM-as-Judge: using Claude to review a Gemini agent
#
ai
#
llm
#
agents
#
evaluation
Comments
Add Comment
7 min read
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account