DEV Community

BAOFUFAN profile picture

BAOFUFAN

404 bio not found

Joined Joined on 
How I Slashed Context Loss from 30% to 0% with Automated LangChain Memory Tests

How I Slashed Context Loss from 30% to 0% with Automated LangChain Memory Tests

1
Comments
3 min read
From 100 Logins to 1: Cut E2E Test Time by 78%

From 100 Logins to 1: Cut E2E Test Time by 78%

Comments
4 min read
Pitfalls of Testing LLM Long-Term Memory: A 3‑Day Debugging Saga

Pitfalls of Testing LLM Long-Term Memory: A 3‑Day Debugging Saga

Comments
4 min read
From JSON to Pinecone: 90% Accuracy Boost for AI Long-Conversation Memory

From JSON to Pinecone: 90% Accuracy Boost for AI Long-Conversation Memory

Comments
3 min read
Bringing LLM Memory Regression Tests from 30 Minutes Down to 90 Seconds with pytest + Redis

Bringing LLM Memory Regression Tests from 30 Minutes Down to 90 Seconds with pytest + Redis

Comments
4 min read
Automating Agent Memory Regression with pytest & Vector DB: 5x Defect Discovery Speedup

Automating Agent Memory Regression with pytest & Vector DB: 5x Defect Discovery Speedup

1
Comments
3 min read
LangChain Memory Pitfall: A Concurrency Race Condition That Cost Me 6 Hours

LangChain Memory Pitfall: A Concurrency Race Condition That Cost Me 6 Hours

Comments
3 min read
We Automated LLM Memory Tests and Got 8x Efficiency

We Automated LLM Memory Tests and Got 8x Efficiency

Comments
3 min read
Pytest + Redis: My 3‑Hour Battle with a Data Inconsistency Bug

Pytest + Redis: My 3‑Hour Battle with a Data Inconsistency Bug

Comments
4 min read
LangChain + Chroma: Multi-turn RAG Memory and Automated Testing That Turned 2-Hour Bugs Into 5-Minute Fixes

LangChain + Chroma: Multi-turn RAG Memory and Automated Testing That Turned 2-Hour Bugs Into 5-Minute Fixes

Comments
4 min read
How Automating RAG Memory Tests with ChromaDB Quadrupled Our Bug Discovery Rate

How Automating RAG Memory Tests with ChromaDB Quadrupled Our Bug Discovery Rate

Comments
3 min read
From Mock to Real Redis: Cutting Agent Memory Test Leakage from 30% to 0

From Mock to Real Redis: Cutting Agent Memory Test Leakage from 30% to 0

Comments
7 min read
From Database Locks to Redis+Lua: 100x QPS Boost in Coupon Flash Sales

From Database Locks to Redis+Lua: 100x QPS Boost in Coupon Flash Sales

Comments
3 min read
A Single `&&` Cost Me 4 Hours — and Dropped Frontend Incidents to Zero

A Single `&&` Cost Me 4 Hours — and Dropped Frontend Incidents to Zero

1
Comments
4 min read
I Spent 48 Hours Debugging Multi-Turn LLM Memory Loss—Then Playwright + Pytest Locked It Down

I Spent 48 Hours Debugging Multi-Turn LLM Memory Loss—Then Playwright + Pytest Locked It Down

Comments
3 min read
Redis Persistence Pitfall: How RDB+AOF Hybrid Persistence Silently Lost Data — I Reproduced 30 Failure Scenarios with pytest + Docker

Redis Persistence Pitfall: How RDB+AOF Hybrid Persistence Silently Lost Data — I Reproduced 30 Failure Scenarios with pytest + Docker

Comments
4 min read
LangChain Memory Pitfalls: How Broken Context Cost Me 6 Hours at 1 AM

LangChain Memory Pitfalls: How Broken Context Cost Me 6 Hours at 1 AM

Comments
3 min read
From Manual Logging to Pytest+Mem0: Slash AI Memory Bugs by 90%

From Manual Logging to Pytest+Mem0: Slash AI Memory Bugs by 90%

Comments
4 min read
Moving Conversation Memory from RAM to Redis: Boosting Long-Term Accuracy from 63% to 98%

Moving Conversation Memory from RAM to Redis: Boosting Long-Term Accuracy from 63% to 98%

Comments
5 min read
From Manual Checks to Pytest + Vector DB: 10x Faster AI Agent Memory Testing

From Manual Checks to Pytest + Vector DB: 10x Faster AI Agent Memory Testing

Comments
6 min read
50 pytest Tests That Caught Redis Cache Bugs Before Production — 90% Fewer Incidents

50 pytest Tests That Caught Redis Cache Bugs Before Production — 90% Fewer Incidents

Comments
4 min read
How I Built a Private Knowledge Base with LangChain + FastAPI — and the 3 Pitfalls That Cost Me 8 Hours

How I Built a Private Knowledge Base with LangChain + FastAPI — and the 3 Pitfalls That Cost Me 8 Hours

Comments 1
3 min read
AI Chat Memory Pitfalls: 30% of Conversations Lost on Refresh

AI Chat Memory Pitfalls: 30% of Conversations Lost on Refresh

Comments
3 min read
How Moving Rate Limiting to Redis+Go 8x'd Our API Gateway Throughput (And Cost Us 3 Days of Debugging)

How Moving Rate Limiting to Redis+Go 8x'd Our API Gateway Throughput (And Cost Us 3 Days of Debugging)

Comments
3 min read
Stop Guessing Memory: How to Automate LangChain Memory Testing and Catch 80% of Multi-Turn Failures

Stop Guessing Memory: How to Automate LangChain Memory Testing and Catch 80% of Multi-Turn Failures

Comments
5 min read
IndexedDB Automation Testing Pitfalls: 3 Hidden Bugs & 30 Wasted Hours

IndexedDB Automation Testing Pitfalls: 3 Hidden Bugs & 30 Wasted Hours

Comments
3 min read
Debugging AI Agent Memory Loss: A 3-Day Investigation

Debugging AI Agent Memory Loss: A 3-Day Investigation

Comments
3 min read
How I Slashed Our LLM API Token Costs by 90% — From 1M to 100K Daily

How I Slashed Our LLM API Token Costs by 90% — From 1M to 100K Daily

Comments
4 min read
Validating AI Agent Memory with ChromaDB: How a Misaligned Similarity Threshold Cost Me 3 Hours

Validating AI Agent Memory with ChromaDB: How a Misaligned Similarity Threshold Cost Me 3 Hours

Comments
4 min read
From 2 Hours to 3 Minutes: Eliminating Missed Tests in AI Memory Consistency Testing

From 2 Hours to 3 Minutes: Eliminating Missed Tests in AI Memory Consistency Testing

Comments
3 min read
Playwright Multi‑Tab IndexedDB Sync: The Browser Context Isolation Trap (6 Hours of Debugging)

Playwright Multi‑Tab IndexedDB Sync: The Browser Context Isolation Trap (6 Hours of Debugging)

Comments
3 min read
From 2-Hour Manual Regression to 4-Minute Playwright Automation for RAG Memory Tests—and 80% Fewer Misses

From 2-Hour Manual Regression to 4-Minute Playwright Automation for RAG Memory Tests—and 80% Fewer Misses

Comments
5 min read
Moving DeepSeek-R1 from Transformers to vLLM: A 14x Throughput Boost

Moving DeepSeek-R1 from Transformers to vLLM: A 14x Throughput Boost

Comments
4 min read
Pytest + Docker: 3 Bugs That Broke My AI Agent's Memory (and Cost Me 8 Hours)

Pytest + Docker: 3 Bugs That Broke My AI Agent's Memory (and Cost Me 8 Hours)

Comments
4 min read
Building RAG with LangChain & Chroma: Two Hidden Pitfalls That Cost Me 6 Hours

Building RAG with LangChain & Chroma: Two Hidden Pitfalls That Cost Me 6 Hours

Comments
4 min read
How I Cut LLM Memory Bug Diagnosis from 2 Hours to 5 Minutes with Playwright & Allure

How I Cut LLM Memory Bug Diagnosis from 2 Hours to 5 Minutes with Playwright & Allure

Comments
3 min read
A 2 AM Serialization Bug in LangChain Memory — And How pytest Stopped It Forever

A 2 AM Serialization Bug in LangChain Memory — And How pytest Stopped It Forever

Comments
4 min read
"How a Refresh Wiped Out 237 Drafts — and How We Used Playwright to Stop It Forever"

"How a Refresh Wiped Out 237 Drafts — and How We Used Playwright to Stop It Forever"

1
Comments
3 min read
I Spent 6 Hours Fixing LangChain's ConversationBufferMemory — Here's the Automated Test You Need

I Spent 6 Hours Fixing LangChain's ConversationBufferMemory — Here's the Automated Test You Need

Comments
3 min read
Scaling Rate Limiting from Single‑Node to a Distributed Go+Redis Token Bucket — 10x Throughput Under Load (with Degradation Strategy)

Scaling Rate Limiting from Single‑Node to a Distributed Go+Redis Token Bucket — 10x Throughput Under Load (with Degradation Strategy)

Comments
4 min read
Slash Multi-Level Cache Debugging Time by 90% with Pytest Parametrization

Slash Multi-Level Cache Debugging Time by 90% with Pytest Parametrization

Comments
3 min read
From 800 Lines of Shell to 30 Lines of Pytest: 10x Redis Persistence Testing Efficiency

From 800 Lines of Shell to 30 Lines of Pytest: 10x Redis Persistence Testing Efficiency

Comments
4 min read
Uncovering 8% IndexedDB Data Loss After Browser Crashes with Playwright

Uncovering 8% IndexedDB Data Loss After Browser Crashes with Playwright

Comments
3 min read
3 Asyncio Pitfalls That Took Me 3 Hours to Debug and Almost Crashed Production

3 Asyncio Pitfalls That Took Me 3 Hours to Debug and Almost Crashed Production

Comments
4 min read
I Rewrote Our Crawler with asyncio and Got a 15x Performance Boost

I Rewrote Our Crawler with asyncio and Got a 15x Performance Boost

Comments
3 min read
asyncio Pitfalls: The Mistake That Cost Me 3 Hours

asyncio Pitfalls: The Mistake That Cost Me 3 Hours

Comments
3 min read
The asyncio Mistake That Cost Me 3 Hours

The asyncio Mistake That Cost Me 3 Hours

Comments
3 min read
3 Hours Wasted on asyncio Pitfalls That Almost Took Down Production

3 Hours Wasted on asyncio Pitfalls That Almost Took Down Production

Comments
4 min read
Is asyncio Really Better Than Multithreading? I Tested 100 Concurrent Requests, and the Difference Is Huge

Is asyncio Really Better Than Multithreading? I Tested 100 Concurrent Requests, and the Difference Is Huge

Comments
3 min read
3-Hour Debugging: How `time.sleep` in Async Functions Killed Our asyncio Concurrency

3-Hour Debugging: How `time.sleep` in Async Functions Killed Our asyncio Concurrency

Comments
4 min read
The asyncio Mistake That Cost Me 3 Hours

The asyncio Mistake That Cost Me 3 Hours

Comments
3 min read
I Rewrote Our Scraper with asyncio. My CTO Thought I Added Servers.

I Rewrote Our Scraper with asyncio. My CTO Thought I Added Servers.

Comments
3 min read
I Spent 3 Hours on This Asyncio Bug — Here’s How to Avoid It

I Spent 3 Hours on This Asyncio Bug — Here’s How to Avoid It

Comments
3 min read
How asyncio Slashed Our Data Aggregation Latency from 30s to 2s

How asyncio Slashed Our Data Aggregation Latency from 30s to 2s

Comments
3 min read
asyncio Pitfalls: 3 Hours Debugging a Concurrent Crawler

asyncio Pitfalls: 3 Hours Debugging a Concurrent Crawler

Comments
3 min read
How I Boosted Concurrency 40x with asyncio — and Ops Thought We Were DDoSed

How I Boosted Concurrency 40x with asyncio — and Ops Thought We Were DDoSed

Comments
3 min read
asyncio Pitfalls: The 3-Hour Bug

asyncio Pitfalls: The 3-Hour Bug

Comments
3 min read
From 20 Minutes to 90 Seconds: Refactoring a 100-Endpoint Crawler with asyncio

From 20 Minutes to 90 Seconds: Refactoring a 100-Endpoint Crawler with asyncio

Comments
4 min read
The 3-Hour Async Bug: Why My 1000 Concurrent Requests Ran One by One

The 3-Hour Async Bug: Why My 1000 Concurrent Requests Ran One by One

Comments
4 min read
Python asyncio Pitfalls: The Bug That Cost Me 3 Hours

Python asyncio Pitfalls: The Bug That Cost Me 3 Hours

Comments
3 min read
loading...