How We Pick What to Review: The Content Calendar Behind Pickuma

#meta #blogging #webdev

The most common question I get from readers is not about a specific tool. It is, consistently: what are you reviewing next, and when?

When you are evaluating tools for a time-sensitive decision, a review that arrives in two months is a review that arrives too late. If you do not know whether it is coming at all, you cannot plan around it. This article is the answer: a transparent walkthrough of the content calendar that drives Pickuma — how I maintain it, how I score candidates, how I decide what gets published when, and which topics I kill before they reach a draft.

The Content Calendar, Explained

I maintain a single Google Sheet that functions as the editorial calendar. It is not a project management tool or a publishing platform. It is a spreadsheet with columns, and the columns are: tool name, category, demand score, testability score, editorial fit score, composite priority, estimated publish window, current status, and notes.

The sheet contains roughly sixty candidates at any given time. About twelve are active — they have cleared the selection gates and are somewhere between initial research and final draft. About twenty are scored and queued but not yet started. The remaining thirty are flagged for re-evaluation before they enter the queue.

Scores are recalculated once a month, on the first Monday. Scoring more frequently would produce noise — minor search volume fluctuations or a single Reddit thread would bounce priorities around. A monthly cadence lets signals accumulate enough to be meaningful and keeps the queue stable enough that I can commit to publish windows.

The single most surprising thing I have learned from two years of tracking this data: community discussion volume is a better predictor of long-term readership than search volume. A tool that generates sustained conversation on Hacker News and Reddit over six months will attract more readers than a tool with higher search volume but zero community discussion. Search tells you what people are typing into Google. Community tells you what people care about enough to argue over. The second signal ages better.

How We Score and Prioritize

The scoring system is deliberately simple. Each candidate gets three scores on a 1-10 scale, and the composite priority is a weighted average. Three dimensions is where additional dimensions stop adding signal and start adding noise.

Demand score (weight: 40%). Combines search volume for comparison and evaluation queries, community discussion persistence, and direct reader requests. Comparison queries ("X vs Y") carry more weight than brand searches. A reader request describing a specific evaluation problem carries more weight than a generic "can you review X." Five or more unique requests for the same tool force the demand score to at least 7, regardless of what search and community signals say.

Testability score (weight: 40%). Can I actually evaluate this tool in a way that produces useful observations? A tool with a generous free tier, clean documentation, and sub-30-minute setup scores a 9 or 10. A tool requiring a multi-node cluster, opaque pricing hidden behind a sales call, and a week of configuration before meaningful results scores a 2 or 3. Tools in the 2-3 range are effectively excluded — I review them only when demand is overwhelming and I can justify the infrastructure cost.

Editorial fit score (weight: 20%). The subjective dimension. Do I understand the category well enough to evaluate it properly? Would the review add something that does not already exist? Is the tool genuinely interesting rather than incrementally better? A tool that changes how developers think about a category scores high. A tool that is 15% cheaper with no other differentiation scores low.

The composite determines queue position, but I maintain a separate "publish window" column that spaces out same-category reviews. Publishing two database reviews back-to-back is less useful than alternating — database, then CI/CD, then monitoring, then returning to databases.

Trending vs. Evergreen: The Balance Problem

This is the hardest editorial decision I make, and I do not have a formula for it. The tension is this: trending topics generate traffic now, but evergreen content generates traffic forever. A review of a tool that launched last week and is dominating Hacker News will bring readers immediately, but the same review will be stale in six months when the tool has shipped three major versions. A comparison guide covering a settled category generates modest but consistent traffic for years, but it does not feel urgent to publish.

I aim for roughly 70% evergreen and 30% trending, but this is aspirational, not a quota. Some months the candidate pool tilts toward comparisons and settled categories; other months a wave of new tools shifts the ratio toward trending reviews. I do not force a ratio. I watch it.

The discipline I enforce: I do not rush a review to catch a trend. Most reviews take two to four weeks of testing, writing, and revision, plus another week for the editorial pipeline. If a tool launches and the hype cycle will be over before I can publish a quality review, I let it go. A rushed review that is wrong damages the site more than a review that is late.

This is also why I prioritize evergreen content so heavily. Trending reviews have a shelf life. Evergreen content accumulates. The comparison guides I published a year ago are still generating traffic today. A review of a tool that was hot for two weeks in March would be generating nothing by May.

I track something I call the regret metric: for every tool I have reviewed in the past two years, I note whether I would still publish the same review today with the same level of effort. If the answer is no, I examine why. The most common reason is that I reviewed a tool because it was trending rather than because it was genuinely important. This metric has steadily pushed the calendar toward evergreen content and away from trend-chasing. It is not a perfect system — I still make mistakes — but it has meaningfully reduced the number of reviews I look back on and wish I had not written.

What Gets Rejected and Why

Beyond the scoring system, there are categories of candidates that I kill outright — not deprioritize, not move to a "maybe later" column, but remove from the sheet entirely.

Pre-launch tools. If a tool exists only as a GitHub README, a waitlist, or a demo video, it is not a candidate. I made this mistake once — I evaluated a tool based on documentation and a private demo, wrote a positive review, then watched the public launch ship something materially different from what I had tested. The review was wrong. I pulled it.

Tools in categories I do not understand. I review developer tools, infrastructure, SaaS productivity tools, and select finance tools where I have domain expertise. If a reader requests a review of a video editing suite or a CRM platform, I decline. Publishing a review without the context to distinguish real innovation from feature parity produces exactly the shallow, summary-of-summaries content that I built Pickuma to replace.

Tools with predatory pricing. If pricing is hidden behind a sales call, or if the pricing model penalizes existing customers without corresponding value improvements, I exclude the tool from the candidate list regardless of demand. This is not a gate that most review sites apply, and I understand why — it eliminates high-revenue affiliate categories. But I cannot in good conscience recommend a tool where the cost is unknowable at evaluation time.

Review requests from vendors. These get filed separately and carry zero weight. A vendor email that says "we would love for you to review our product" does not influence the scoring system. If the tool is worth reviewing, developers are already asking about it.

The calendar is a living document. It is wrong more often than it is right — demand shifts, testing reveals surprises, tools I expected to recommend turn out to be poor fits, and tools I expected to dismiss turn out to be excellent. The calendar's job is not to be correct as a prediction. Its job is to be honest as a process.

If there is a tool you are evaluating and the answer to "is Pickuma reviewing it" would change your decision, send the request through the contact form. I read every one. I cannot promise a review, but I can promise that the request will be logged, scored, and considered alongside every other signal in the sheet. That is more than most review sites will tell you, and I think that matters.

Originally published at pickuma.com. Subscribe to the RSS or follow @pickuma.bsky.social for new reviews.