DEV Community

Creeta profile picture

Creeta

404 bio not found

Joined Joined on 
Claude Code v2.1.156: Opus 4.8 Thinking Block Hotfix Explained

Claude Code v2.1.156: Opus 4.8 Thinking Block Hotfix Explained

Comments
8 min read
langchain-fireworks 1.4.2: Annotated + ChatFireworks Quickstart

langchain-fireworks 1.4.2: Annotated + ChatFireworks Quickstart

Comments
7 min read
openai-codex v0.1.0b1: First Beta Install and Thread Walkthrough

openai-codex v0.1.0b1: First Beta Install and Thread Walkthrough

Comments
7 min read
'Gemini Omni 3.5' doesn't exist. Here's the real split.

'Gemini Omni 3.5' doesn't exist. Here's the real split.

Comments
7 min read
You don't pick the RL algorithm — SIA's Feedback loop does

You don't pick the RL algorithm — SIA's Feedback loop does

Comments
8 min read
Qwen3.6-35B NVFP4 runs on one H100 — A100 owners are out

Qwen3.6-35B NVFP4 runs on one H100 — A100 owners are out

Comments
8 min read
Step 3.7 Flash is a drop-in — except for one endpoint detail

Step 3.7 Flash is a drop-in — except for one endpoint detail

1
Comments
9 min read
llama-bench skipped FA on capable GPUs — b9437 corrects it

llama-bench skipped FA on capable GPUs — b9437 corrects it

Comments
7 min read
Opus 4.8 kills budget_tokens — here's what else moved

Opus 4.8 kills budget_tokens — here's what else moved

Comments
7 min read
Composer 2.5 hits near-frontier at 60 lower spend

Composer 2.5 hits near-frontier at 60 lower spend

Comments
7 min read
Nemotron 3 Ultra went live June 4. Here's the call that works.

Nemotron 3 Ultra went live June 4. Here's the call that works.

Comments
7 min read
Windsurf is Devin Desktop now. Cascade has 27 days left.

Windsurf is Devin Desktop now. Cascade has 27 days left.

Comments
7 min read
Is Omni's conversational video editor as good as the demos?

Is Omni's conversational video editor as good as the demos?

1
Comments
7 min read
NeMo out, GGUF in: how parakeet.cpp ports NVIDIA ASR to C++

NeMo out, GGUF in: how parakeet.cpp ports NVIDIA ASR to C++

Comments
6 min read
Qwen3 in the browser, zero keys — WebLLM 0.2.83 hands-on

Qwen3 in the browser, zero keys — WebLLM 0.2.83 hands-on

Comments
7 min read
NVIDIA's 550B finally lands: free to use, expensive to host

NVIDIA's 550B finally lands: free to use, expensive to host

Comments
6 min read
MiniMax M3 benchmarks at $0.30/M: verified vs. vendor-only

MiniMax M3 benchmarks at $0.30/M: verified vs. vendor-only

Comments
7 min read
Meta Business AI went global — gated rollout, paid plans TBD

Meta Business AI went global — gated rollout, paid plans TBD

Comments
6 min read
Gemma 4 12B skips the audio encoder. Is 16 GB enough?

Gemma 4 12B skips the audio encoder. Is 16 GB enough?

Comments
7 min read
K2.7 Code is 30% lighter — but chain-of-thought is locked on

K2.7 Code is 30% lighter — but chain-of-thought is locked on

Comments
7 min read
A 10-second Grok Imagine 1.5 clip at 720p runs $1.41

A 10-second Grok Imagine 1.5 clip at 720p runs $1.41

5
Comments
7 min read
Ideogram 4 was trained on JSON — plain prompts are second-class

Ideogram 4 was trained on JSON — plain prompts are second-class

Comments
7 min read
macOS 27 skips Intel — Siri AI is queued, Liquid Glass is not

macOS 27 skips Intel — Siri AI is queued, Liquid Glass is not

Comments
6 min read
Hand off any session to the GUI — Codex CLI 0.138.0

Hand off any session to the GUI — Codex CLI 0.138.0

Comments
5 min read
Kog hits 3K t/s on MI300X, no kernel switches — test it now

Kog hits 3K t/s on MI300X, no kernel switches — test it now

1
Comments
8 min read
Fable 5 refusals are 200 OK — your error handler misses them

Fable 5 refusals are 200 OK — your error handler misses them

1
Comments
7 min read
loading...