DEV Community

SomeOddCodeGuy profile picture

SomeOddCodeGuy

Just a boring dev/manager who picked up LLMs in early 2023, and been engrossed ever since. This is a cross-post blog. The main blog is at https://someoddcodeguy.dev

Location Florida, USA Joined Joined on  Personal website https://github.com/someoddcodeguy
My Next Steps with Wilmer

My Next Steps with Wilmer

Comments
4 min read
Microsoft's New User Role Model

Microsoft's New User Role Model

Comments
1 min read
Agentic Coding Pt 2...

Agentic Coding Pt 2...

Comments
2 min read
Reddit woes; but a light at the end of the tunnel

Reddit woes; but a light at the end of the tunnel

Comments
1 min read
GLM 4.6 MXFP4 vs q8_0 gguf speeds on Mac M3 Ultra

GLM 4.6 MXFP4 vs q8_0 gguf speeds on Mac M3 Ultra

Comments
2 min read
I'm Not Sure If This Counts As "Vibe Coding"?

I'm Not Sure If This Counts As "Vibe Coding"?

Comments
2 min read
Recursive Workflows- Offline Wikipedia Search in WilmerAI

Recursive Workflows- Offline Wikipedia Search in WilmerAI

Comments
3 min read
Stanford Lectures...

Stanford Lectures...

Comments
1 min read
Heh... Oops. All Hail the Unit Tests.

Heh... Oops. All Hail the Unit Tests.

Comments
1 min read
My Unorthodox Homelab Setup: Updated

My Unorthodox Homelab Setup: Updated

Comments
5 min read
Latency while using RDP

Latency while using RDP

Comments
1 min read
Time For Another Studio? Wallet Beware...

Time For Another Studio? Wallet Beware...

Comments
1 min read
A Quick List of LLM Benchmarks

A Quick List of LLM Benchmarks

Comments
1 min read
RAG Really Is More of a Software Problem Than An AI Problem

RAG Really Is More of a Software Problem Than An AI Problem

Comments
2 min read
That LinkedIn '95% of AI Ventures Fail' Stat That's Going Around...

That LinkedIn '95% of AI Ventures Fail' Stat That's Going Around...

Comments
2 min read
Mac Studio M3 Ultra Speeds for Qwen3 235b, GPT-OSS-120b, GLM 4.5, and Deepseek V3.1

Mac Studio M3 Ultra Speeds for Qwen3 235b, GPT-OSS-120b, GLM 4.5, and Deepseek V3.1

Comments
2 min read
Reddit Shadowbans- A Deep Dive Into What Little I Could Find

Reddit Shadowbans- A Deep Dive Into What Little I Could Find

Comments
4 min read
I'll Always Have A Softspot for the Old Text-Generation-WebUI Chat Bubbles

I'll Always Have A Softspot for the Old Text-Generation-WebUI Chat Bubbles

Comments
1 min read
Running Deepseek R1 0528 q4_K_M and mlx 4-bit on a Mac Studio M3

Running Deepseek R1 0528 q4_K_M and mlx 4-bit on a Mac Studio M3

Comments
2 min read
M3 Ultra Mac Studio 512GB prompt and write speeds for Deepseek V3 0 671b gguf q4_K_M, for those curious

M3 Ultra Mac Studio 512GB prompt and write speeds for Deepseek V3 0 671b gguf q4_K_M, for those curious

Comments
2 min read
Running Llama 3.1 405b q6 and Command-A 111b Q8 on M3 Ultra Mac Studio

Running Llama 3.1 405b q6 and Command-A 111b Q8 on M3 Ultra Mac Studio

Comments
1 min read
Mac Speed Comparison: M2 Ultra vs M3 Ultra using KoboldCpp

Mac Speed Comparison: M2 Ultra vs M3 Ultra using KoboldCpp

Comments
3 min read
Low Context Speed Comparison: Macbook, Mac Studios, and RTX 4090

Low Context Speed Comparison: Macbook, Mac Studios, and RTX 4090

Comments
3 min read
My Personal Guide for Developing Software with AI Assistance: Part 2

My Personal Guide for Developing Software with AI Assistance: Part 2

Comments
6 min read
MMLU-Pro Combined Results - Model Quantization Comparison

MMLU-Pro Combined Results - Model Quantization Comparison

Comments
13 min read
Meet WilmerAI- my open source project to maximize the potential of Local LLMs via prompt routing and multi-model workflows

Meet WilmerAI- my open source project to maximize the potential of Local LLMs via prompt routing and multi-model workflows

Comments
9 min read
Offline Wikipedia API- An easy to use offline API that serves up full text Wikipedia articles.

Offline Wikipedia API- An easy to use offline API that serves up full text Wikipedia articles.

Comments
3 min read
My Personal Guide for Developing Software with AI Assistance

My Personal Guide for Developing Software with AI Assistance

Comments
7 min read
Almost a year later, I can finally do this. A small teaser of a project I'm working on

Almost a year later, I can finally do this. A small teaser of a project I'm working on

Comments
2 min read
Frankenmerges are actually kind of great...

Frankenmerges are actually kind of great...

Comments
2 min read
Ok, I admit- SillyTavern is a great way to test models after all

Ok, I admit- SillyTavern is a great way to test models after all

Comments
3 min read
Real World Speeds on the Mac: Koboldcpp Context Shift Edition!

Real World Speeds on the Mac: Koboldcpp Context Shift Edition!

Comments
9 min read
Here Are Some Real World Speeds For the Mac M2 Ultra, In Case You Were Curious

Here Are Some Real World Speeds For the Mac M2 Ultra, In Case You Were Curious

Comments
6 min read
Deepseek 67b is amazing, and in at least 1 usecase it seems better than ChatGPT 4

Deepseek 67b is amazing, and in at least 1 usecase it seems better than ChatGPT 4

Comments
2 min read
Quick Start Guide To Converting Your Own GGUFs (including fp16)

Quick Start Guide To Converting Your Own GGUFs (including fp16)

Comments
6 min read
Running old gglmv3 models in gpt4all

Running old gglmv3 models in gpt4all

Comments
1 min read
Clearing up confusion: GPT 3.5-Turbo may not be 20b after all

Clearing up confusion: GPT 3.5-Turbo may not be 20b after all

Comments
2 min read
4090 Ahoy...

4090 Ahoy...

Comments
1 min read
NTK Scaling and Llama 2

NTK Scaling and Llama 2

Comments
1 min read
A Little About Local Models

A Little About Local Models

Comments
2 min read
Redownloading All The Old Models

Redownloading All The Old Models

Comments
1 min read
If You Buy A Mac for LLMs, Don't Skimp on RAM

If You Buy A Mac for LLMs, Don't Skimp on RAM

Comments
1 min read
loading...