DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

1
Comments 1
4 min read
A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

Comments
4 min read
PaliGemma: A versatile 3B VLM for transfer

PaliGemma: A versatile 3B VLM for transfer

Comments
4 min read
There Has To Be a Lot That We're Missing: Moderating AI-Generated Content on Reddit

There Has To Be a Lot That We're Missing: Moderating AI-Generated Content on Reddit

Comments
4 min read
FACTS About Building Retrieval Augmented Generation-based Chatbots

FACTS About Building Retrieval Augmented Generation-based Chatbots

Comments
4 min read
Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

Comments
4 min read
SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

Comments
4 min read
What's the Magic Word? A Control Theory of LLM Prompting

What's the Magic Word? A Control Theory of LLM Prompting

Comments
4 min read
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Comments
4 min read
Exploring the Latest LLMs for Leaderboard Extraction

Exploring the Latest LLMs for Leaderboard Extraction

Comments
4 min read
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization

Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization

Comments
4 min read
Prompt Engineering a Prompt Engineer

Prompt Engineering a Prompt Engineer

Comments
4 min read
AI Agents That Matter

AI Agents That Matter

Comments
3 min read
A Multivariate Unimodality Test Harnessing the Dip Statistic of Mahalanobis Distances Over Random Projections

A Multivariate Unimodality Test Harnessing the Dip Statistic of Mahalanobis Distances Over Random Projections

Comments
3 min read
The Reason behind Good or Bad: Towards a Better Mathematical Verifier with Natural Language Feedback

The Reason behind Good or Bad: Towards a Better Mathematical Verifier with Natural Language Feedback

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.