DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

1
Comments 1
4 min read
X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

Comments
4 min read
Reasoning in Large Language Models: A Geometric Perspective

Reasoning in Large Language Models: A Geometric Perspective

Comments
5 min read
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

1
Comments
3 min read
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Comments
3 min read
LoRA+: Efficient Low Rank Adaptation of Large Models

LoRA+: Efficient Low Rank Adaptation of Large Models

Comments
3 min read
Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

2
Comments
3 min read
Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows

Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows

Comments
5 min read
When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

Comments
4 min read
Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization

Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization

Comments
4 min read
Simulacra as Conscious Exotica

Simulacra as Conscious Exotica

1
Comments 1
4 min read
Exploring the Latest LLMs for Leaderboard Extraction

Exploring the Latest LLMs for Leaderboard Extraction

Comments
4 min read
SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

Comments
4 min read
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Comments
4 min read
Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.