DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Unsupervised Evaluation of Code LLMs with Round-Trip Correctness

Unsupervised Evaluation of Code LLMs with Round-Trip Correctness

Comments
4 min read
Training Language Models to Generate Text with Citations via Fine-grained Rewards

Training Language Models to Generate Text with Citations via Fine-grained Rewards

Comments
3 min read
ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation

ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation

Comments
4 min read
InvertAvatar: Incremental GAN Inversion for Generalized Head Avatars

InvertAvatar: Incremental GAN Inversion for Generalized Head Avatars

Comments
4 min read
Why are Sensitive Functions Hard for Transformers?

Why are Sensitive Functions Hard for Transformers?

Comments
4 min read
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Comments
3 min read
Transformers Can Do Arithmetic with the Right Embeddings

Transformers Can Do Arithmetic with the Right Embeddings

Comments
4 min read
Thermodynamic Natural Gradient Descent

Thermodynamic Natural Gradient Descent

Comments
5 min read
Representation noising effectively prevents harmful fine-tuning on LLMs

Representation noising effectively prevents harmful fine-tuning on LLMs

Comments
5 min read
A Declarative System for Optimizing AI Workloads

A Declarative System for Optimizing AI Workloads

Comments
4 min read
BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

Comments
4 min read
Demo Paper: A Game Agents Battle Driven by Free-Form Text Commands Using Code-Generation LLM

Demo Paper: A Game Agents Battle Driven by Free-Form Text Commands Using Code-Generation LLM

Comments
4 min read
The CAP Principle for LLM Serving

The CAP Principle for LLM Serving

Comments
4 min read
ColorFoil: Investigating Color Blindness in Large Vision and Language Models

ColorFoil: Investigating Color Blindness in Large Vision and Language Models

Comments
4 min read
Self-playing Adversarial Language Game Enhances LLM Reasoning

Self-playing Adversarial Language Game Enhances LLM Reasoning

Comments
4 min read
Attention as an RNN

Attention as an RNN

Comments
4 min read
Pareto Optimal Learning for Estimating Large Language Model Errors

Pareto Optimal Learning for Estimating Large Language Model Errors

Comments
4 min read
Track Anything Rapter(TAR)

Track Anything Rapter(TAR)

Comments
3 min read
Neuromorphic dreaming: A pathway to efficient learning in artificial agents

Neuromorphic dreaming: A pathway to efficient learning in artificial agents

Comments
3 min read
Power Hungry Processing: Watts Driving the Cost of AI Deployment?

Power Hungry Processing: Watts Driving the Cost of AI Deployment?

1
Comments
4 min read
UFO: A UI-Focused Agent for Windows OS Interaction

UFO: A UI-Focused Agent for Windows OS Interaction

Comments
4 min read
Fractal Patterns May Illuminate the Success of Next-Token Prediction

Fractal Patterns May Illuminate the Success of Next-Token Prediction

Comments
5 min read
As an AI Language Model, Yes I Would Recommend Calling the Police'': Norm Inconsistency in LLM Decision-Making

As an AI Language Model, Yes I Would Recommend Calling the Police'': Norm Inconsistency in LLM Decision-Making

Comments
4 min read
TimeGPT-1

TimeGPT-1

Comments
4 min read
Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models

Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models

Comments
5 min read
loading...