DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Robust Classification via a Single Diffusion Model

Robust Classification via a Single Diffusion Model

Comments
4 min read
Lessons from the Trenches on Reproducible Evaluation of Language Models

Lessons from the Trenches on Reproducible Evaluation of Language Models

Comments
5 min read
Efficient Encoder-Decoder Transformer Decoding for Decomposable Tasks

Efficient Encoder-Decoder Transformer Decoding for Decomposable Tasks

1
Comments
3 min read
Ephemeral Rollups are All you Need

Ephemeral Rollups are All you Need

Comments
3 min read
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Comments
4 min read
Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling

Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling

Comments
4 min read
Chain-of-Thought Reasoning Without Prompting

Chain-of-Thought Reasoning Without Prompting

Comments
4 min read
Extracting Prompts by Inverting LLM Outputs

Extracting Prompts by Inverting LLM Outputs

Comments
4 min read
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition

Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition

Comments
4 min read
DarkDNS: Revisiting the Value of Rapid Zone Update

DarkDNS: Revisiting the Value of Rapid Zone Update

Comments
4 min read
LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

LoRA Fine-tuning Efficiently Undoes Safety Training in Llama 2-Chat 70B

Comments
4 min read
Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer

Direct3D: Scalable Image-to-3D Generation via 3D Latent Diffusion Transformer

Comments
4 min read
Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA

Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA

Comments
4 min read
Training Language Models to Generate Text with Citations via Fine-grained Rewards

Training Language Models to Generate Text with Citations via Fine-grained Rewards

Comments
3 min read
InvertAvatar: Incremental GAN Inversion for Generalized Head Avatars

InvertAvatar: Incremental GAN Inversion for Generalized Head Avatars

Comments
4 min read
The CAP Principle for LLM Serving

The CAP Principle for LLM Serving

Comments
4 min read
Self-playing Adversarial Language Game Enhances LLM Reasoning

Self-playing Adversarial Language Game Enhances LLM Reasoning

Comments
4 min read
A Declarative System for Optimizing AI Workloads

A Declarative System for Optimizing AI Workloads

Comments
4 min read
Why are Sensitive Functions Hard for Transformers?

Why are Sensitive Functions Hard for Transformers?

Comments
4 min read
ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation

ARAIDA: Analogical Reasoning-Augmented Interactive Data Annotation

Comments
4 min read
Representation noising effectively prevents harmful fine-tuning on LLMs

Representation noising effectively prevents harmful fine-tuning on LLMs

Comments
5 min read
Increasing the LLM Accuracy for Question Answering: Ontologies to the Rescue!

Increasing the LLM Accuracy for Question Answering: Ontologies to the Rescue!

Comments
5 min read
Demo Paper: A Game Agents Battle Driven by Free-Form Text Commands Using Code-Generation LLM

Demo Paper: A Game Agents Battle Driven by Free-Form Text Commands Using Code-Generation LLM

Comments
4 min read
Thermodynamic Natural Gradient Descent

Thermodynamic Natural Gradient Descent

Comments
5 min read
BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once

Comments
4 min read
Attention as an RNN

Attention as an RNN

Comments
4 min read
Transformers Can Do Arithmetic with the Right Embeddings

Transformers Can Do Arithmetic with the Right Embeddings

Comments
4 min read
ColorFoil: Investigating Color Blindness in Large Vision and Language Models

ColorFoil: Investigating Color Blindness in Large Vision and Language Models

Comments
4 min read
Pareto Optimal Learning for Estimating Large Language Model Errors

Pareto Optimal Learning for Estimating Large Language Model Errors

Comments
4 min read
Unsupervised Evaluation of Code LLMs with Round-Trip Correctness

Unsupervised Evaluation of Code LLMs with Round-Trip Correctness

Comments
4 min read
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving

Comments
3 min read
Fractal Patterns May Illuminate the Success of Next-Token Prediction

Fractal Patterns May Illuminate the Success of Next-Token Prediction

Comments
5 min read
Neuromorphic dreaming: A pathway to efficient learning in artificial agents

Neuromorphic dreaming: A pathway to efficient learning in artificial agents

Comments
3 min read
As an AI Language Model, Yes I Would Recommend Calling the Police'': Norm Inconsistency in LLM Decision-Making

As an AI Language Model, Yes I Would Recommend Calling the Police'': Norm Inconsistency in LLM Decision-Making

Comments
4 min read
BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry

BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry

Comments
4 min read
Track Anything Rapter(TAR)

Track Anything Rapter(TAR)

Comments
3 min read
VerMCTS: Synthesizing Multi-Step Programs using a Verifier, a Large Language Model, and Tree Search

VerMCTS: Synthesizing Multi-Step Programs using a Verifier, a Large Language Model, and Tree Search

Comments
4 min read
TimeGPT-1

TimeGPT-1

Comments
4 min read
Power Hungry Processing: Watts Driving the Cost of AI Deployment?

Power Hungry Processing: Watts Driving the Cost of AI Deployment?

Comments
4 min read
Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models

Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models

Comments
5 min read
UFO: A UI-Focused Agent for Windows OS Interaction

UFO: A UI-Focused Agent for Windows OS Interaction

Comments
4 min read
ML Intro : IRIS DATASET

ML Intro : IRIS DATASET

4
Comments
4 min read
¿Qué esta pasando con Gemini AI de Google?

¿Qué esta pasando con Gemini AI de Google?

13
Comments
2 min read
Join our ML Reading Group on May 14th !

Join our ML Reading Group on May 14th !

Comments
1 min read
MT-Bench: Comparing different LLM Judges

MT-Bench: Comparing different LLM Judges

13
Comments 1
4 min read
Managing Machine Learning Projects

Managing Machine Learning Projects

10
Comments
10 min read
Learning Python

Learning Python

1
Comments
1 min read
LISA adapted to SamGIS

LISA adapted to SamGIS

1
Comments
2 min read
Can LLMs Truly Understand Text-based Emotion ?

Can LLMs Truly Understand Text-based Emotion ?

Comments
20 min read
What I learnt from development on LISA with SamGIS (So far)

What I learnt from development on LISA with SamGIS (So far)

2
Comments
2 min read
LISA integrato in SamGIS

LISA integrato in SamGIS

Comments
2 min read
Cosa ho imparato durante lo sviluppo di SamGIS con LISA (finora)

Cosa ho imparato durante lo sviluppo di SamGIS con LISA (finora)

Comments
2 min read
Amazon Macie to detect sensitive data from your S3 Buckets

Amazon Macie to detect sensitive data from your S3 Buckets

13
Comments
4 min read
Discover Wit.ai: Create Your Own Intelligent Bots for Free 🚀🤖

Discover Wit.ai: Create Your Own Intelligent Bots for Free 🚀🤖

13
Comments 2
2 min read
Machine Learning 101: What You Need to Know

Machine Learning 101: What You Need to Know

7
Comments
3 min read
Unleashing the Value of Data: A Journey into Data Monetization

Unleashing the Value of Data: A Journey into Data Monetization

1
Comments
2 min read
How to learn machine learning

How to learn machine learning

Comments
2 min read
Appunti sul machine learning

Appunti sul machine learning

Comments
3 min read
Generative AI Revolutionizes Quantum Computer Programming

Generative AI Revolutionizes Quantum Computer Programming

6
Comments
2 min read
Case Study - Optimizing Linear Layer

Case Study - Optimizing Linear Layer

Comments
7 min read
loading...