DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models

EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models

2
Comments 1
4 min read
Evaluating the Performance of ChatGPT for Spam Email Detection

Evaluating the Performance of ChatGPT for Spam Email Detection

1
Comments
3 min read
Exploitation Business: Leveraging Information Asymmetry

Exploitation Business: Leveraging Information Asymmetry

Comments
3 min read
Large Language Models for Data Annotation: A Survey

Large Language Models for Data Annotation: A Survey

Comments
4 min read
4090 - ECC ON vs ECC OFF

4090 - ECC ON vs ECC OFF

10
Comments
1 min read
Step-by-Step with Pandas: Basic Operations to Intermediate Mastery 🐍🐼

Step-by-Step with Pandas: Basic Operations to Intermediate Mastery 🐍🐼

6
Comments
2 min read
LMDX: Language Model-based Document Information Extraction and Localization

LMDX: Language Model-based Document Information Extraction and Localization

2
Comments
4 min read
How Susceptible are Large Language Models to Ideological Manipulation?

How Susceptible are Large Language Models to Ideological Manipulation?

Comments
3 min read
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

5
Comments
4 min read
Transcendence: Generative Models Can Outperform The Experts That Train Them

Transcendence: Generative Models Can Outperform The Experts That Train Them

Comments
4 min read
The Impact of Reasoning Step Length on Large Language Models

The Impact of Reasoning Step Length on Large Language Models

2
Comments
4 min read
An Image is Worth 32 Tokens for Reconstruction and Generation

An Image is Worth 32 Tokens for Reconstruction and Generation

Comments
4 min read
VideoPrism: A Foundational Visual Encoder for Video Understanding

VideoPrism: A Foundational Visual Encoder for Video Understanding

Comments 1
4 min read
Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models

Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models

Comments
4 min read
Are LLMs Naturally Good at Synthetic Tabular Data Generation?

Are LLMs Naturally Good at Synthetic Tabular Data Generation?

1
Comments
4 min read
Transparent Image Layer Diffusion using Latent Transparency

Transparent Image Layer Diffusion using Latent Transparency

Comments
3 min read
Foundation Models for Time Series Analysis: A Tutorial and Survey

Foundation Models for Time Series Analysis: A Tutorial and Survey

2
Comments
4 min read
StreamBench: Towards Benchmarking Continuous Improvement of Language Agents

StreamBench: Towards Benchmarking Continuous Improvement of Language Agents

Comments
4 min read
An Interactive Agent Foundation Model

An Interactive Agent Foundation Model

Comments
3 min read
Social Norms in Cinema: A Cross-Cultural Analysis of Shame, Pride and Prejudice

Social Norms in Cinema: A Cross-Cultural Analysis of Shame, Pride and Prejudice

Comments
4 min read
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images

StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images

Comments
3 min read
Neural Thermodynamic Integration: Free Energies from Energy-based Diffusion Models

Neural Thermodynamic Integration: Free Energies from Energy-based Diffusion Models

Comments
4 min read
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

Comments
4 min read
State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing

State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing

Comments
4 min read
Chain-of-Thought Unfaithfulness as Disguised Accuracy

Chain-of-Thought Unfaithfulness as Disguised Accuracy

1
Comments
4 min read
Large language models surpass human experts in predicting neuroscience results

Large language models surpass human experts in predicting neuroscience results

1
Comments
4 min read
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

Comments
4 min read
Refusal in Language Models Is Mediated by a Single Direction

Refusal in Language Models Is Mediated by a Single Direction

1
Comments
3 min read
Large Language Models Are Zero-Shot Time Series Forecasters

Large Language Models Are Zero-Shot Time Series Forecasters

2
Comments
4 min read
Jellyfish: A Large Language Model for Data Preprocessing

Jellyfish: A Large Language Model for Data Preprocessing

2
Comments
4 min read
Should AI Optimize Your Code? A Comparative Study of Current Large Language Models Versus Classical Optimizing Compilers

Should AI Optimize Your Code? A Comparative Study of Current Large Language Models Versus Classical Optimizing Compilers

Comments
4 min read
garak: A Framework for Security Probing Large Language Models

garak: A Framework for Security Probing Large Language Models

1
Comments
4 min read
LLAMAFUZZ: Large Language Model Enhanced Greybox Fuzzing

LLAMAFUZZ: Large Language Model Enhanced Greybox Fuzzing

1
Comments
4 min read
A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges

A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges

Comments
6 min read
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Comments
3 min read
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

Comments
4 min read
A Survey on In-context Learning

A Survey on In-context Learning

1
Comments
4 min read
Where there's a will there's a way: ChatGPT is used more for science in countries where it is prohibited

Where there's a will there's a way: ChatGPT is used more for science in countries where it is prohibited

Comments
3 min read
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Comments
4 min read
Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation

Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation

Comments
4 min read
How Do Humans Write Code? Large Models Do It the Same Way Too

How Do Humans Write Code? Large Models Do It the Same Way Too

Comments
4 min read
DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

Comments
4 min read
Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

Comments
4 min read
Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

Comments
4 min read
Scrolly2Reel: Retargeting Graphics for Social Media Using Narrative Beats

Scrolly2Reel: Retargeting Graphics for Social Media Using Narrative Beats

Comments
3 min read
Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

Comments
5 min read
Depth Anything V2

Depth Anything V2

1
Comments
4 min read
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning

Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning

Comments
4 min read
DataComp-LM: In search of the next generation of training sets for language models

DataComp-LM: In search of the next generation of training sets for language models

Comments
3 min read
A Survey on Large Language Models for Recommendation

A Survey on Large Language Models for Recommendation

Comments
4 min read
Directly Fine-Tuning Diffusion Models on Differentiable Rewards

Directly Fine-Tuning Diffusion Models on Differentiable Rewards

Comments
4 min read
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Comments
4 min read
CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

Comments
3 min read
Everything You Need to Know About Microsoft Azure Face Recognition Technology

Everything You Need to Know About Microsoft Azure Face Recognition Technology

2
Comments
1 min read
Behind the Scenes of AI: How Language Models Like ChatGPT Work

Behind the Scenes of AI: How Language Models Like ChatGPT Work

Comments 1
3 min read
The Magical World of Machine Learning at Hogwarts (Part #2)

The Magical World of Machine Learning at Hogwarts (Part #2)

6
Comments
7 min read
A forensic analysis of the Claude Sonnet 3.5 system prompt leak

A forensic analysis of the Claude Sonnet 3.5 system prompt leak

4
Comments
7 min read
Text-based language processing enhanced with AI/ML

Text-based language processing enhanced with AI/ML

4
Comments
22 min read
The Magical World of Machine Learning at Hogwarts (Part #1)

The Magical World of Machine Learning at Hogwarts (Part #1)

6
Comments 1
8 min read
Introduction to Pandas

Introduction to Pandas

9
Comments 4
37 min read
loading...