DEV Community

Artificial Intelligence

Artificial intelligence leverages computers and machines to mimic the problem-solving and decision-making capabilities found in humans and in nature.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Compressed-Language Models for Understanding Compressed File Formats: a JPEG Exploration

Compressed-Language Models for Understanding Compressed File Formats: a JPEG Exploration

Comments
5 min read
ToonCrafter: Generative Cartoon Interpolation

ToonCrafter: Generative Cartoon Interpolation

3
Comments
4 min read
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

1
Comments
4 min read
Large Language Models Can Self-Improve At Web Agent Tasks

Large Language Models Can Self-Improve At Web Agent Tasks

Comments
4 min read
BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B

BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B

Comments
4 min read
There and Back Again: The AI Alignment Paradox

There and Back Again: The AI Alignment Paradox

Comments
4 min read
LLMs achieve adult human performance on higher-order theory of mind tasks

LLMs achieve adult human performance on higher-order theory of mind tasks

Comments
4 min read
Privacy-Aware Visual Language Models

Privacy-Aware Visual Language Models

Comments
4 min read
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

Comments
4 min read
Easy Problems That LLMs Get Wrong

Easy Problems That LLMs Get Wrong

6
Comments
4 min read
Executable Code Actions Elicit Better LLM Agents

Executable Code Actions Elicit Better LLM Agents

Comments
4 min read
Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame Simulations

Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame Simulations

Comments
4 min read
The rising costs of training frontier AI models

The rising costs of training frontier AI models

Comments
5 min read
MoEUT: Mixture-of-Experts Universal Transformers

MoEUT: Mixture-of-Experts Universal Transformers

Comments
3 min read
You Need to Pay Better Attention: Rethinking the Mathematics of Attention Mechanism

You Need to Pay Better Attention: Rethinking the Mathematics of Attention Mechanism

Comments
4 min read
Evaluating AI-generated code for C++, Fortran, Go, Java, Julia, Matlab, Python, R, and Rust

Evaluating AI-generated code for C++, Fortran, Go, Java, Julia, Matlab, Python, R, and Rust

1
Comments
4 min read
Simplifying Transformer Blocks

Simplifying Transformer Blocks

Comments
4 min read
Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models

Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models

Comments
4 min read
The Road Less Scheduled

The Road Less Scheduled

Comments
4 min read
gzip Predicts Data-dependent Scaling Laws

gzip Predicts Data-dependent Scaling Laws

Comments
5 min read
Look Once to Hear: Target Speech Hearing with Noisy Examples

Look Once to Hear: Target Speech Hearing with Noisy Examples

Comments
5 min read
Generate and Pray: Using SALLMS to Evaluate the Security of LLM Generated Code

Generate and Pray: Using SALLMS to Evaluate the Security of LLM Generated Code

1
Comments
5 min read
Arrows of Time for Large Language Models

Arrows of Time for Large Language Models

Comments
5 min read
NPGA: Neural Parametric Gaussian Avatars

NPGA: Neural Parametric Gaussian Avatars

2
Comments
3 min read
An Introduction to Vision-Language Modeling

An Introduction to Vision-Language Modeling

1
Comments 1
4 min read
loading...