DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

Comments
4 min read
Iterative Reasoning Preference Optimization

Iterative Reasoning Preference Optimization

Comments
4 min read
Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis

Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis

Comments
4 min read
Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

Comments
3 min read
Make Your LLM Fully Utilize the Context

Make Your LLM Fully Utilize the Context

Comments
4 min read
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Comments
4 min read
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models

CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models

Comments
4 min read
DoRA: Weight-Decomposed Low-Rank Adaptation

DoRA: Weight-Decomposed Low-Rank Adaptation

Comments
4 min read
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Comments
4 min read
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

Comments
4 min read
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Comments
4 min read
Building a Large Japanese Web Corpus for Large Language Models

Building a Large Japanese Web Corpus for Large Language Models

Comments
3 min read
InstructEdit: Instruction-based Knowledge Editing for Large Language Models

InstructEdit: Instruction-based Knowledge Editing for Large Language Models

Comments
4 min read
Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Comments
5 min read
Extending Llama-3's Context Ten-Fold Overnight

Extending Llama-3's Context Ten-Fold Overnight

Comments
4 min read
Relay Mining: Incentivizing Full Non-Validating Nodes Servicing All RPC Types

Relay Mining: Incentivizing Full Non-Validating Nodes Servicing All RPC Types

Comments
4 min read
Step Differences in Instructional Video

Step Differences in Instructional Video

Comments
4 min read
Neural-Symbolic Recursive Machine for Systematic Generalization

Neural-Symbolic Recursive Machine for Systematic Generalization

Comments
4 min read
Benchmarking Benchmark Leakage in Large Language Models

Benchmarking Benchmark Leakage in Large Language Models

Comments
4 min read
Relational Graph Convolutional Networks for Sentiment Analysis

Relational Graph Convolutional Networks for Sentiment Analysis

Comments
5 min read
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step

LDB: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step

Comments
4 min read
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Comments
4 min read
EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

Comments
5 min read
Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Comments
4 min read
Benchmarking Mobile Device Control Agents across Diverse Configurations

Benchmarking Mobile Device Control Agents across Diverse Configurations

1
Comments
4 min read
loading...