DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
DeepRacer-for-Cloud v5.2.2 now available with new real-time training metrics

DeepRacer-for-Cloud v5.2.2 now available with new real-time training metrics

1
Comments
4 min read
Demystifying Heuristic Search Algorithms

Demystifying Heuristic Search Algorithms

15
Comments
5 min read
Generative AI: Novice Guide to Transformers

Generative AI: Novice Guide to Transformers

Comments
4 min read
How paranoid is it to not use facial recognition on an Iphone?

How paranoid is it to not use facial recognition on an Iphone?

1
Comments 1
2 min read
Computer Vision Meetup: Who needs RLHF When You Have SFT? 30:36

Computer Vision Meetup: Who needs RLHF When You Have SFT?

3
Comments
1 min read
Building a Trader Bot with Sentiment Analysis: A Step-by-Step Guide

Building a Trader Bot with Sentiment Analysis: A Step-by-Step Guide

11
Comments
3 min read
Computer Vision Meetup: Making LLMs Safe & Reliable 31:54

Computer Vision Meetup: Making LLMs Safe & Reliable

Comments
1 min read
5 Open Source Large Language Models APIs for Developers

5 Open Source Large Language Models APIs for Developers

20
Comments 3
10 min read
GenCast: Diffusion-based ensemble forecasting for medium-range weather

GenCast: Diffusion-based ensemble forecasting for medium-range weather

2
Comments
3 min read
KAN: Kolmogorov-Arnold Networks

KAN: Kolmogorov-Arnold Networks

2
Comments
4 min read
Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input

Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input

1
Comments
4 min read
Training-free Graph Neural Networks and the Power of Labels as Features

Training-free Graph Neural Networks and the Power of Labels as Features

1
Comments
4 min read
Eureka: Human-Level Reward Design via Coding Large Language Models

Eureka: Human-Level Reward Design via Coding Large Language Models

Comments
4 min read
Thousands of AI Authors on the Future of AI

Thousands of AI Authors on the Future of AI

Comments
3 min read
Better & Faster Large Language Models via Multi-token Prediction

Better & Faster Large Language Models via Multi-token Prediction

Comments
4 min read
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Comments
3 min read
Predicting SSH keys in Open SSH Memory dumps

Predicting SSH keys in Open SSH Memory dumps

Comments
4 min read
Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification

Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification

Comments
3 min read
BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

1
Comments
4 min read
Fine-tune your first large language model (LLM) with LoRA, llama.cpp, and KitOps in 5 easy steps

Fine-tune your first large language model (LLM) with LoRA, llama.cpp, and KitOps in 5 easy steps

63
Comments 4
3 min read
Deepfake detection by FacePlugin-Safeguarding Remote Onboarding

Deepfake detection by FacePlugin-Safeguarding Remote Onboarding

20
Comments
1 min read
Single Page Applications (SPAs) vs. Multi-Page Applications (MPAs). Navigating the Web App Landscape

Single Page Applications (SPAs) vs. Multi-Page Applications (MPAs). Navigating the Web App Landscape

Comments
1 min read
How to use LLMs: Summarize long documents

How to use LLMs: Summarize long documents

22
Comments
5 min read
Cross-validation in Machine Learning

Cross-validation in Machine Learning

Comments
4 min read
May 8, 2024 AI, Machine Learning and Computer Vision Meetup

May 8, 2024 AI, Machine Learning and Computer Vision Meetup

1
Comments
3 min read
A complete beginner's guide to using the Face-To-Many model on Replicate

A complete beginner's guide to using the Face-To-Many model on Replicate

Comments
3 min read
A complete beginner's guide to using the Real-Esrgan model on Replicate

A complete beginner's guide to using the Real-Esrgan model on Replicate

Comments
2 min read
A beginner's guide to using Face-To-Many on Replicate

A beginner's guide to using Face-To-Many on Replicate

1
Comments
3 min read
Iterative Reasoning Preference Optimization

Iterative Reasoning Preference Optimization

Comments
4 min read
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Comments
4 min read
Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

Comments
3 min read
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Comments
4 min read
DoRA: Weight-Decomposed Low-Rank Adaptation

DoRA: Weight-Decomposed Low-Rank Adaptation

Comments
4 min read
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

Comments
3 min read
InstructEdit: Instruction-based Knowledge Editing for Large Language Models

InstructEdit: Instruction-based Knowledge Editing for Large Language Models

Comments
4 min read
RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation

Comments
4 min read
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Comments
4 min read
Step Differences in Instructional Video

Step Differences in Instructional Video

Comments
4 min read
Make Your LLM Fully Utilize the Context

Make Your LLM Fully Utilize the Context

Comments
4 min read
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Comments
4 min read
Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis

Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis

Comments
4 min read
Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Kosmos-G: Generating Images in Context with Multimodal Large Language Models

Comments
5 min read
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models

CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models

Comments
4 min read
Neural-Symbolic Recursive Machine for Systematic Generalization

Neural-Symbolic Recursive Machine for Systematic Generalization

Comments
4 min read
EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

Comments
5 min read
Building a Large Japanese Web Corpus for Large Language Models

Building a Large Japanese Web Corpus for Large Language Models

Comments
3 min read
Relational Graph Convolutional Networks for Sentiment Analysis

Relational Graph Convolutional Networks for Sentiment Analysis

Comments
5 min read
Benchmarking Benchmark Leakage in Large Language Models

Benchmarking Benchmark Leakage in Large Language Models

Comments
4 min read
Relay Mining: Incentivizing Full Non-Validating Nodes Servicing All RPC Types

Relay Mining: Incentivizing Full Non-Validating Nodes Servicing All RPC Types

Comments
4 min read
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

Comments
4 min read
Extending Llama-3's Context Ten-Fold Overnight

Extending Llama-3's Context Ten-Fold Overnight

Comments
4 min read
LDB: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step

LDB: A Large Language Model Debugger via Verifying Runtime Execution Step-by-step

Comments
4 min read
Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Comments
4 min read
Benchmarking Mobile Device Control Agents across Diverse Configurations

Benchmarking Mobile Device Control Agents across Diverse Configurations

1
Comments
4 min read
DressCode: Autoregressively Sewing and Generating Garments from Text Guidance

DressCode: Autoregressively Sewing and Generating Garments from Text Guidance

Comments
3 min read
Learning Performance-Improving Code Edits

Learning Performance-Improving Code Edits

Comments
4 min read
Breaking News: AWS Bedrock Lands in Sydney

Breaking News: AWS Bedrock Lands in Sydney

3
Comments
2 min read
Vector Databases Are the Base of RAG Retrieval

Vector Databases Are the Base of RAG Retrieval

15
Comments
6 min read
Exploring BGE-M3 and Splade: Two Machine Learning Models for Generating Sparse Embeddings

Exploring BGE-M3 and Splade: Two Machine Learning Models for Generating Sparse Embeddings

17
Comments
8 min read
How to use Retrieval Augmented Generation (RAG) for Go applications

How to use Retrieval Augmented Generation (RAG) for Go applications

10
Comments 1
12 min read
loading...