DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
History of Math & Machine Learning

History of Math & Machine Learning

1
Comments
8 min read
The 6 Best Machine Learning APIs 2024

The 6 Best Machine Learning APIs 2024

2
Comments
7 min read
Capabilities of Gemini Models in Medicine

Capabilities of Gemini Models in Medicine

1
Comments
4 min read
A Careful Examination of Large Language Model Performance on Grade School Arithmetic

A Careful Examination of Large Language Model Performance on Grade School Arithmetic

1
Comments
5 min read
Uncovering the Metaverse within Everyday Environments: a Coarse-to-Fine Approach

Uncovering the Metaverse within Everyday Environments: a Coarse-to-Fine Approach

1
Comments
3 min read
Scalable network reconstruction in subquadratic time

Scalable network reconstruction in subquadratic time

1
Comments
3 min read
Build your own AI ChatBot on your machine

Build your own AI ChatBot on your machine

2
Comments
4 min read
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks

Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks

1
Comments
3 min read
Fewer Truncations Improve Language Modeling

Fewer Truncations Improve Language Modeling

1
Comments
4 min read
Mixture-of-Linear-Experts for Long-term Time Series Forecasting

Mixture-of-Linear-Experts for Long-term Time Series Forecasting

Comments
4 min read
Retrieval-Augmented Score Distillation for Text-to-3D Generation

Retrieval-Augmented Score Distillation for Text-to-3D Generation

Comments
4 min read
A Watermark for Large Language Models

A Watermark for Large Language Models

Comments
4 min read
The Shape of Money Laundering: Subgraph Representation Learning on the Blockchain with the Elliptic2 Dataset

The Shape of Money Laundering: Subgraph Representation Learning on the Blockchain with the Elliptic2 Dataset

Comments
4 min read
Alice's Adventures in a Differentiable Wonderland -- Volume I, A Tour of the Land

Alice's Adventures in a Differentiable Wonderland -- Volume I, A Tour of the Land

Comments
4 min read
Chronos: Learning the Language of Time Series

Chronos: Learning the Language of Time Series

Comments
3 min read
DeepRacer-for-Cloud v5.2.2 now available with new real-time training metrics

DeepRacer-for-Cloud v5.2.2 now available with new real-time training metrics

1
Comments
4 min read
Demystifying Heuristic Search Algorithms

Demystifying Heuristic Search Algorithms

14
Comments
5 min read
Generative AI: Novice Guide to Transformers

Generative AI: Novice Guide to Transformers

Comments
4 min read
How paranoid is it to not use facial recognition on an Iphone?

How paranoid is it to not use facial recognition on an Iphone?

1
Comments 1
2 min read
Computer Vision Meetup: Who needs RLHF When You Have SFT? 30:36

Computer Vision Meetup: Who needs RLHF When You Have SFT?

3
Comments
1 min read
Building a Trader Bot with Sentiment Analysis: A Step-by-Step Guide

Building a Trader Bot with Sentiment Analysis: A Step-by-Step Guide

10
Comments
3 min read
Computer Vision Meetup: Making LLMs Safe & Reliable 31:54

Computer Vision Meetup: Making LLMs Safe & Reliable

Comments
1 min read
5 Open Source Large Language Models APIs for Developers

5 Open Source Large Language Models APIs for Developers

12
Comments 3
10 min read
GenCast: Diffusion-based ensemble forecasting for medium-range weather

GenCast: Diffusion-based ensemble forecasting for medium-range weather

2
Comments
3 min read
KAN: Kolmogorov-Arnold Networks

KAN: Kolmogorov-Arnold Networks

2
Comments
4 min read
Training-free Graph Neural Networks and the Power of Labels as Features

Training-free Graph Neural Networks and the Power of Labels as Features

1
Comments
4 min read
Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input

Where on Earth Do Users Say They Are?: Geo-Entity Linking for Noisy Multilingual User Input

1
Comments
4 min read
Better & Faster Large Language Models via Multi-token Prediction

Better & Faster Large Language Models via Multi-token Prediction

Comments
4 min read
Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification

Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification

Comments
3 min read
Predicting SSH keys in Open SSH Memory dumps

Predicting SSH keys in Open SSH Memory dumps

Comments
4 min read
Thousands of AI Authors on the Future of AI

Thousands of AI Authors on the Future of AI

Comments
3 min read
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Comments
3 min read
Eureka: Human-Level Reward Design via Coding Large Language Models

Eureka: Human-Level Reward Design via Coding Large Language Models

Comments
4 min read
BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

BlenderAlchemy: Editing 3D Graphics with Vision-Language Models

1
Comments
4 min read
Fine-tune your first large language model (LLM) with LoRA, llama.cpp, and KitOps in 5 easy steps

Fine-tune your first large language model (LLM) with LoRA, llama.cpp, and KitOps in 5 easy steps

62
Comments 4
3 min read
Single Page Applications (SPAs) vs. Multi-Page Applications (MPAs). Navigating the Web App Landscape

Single Page Applications (SPAs) vs. Multi-Page Applications (MPAs). Navigating the Web App Landscape

Comments
1 min read
How to use LLMs: Summarize long documents

How to use LLMs: Summarize long documents

15
Comments
5 min read
Cross-validation in Machine Learning

Cross-validation in Machine Learning

Comments
4 min read
May 8, 2024 AI, Machine Learning and Computer Vision Meetup

May 8, 2024 AI, Machine Learning and Computer Vision Meetup

1
Comments
3 min read
A complete beginner's guide to using the Face-To-Many model on Replicate

A complete beginner's guide to using the Face-To-Many model on Replicate

Comments
3 min read
A complete beginner's guide to using the Real-Esrgan model on Replicate

A complete beginner's guide to using the Real-Esrgan model on Replicate

Comments
2 min read
A beginner's guide to using Face-To-Many on Replicate

A beginner's guide to using Face-To-Many on Replicate

1
Comments
3 min read
Iterative Reasoning Preference Optimization

Iterative Reasoning Preference Optimization

Comments
4 min read
Benchmarking Benchmark Leakage in Large Language Models

Benchmarking Benchmark Leakage in Large Language Models

Comments
4 min read
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Comments
4 min read
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Comments
4 min read
DoRA: Weight-Decomposed Low-Rank Adaptation

DoRA: Weight-Decomposed Low-Rank Adaptation

Comments
4 min read
InstructEdit: Instruction-based Knowledge Editing for Large Language Models

InstructEdit: Instruction-based Knowledge Editing for Large Language Models

Comments
4 min read
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models

Comments
4 min read
Building a Large Japanese Web Corpus for Large Language Models

Building a Large Japanese Web Corpus for Large Language Models

Comments
3 min read
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

Comments
3 min read
Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

Comments
3 min read
Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis

Layered and Staged Monte Carlo Tree Search for SMT Strategy Synthesis

Comments
4 min read
Neural-Symbolic Recursive Machine for Systematic Generalization

Neural-Symbolic Recursive Machine for Systematic Generalization

Comments
4 min read
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Comments
4 min read
CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models

CharacterFactory: Sampling Consistent Characters with GANs for Diffusion Models

Comments
4 min read
EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning

Comments
5 min read
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Comments
4 min read
Make Your LLM Fully Utilize the Context

Make Your LLM Fully Utilize the Context

Comments
4 min read
Relay Mining: Incentivizing Full Non-Validating Nodes Servicing All RPC Types

Relay Mining: Incentivizing Full Non-Validating Nodes Servicing All RPC Types

Comments
4 min read
loading...