DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
AutoCodeRover: Autonomous Program Improvement

AutoCodeRover: Autonomous Program Improvement

1
Comments
3 min read
Aprenda a falar inglês

Aprenda a falar inglês

Comments 1
2 min read
The Illusion of State in State-Space Models

The Illusion of State in State-Space Models

Comments
4 min read
Zero-shot Building Age Classification from Facade Image Using GPT-4

Zero-shot Building Age Classification from Facade Image Using GPT-4

Comments
4 min read
Google Play Biometrics Verification Method: Should You Turn It On?

Google Play Biometrics Verification Method: Should You Turn It On?

1
Comments
2 min read
AI enthusiasm #5 - Calculate your carbon footprint🌱

AI enthusiasm #5 - Calculate your carbon footprint🌱

1
Comments
3 min read
DEVin | The Scary Future For Developers

DEVin | The Scary Future For Developers

1
Comments
4 min read
Data preprocessing for extractive QA

Data preprocessing for extractive QA

Comments
1 min read
Tools Every Data Scientist Should Know

Tools Every Data Scientist Should Know

Comments
2 min read
Large Language Models as Optimizers

Large Language Models as Optimizers

1
Comments
4 min read
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

1
Comments
4 min read
Recommender Systems in the Era of Large Language Models (LLMs)

Recommender Systems in the Era of Large Language Models (LLMs)

1
Comments
4 min read
H2O-Danube-1.8B Technical Report

H2O-Danube-1.8B Technical Report

Comments
4 min read
Dataset Reset Policy Optimization for RLHF

Dataset Reset Policy Optimization for RLHF

Comments
4 min read
Manipulating Large Language Models to Increase Product Visibility

Manipulating Large Language Models to Increase Product Visibility

Comments
3 min read
CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

Comments
3 min read
Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Comments
4 min read
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

Comments
4 min read
BooookScore: A systematic exploration of book-length summarization in the era of LLMs

BooookScore: A systematic exploration of book-length summarization in the era of LLMs

Comments
4 min read
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Comments
4 min read
The Curse of Recursion: Training on Generated Data Makes Models Forget

The Curse of Recursion: Training on Generated Data Makes Models Forget

Comments
4 min read
TransformerFAM: Feedback attention is working memory

TransformerFAM: Feedback attention is working memory

Comments
4 min read
Beginner's Guide to Math's for Machine Learning

Beginner's Guide to Math's for Machine Learning

2
Comments 1
2 min read
Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Comments
4 min read
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

Comments
4 min read
loading...