DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Recommender Systems in the Era of Large Language Models (LLMs)

Recommender Systems in the Era of Large Language Models (LLMs)

1
Comments
4 min read
Large Language Models as Optimizers

Large Language Models as Optimizers

1
Comments
4 min read
Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Comments
4 min read
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

1
Comments
4 min read
BooookScore: A systematic exploration of book-length summarization in the era of LLMs

BooookScore: A systematic exploration of book-length summarization in the era of LLMs

Comments
4 min read
The Curse of Recursion: Training on Generated Data Makes Models Forget

The Curse of Recursion: Training on Generated Data Makes Models Forget

Comments
4 min read
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Comments
4 min read
Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Comments
4 min read
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

Comments
4 min read
Tools Every Data Scientist Should Know

Tools Every Data Scientist Should Know

Comments
2 min read
CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

Comments
3 min read
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

Comments
4 min read
TransformerFAM: Feedback attention is working memory

TransformerFAM: Feedback attention is working memory

Comments
4 min read
Beginner's Guide to Math's for Machine Learning

Beginner's Guide to Math's for Machine Learning

2
Comments 1
2 min read
Insights from the Leading EDJE Round Table Discussion on AI: Bridging the Gap Between Virtual and Physical Spaces

Insights from the Leading EDJE Round Table Discussion on AI: Bridging the Gap Between Virtual and Physical Spaces

2
Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.