DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Comments
4 min read
The Curse of Recursion: Training on Generated Data Makes Models Forget

The Curse of Recursion: Training on Generated Data Makes Models Forget

Comments
4 min read
TransformerFAM: Feedback attention is working memory

TransformerFAM: Feedback attention is working memory

Comments
4 min read
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

Comments
4 min read
Beginner's Guide to Math's for Machine Learning

Beginner's Guide to Math's for Machine Learning

2
Comments 1
2 min read
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

Comments
4 min read
Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Comments
4 min read
Insights from the Leading EDJE Round Table Discussion on AI: Bridging the Gap Between Virtual and Physical Spaces

Insights from the Leading EDJE Round Table Discussion on AI: Bridging the Gap Between Virtual and Physical Spaces

2
Comments
3 min read
Assumption of Homoscedasticity : A Guide to verifying the Assumption of Constant Variance of Residuals

Assumption of Homoscedasticity : A Guide to verifying the Assumption of Constant Variance of Residuals

1
Comments
3 min read
Meeting Minutes Made Easy: Summarize Key Points and Send Emails using Lyzr-Automata

Meeting Minutes Made Easy: Summarize Key Points and Send Emails using Lyzr-Automata

Comments
3 min read
Zero-Shot Prediction Plugin for FiftyOne

Zero-Shot Prediction Plugin for FiftyOne

Comments
8 min read
Can Facial Recognition Be Trusted for Immigration Control?

Can Facial Recognition Be Trusted for Immigration Control?

1
Comments
1 min read
Cognita: An Open-Source Framework for Enhanced RAG Applications

Cognita: An Open-Source Framework for Enhanced RAG Applications

2
Comments
3 min read
RAG Redefined : Ready-to-Deploy RAG for Organizations at Scale.

RAG Redefined : Ready-to-Deploy RAG for Organizations at Scale.

1
Comments 2
1 min read
Day 1 of 30 : Machine Learning

Day 1 of 30 : Machine Learning

11
Comments 6
2 min read
AI is Not Going to Steal Your Keyboard (Unless You Let Them Write the Code)

AI is Not Going to Steal Your Keyboard (Unless You Let Them Write the Code)

1
Comments
2 min read
Get Hired Faster: How to use Lyzr-Automata to draft personalised cold emails

Get Hired Faster: How to use Lyzr-Automata to draft personalised cold emails

1
Comments
4 min read
Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

5
Comments
4 min read
The Impact of Depth on Compositional Generalization in Transformer Language Models

The Impact of Depth on Compositional Generalization in Transformer Language Models

5
Comments
4 min read
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

5
Comments
3 min read
Rho-1: Not All Tokens Are What You Need

Rho-1: Not All Tokens Are What You Need

5
Comments
4 min read
JetMoE: Reaching Llama2 Performance with 0.1M Dollars

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

4
Comments
4 min read
ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

5
Comments
4 min read
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

5
Comments
4 min read
Generalization in diffusion models arises from geometry-adaptive harmonic representations

Generalization in diffusion models arises from geometry-adaptive harmonic representations

5
Comments
4 min read
loading...