DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Comments
4 min read
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

1
Comments
4 min read
LLMs cannot find reasoning errors, but can correct them given the error location

LLMs cannot find reasoning errors, but can correct them given the error location

Comments
5 min read
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models

Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models

2
Comments
5 min read
Harvard Undergraduate Survey on Generative AI

Harvard Undergraduate Survey on Generative AI

Comments
2 min read
Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?

Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?

1
Comments
4 min read
Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study

Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study

Comments
4 min read
S-LoRA: Serving Thousands of Concurrent LoRA Adapters

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Comments
4 min read
The Geometry of Categorical and Hierarchical Concepts in Large Language Models

The Geometry of Categorical and Hierarchical Concepts in Large Language Models

1
Comments
4 min read
Will we run out of data? Limits of LLM scaling based on human-generated data

Will we run out of data? Limits of LLM scaling based on human-generated data

1
Comments
4 min read
Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings

Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings

Comments
4 min read
To Believe or Not to Believe Your LLM

To Believe or Not to Believe Your LLM

1
Comments
4 min read
REBUS: A Robust Evaluation Benchmark of Understanding Symbols

REBUS: A Robust Evaluation Benchmark of Understanding Symbols

1
Comments
4 min read
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification

CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification

Comments
3 min read
Do Llamas Work in English? On the Latent Language of Multilingual Transformers

Do Llamas Work in English? On the Latent Language of Multilingual Transformers

Comments
4 min read
Empirical influence functions to understand the logic of fine-tuning

Empirical influence functions to understand the logic of fine-tuning

Comments
4 min read
ChatDev: Communicative Agents for Software Development

ChatDev: Communicative Agents for Software Development

Comments
3 min read
SqueezeLLM: Dense-and-Sparse Quantization

SqueezeLLM: Dense-and-Sparse Quantization

1
Comments
4 min read
Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks

Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks

Comments
4 min read
LLark: A Multimodal Instruction-Following Language Model for Music

LLark: A Multimodal Instruction-Following Language Model for Music

Comments
3 min read
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

Comments
5 min read
Evaluating Quantized Large Language Models

Evaluating Quantized Large Language Models

Comments
5 min read
Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning

Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning

Comments
4 min read
Virtual avatar generation models as world navigators

Virtual avatar generation models as world navigators

Comments
4 min read
Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

Comments
3 min read
RAFT: Adapting Language Model to Domain Specific RAG

RAFT: Adapting Language Model to Domain Specific RAG

1
Comments
4 min read
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Comments
3 min read
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

1
Comments
4 min read
Large Language Models for Generative Information Extraction: A Survey

Large Language Models for Generative Information Extraction: A Survey

Comments
5 min read
A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism

A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism

Comments
4 min read
Bridging the Empirical-Theoretical Gap in Neural Network Formal Language Learning Using Minimum Description Length

Bridging the Empirical-Theoretical Gap in Neural Network Formal Language Learning Using Minimum Description Length

Comments
4 min read
Position: Categorical Deep Learning is an Algebraic Theory of All Architectures

Position: Categorical Deep Learning is an Algebraic Theory of All Architectures

Comments
1 min read
Are We Living in a Nightmare or a Paradise Together with Machines?

Are We Living in a Nightmare or a Paradise Together with Machines?

3
Comments 1
2 min read
How We Generated a 10K Dataset Using LLM to Fine-Tune Another LLM

How We Generated a 10K Dataset Using LLM to Fine-Tune Another LLM

Comments
4 min read
Machine Learning Developmental Life Cycle

Machine Learning Developmental Life Cycle

2
Comments
1 min read
A step-by-step guide to building an MLOps pipeline

A step-by-step guide to building an MLOps pipeline

68
Comments 5
13 min read
Transform Your Video Transcripts: From Raw to Readable Text

Transform Your Video Transcripts: From Raw to Readable Text

1
Comments
6 min read
Three Levels of Scrapping Data: From Basic to Advanced to Pro

Three Levels of Scrapping Data: From Basic to Advanced to Pro

Comments
3 min read
ValueError: A given column is not a column of the dataframe

ValueError: A given column is not a column of the dataframe

1
Comments
1 min read
Machine Learning Models: Linear Regression

Machine Learning Models: Linear Regression

Comments
8 min read
Heart Disease Prediction System

Heart Disease Prediction System

6
Comments
2 min read
Machine Learning : Things you don't know yet !!!

Machine Learning : Things you don't know yet !!!

5
Comments
2 min read
How to Perform Semantic Search using ChromaDB in JavaScript

How to Perform Semantic Search using ChromaDB in JavaScript

2
Comments
8 min read
Evaluation Metrics: Machine Learning Models 🤖🐍

Evaluation Metrics: Machine Learning Models 🤖🐍

6
Comments
3 min read
A question about LLP,NLP or whatever

A question about LLP,NLP or whatever

Comments
1 min read
What are Neural Networks? Deep Learning Explained for Beginners

What are Neural Networks? Deep Learning Explained for Beginners

Comments
3 min read
Algorithmic Detection of TVEC Content

Algorithmic Detection of TVEC Content

Comments
1 min read
Streamlining Image Annotation with Annotate-Lab

Streamlining Image Annotation with Annotate-Lab

Comments 1
4 min read
Ace Your Exams: Automated Question Generation for the Diligent Student

Ace Your Exams: Automated Question Generation for the Diligent Student

1
Comments
10 min read
OCR with tesseract, python and pytesseract

OCR with tesseract, python and pytesseract

1
Comments
4 min read
Nvidia's 1000x Performance Boost Claim Verified

Nvidia's 1000x Performance Boost Claim Verified

12
Comments
2 min read
AI indoor navigation service

AI indoor navigation service

12
Comments
3 min read
Elementary Logic And Proof Techniques

Elementary Logic And Proof Techniques

1
Comments
7 min read
Machine Learning: Day 2

Machine Learning: Day 2

Comments
2 min read
Fundamentals Of Set Theory

Fundamentals Of Set Theory

2
Comments
2 min read
Recapping the AI, Machine Learning and Data Science Meetup - May 30, 2024

Recapping the AI, Machine Learning and Data Science Meetup - May 30, 2024

Comments
6 min read
On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models

On the Brittle Foundations of ReAct Prompting for Agentic Large Language Models

1
Comments 1
4 min read
An Introduction to Vision-Language Modeling

An Introduction to Vision-Language Modeling

1
Comments 1
4 min read
YouTube Video Transcripts Using LangChain

YouTube Video Transcripts Using LangChain

14
Comments 3
2 min read
Context Injection Attacks on Large Language Models

Context Injection Attacks on Large Language Models

Comments
4 min read
loading...