DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Comments
4 min read
Gated Linear Attention Transformers with Hardware-Efficient Training

Gated Linear Attention Transformers with Hardware-Efficient Training

Comments
4 min read
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Comments
3 min read
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

Comments
5 min read
To Believe or Not to Believe Your LLM

To Believe or Not to Believe Your LLM

2
Comments
4 min read
LLMs cannot find reasoning errors, but can correct them given the error location

LLMs cannot find reasoning errors, but can correct them given the error location

Comments
5 min read
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

1
Comments
4 min read
Deep Learning for Camera Calibration and Beyond: A Survey

Deep Learning for Camera Calibration and Beyond: A Survey

Comments
4 min read
The Geometry of Categorical and Hierarchical Concepts in Large Language Models

The Geometry of Categorical and Hierarchical Concepts in Large Language Models

1
Comments
4 min read
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification

CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification

Comments
3 min read
Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models

Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models

1
Comments
5 min read
Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?

Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?

Comments
4 min read
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

1
Comments
4 min read
REBUS: A Robust Evaluation Benchmark of Understanding Symbols

REBUS: A Robust Evaluation Benchmark of Understanding Symbols

1
Comments
4 min read
Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings

Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.