DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
S-LoRA: Serving Thousands of Concurrent LoRA Adapters

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Comments
4 min read
The Geometry of Categorical and Hierarchical Concepts in Large Language Models

The Geometry of Categorical and Hierarchical Concepts in Large Language Models

1
Comments
4 min read
Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study

Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study

Comments
4 min read
Will we run out of data? Limits of LLM scaling based on human-generated data

Will we run out of data? Limits of LLM scaling based on human-generated data

1
Comments
4 min read
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

1
Comments
4 min read
REBUS: A Robust Evaluation Benchmark of Understanding Symbols

REBUS: A Robust Evaluation Benchmark of Understanding Symbols

1
Comments
4 min read
CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification

CompanyKG: A Large-Scale Heterogeneous Graph for Company Similarity Quantification

Comments
3 min read
To Believe or Not to Believe Your LLM

To Believe or Not to Believe Your LLM

1
Comments
4 min read
Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings

Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings

Comments
4 min read
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

Comments
5 min read
Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks

Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks

Comments
4 min read
Evaluating Quantized Large Language Models

Evaluating Quantized Large Language Models

Comments
5 min read
Do Llamas Work in English? On the Latent Language of Multilingual Transformers

Do Llamas Work in English? On the Latent Language of Multilingual Transformers

Comments
4 min read
LLark: A Multimodal Instruction-Following Language Model for Music

LLark: A Multimodal Instruction-Following Language Model for Music

Comments
3 min read
Empirical influence functions to understand the logic of fine-tuning

Empirical influence functions to understand the logic of fine-tuning

Comments
4 min read
ChatDev: Communicative Agents for Software Development

ChatDev: Communicative Agents for Software Development

Comments
3 min read
SqueezeLLM: Dense-and-Sparse Quantization

SqueezeLLM: Dense-and-Sparse Quantization

1
Comments
4 min read
RAFT: Adapting Language Model to Domain Specific RAG

RAFT: Adapting Language Model to Domain Specific RAG

2
Comments 1
4 min read
Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning

Comparing Inferential Strategies of Humans and Large Language Models in Deductive Reasoning

Comments
4 min read
Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs

Comments
3 min read
Virtual avatar generation models as world navigators

Virtual avatar generation models as world navigators

Comments
4 min read
SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales

1
Comments
4 min read
Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Comments
3 min read
Bridging the Empirical-Theoretical Gap in Neural Network Formal Language Learning Using Minimum Description Length

Bridging the Empirical-Theoretical Gap in Neural Network Formal Language Learning Using Minimum Description Length

Comments
4 min read
A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism

A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism

Comments
4 min read
loading...