DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
SqueezeSAM: User friendly mobile interactive segmentation

SqueezeSAM: User friendly mobile interactive segmentation

Comments
3 min read
Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models

Special Characters Attack: Toward Scalable Training Data Extraction From Large Language Models

Comments
4 min read
Distinguishing Tor From Other Encrypted Network Traffic Through Character Analysis

Distinguishing Tor From Other Encrypted Network Traffic Through Character Analysis

Comments
3 min read
Chameleon: Mixed-Modal Early-Fusion Foundation Models

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Comments
4 min read
SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models

Comments
4 min read
A Spectral Condition for Feature Learning

A Spectral Condition for Feature Learning

Comments
4 min read
VILA: On Pre-training for Visual Language Models

VILA: On Pre-training for Visual Language Models

Comments
4 min read
Observational Scaling Laws and the Predictability of Language Model Performance

Observational Scaling Laws and the Predictability of Language Model Performance

1
Comments
4 min read
Thinking Tokens for Language Modeling

Thinking Tokens for Language Modeling

Comments
3 min read
On the Security Vulnerabilities of Text-to-SQL Models

On the Security Vulnerabilities of Text-to-SQL Models

Comments
3 min read
Multimodal Chain-of-Thought Reasoning in Language Models

Multimodal Chain-of-Thought Reasoning in Language Models

Comments
4 min read
An Analysis of Quantile Temporal-Difference Learning

An Analysis of Quantile Temporal-Difference Learning

1
Comments
4 min read
GPT-4 passes most of the 297 written Polish Board Certification Examinations

GPT-4 passes most of the 297 written Polish Board Certification Examinations

Comments
3 min read
Zero-Shot Tokenizer Transfer

Zero-Shot Tokenizer Transfer

Comments
4 min read
Hydragen: High-Throughput LLM Inference with Shared Prefixes

Hydragen: High-Throughput LLM Inference with Shared Prefixes

Comments
4 min read
Training-Free Consistent Text-to-Image Generation

Training-Free Consistent Text-to-Image Generation

Comments
4 min read
Can Large Language Models Write Parallel Code?

Can Large Language Models Write Parallel Code?

Comments
3 min read
Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

Comments
4 min read
FOLIO: Natural Language Reasoning with First-Order Logic

FOLIO: Natural Language Reasoning with First-Order Logic

Comments
4 min read
Biomedical knowledge graph-optimized prompt generation for large language models

Biomedical knowledge graph-optimized prompt generation for large language models

Comments
3 min read
Layer-Condensed KV Cache for Efficient Inference of Large Language Models

Layer-Condensed KV Cache for Efficient Inference of Large Language Models

Comments
5 min read
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

1
Comments
4 min read
Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models

Arctic-Embed: Scalable, Efficient, and Accurate Text Embedding Models

Comments
5 min read
Simultaneous Many-Row Activation in Off-the-Shelf DRAM Chips: Experimental Characterization and Analysis

Simultaneous Many-Row Activation in Off-the-Shelf DRAM Chips: Experimental Characterization and Analysis

Comments
4 min read
The Platonic Representation Hypothesis

The Platonic Representation Hypothesis

Comments
3 min read
loading...