DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

Comments
4 min read
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Comments
4 min read
LoRA+: Efficient Low Rank Adaptation of Large Models

LoRA+: Efficient Low Rank Adaptation of Large Models

Comments
3 min read
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

1
Comments
3 min read
ColPali: Efficient Document Retrieval with Vision Language Models

ColPali: Efficient Document Retrieval with Vision Language Models

3
Comments
4 min read
X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

Comments
4 min read
Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

1
Comments 1
4 min read
A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

Comments
4 min read
PaliGemma: A versatile 3B VLM for transfer

PaliGemma: A versatile 3B VLM for transfer

Comments
4 min read
Achieving Energetic Superiority Through System-Level Quantum Circuit Simulation

Achieving Energetic Superiority Through System-Level Quantum Circuit Simulation

Comments
4 min read
When LLMs Play the Telephone Game: Cumulative Changes and Attractors in Iterated Cultural Transmissions

When LLMs Play the Telephone Game: Cumulative Changes and Attractors in Iterated Cultural Transmissions

Comments
4 min read
Volumetric Rendering with Baked Quadrature Fields

Volumetric Rendering with Baked Quadrature Fields

Comments
3 min read
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Comments
4 min read
Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

2
Comments
3 min read
When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

Comments
4 min read
Reasoning in Large Language Models: A Geometric Perspective

Reasoning in Large Language Models: A Geometric Perspective

Comments
4 min read
Memory, Consciousness and Large Language Model

Memory, Consciousness and Large Language Model

Comments
4 min read
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Comments
4 min read
SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

SpikeGPT: Generative Pre-trained Language Model with Spiking Neural Networks

Comments
4 min read
What's the Magic Word? A Control Theory of LLM Prompting

What's the Magic Word? A Control Theory of LLM Prompting

Comments
4 min read
Toto: Time Series Optimized Transformer for Observability

Toto: Time Series Optimized Transformer for Observability

Comments
5 min read
Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

Comments
4 min read
FACTS About Building Retrieval Augmented Generation-based Chatbots

FACTS About Building Retrieval Augmented Generation-based Chatbots

Comments
4 min read
Exploring the Latest LLMs for Leaderboard Extraction

Exploring the Latest LLMs for Leaderboard Extraction

Comments
4 min read
There Has To Be a Lot That We're Missing: Moderating AI-Generated Content on Reddit

There Has To Be a Lot That We're Missing: Moderating AI-Generated Content on Reddit

Comments
4 min read
loading...