DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

Comments
4 min read
Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

1
Comments 1
4 min read
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Comments
3 min read
Mixture of A Million Experts

Mixture of A Million Experts

2
Comments
3 min read
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Comments
4 min read
Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

2
Comments
3 min read
X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

Comments
4 min read
Which algorithm to select in sports timetabling?

Which algorithm to select in sports timetabling?

Comments
4 min read
Reasoning in Large Language Models: A Geometric Perspective

Reasoning in Large Language Models: A Geometric Perspective

Comments
5 min read
Volumetric Rendering with Baked Quadrature Fields

Volumetric Rendering with Baked Quadrature Fields

Comments
3 min read
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Comments
4 min read
Achieving Energetic Superiority Through System-Level Quantum Circuit Simulation

Achieving Energetic Superiority Through System-Level Quantum Circuit Simulation

Comments
4 min read
A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

Comments
4 min read
FACTS About Building Retrieval Augmented Generation-based Chatbots

FACTS About Building Retrieval Augmented Generation-based Chatbots

Comments
4 min read
Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

Testing AI on language comprehension tasks reveals insensitivity to underlying meaning

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.