DEV Community

Artificial Intelligence

Artificial intelligence leverages computers and machines to mimic the problem-solving and decision-making capabilities found in humans and in nature.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Vulnerability Detection with Code Language Models: How Far Are We?

Vulnerability Detection with Code Language Models: How Far Are We?

Comments
5 min read
Simulacra as Conscious Exotica

Simulacra as Conscious Exotica

Comments
4 min read
Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models

1
Comments
5 min read
Why I made TabbyAPI

Why I made TabbyAPI

1
Comments
7 min read
SmartChoices: Augmenting Software with Learned Implementations

SmartChoices: Augmenting Software with Learned Implementations

Comments
4 min read
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Comments
4 min read
Summarize articles with Cloudflare Workers AI LoRAs

Summarize articles with Cloudflare Workers AI LoRAs

6
Comments
6 min read
PaliGemma: A versatile 3B VLM for transfer

PaliGemma: A versatile 3B VLM for transfer

Comments
4 min read
A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task

Comments
4 min read
Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

Mooncake: Kimi's KVCache-centric Architecture for LLM Serving

1
Comments 1
4 min read
X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

Comments
4 min read
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

Comments
4 min read
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Comments
3 min read
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

1
Comments
3 min read
Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows

Abide by the Law and Follow the Flow: Conservation Laws for Gradient Flows

Comments
5 min read
LoRA+: Efficient Low Rank Adaptation of Large Models

LoRA+: Efficient Low Rank Adaptation of Large Models

Comments
3 min read
Mixture of A Million Experts

Mixture of A Million Experts

2
Comments
3 min read
Which algorithm to select in sports timetabling?

Which algorithm to select in sports timetabling?

Comments
4 min read
Achieving Energetic Superiority Through System-Level Quantum Circuit Simulation

Achieving Energetic Superiority Through System-Level Quantum Circuit Simulation

Comments
4 min read
Volumetric Rendering with Baked Quadrature Fields

Volumetric Rendering with Baked Quadrature Fields

Comments
3 min read
How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions

Comments
4 min read
Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Learning to (Learn at Test Time): RNNs with Expressive Hidden States

2
Comments
3 min read
ColPali: Efficient Document Retrieval with Vision Language Models

ColPali: Efficient Document Retrieval with Vision Language Models

3
Comments
4 min read
Reasoning in Large Language Models: A Geometric Perspective

Reasoning in Large Language Models: A Geometric Perspective

Comments
5 min read
When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

Comments
4 min read
loading...