DEV Community

Beginners

"A journey of a thousand miles begins with a single step." -Chinese Proverb

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
The Impact of Depth on Compositional Generalization in Transformer Language Models

The Impact of Depth on Compositional Generalization in Transformer Language Models

5
Comments
4 min read
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

5
Comments
4 min read
CodecLM: Aligning Language Models with Tailored Synthetic Data

CodecLM: Aligning Language Models with Tailored Synthetic Data

6
Comments
4 min read
Vision Transformers Need Registers

Vision Transformers Need Registers

5
Comments
4 min read
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

5
Comments
3 min read
JetMoE: Reaching Llama2 Performance with 0.1M Dollars

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

4
Comments
4 min read
Show Your Work with Confidence: Confidence Bands for Tuning Curves

Show Your Work with Confidence: Confidence Bands for Tuning Curves

6
Comments
4 min read
Why Isn't AI As Smart As We Always Expect?

Why Isn't AI As Smart As We Always Expect?

8
Comments 15
2 min read
SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

5
Comments
4 min read
The Expressive Power of Transformers with Chain of Thought

The Expressive Power of Transformers with Chain of Thought

5
Comments
5 min read
Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists

Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists

6
Comments
4 min read
Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

5
Comments
4 min read
Unleash Your Coding Potential: Top 5 Websites to Kickstart Your Learning

Unleash Your Coding Potential: Top 5 Websites to Kickstart Your Learning

Comments
2 min read
Understanding SSR and SSG in Modern Web Development

Understanding SSR and SSG in Modern Web Development

6
Comments
5 min read
Litestyle CSS: A Lightweight Alternative to Full-Fledged Frameworks

Litestyle CSS: A Lightweight Alternative to Full-Fledged Frameworks

7
Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.