DEV Community

Beginners

"A journey of a thousand miles begins with a single step." -Chinese Proverb

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Rotational Equilibrium: How Weight Decay Balances Learning Across Neural Networks

Rotational Equilibrium: How Weight Decay Balances Learning Across Neural Networks

Comments
5 min read
The Road Less Scheduled

The Road Less Scheduled

Comments
4 min read
AnyLoss: Transforming Classification Metrics into Loss Functions

AnyLoss: Transforming Classification Metrics into Loss Functions

Comments
4 min read
NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

NV-Embed: Improved Techniques for Training LLMs as Generalist Embedding Models

Comments
3 min read
MoEUT: Mixture-of-Experts Universal Transformers

MoEUT: Mixture-of-Experts Universal Transformers

Comments
3 min read
NPGA: Neural Parametric Gaussian Avatars

NPGA: Neural Parametric Gaussian Avatars

Comments
3 min read
Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame Simulations

Human vs. Machine: Behavioral Differences Between Expert Humans and Language Models in Wargame Simulations

Comments
4 min read
The rising costs of training frontier AI models

The rising costs of training frontier AI models

Comments
5 min read
Look Once to Hear: Target Speech Hearing with Noisy Examples

Look Once to Hear: Target Speech Hearing with Noisy Examples

Comments
5 min read
Diffusion On Syntax Trees For Program Synthesis

Diffusion On Syntax Trees For Program Synthesis

2
Comments
4 min read
Neural Network Parameter Diffusion

Neural Network Parameter Diffusion

Comments
4 min read
Towards Lightweight Super-Resolution with Dual Regression Learning

Towards Lightweight Super-Resolution with Dual Regression Learning

Comments
3 min read
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

Comments
4 min read
Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities

Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities

Comments
4 min read
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities

Comments
4 min read
Formalizing and Benchmarking Prompt Injection Attacks and Defenses

Formalizing and Benchmarking Prompt Injection Attacks and Defenses

1
Comments
4 min read
PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion

PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion

Comments
3 min read
Learning to Model the World with Language

Learning to Model the World with Language

Comments
4 min read
Is In-Context Learning Sufficient for Instruction Following in LLMs?

Is In-Context Learning Sufficient for Instruction Following in LLMs?

Comments
4 min read
Text clustering with LLM embeddings

Text clustering with LLM embeddings

1
Comments
4 min read
There and Back Again: The AI Alignment Paradox

There and Back Again: The AI Alignment Paradox

Comments
4 min read
ToonCrafter: Generative Cartoon Interpolation

ToonCrafter: Generative Cartoon Interpolation

3
Comments
4 min read
Assessing Large Language Models on Climate Information

Assessing Large Language Models on Climate Information

Comments
3 min read
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

1
Comments
4 min read
LLMs achieve adult human performance on higher-order theory of mind tasks

LLMs achieve adult human performance on higher-order theory of mind tasks

Comments
4 min read
Large Language Models Can Self-Improve At Web Agent Tasks

Large Language Models Can Self-Improve At Web Agent Tasks

Comments
3 min read
Privacy-Aware Visual Language Models

Privacy-Aware Visual Language Models

Comments
4 min read
Executable Code Actions Elicit Better LLM Agents

Executable Code Actions Elicit Better LLM Agents

Comments
4 min read
LLaMA Pro: Progressive LLaMA with Block Expansion

LLaMA Pro: Progressive LLaMA with Block Expansion

Comments
5 min read
Sparse maximal update parameterization: A holistic approach to sparse training dynamics

Sparse maximal update parameterization: A holistic approach to sparse training dynamics

Comments
5 min read
Metaheuristics and Large Language Models Join Forces: Towards an Integrated Optimization Approach

Metaheuristics and Large Language Models Join Forces: Towards an Integrated Optimization Approach

Comments
4 min read
Evaluating AI-generated code for C++, Fortran, Go, Java, Julia, Matlab, Python, R, and Rust

Evaluating AI-generated code for C++, Fortran, Go, Java, Julia, Matlab, Python, R, and Rust

1
Comments
4 min read
Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models

Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models

Comments
4 min read
gzip Predicts Data-dependent Scaling Laws

gzip Predicts Data-dependent Scaling Laws

Comments
4 min read
Generate and Pray: Using SALLMS to Evaluate the Security of LLM Generated Code

Generate and Pray: Using SALLMS to Evaluate the Security of LLM Generated Code

Comments
4 min read
You Need to Pay Better Attention: Rethinking the Mathematics of Attention Mechanism

You Need to Pay Better Attention: Rethinking the Mathematics of Attention Mechanism

Comments
4 min read
Arrows of Time for Large Language Models

Arrows of Time for Large Language Models

Comments
5 min read
Simplifying Transformer Blocks

Simplifying Transformer Blocks

Comments
3 min read
Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

3
Comments 1
1 min read
Single Page Applications(SPA) Vs Multi Page Applications(MPA)

Single Page Applications(SPA) Vs Multi Page Applications(MPA)

2
Comments
3 min read
Unleashing the Power đź’Ş of JavaScript: Embrace "use strict" đźš«

Unleashing the Power đź’Ş of JavaScript: Embrace "use strict" đźš«

5
Comments
3 min read
Ibuprofeno.pyđź’Š| #118: Explica este cĂłdigo Python

Ibuprofeno.pyđź’Š| #118: Explica este cĂłdigo Python

3
Comments
1 min read
🚀 Unveiling JavaScript AsyncFunction and AsyncFunction() Constructor: A Deep Dive

🚀 Unveiling JavaScript AsyncFunction and AsyncFunction() Constructor: A Deep Dive

17
Comments
3 min read
The Rise of Web3: Transforming the Digital Landscape

The Rise of Web3: Transforming the Digital Landscape

6
Comments 1
2 min read
Next-auth App Router Credentials - An Annotated Guide

Next-auth App Router Credentials - An Annotated Guide

Comments
5 min read
Speed Up Your Site with 3 Simple JavaScript Performance Optimization Tips

Speed Up Your Site with 3 Simple JavaScript Performance Optimization Tips

106
Comments 8
3 min read
Building a CRUD Application with Node.js, Express, and MongoDB

Building a CRUD Application with Node.js, Express, and MongoDB

3
Comments
3 min read
Embarking on My Coding Journey

Embarking on My Coding Journey

1
Comments 1
2 min read
Generics in Rust: murky waters of implementing foreign traits on foreign types

Generics in Rust: murky waters of implementing foreign traits on foreign types

3
Comments
4 min read
Intro to React Native

Intro to React Native

Comments
5 min read
Code Smell 253 - Silent Truncation

Code Smell 253 - Silent Truncation

2
Comments
4 min read
Crafting Better Software

Crafting Better Software

1
Comments
3 min read
What Lies Ahead for Flutter: Advancements, Innovations, & Beyond

What Lies Ahead for Flutter: Advancements, Innovations, & Beyond

2
Comments
6 min read
How to Manage Hierarchical Data in MongoDB With GraphLookup?

How to Manage Hierarchical Data in MongoDB With GraphLookup?

2
Comments
1 min read
Issue Report: Dialogs Dismissed Prematurely with ensureSemantics

Issue Report: Dialogs Dismissed Prematurely with ensureSemantics

1
Comments
4 min read
What's with the Weird Elixir Function Names

What's with the Weird Elixir Function Names

2
Comments
1 min read
Entendendo e Utilizando Tipos Condicionais (TypeScript)

Entendendo e Utilizando Tipos Condicionais (TypeScript)

1
Comments
4 min read
Basic types in Elixir

Basic types in Elixir

10
Comments 1
6 min read
Memahami CQRS (Command Query Responsibility Segregation) Kenapa dan Bagaimana Menggunakannya

Memahami CQRS (Command Query Responsibility Segregation) Kenapa dan Bagaimana Menggunakannya

2
Comments 2
3 min read
Simplifying Authentication with JWT, TypeScript and Fastify

Simplifying Authentication with JWT, TypeScript and Fastify

Comments
3 min read
loading...