DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Rotational Equilibrium: How Weight Decay Balances Learning Across Neural Networks

Rotational Equilibrium: How Weight Decay Balances Learning Across Neural Networks

Comments
5 min read
Metaheuristics and Large Language Models Join Forces: Towards an Integrated Optimization Approach

Metaheuristics and Large Language Models Join Forces: Towards an Integrated Optimization Approach

Comments
5 min read
Easy Problems That LLMs Get Wrong

Easy Problems That LLMs Get Wrong

6
Comments
4 min read
Grokfast: Accelerated Grokking by Amplifying Slow Gradients

Grokfast: Accelerated Grokking by Amplifying Slow Gradients

1
Comments
4 min read
There and Back Again: The AI Alignment Paradox

There and Back Again: The AI Alignment Paradox

Comments
4 min read
BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B

BadLlama: cheaply removing safety fine-tuning from Llama 2-Chat 13B

Comments
4 min read
Text clustering with LLM embeddings

Text clustering with LLM embeddings

24
Comments
4 min read
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

Comments
4 min read
Executable Code Actions Elicit Better LLM Agents

Executable Code Actions Elicit Better LLM Agents

Comments
4 min read
Compressed-Language Models for Understanding Compressed File Formats: a JPEG Exploration

Compressed-Language Models for Understanding Compressed File Formats: a JPEG Exploration

Comments
5 min read
Large Language Models Can Self-Improve At Web Agent Tasks

Large Language Models Can Self-Improve At Web Agent Tasks

Comments
4 min read
Training-Free Long-Context Scaling of Large Language Models

Training-Free Long-Context Scaling of Large Language Models

Comments
4 min read
Sparse maximal update parameterization: A holistic approach to sparse training dynamics

Sparse maximal update parameterization: A holistic approach to sparse training dynamics

Comments
5 min read
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

1
Comments
4 min read
LLaMA Pro: Progressive LLaMA with Block Expansion

LLaMA Pro: Progressive LLaMA with Block Expansion

Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.