DEV Community

Artificial Intelligence

Artificial intelligence leverages computers and machines to mimic the problem-solving and decision-making capabilities found in humans and in nature.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Comments
4 min read
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

1
Comments
4 min read
Recommender Systems in the Era of Large Language Models (LLMs)

Recommender Systems in the Era of Large Language Models (LLMs)

1
Comments
4 min read
Manipulating Large Language Models to Increase Product Visibility

Manipulating Large Language Models to Increase Product Visibility

Comments
4 min read
Dataset Reset Policy Optimization for RLHF

Dataset Reset Policy Optimization for RLHF

Comments
4 min read
Large Language Models as Optimizers

Large Language Models as Optimizers

1
Comments
4 min read
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

Comments
4 min read
Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Comments
4 min read
BooookScore: A systematic exploration of book-length summarization in the era of LLMs

BooookScore: A systematic exploration of book-length summarization in the era of LLMs

Comments
4 min read
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Comments
4 min read
The Curse of Recursion: Training on Generated Data Makes Models Forget

The Curse of Recursion: Training on Generated Data Makes Models Forget

Comments
4 min read
CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

Comments
3 min read
TransformerFAM: Feedback attention is working memory

TransformerFAM: Feedback attention is working memory

Comments
4 min read
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

Comments
4 min read
Recipe Generator

Recipe Generator

31
Comments 7
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.