DEV Community

Artificial Intelligence

Artificial intelligence leverages computers and machines to mimic the problem-solving and decision-making capabilities found in humans and in nature.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Efficient LLM inference solution on Intel GPU

Efficient LLM inference solution on Intel GPU

2
Comments
4 min read
TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners

TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners

2
Comments
4 min read
EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models

EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models

2
Comments 1
4 min read
Is the System Message Really Important to Jailbreaks in Large Language Models?

Is the System Message Really Important to Jailbreaks in Large Language Models?

1
Comments
4 min read
Transformers are Multi-State RNNs

Transformers are Multi-State RNNs

1
Comments
3 min read
Evaluating the Performance of ChatGPT for Spam Email Detection

Evaluating the Performance of ChatGPT for Spam Email Detection

1
Comments
3 min read
Exploitation Business: Leveraging Information Asymmetry

Exploitation Business: Leveraging Information Asymmetry

Comments
3 min read
Large Language Models for Data Annotation: A Survey

Large Language Models for Data Annotation: A Survey

Comments
4 min read
4090 - ECC ON vs ECC OFF

4090 - ECC ON vs ECC OFF

9
Comments
1 min read
Building Your Own SpicyChat AI: A Developer's Guide

Building Your Own SpicyChat AI: A Developer's Guide

20
Comments
7 min read
The Impact of Reasoning Step Length on Large Language Models

The Impact of Reasoning Step Length on Large Language Models

2
Comments
4 min read
LMDX: Language Model-based Document Information Extraction and Localization

LMDX: Language Model-based Document Information Extraction and Localization

2
Comments
4 min read
Transcendence: Generative Models Can Outperform The Experts That Train Them

Transcendence: Generative Models Can Outperform The Experts That Train Them

Comments
4 min read
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

5
Comments
4 min read
How Susceptible are Large Language Models to Ideological Manipulation?

How Susceptible are Large Language Models to Ideological Manipulation?

Comments
3 min read
An Interactive Agent Foundation Model

An Interactive Agent Foundation Model

Comments
3 min read
Foundation Models for Time Series Analysis: A Tutorial and Survey

Foundation Models for Time Series Analysis: A Tutorial and Survey

2
Comments
4 min read
StreamBench: Towards Benchmarking Continuous Improvement of Language Agents

StreamBench: Towards Benchmarking Continuous Improvement of Language Agents

Comments
4 min read
Transparent Image Layer Diffusion using Latent Transparency

Transparent Image Layer Diffusion using Latent Transparency

Comments
3 min read
Are LLMs Naturally Good at Synthetic Tabular Data Generation?

Are LLMs Naturally Good at Synthetic Tabular Data Generation?

1
Comments
4 min read
An Image is Worth 32 Tokens for Reconstruction and Generation

An Image is Worth 32 Tokens for Reconstruction and Generation

Comments
4 min read
VideoPrism: A Foundational Visual Encoder for Video Understanding

VideoPrism: A Foundational Visual Encoder for Video Understanding

Comments 1
4 min read
Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models

Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models

Comments
4 min read
Refusal in Language Models Is Mediated by a Single Direction

Refusal in Language Models Is Mediated by a Single Direction

1
Comments
3 min read
Chain-of-Thought Unfaithfulness as Disguised Accuracy

Chain-of-Thought Unfaithfulness as Disguised Accuracy

1
Comments
4 min read
Large language models surpass human experts in predicting neuroscience results

Large language models surpass human experts in predicting neuroscience results

1
Comments
4 min read
Jellyfish: A Large Language Model for Data Preprocessing

Jellyfish: A Large Language Model for Data Preprocessing

2
Comments
4 min read
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

Comments
4 min read
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference

Comments
4 min read
Neural Thermodynamic Integration: Free Energies from Energy-based Diffusion Models

Neural Thermodynamic Integration: Free Energies from Energy-based Diffusion Models

Comments
4 min read
Large Language Models Are Zero-Shot Time Series Forecasters

Large Language Models Are Zero-Shot Time Series Forecasters

2
Comments
4 min read
State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing

State-Compute Replication: Parallelizing High-Speed Stateful Packet Processing

Comments
4 min read
Social Norms in Cinema: A Cross-Cultural Analysis of Shame, Pride and Prejudice

Social Norms in Cinema: A Cross-Cultural Analysis of Shame, Pride and Prejudice

Comments
4 min read
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images

StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images

Comments
3 min read
LLAMAFUZZ: Large Language Model Enhanced Greybox Fuzzing

LLAMAFUZZ: Large Language Model Enhanced Greybox Fuzzing

1
Comments
4 min read
Should AI Optimize Your Code? A Comparative Study of Current Large Language Models Versus Classical Optimizing Compilers

Should AI Optimize Your Code? A Comparative Study of Current Large Language Models Versus Classical Optimizing Compilers

Comments
4 min read
garak: A Framework for Security Probing Large Language Models

garak: A Framework for Security Probing Large Language Models

1
Comments
4 min read
A Survey on In-context Learning

A Survey on In-context Learning

1
Comments
4 min read
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Comments
3 min read
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

Comments
4 min read
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Comments
4 min read
A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges

A Survey of Large Language Models for Financial Applications: Progress, Prospects and Challenges

Comments
6 min read
Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation

Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation

Comments
4 min read
Where there's a will there's a way: ChatGPT is used more for science in countries where it is prohibited

Where there's a will there's a way: ChatGPT is used more for science in countries where it is prohibited

Comments
3 min read
Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

Optimized Feature Generation for Tabular Data via LLMs with Decision Tree Reasoning

Comments
4 min read
How Do Humans Write Code? Large Models Do It the Same Way Too

How Do Humans Write Code? Large Models Do It the Same Way Too

Comments
4 min read
Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

Comments
5 min read
Depth Anything V2

Depth Anything V2

1
Comments
4 min read
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning

Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning

Comments
4 min read
A Survey on Large Language Models for Recommendation

A Survey on Large Language Models for Recommendation

Comments
4 min read
DataComp-LM: In search of the next generation of training sets for language models

DataComp-LM: In search of the next generation of training sets for language models

Comments
3 min read
Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

Monitoring AI-Modified Content at Scale: A Case Study on the Impact of ChatGPT on AI Conference Peer Reviews

Comments
4 min read
DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

Comments
4 min read
Scrolly2Reel: Retargeting Graphics for Social Media Using Narrative Beats

Scrolly2Reel: Retargeting Graphics for Social Media Using Narrative Beats

Comments
3 min read
CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

Comments
3 min read
Directly Fine-Tuning Diffusion Models on Differentiable Rewards

Directly Fine-Tuning Diffusion Models on Differentiable Rewards

Comments
4 min read
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Comments
4 min read
Xpenser - AI Expense Manger using Twilio & Gemini AI with WhatsApp Integration

Xpenser - AI Expense Manger using Twilio & Gemini AI with WhatsApp Integration

21
Comments 2
4 min read
Personalization at Scale: How AI Enhances Customer Engagement

Personalization at Scale: How AI Enhances Customer Engagement

2
Comments
2 min read
Introduction TO Word Embeddings

Introduction TO Word Embeddings

Comments
3 min read
loading...