DEV Community

Beginners

"A journey of a thousand miles begins with a single step." -Chinese Proverb

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

Comments
4 min read
Transcendence: Generative Models Can Outperform The Experts That Train Them

Transcendence: Generative Models Can Outperform The Experts That Train Them

Comments
4 min read
Interpreting Benchmarks and Evaluations in LLMs

Interpreting Benchmarks and Evaluations in LLMs

Comments
2 min read
Efficient LLM inference solution on Intel GPU

Efficient LLM inference solution on Intel GPU

2
Comments
4 min read
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

5
Comments
5 min read
Chain-of-Thought Unfaithfulness as Disguised Accuracy

Chain-of-Thought Unfaithfulness as Disguised Accuracy

1
Comments
4 min read
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Comments
4 min read
How Susceptible are Large Language Models to Ideological Manipulation?

How Susceptible are Large Language Models to Ideological Manipulation?

Comments
3 min read
Large language models surpass human experts in predicting neuroscience results

Large language models surpass human experts in predicting neuroscience results

1
Comments
4 min read
Where there's a will there's a way: ChatGPT is used more for science in countries where it is prohibited

Where there's a will there's a way: ChatGPT is used more for science in countries where it is prohibited

Comments
3 min read
In Excel, Search A Target Value And Hide Columns To Its Right

In Excel, Search A Target Value And Hide Columns To Its Right

7
Comments 2
1 min read
Evaluating the Performance of ChatGPT for Spam Email Detection

Evaluating the Performance of ChatGPT for Spam Email Detection

1
Comments
3 min read
TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners

TRIP-PAL: Travel Planning with Guarantees by Combining Large Language Models and Automated Planners

2
Comments
4 min read
An Image is Worth 32 Tokens for Reconstruction and Generation

An Image is Worth 32 Tokens for Reconstruction and Generation

Comments
4 min read
Should AI Optimize Your Code? A Comparative Study of Current Large Language Models Versus Classical Optimizing Compilers

Should AI Optimize Your Code? A Comparative Study of Current Large Language Models Versus Classical Optimizing Compilers

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.