DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Giskard: LLM-Assisted Automated Red Teaming

Giskard: LLM-Assisted Automated Red Teaming

1
Comments
7 min read
Python antivirus or security operation solution and potentially machine learning.

Python antivirus or security operation solution and potentially machine learning.

1
Comments
1 min read
Unlock Efficiency with ID Document Recognition: 8 Hassle-Free Validation Techniques

Unlock Efficiency with ID Document Recognition: 8 Hassle-Free Validation Techniques

23
Comments
1 min read
AutoCodeRover: Autonomous Program Improvement

AutoCodeRover: Autonomous Program Improvement

1
Comments
3 min read
Aprenda a falar inglês

Aprenda a falar inglês

Comments 1
2 min read
The Illusion of State in State-Space Models

The Illusion of State in State-Space Models

Comments
4 min read
Zero-shot Building Age Classification from Facade Image Using GPT-4

Zero-shot Building Age Classification from Facade Image Using GPT-4

Comments
4 min read
Google Play Biometrics Verification Method: Should You Turn It On?

Google Play Biometrics Verification Method: Should You Turn It On?

1
Comments
2 min read
AI enthusiasm #5 - Calculate your carbon footprint🌱

AI enthusiasm #5 - Calculate your carbon footprint🌱

1
Comments
3 min read
DEVin | The Scary Future For Developers

DEVin | The Scary Future For Developers

1
Comments
4 min read
Data preprocessing for extractive QA

Data preprocessing for extractive QA

Comments
1 min read
LLM Security: Using Automated Tools for Vulnerability Scans

LLM Security: Using Automated Tools for Vulnerability Scans

1
Comments
8 min read
Tools Every Data Scientist Should Know

Tools Every Data Scientist Should Know

Comments
2 min read
Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

1
Comments
4 min read
Large Language Models as Optimizers

Large Language Models as Optimizers

1
Comments
4 min read
Recommender Systems in the Era of Large Language Models (LLMs)

Recommender Systems in the Era of Large Language Models (LLMs)

1
Comments
4 min read
Manipulating Large Language Models to Increase Product Visibility

Manipulating Large Language Models to Increase Product Visibility

Comments
3 min read
H2O-Danube-1.8B Technical Report

H2O-Danube-1.8B Technical Report

Comments
4 min read
Dataset Reset Policy Optimization for RLHF

Dataset Reset Policy Optimization for RLHF

Comments
4 min read
Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Tiny Titans: Can Smaller Large Language Models Punch Above Their Weight in the Real World for Meeting Summarization?

Comments
4 min read
CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

CHOPS: CHat with custOmer Profile Systems for Customer Service with LLMs

Comments
3 min read
The Curse of Recursion: Training on Generated Data Makes Models Forget

The Curse of Recursion: Training on Generated Data Makes Models Forget

Comments
4 min read
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

Comments
4 min read
TransformerFAM: Feedback attention is working memory

TransformerFAM: Feedback attention is working memory

Comments
4 min read
BooookScore: A systematic exploration of book-length summarization in the era of LLMs

BooookScore: A systematic exploration of book-length summarization in the era of LLMs

Comments
4 min read
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Comments
4 min read
Beginner's Guide to Math's for Machine Learning

Beginner's Guide to Math's for Machine Learning

2
Comments 1
2 min read
Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Tied-Lora: Enhancing parameter efficiency of LoRA with weight tying

Comments
4 min read
SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

SCott: Accelerating Diffusion Models with Stochastic Consistency Distillation

Comments
4 min read
Insights from the Leading EDJE Round Table Discussion on AI: Bridging the Gap Between Virtual and Physical Spaces

Insights from the Leading EDJE Round Table Discussion on AI: Bridging the Gap Between Virtual and Physical Spaces

2
Comments
3 min read
Assumption of Homoscedasticity : A Guide to verifying the Assumption of Constant Variance of Residuals

Assumption of Homoscedasticity : A Guide to verifying the Assumption of Constant Variance of Residuals

1
Comments
3 min read
Meeting Minutes Made Easy: Summarize Key Points and Send Emails using Lyzr-Automata

Meeting Minutes Made Easy: Summarize Key Points and Send Emails using Lyzr-Automata

Comments
3 min read
Zero-Shot Prediction Plugin for FiftyOne

Zero-Shot Prediction Plugin for FiftyOne

Comments
8 min read
Future trends and emerging technologies in AI

Future trends and emerging technologies in AI

Comments
16 min read
Can Facial Recognition Be Trusted for Immigration Control?

Can Facial Recognition Be Trusted for Immigration Control?

1
Comments
1 min read
Cognita: An Open-Source Framework for Enhanced RAG Applications

Cognita: An Open-Source Framework for Enhanced RAG Applications

2
Comments
3 min read
RAG Redefined : Ready-to-Deploy RAG for Organizations at Scale.

RAG Redefined : Ready-to-Deploy RAG for Organizations at Scale.

1
Comments 2
1 min read
CodeLlama: The Next-Gen Coding Assistant

CodeLlama: The Next-Gen Coding Assistant

8
Comments 10
8 min read
Day 1 of 30 : Machine Learning

Day 1 of 30 : Machine Learning

11
Comments 6
2 min read
AI is Not Going to Steal Your Keyboard (Unless You Let Them Write the Code)

AI is Not Going to Steal Your Keyboard (Unless You Let Them Write the Code)

1
Comments
2 min read
Get Hired Faster: How to use Lyzr-Automata to draft personalised cold emails

Get Hired Faster: How to use Lyzr-Automata to draft personalised cold emails

1
Comments
4 min read
Vision Transformers Need Registers

Vision Transformers Need Registers

5
Comments
4 min read
Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists

Algorithmic Collective Action in Recommender Systems: Promoting Songs by Reordering Playlists

6
Comments
4 min read
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

5
Comments
4 min read
The Expressive Power of Transformers with Chain of Thought

The Expressive Power of Transformers with Chain of Thought

5
Comments
4 min read
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

5
Comments
3 min read
Rho-1: Not All Tokens Are What You Need

Rho-1: Not All Tokens Are What You Need

5
Comments
4 min read
The Impact of Depth on Compositional Generalization in Transformer Language Models

The Impact of Depth on Compositional Generalization in Transformer Language Models

5
Comments
4 min read
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

5
Comments
3 min read
ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

ChatGPT Can Predict the Future when it Tells Stories Set in the Future About the Past

5
Comments
4 min read
Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

Chapter: Vulnerability of Quantum Information Systems to Collective Manipulation

5
Comments
4 min read
GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications

6
Comments
4 min read
Show Your Work with Confidence: Confidence Bands for Tuning Curves

Show Your Work with Confidence: Confidence Bands for Tuning Curves

6
Comments
4 min read
Generalization in diffusion models arises from geometry-adaptive harmonic representations

Generalization in diffusion models arises from geometry-adaptive harmonic representations

5
Comments
4 min read
Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

5
Comments
3 min read
Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

5
Comments
4 min read
CodecLM: Aligning Language Models with Tailored Synthetic Data

CodecLM: Aligning Language Models with Tailored Synthetic Data

6
Comments
4 min read
JetMoE: Reaching Llama2 Performance with 0.1M Dollars

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

4
Comments
4 min read
Exploring LLM RAG Application Vulnerabilities

Exploring LLM RAG Application Vulnerabilities

1
Comments
11 min read
SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

5
Comments
4 min read
loading...