DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Defending LLMs against Jailbreaking Attacks via Backtranslation

Defending LLMs against Jailbreaking Attacks via Backtranslation

1
Comments
3 min read
Teams of LLM Agents can Exploit Zero-Day Vulnerabilities

Teams of LLM Agents can Exploit Zero-Day Vulnerabilities

2
Comments
3 min read
Documenting my pin collection with Segment Anything: Part 2

Documenting my pin collection with Segment Anything: Part 2

Comments
3 min read
Creativity Has Left the Chat: The Price of Debiasing Language Models

Creativity Has Left the Chat: The Price of Debiasing Language Models

1
Comments
3 min read
The Bayesian Learning Rule

The Bayesian Learning Rule

1
Comments
5 min read
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?

Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?

1
Comments
4 min read
Thermodynamic Linear Algebra

Thermodynamic Linear Algebra

1
Comments
3 min read
RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program Repair

RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program Repair

1
Comments
4 min read
Guardrail Baselines for Unlearning in LLMs

Guardrail Baselines for Unlearning in LLMs

Comments
4 min read
Magicoder: Empowering Code Generation with OSS-Instruct

Magicoder: Empowering Code Generation with OSS-Instruct

1
Comments
4 min read
Harnessing the Power of Generative AI for Practical Business Solutions

Harnessing the Power of Generative AI for Practical Business Solutions

1
Comments
3 min read
Advanced Time-Series & Python Libraries

Advanced Time-Series & Python Libraries

7
Comments
2 min read
Confusion Matrix: A Clear Guide to Understanding It

Confusion Matrix: A Clear Guide to Understanding It

Comments
5 min read
Best way to start learning Machine Learning

Best way to start learning Machine Learning

Comments
3 min read
Insights on the future of AI/ML field for BTech AI/ML Engineering graduates

Insights on the future of AI/ML field for BTech AI/ML Engineering graduates

Comments
7 min read
Use Hugging Face's ControlNet

Use Hugging Face's ControlNet

Comments
2 min read
Day 6 of Machine Learning||Supervised ML Algorithms

Day 6 of Machine Learning||Supervised ML Algorithms

8
Comments
2 min read
5 Developer Techniques to Enhance LLMs Performance!

5 Developer Techniques to Enhance LLMs Performance!

23
Comments
8 min read
Fix SHAP Multiclass Summary Plot - Downgrade to v0.44.1 from 0.45.0

Fix SHAP Multiclass Summary Plot - Downgrade to v0.44.1 from 0.45.0

1
Comments
2 min read
A single API for all your conversational generative AI applications

A single API for all your conversational generative AI applications

11
Comments
5 min read
CVPR Edition: Voxel51 Filtered Views Newsletter - June 21, 2024

CVPR Edition: Voxel51 Filtered Views Newsletter - June 21, 2024

1
Comments 1
5 min read
How I discovered Named Entity Recognition while trying to remove gibberish from a string.

How I discovered Named Entity Recognition while trying to remove gibberish from a string.

Comments
3 min read
Ethical AI: Balancing Innovation with Responsibility

Ethical AI: Balancing Innovation with Responsibility

Comments
2 min read
YOLOv10 on Custom Dataset

YOLOv10 on Custom Dataset

35
Comments
2 min read
Navigating the Testing Challenges in Machine Learning Systems

Navigating the Testing Challenges in Machine Learning Systems

1
Comments
3 min read
TinyLlama: An Open-Source Small Language Model

TinyLlama: An Open-Source Small Language Model

11
Comments 2
4 min read
Vision-LSTM: xLSTM as Generic Vision Backbone

Vision-LSTM: xLSTM as Generic Vision Backbone

2
Comments
4 min read
Bootstrap3D: Improving 3D Content Creation with Synthetic Data

Bootstrap3D: Improving 3D Content Creation with Synthetic Data

1
Comments
4 min read
Common Pitfalls in Machine Learning Model Inference for Beginners and How to Solve Them

Common Pitfalls in Machine Learning Model Inference for Beginners and How to Solve Them

3
Comments
2 min read
Open-Endedness is Essential for Artificial Superhuman Intelligence

Open-Endedness is Essential for Artificial Superhuman Intelligence

Comments
4 min read
Improving Text Embeddings with Large Language Models

Improving Text Embeddings with Large Language Models

1
Comments
3 min read
Know Your Neighborhood: General and Zero-Shot Capable Binary Function Search Powered by Call Graphlets

Know Your Neighborhood: General and Zero-Shot Capable Binary Function Search Powered by Call Graphlets

Comments
4 min read
Scalable MatMul-free Language Modeling

Scalable MatMul-free Language Modeling

7
Comments
4 min read
QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice Codebooks

QuIP#: Even Better LLM Quantization with Hadamard Incoherence and Lattice Codebooks

Comments
3 min read
Graph Convolutional Branch and Bound

Graph Convolutional Branch and Bound

Comments
3 min read
Ask LLMs Directly, What shapes your bias?: Measuring Social Bias in Large Language Models

Ask LLMs Directly, What shapes your bias?: Measuring Social Bias in Large Language Models

Comments
1 min read
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

Comments
5 min read
CityDreamer: Compositional Generative Model of Unbounded 3D Cities

CityDreamer: Compositional Generative Model of Unbounded 3D Cities

Comments
3 min read
Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning

Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning

1
Comments
3 min read
Improving Alignment and Robustness with Short Circuiting

Improving Alignment and Robustness with Short Circuiting

Comments
4 min read
Scalable Detection of Salient Entities in News Articles

Scalable Detection of Salient Entities in News Articles

Comments
3 min read
Approximate Nearest Neighbor Search with Window Filters

Approximate Nearest Neighbor Search with Window Filters

Comments
3 min read
Statistics for Data Science and Machine Learning

Statistics for Data Science and Machine Learning

3
Comments
9 min read
Building your first machine learning model in Python

Building your first machine learning model in Python

2
Comments
6 min read
Understanding Generative AI and LLMs

Understanding Generative AI and LLMs

1
Comments
3 min read
What other skills does industry need in ML?

What other skills does industry need in ML?

Comments
1 min read
Search Engines 2.0: Powered by LLMs and Multilingual Voice Search

Search Engines 2.0: Powered by LLMs and Multilingual Voice Search

Comments
8 min read
Computer Vision Meetup: Lessons Learned fine-tuning Llama2 for Autonomous Agents 31:03

Computer Vision Meetup: Lessons Learned fine-tuning Llama2 for Autonomous Agents

3
Comments
1 min read
China's new Sora rival is here

China's new Sora rival is here

Comments
1 min read
From sticks and levers to worlds and chasms

From sticks and levers to worlds and chasms

1
Comments
2 min read
WaveCoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning

WaveCoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning

1
Comments
3 min read
Computer Vision Meetup: Combining Hugging Face Transformer Models and Image Data with FiftyOne 09:25

Computer Vision Meetup: Combining Hugging Face Transformer Models and Image Data with FiftyOne

1
Comments
1 min read
LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code

1
Comments
4 min read
ReGAL: Refactoring Programs to Discover Generalizable Abstractions

ReGAL: Refactoring Programs to Discover Generalizable Abstractions

Comments
4 min read
InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification

InfoLossQA: Characterizing and Recovering Information Loss in Text Simplification

Comments
3 min read
Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Comments
4 min read
Knockout: A simple way to handle missing inputs

Knockout: A simple way to handle missing inputs

1
Comments
4 min read
Gated Linear Attention Transformers with Hardware-Efficient Training

Gated Linear Attention Transformers with Hardware-Efficient Training

Comments
4 min read
Deep Learning for Camera Calibration and Beyond: A Survey

Deep Learning for Camera Calibration and Beyond: A Survey

Comments
4 min read
LLMs cannot find reasoning errors, but can correct them given the error location

LLMs cannot find reasoning errors, but can correct them given the error location

Comments
5 min read
loading...