DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

5
Comments
3 min read
Rho-1: Not All Tokens Are What You Need

Rho-1: Not All Tokens Are What You Need

5
Comments
4 min read
Generalization in diffusion models arises from geometry-adaptive harmonic representations

Generalization in diffusion models arises from geometry-adaptive harmonic representations

5
Comments
4 min read
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

5
Comments
4 min read
Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

5
Comments
4 min read
JetMoE: Reaching Llama2 Performance with 0.1M Dollars

JetMoE: Reaching Llama2 Performance with 0.1M Dollars

4
Comments
4 min read
CodecLM: Aligning Language Models with Tailored Synthetic Data

CodecLM: Aligning Language Models with Tailored Synthetic Data

6
Comments
4 min read
Exploring LLM RAG Application Vulnerabilities

Exploring LLM RAG Application Vulnerabilities

1
Comments
11 min read
SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

5
Comments
4 min read
Announcing FiftyOne 0.23.7 and FiftyOne Teams 1.5.8

Announcing FiftyOne 0.23.7 and FiftyOne Teams 1.5.8

Comments
6 min read
Getting Started with Gemma Models

Getting Started with Gemma Models

21
Comments
5 min read
The death of creativity

The death of creativity

47
Comments 15
6 min read
Validating Linear Regression Assumptions: A Comprehensive Approach to Multivariate Normality

Validating Linear Regression Assumptions: A Comprehensive Approach to Multivariate Normality

1
Comments
4 min read
CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss

CVPR 2024 Survival Guide: Five Vision-Language Papers You Don’t Want to Miss

1
Comments
9 min read
Exploring the Top AI Blogs of 2024: Illuminating Insights into Artificial Intelligence

Exploring the Top AI Blogs of 2024: Illuminating Insights into Artificial Intelligence

Comments 1
3 min read
Developer’s Guide : Modular, Flexible, Scalable Prod ready RAG

Developer’s Guide : Modular, Flexible, Scalable Prod ready RAG

Comments 1
2 min read
Exploring Multicollinearity: Strategies for Detecting and Managing Correlated Predictors in Regression Analysis

Exploring Multicollinearity: Strategies for Detecting and Managing Correlated Predictors in Regression Analysis

4
Comments
3 min read
การแสดง Multiple Linear Regression เป็นกราฟ 3 มิติ โดยใช้ Python

การแสดง Multiple Linear Regression เป็นกราฟ 3 มิติ โดยใช้ Python

Comments
2 min read
survey: Risk Prediction of Digital Transformation of Manufacturing Supply Chain Based on PCA and BPNN

survey: Risk Prediction of Digital Transformation of Manufacturing Supply Chain Based on PCA and BPNN

Comments
1 min read
Unravelling Linearity: My Journey in Regression Modeling

Unravelling Linearity: My Journey in Regression Modeling

1
Comments
3 min read
Safeguarding AI with Llama Guard: Ethical AI Development

Safeguarding AI with Llama Guard: Ethical AI Development

Comments
8 min read
Migrating from AWS SageMaker to GCP Vertex AI: A Training Environment Transition

Migrating from AWS SageMaker to GCP Vertex AI: A Training Environment Transition

5
Comments 2
4 min read
DanceTime is back!

DanceTime is back!

Comments
6 min read
Independence of Errors: A Guide to Validating Linear Regression Assumptions

Independence of Errors: A Guide to Validating Linear Regression Assumptions

6
Comments
3 min read
LLMs are secretly good at regression calculations

LLMs are secretly good at regression calculations

4
Comments
9 min read
Lambda Function(Python)

Lambda Function(Python)

3
Comments
1 min read
Let's make money with Amazon Mechanical Turk!

Let's make money with Amazon Mechanical Turk!

1
Comments
2 min read
The Optimal Choice of Hypothesis Is the Weakest, Not the Shortest

The Optimal Choice of Hypothesis Is the Weakest, Not the Shortest

Comments
4 min read
Increased LLM Vulnerabilities from Fine-tuning and Quantization

Increased LLM Vulnerabilities from Fine-tuning and Quantization

Comments
4 min read
Impact of Extensions on Browser Performance: An Empirical Study on Google Chrome

Impact of Extensions on Browser Performance: An Empirical Study on Google Chrome

Comments
3 min read
The Topos of Transformer Networks

The Topos of Transformer Networks

Comments
4 min read
From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples

From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples

Comments
4 min read
From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications

From Model-centered to Human-Centered: Revision Distance as a Metric for Text Evaluation in LLMs-based Applications

Comments
4 min read
Accelerating Recommender Model Training by Dynamically Skipping Stale Embeddings

Accelerating Recommender Model Training by Dynamically Skipping Stale Embeddings

Comments
4 min read
Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic

Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic

Comments
4 min read
The Use of Generative Search Engines for Knowledge Work and Complex Tasks

The Use of Generative Search Engines for Knowledge Work and Complex Tasks

Comments
3 min read
94% on CIFAR-10 in 3.29 Seconds on a Single GPU

94% on CIFAR-10 in 3.29 Seconds on a Single GPU

Comments
3 min read
Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

Comments
4 min read
New EC2 Instance Type - Designed For AI / ML Workloads

New EC2 Instance Type - Designed For AI / ML Workloads

Comments
1 min read
FiftyOne Computer Vision Tips and Tricks - April 12, 2024

FiftyOne Computer Vision Tips and Tricks - April 12, 2024

1
Comments
4 min read
Optical Character Recognition with PyTesseract

Optical Character Recognition with PyTesseract

Comments
9 min read
PIGEON: Predicting Image Geolocations

PIGEON: Predicting Image Geolocations

11
Comments
4 min read
Unleashing the basics of Exploratory Data Analysis||EDA

Unleashing the basics of Exploratory Data Analysis||EDA

Comments
4 min read
Efficient Quantum Circuit Design with a Standard Cell Approach, with an Application to Neutral Atom Quantum Computers

Efficient Quantum Circuit Design with a Standard Cell Approach, with an Application to Neutral Atom Quantum Computers

5
Comments
4 min read
Impossible Distillation: from Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing

Impossible Distillation: from Low-Quality Model to High-Quality Dataset & Model for Summarization and Paraphrasing

5
Comments
3 min read
ShapeFusion: A 3D diffusion model for localized shape editing

ShapeFusion: A 3D diffusion model for localized shape editing

5
Comments
6 min read
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

5
Comments
4 min read
Characterization of Large Language Model Development in the Datacenter

Characterization of Large Language Model Development in the Datacenter

5
Comments
4 min read
SonicVisionLM: Playing Sound with Vision Language Models

SonicVisionLM: Playing Sound with Vision Language Models

5
Comments
4 min read
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

5
Comments
4 min read
BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

5
Comments
4 min read
AIJack: Let's Hijack AI! Security and Privacy Risk Simulator for Machine Learning

AIJack: Let's Hijack AI! Security and Privacy Risk Simulator for Machine Learning

5
Comments
4 min read
Advancing LLM Reasoning Generalists with Preference Trees

Advancing LLM Reasoning Generalists with Preference Trees

5
Comments
4 min read
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning

MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning

5
Comments
3 min read
PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits

PersonaLLM: Investigating the Ability of Large Language Models to Express Personality Traits

5
Comments
5 min read
Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models

Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models

5
Comments
4 min read
GenN2N: Generative NeRF2NeRF Translation

GenN2N: Generative NeRF2NeRF Translation

5
Comments
3 min read
Congrats to the Coze AI Bot Challenge Winners!

Congrats to the Coze AI Bot Challenge Winners!

47
Comments 20
2 min read
Towards a Brazilian History Knowledge Graph

Towards a Brazilian History Knowledge Graph

Comments
5 min read
DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation

DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation

Comments
2 min read
loading...