DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?

Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations?

1
Comments
5 min read
From Interpolation to Extrapolation: Complete Length Generalization for Arithmetic Transformers

From Interpolation to Extrapolation: Complete Length Generalization for Arithmetic Transformers

1
Comments
4 min read
Evaluating Large Language Models for Material Selection

Evaluating Large Language Models for Material Selection

1
Comments
4 min read
A Survey on the Real Power of ChatGPT

A Survey on the Real Power of ChatGPT

1
Comments
4 min read
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

1
Comments
3 min read
CNN-Based Equalization for Communications: Achieving Gigabit Throughput with a Flexible FPGA Hardware Architecture

CNN-Based Equalization for Communications: Achieving Gigabit Throughput with a Flexible FPGA Hardware Architecture

1
Comments
4 min read
You Only Cache Once: Decoder-Decoder Architectures for Language Models

You Only Cache Once: Decoder-Decoder Architectures for Language Models

1
Comments
4 min read
Motorway: Seamless high speed BFT

Motorway: Seamless high speed BFT

1
Comments
3 min read
Language Modeling Using Tensor Trains

Language Modeling Using Tensor Trains

1
Comments
4 min read
Linearizing Large Language Models

Linearizing Large Language Models

1
Comments
4 min read
Integrating a custom AI copilot into a new

Integrating a custom AI copilot into a new

Comments
1 min read
The Role of Data Integration in Healthcare Research and Precision Medicine

The Role of Data Integration in Healthcare Research and Precision Medicine

Comments
4 min read
"Day 58 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Maths for Data Analysis (Probability - 4)

"Day 58 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Maths for Data Analysis (Probability - 4)

1
Comments
1 min read
"Day 60 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Maths for Data Analysis (Probability - 6)

"Day 60 of My Learning Journey: Setting Sail into Data Excellence! Today's Focus: Maths for Data Analysis (Probability - 6)

1
Comments
1 min read
DSA for Data Scientists

DSA for Data Scientists

Comments
36 min read
is Hadoop Dead?

is Hadoop Dead?

12
Comments
3 min read
LLMs Can Patch Up Missing Relevance Judgments in Evaluation

LLMs Can Patch Up Missing Relevance Judgments in Evaluation

1
Comments
4 min read
Chain of Thoughtlessness: An Analysis of CoT in Planning

Chain of Thoughtlessness: An Analysis of CoT in Planning

1
Comments
4 min read
The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates

The AI Review Lottery: Widespread AI-Assisted Peer Reviews Boost Paper Scores and Acceptance Rates

1
Comments
4 min read
The Power of Training: How Different Neural Network Setups Influence the Energy Demand

The Power of Training: How Different Neural Network Setups Influence the Energy Demand

1
Comments
4 min read
Neural Networks Make Approximately Independent Errors Over Repeated Training

Neural Networks Make Approximately Independent Errors Over Repeated Training

1
Comments
4 min read
Voxel51 Filtered Views Newsletter - May 10, 2024

Voxel51 Filtered Views Newsletter - May 10, 2024

1
Comments
11 min read
OptPDE: Discovering Novel Integrable Systems via AI-Human Collaboration

OptPDE: Discovering Novel Integrable Systems via AI-Human Collaboration

Comments
4 min read
TIM: An Efficient Temporal Interaction Module for Spiking Transformer

TIM: An Efficient Temporal Interaction Module for Spiking Transformer

Comments
5 min read
Large Language Models can Strategically Deceive their Users when Put Under Pressure

Large Language Models can Strategically Deceive their Users when Put Under Pressure

Comments
4 min read
Creating a Stacked Mountain Chart (JS)

Creating a Stacked Mountain Chart (JS)

Comments
6 min read
Recapping the AI, Machine Learning and Data Science Meetup — May 8, 2024

Recapping the AI, Machine Learning and Data Science Meetup — May 8, 2024

1
Comments
9 min read
Creating AI Apps Using RAG & LangChain: A Step-by-Step Developer Guide!

Creating AI Apps Using RAG & LangChain: A Step-by-Step Developer Guide!

20
Comments
7 min read
Basic Terms in Machine Learning (Model Training)

Basic Terms in Machine Learning (Model Training)

4
Comments
4 min read
ID Document Recognition SDK by FacePlugin

ID Document Recognition SDK by FacePlugin

20
Comments
1 min read
Mitigating LLM Hallucinations via Conformal Abstention

Mitigating LLM Hallucinations via Conformal Abstention

Comments
4 min read
Can't say cant? Measuring and Reasoning of Dark Jargons in Large Language Models

Can't say cant? Measuring and Reasoning of Dark Jargons in Large Language Models

Comments
3 min read
xLSTM: Extended Long Short-Term Memory

xLSTM: Extended Long Short-Term Memory

Comments
3 min read
Assemblage: Automatic Binary Dataset Construction for Machine Learning

Assemblage: Automatic Binary Dataset Construction for Machine Learning

Comments
4 min read
HCC Is All You Need: Alignment-The Sensible Kind Anyway-Is Just Human-Centered Computing

HCC Is All You Need: Alignment-The Sensible Kind Anyway-Is Just Human-Centered Computing

Comments
4 min read
Generative Multimodal Models are In-Context Learners

Generative Multimodal Models are In-Context Learners

Comments
4 min read
From ETL to Modern Integration Platforms

From ETL to Modern Integration Platforms

Comments
4 min read
The Psychosocial Impacts of Generative AI Harms

The Psychosocial Impacts of Generative AI Harms

Comments
4 min read
Popular data science libraries

Popular data science libraries

Comments
9 min read
How to Visualize LiDAR Data

How to Visualize LiDAR Data

2
Comments
1 min read
TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness

TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness

Comments
3 min read
Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents

Agent Hospital: A Simulacrum of Hospital with Evolvable Medical Agents

Comments
4 min read
CascadedGaze: Efficiency in Global Context Extraction for Image Restoration

CascadedGaze: Efficiency in Global Context Extraction for Image Restoration

Comments
3 min read
AlphaMath Almost Zero: process Supervision without process

AlphaMath Almost Zero: process Supervision without process

Comments
4 min read
Automating Data Processes for Efficiency and Accuracy

Automating Data Processes for Efficiency and Accuracy

Comments
5 min read
What to use parquet or CSV?

What to use parquet or CSV?

17
Comments
3 min read
🗺️ Neo4J #GraphSummits as data!

🗺️ Neo4J #GraphSummits as data!

Comments 4
1 min read
Accelerating ETL Processes for Timely Business Intelligence

Accelerating ETL Processes for Timely Business Intelligence

Comments
4 min read
Prompt Design and Engineering: Introduction and Advanced Methods

Prompt Design and Engineering: Introduction and Advanced Methods

2
Comments
4 min read
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking

1
Comments
4 min read
Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity

Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity

1
Comments
4 min read
A Simple and Effective Pruning Approach for Large Language Models

A Simple and Effective Pruning Approach for Large Language Models

1
Comments
3 min read
Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

1
Comments
3 min read
SAR image matching algorithm based on multi-class features

SAR image matching algorithm based on multi-class features

Comments
4 min read
Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks

Scalable Bayesian Inference in the Era of Deep Learning: From Gaussian Processes to Deep Neural Networks

Comments
4 min read
Are aligned neural networks adversarially aligned?

Are aligned neural networks adversarially aligned?

Comments
4 min read
PopulAtion Parameter Averaging (PAPA)

PopulAtion Parameter Averaging (PAPA)

Comments
3 min read
Poisoning Web-Scale Training Datasets is Practical

Poisoning Web-Scale Training Datasets is Practical

Comments
3 min read
Beyond Memorization: Violating Privacy Via Inference with Large Language Models

Beyond Memorization: Violating Privacy Via Inference with Large Language Models

Comments
3 min read
Network reconstruction via the minimum description length principle

Network reconstruction via the minimum description length principle

Comments
3 min read
loading...