DEV Community

Mike Young profile picture

Mike Young

Building indie hacker stuff in my free time, focusing on AI. Launching https://aimodels.fyi - find the right AI model for your project!

Location Washington, DC Joined Joined on  Personal website https://aimodels.fyi twitter website

Education

Purdue

Work

Indie hacking stuff!

Proofread: Fixes All Errors with One Tap

Proofread: Fixes All Errors with One Tap

Comments
4 min read

Want to connect with Mike Young?

Create an account to connect with Mike Young. You can also sign in below to proceed if you already have an account.

Already have an account? Sign in
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Comments
4 min read
TextGrad: Automatic Differentiation via Text

TextGrad: Automatic Differentiation via Text

Comments
4 min read
Can Language Models Serve as Text-Based World Simulators?

Can Language Models Serve as Text-Based World Simulators?

Comments
4 min read
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Comments
4 min read
Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes

Publicly Shareable Clinical Large Language Model Built on Synthetic Clinical Notes

Comments
4 min read
Can Language Models Use Forecasting Strategies?

Can Language Models Use Forecasting Strategies?

Comments
4 min read
An Empirical Study of Mamba-based Language Models

An Empirical Study of Mamba-based Language Models

Comments
4 min read
Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch

Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch

Comments
4 min read
HyperFields: Towards Zero-Shot Generation of NeRFs from Text

HyperFields: Towards Zero-Shot Generation of NeRFs from Text

Comments
4 min read
Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications

Raccoon: Prompt Extraction Benchmark of LLM-Integrated Applications

Comments
4 min read
Understanding Hallucinations in Diffusion Models through Mode Interpolation

Understanding Hallucinations in Diffusion Models through Mode Interpolation

1
Comments
4 min read
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Comments
4 min read
Progress Towards Decoding Visual Imagery via fNIRS

Progress Towards Decoding Visual Imagery via fNIRS

Comments
4 min read
Stencil Computations on AMD and Nvidia Graphics Processors: Performance and Tuning Strategies

Stencil Computations on AMD and Nvidia Graphics Processors: Performance and Tuning Strategies

Comments
4 min read
Step-by-Step Diffusion: An Elementary Tutorial

Step-by-Step Diffusion: An Elementary Tutorial

Comments
3 min read
Rough Set improved Therapy-Based Metaverse Assisting System

Rough Set improved Therapy-Based Metaverse Assisting System

Comments
5 min read
Open Problems in DAOs

Open Problems in DAOs

Comments
4 min read
Unlearning Traces the Influential Training Data of Language Models

Unlearning Traces the Influential Training Data of Language Models

1
Comments
5 min read
The Prompt Report: A Systematic Survey of Prompting Techniques

The Prompt Report: A Systematic Survey of Prompting Techniques

3
Comments
3 min read
Large Language Models for Automated Open-domain Scientific Hypotheses Discovery

Large Language Models for Automated Open-domain Scientific Hypotheses Discovery

Comments
4 min read
What If We Recaption Billions of Web Images with LLaMA-3?

What If We Recaption Billions of Web Images with LLaMA-3?

Comments
5 min read
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Comments
4 min read
Discovering Preference Optimization Algorithms with and for Large Language Models

Discovering Preference Optimization Algorithms with and for Large Language Models

Comments
4 min read
AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image Generation

AttnDreamBooth: Towards Text-Aligned Personalized Text-to-Image Generation

Comments
3 min read
Instant 3D Human Avatar Generation using Image Diffusion Models

Instant 3D Human Avatar Generation using Image Diffusion Models

Comments
4 min read
Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM

Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM

Comments
4 min read
Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Improve Mathematical Reasoning in Language Models by Automated Process Supervision

Comments
4 min read
Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Comments
4 min read
Toward Autonomous Driving by Musculoskeletal Humanoids: A Study of Developed Hardware and Learning-Based Software

Toward Autonomous Driving by Musculoskeletal Humanoids: A Study of Developed Hardware and Learning-Based Software

Comments
4 min read
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B

Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B

Comments
4 min read
PowerInfer-2: Fast Large Language Model Inference on a Smartphone

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

1
Comments
4 min read
Towards a Personal Health Large Language Model

Towards a Personal Health Large Language Model

Comments
5 min read
Clifford-Steerable Convolutional Neural Networks

Clifford-Steerable Convolutional Neural Networks

1
Comments
4 min read
Zero-shot Image Editing with Reference Imitation

Zero-shot Image Editing with Reference Imitation

2
Comments
4 min read
The statistical thermodynamics of generative diffusion models: Phase transitions, symmetry breaking and critical instability

The statistical thermodynamics of generative diffusion models: Phase transitions, symmetry breaking and critical instability

Comments
5 min read
RAG Does Not Work for Enterprises

RAG Does Not Work for Enterprises

2
Comments
4 min read
Separating the Chirp from the Chat: Self-supervised Visual Grounding of Sound and Language

Separating the Chirp from the Chat: Self-supervised Visual Grounding of Sound and Language

1
Comments
4 min read
GenAI Arena: An Open Evaluation Platform for Generative Models

GenAI Arena: An Open Evaluation Platform for Generative Models

1
Comments
3 min read
Creativity Has Left the Chat: The Price of Debiasing Language Models

Creativity Has Left the Chat: The Price of Debiasing Language Models

3
Comments
3 min read
The Bayesian Learning Rule

The Bayesian Learning Rule

5
Comments
5 min read
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?

Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?

3
Comments
4 min read
Thermodynamic Linear Algebra

Thermodynamic Linear Algebra

5
Comments
3 min read
A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents

3
Comments
5 min read
Magicoder: Empowering Code Generation with OSS-Instruct

Magicoder: Empowering Code Generation with OSS-Instruct

1
Comments
4 min read
Teams of LLM Agents can Exploit Zero-Day Vulnerabilities

Teams of LLM Agents can Exploit Zero-Day Vulnerabilities

3
Comments
3 min read
Golden Ratio Yoshimura for Meta-Stable and Massively Reconfigurable Deployment

Golden Ratio Yoshimura for Meta-Stable and Massively Reconfigurable Deployment

3
Comments
4 min read
Defending LLMs against Jailbreaking Attacks via Backtranslation

Defending LLMs against Jailbreaking Attacks via Backtranslation

5
Comments
3 min read
Explaining Explanations in Probabilistic Logic Programming

Explaining Explanations in Probabilistic Logic Programming

5
Comments
4 min read
Conformal Prediction Sets Improve Human Decision Making

Conformal Prediction Sets Improve Human Decision Making

3
Comments 1
4 min read
Learning to Infer Generative Template Programs for Visual Concepts

Learning to Infer Generative Template Programs for Visual Concepts

5
Comments
4 min read
Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey

Large Language Models(LLMs) on Tabular Data: Prediction, Generation, and Understanding -- A Survey

4
Comments
5 min read
RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program Repair

RepairLLaMA: Efficient Representations and Fine-Tuned Adapters for Program Repair

3
Comments
4 min read
CompeteAI: Understanding the Competition Dynamics in Large Language Model-based Agents

CompeteAI: Understanding the Competition Dynamics in Large Language Model-based Agents

4
Comments
4 min read
Semantically Diverse Language Generation for Uncertainty Estimation in Language Models

Semantically Diverse Language Generation for Uncertainty Estimation in Language Models

6
Comments
4 min read
Guardrail Baselines for Unlearning in LLMs

Guardrail Baselines for Unlearning in LLMs

4
Comments
4 min read
Know Your Neighborhood: General and Zero-Shot Capable Binary Function Search Powered by Call Graphlets

Know Your Neighborhood: General and Zero-Shot Capable Binary Function Search Powered by Call Graphlets

2
Comments
4 min read
Scalable MatMul-free Language Modeling

Scalable MatMul-free Language Modeling

7
Comments
4 min read
Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning

Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning

1
Comments
3 min read
Bootstrap3D: Improving 3D Content Creation with Synthetic Data

Bootstrap3D: Improving 3D Content Creation with Synthetic Data

4
Comments
4 min read
loading...