DEV Community

Julien Simon
Julien Simon

Posted on • Originally published at julsimon.Medium on

Video — Deep dive: model merging

Video — Deep dive: model merging

Model merging is an increasingly popular technique that makes it possible to add or remove capabilities to transformer models, without the need for any additional training.

In this video, we first introduce what model merging is. Then, we discuss different merging algorithms implemented in the mergekit library: model soups, SLERP, Task Arithmetic, TIES, DARE, and Franken-merging.

#opensource #ai

Top comments (0)