麦丽素

My life is getting better

Dreambooth

July 8, 2023 less than 1 minute read

Introduction

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

It’s like a photo booth, but once the subject is captured, it can be synthesized wherever your dreams take you.

20230708190125

在微调层面，我们重新使用图片进行微调训练会导致几个问题
- 语言漂移【学了新的忘了旧的】
  - 在大模型上微调之后，逐渐失去语言的句法和语义知识。即失去对一般知识的理解
- 过度拟合
  - 如图所示，在输入狗的照片后，过拟合会导致狗的姿势一直趴在沙滩上，而非其他姿势
Dreambooth的优势
- 使用了一个新的罕见词来代表图片的含义，保证新加入的图片对应的词在模型中没有其他太多含义
- 区别于Textual inversion方法，Dreambooth使用罕见次，而textual inversion是新词。Dreambooth会对整个模型进行微调，而textual inversion只对embedding部分调整

Reference

Twitter Facebook LinkedIn

You May Also Enjoy

FruitNerf

September 28, 2024 2 minute read

FruitNeRF: A Unified Neural Radiance Field based Fruit Counting Framework

Nerf and 3DGS

August 10, 2024 2 minute read

AnySyn3D Paper sets

June 30, 2024 less than 1 minute read

Unsigned Orthogonal Distance Fields: An Accurate Neural Implicit Representation for Diverse 3D Shapes

May 13, 2024 less than 1 minute read