Small bug fixes for running models without tokenizers #168

coryMosaicML · 2024-08-28T16:18:58Z

Two small tweaks:

In the image logger, change tokenizer.model.max_length to tokenizer.model_max_length
In train.py, explicitly check if the model has a tokenizer, and if not set it to none. Previously we assumed that all diffusion models would come with a tokenizer for running generation, but this is no longer the case.

Landanjs and others added 5 commits August 27, 2024 05:18

First pass on v1 model

43258fe

Update train

0e0fe80

model_max_length

e03fe0c

use right tokenizer

7f97fd6

no model class yet

8eff9a2

gupta-abhay approved these changes Aug 28, 2024

View reviewed changes

coryMosaicML merged commit c1f953f into mosaicml:main Aug 28, 2024
5 checks passed

Provide feedback