Implement `pipeline.to(device)` #195

pcuenca · 2022-08-17T08:13:26Z

Currently, pipeline modules are moved to the preferred compute device during __call__. This is reasonable, as they stay there as long as the user keeps passing the same torch_device across calls.

However, in multi-GPU model-serving scenarios, it could be useful to move each pipeline to a dedicated device during or immediately after instantiation. This would make it possible to create, say, 8 different pipelines and move each one to a different GPU. Doing it this way could potentially save CPU memory while preparing the service.

Currently, the workaround to achieve the same would be to perform a call with fake data immediately after the instantiation.

Describe the solution you'd like
Ideally, the following should work:

pipe = StableDiffusionPipeline.from_pretrained(model_id).to("cuda:1")

Describe alternatives you've considered
Current workaround:

pipe = StableDiffusionPipeline.from_pretrained(model_id)
_ = pipe(["cat"], num_inference_steps=1, torch_device="cuda:1")

Another alternative would be to pass the device to the initializer. This could be done in addition to adding a to method, but I believe it's not necessary as to is familiar enough to PyTorch users.

Additional context
See discussion in this Slack thread.

The text was updated successfully, but these errors were encountered:

patrickvonplaten · 2022-08-17T08:21:56Z

Sounds good to me! +1 to add such a .to(...) function to the DiffusionPipeline class.

What do you think @anton-l @patil-suraj ?

anton-l · 2022-08-17T08:23:50Z

Works for me, nice and intuitive :)

patil-suraj · 2022-08-17T08:42:38Z

+1, let's go for it!

patil-suraj · 2022-08-17T09:20:05Z

@pcuenca do you wanna take a stab at it ? Otherwise happy to work on it, if you are busy :)

pcuenca · 2022-08-17T11:10:56Z

@patil-suraj happy to take it! I'll do it after making some progress on the backend, unless it's urgent. I think I'd be ready to work on this later today or tomorrow, would that be ok?

pcuenca self-assigned this Aug 17, 2022

pcuenca mentioned this issue Aug 18, 2022

Pipeline to device #210

Merged

anton-l closed this as completed in #210 Aug 19, 2022

Implement `pipeline.to(device)` #195

Implement `pipeline.to(device)` #195

pcuenca commented Aug 17, 2022

patrickvonplaten commented Aug 17, 2022

anton-l commented Aug 17, 2022

patil-suraj commented Aug 17, 2022

patil-suraj commented Aug 17, 2022

pcuenca commented Aug 17, 2022

Implement pipeline.to(device) #195

Implement pipeline.to(device) #195

Comments

pcuenca commented Aug 17, 2022

patrickvonplaten commented Aug 17, 2022

anton-l commented Aug 17, 2022

patil-suraj commented Aug 17, 2022

patil-suraj commented Aug 17, 2022

pcuenca commented Aug 17, 2022

Implement `pipeline.to(device)` #195

Implement `pipeline.to(device)` #195