Use torch.device instead of current device index for BnB quantizer #10069

a-r-r-o-w · 2024-12-01T15:13:14Z

HuggingFaceDocBuilderDev · 2024-12-01T15:19:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

sayakpaul

Could you run some slow tests and maybe the

diffusers/tests/quantization/bnb/test_4bit.py

Line 489 in c96bfa5

class SlowBnb4BitFluxTests(Base4bitTests):

suite?

If not, it will need to wait till tomorrow when I can find time to run it. LMK.

sayakpaul · 2024-12-01T15:19:39Z

src/diffusers/models/modeling_utils.py

@@ -836,7 +836,7 @@ def from_pretrained(cls, pretrained_model_name_or_path: Optional[Union[str, os.P
                        param_device = "cpu"
                    # TODO (sayakpaul,  SunMarc): remove this after model loading refactor
                    elif is_quant_method_bnb:
-                        param_device = torch.cuda.current_device()
+                        param_device = torch.device(torch.cuda.current_device())


This looks good to me!

@yiyixuxu WDYT?

Cc: @SunMarc as well.

let's throw an error in load_model_dict_into_meta when device is passed as index??

Throws a value error now. @yiyixuxu

@sayakpaul, the integration tests pass:

(nightly-venv) (nightly-venv) aryan@hf-dgx-01:~/work/diffusers$ RUN_SLOW=1 CUDA_VISIBLE_DEVICES="3" pytest -s tests/quantization/bnb/test_4bit.py::SlowBnb4BitFluxTests ========================================================================================================================================= test session starts ========================================================================================================================================== platform linux -- Python 3.10.14, pytest-8.3.2, pluggy-1.5.0 rootdir: /home/aryan/work/diffusers configfile: pyproject.toml plugins: timeout-2.3.1, requests-mock-1.10.0, xdist-3.6.1, anyio-4.6.2.post1 collected 1 item tests/quantization/bnb/test_4bit.py Unused kwargs: ['_load_in_4bit', '_load_in_8bit', 'quant_method']. These kwargs are not used in <class 'transformers.utils.quantization_config.BitsAndBytesConfig'>. `low_cpu_mem_usage` was None, now default to True since model is quantized. Loading pipeline components...: 14%|████████████████████████████████▋ | 1/7 [00:00<00:00, 9.00it/s]You set `add_prefix_space`. The tokenizer needs to be converted from the slow tokenizers Loading pipeline components...: 100%|█████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 7/7 [00:00<00:00, 9.26it/s] 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 10/10 [00:14<00:00, 1.50s/it] . ========================================================================================================================================== 1 passed in 52.27s ===================================================================

yiyixuxu

thanks!

sayakpaul

Merge away! Thank you very much.

Maybe we could also update the type hint of device in load_model_dict_into_meta()?

a-r-r-o-w · 2024-12-05T06:03:36Z

Maybe we could also update the type hint of device in load_model_dict_into_meta()?

I think it's already correctly set to str | torch.device

…10069) * update * apply review suggestion --------- Co-authored-by: Sayak Paul <[email protected]>

update

aeeadc7

a-r-r-o-w requested a review from sayakpaul December 1, 2024 15:13

a-r-r-o-w changed the title ~~Use torch.dtype instead of current device index for BnB quantizer~~ Use torch.device instead of current device index for BnB quantizer Dec 1, 2024

sayakpaul reviewed Dec 1, 2024

View reviewed changes

sayakpaul requested a review from yiyixuxu December 1, 2024 15:21

sayakpaul mentioned this pull request Dec 2, 2024

[core] TorchAO Quantizer #10009

Merged

9 tasks

yiyixuxu added the close-to-merge label Dec 3, 2024

sayakpaul mentioned this pull request Dec 4, 2024

[Single File] Add GGUF support #9964

Merged

6 tasks

DN6 added the roadmap Add to current release roadmap label Dec 4, 2024

a-r-r-o-w added 2 commits December 4, 2024 22:24

Merge branch 'main' into param-device-bnb

7261559

apply review suggestion

126d84e

a-r-r-o-w requested a review from sayakpaul December 4, 2024 21:33

yiyixuxu approved these changes Dec 4, 2024

View reviewed changes

Merge branch 'main' into param-device-bnb

1a171c3

sayakpaul approved these changes Dec 5, 2024

View reviewed changes

sayakpaul merged commit 98d0cd5 into main Dec 5, 2024
18 checks passed

sayakpaul deleted the param-device-bnb branch December 5, 2024 02:35

sayakpaul added a commit that referenced this pull request Dec 23, 2024

Use torch.device instead of current device index for BnB quantizer (#…

6bf8b2b

…10069) * update * apply review suggestion --------- Co-authored-by: Sayak Paul <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use torch.device instead of current device index for BnB quantizer #10069

Use torch.device instead of current device index for BnB quantizer #10069

Uh oh!

a-r-r-o-w commented Dec 1, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Dec 1, 2024

Uh oh!

sayakpaul left a comment

Uh oh!

sayakpaul Dec 1, 2024

Uh oh!

yiyixuxu Dec 1, 2024

Uh oh!

a-r-r-o-w Dec 4, 2024

Uh oh!

yiyixuxu left a comment

Uh oh!

sayakpaul left a comment

Uh oh!

Uh oh!

a-r-r-o-w commented Dec 5, 2024

Uh oh!

Uh oh!

Use torch.device instead of current device index for BnB quantizer #10069

Use torch.device instead of current device index for BnB quantizer #10069

Uh oh!

Conversation

a-r-r-o-w commented Dec 1, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Dec 1, 2024

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul Dec 1, 2024

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Dec 1, 2024

Choose a reason for hiding this comment

Uh oh!

a-r-r-o-w Dec 4, 2024

Choose a reason for hiding this comment

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

sayakpaul left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

a-r-r-o-w commented Dec 5, 2024

Uh oh!

Uh oh!