Option to set 'non_blocking' for to(device) in BatchEncoding and BatchFeature #34883

daniel-bogdoll · 2024-11-22T16:32:08Z

Option to set 'non_blocking' for to(device) operation in BatchEncoding for performance improvements. Defaults to 'false', thus no behavioral changes.

What does this PR do?

This minor PR adds the non_blocking option to the to() function.

Previous: def to(self, device: Union[str, "torch.device"]) -> "BatchEncoding":
New: def to(self, device: Union[str, "torch.device"], non_blocking: bool = False) -> "BatchEncoding":

Since non_blocking defaults to 'False', this PR does not introduce behavioral changes.

I realized, when utilizing Zero Shot Object Detection models, that it was not possible to set this option, leading to sub-optimal performance during inference.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

vision models: @amyeroberts, @qubvel

… improvements. Defaults to 'false', thus no behavioral changes.

qubvel

Hi @daniel-bogdoll, thanks for adding this! It looks great to me. Do you think it might be worth extending the same option to BatchFeature to ensure consistent capabilities?

daniel-bogdoll · 2024-11-22T18:20:21Z

Thanks @qubvel, sure thing! Which tests would I need to run to make sure modifications in the to() function of BatchFeature get tested?

Just to make sure, I assume you refer to

transformers/src/transformers/feature_extraction_utils.py

Line 206 in 54be2d7

def to(self, *args, **kwargs) -> "BatchFeature":

?

qubvel · 2024-11-22T18:42:40Z

Yes, I refer to this one, but not sure it's properly tested anywhere, I was able to find only SequenceFeatureExtractionTestMixin

qubvel · 2024-11-22T18:44:35Z

Maybe we can do it as simple as

non_blocking = kwargs.get("non_blocking", False)
...
elif isinstance(v, torch.Tensor) and device is not None:
      new_data[k] = v.to(device=device, non_blocking=non_blocking)
...

daniel-bogdoll · 2024-11-22T18:48:06Z

That's how I would have tried it as well. But what about this block?

# Check if the args are a device or a dtype
        if device is None and len(args) > 0:
            # device should be always the first argument
            arg = args[0]
            if is_torch_dtype(arg):
                # The first argument is a dtype
                pass
            elif isinstance(arg, str) or is_torch_device(arg) or isinstance(arg, int):
                device = arg
            else:
                # it's something else
                raise ValueError(f"Attempting to cast a BatchFeature to type {str(arg)}. This is not supported.")

Here device is derived from args rather than kwargs. Should this be extended in some way to also consider deriving non_blocking? Not sure where or how this is used.

qubvel · 2024-11-22T18:57:15Z

Here device is derived from args rather than kwargs. Should this be extended in some way to also consider deriving non_blocking? Not sure where or how this is used.

I don't think so, maybe at some moment, it is worth refactoring this method for more explicit args and kwargs. For now, we can add a note in docstring that non_blocking should be passed as a keyword argument.

daniel-bogdoll · 2024-11-22T19:01:11Z

@qubvel Done! Thanks for the super-fast replies, was a pleasure! Tests fail now, though:

For the first one, as you stated here (#34826 (comment)), it does not seem to be related.

https://app.circleci.com/pipelines/github/huggingface/transformers/111324/workflows/3351b194-4b9e-4a17-876b-85360fc7ff01/jobs/1482124?utm_campaign=vcs-integration-link&utm_medium=referral&utm_source=github-checks-link&utm_content=summary

FAILED
tests/models/xlm_roberta_xl/test_modeling_xlm_roberta_xl.py::XLMRobertaXLModelTest::test_assisted_decoding_matches_greedy_search_1_same 
- AssertionError: False is not true

As the second one is a timeout issue, it also seems unrelated:

https://app.circleci.com/pipelines/github/huggingface/transformers/111324/workflows/3351b194-4b9e-4a17-876b-85360fc7ff01/jobs/1482127?utm_campaign=vcs-integration-link&utm_medium=referral&utm_source=github-checks-link&utm_content=summary

FAILED
tests/models/convbert/test_modeling_convbert.py::ConvBertModelTest::test_pipeline_fill_mask -
requests.exceptions.ReadTimeout: (ReadTimeoutError("HTTPSConnectionPool(host='huggingface.co', port=443):
Read timed out. (read timeout=10)"), '(Request ID: 04e3d1b8-11fc-4791-ba74-3d7d67a5f3f2)')

Option to set 'non_blocking' for to(device) operation for performance…

88eddf8

… improvements. Defaults to 'false', thus no behavioral changes.

qubvel reviewed Nov 22, 2024

View reviewed changes

Enabling non_blocking in to() operation of BatchFeature.

08f5d4b

daniel-bogdoll changed the title ~~Option to set 'non_blocking' for to(device) operation in BatchEncoding~~ Option to set 'non_blocking' for to(device) in BatchEncoding and BatchFeature Nov 22, 2024

Improved docstring on utilization of non_blocking

9f465d7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to set 'non_blocking' for to(device) in BatchEncoding and BatchFeature #34883

Option to set 'non_blocking' for to(device) in BatchEncoding and BatchFeature #34883

daniel-bogdoll commented Nov 22, 2024

qubvel left a comment

daniel-bogdoll commented Nov 22, 2024 •

edited

Loading

qubvel commented Nov 22, 2024

qubvel commented Nov 22, 2024 •

edited

Loading

daniel-bogdoll commented Nov 22, 2024

qubvel commented Nov 22, 2024

daniel-bogdoll commented Nov 22, 2024 •

edited

Loading

Option to set 'non_blocking' for to(device) in BatchEncoding and BatchFeature #34883

Are you sure you want to change the base?

Option to set 'non_blocking' for to(device) in BatchEncoding and BatchFeature #34883

Conversation

daniel-bogdoll commented Nov 22, 2024

What does this PR do?

Before submitting

Who can review?

qubvel left a comment

Choose a reason for hiding this comment

daniel-bogdoll commented Nov 22, 2024 • edited Loading

qubvel commented Nov 22, 2024

qubvel commented Nov 22, 2024 • edited Loading

daniel-bogdoll commented Nov 22, 2024

qubvel commented Nov 22, 2024

daniel-bogdoll commented Nov 22, 2024 • edited Loading

daniel-bogdoll commented Nov 22, 2024 •

edited

Loading

qubvel commented Nov 22, 2024 •

edited

Loading

daniel-bogdoll commented Nov 22, 2024 •

edited

Loading