Fix return_dict in encodec #31646

jla524 · 2024-06-26T18:49:53Z

What does this PR do?

Fixes #31642 (issue)

With this PR, return_dict=False returns a tuple, and the unit test compares tuple vs dict values.

% pytest tests/models/encodec/test_modeling_encodec.py -k test_model_outputs_equivalence
=========================================================== test session starts ============================================================
platform darwin -- Python 3.12.3, pytest-7.4.4, pluggy-1.4.0
rootdir: /Users/jacky/repos/transformers
configfile: pyproject.toml
plugins: xdist-3.5.0, timeout-2.3.1, rich-0.1.1
collected 116 items / 115 deselected / 1 selected                                                                                          

tests/models/encodec/test_modeling_encodec.py .                                                                                      [100%]

<warnings redacted>
============================================== 1 passed, 115 deselected, 9 warnings in 1.56s ===============================================

Who can review?

@kamilakesbi

kamilakesbi

Thanks @jla524 for iterating on this!

I left a comment for small suggested changes :)

kamilakesbi · 2024-06-27T08:06:13Z

src/transformers/models/encodec/modeling_encodec.py

+        if return_dict is None:
+            return_dict = self.config.return_dict


Here we could do instead:

Suggested change

if return_dict is None:

return_dict = self.config.return_dict

return_dict = return_dict if return_dict is not None else self.config.return_dict

so that return_dict is still set to self.config.return_dict by default.

It would be also nice to do the same modification in the decode method (at this line).

kamilakesbi · 2024-06-27T09:16:12Z

Thanks for iterating on this @jla524!

@amyeroberts this should be ready for final review/merge :)

HuggingFaceDocBuilderDev · 2024-06-27T09:35:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

amyeroberts

Thanks for fixing!

Just a q on the change to the recursive check

amyeroberts · 2024-06-27T09:52:43Z

tests/models/encodec/test_modeling_encodec.py

-                    if isinstance(tuple_object, (List, Tuple)):
-                        for tuple_iterable_value, dict_iterable_value in zip(tuple_object, dict_object):
-                            recursive_check(tuple_iterable_value, dict_iterable_value)
-                    elif isinstance(tuple_object, Dict):


Even if we assert that tuple_object is a tuple on L377, as this is a recursive function, isn't is still possible that the values in tuple_object i.e. tuple_iterable_value are a dict or None?

The values in the tuple_object should be tensor.Tensor only.

To check the types, I added a print statement in the recursive function:

def recursive_check(tuple_object, dict_object): print(f"[DEBUG]: {type(tuple_object)=}, {type(dict_object)=}") ...

and I got this:

[DEBUG]: type(tuple_object)=<class 'tuple'>, type(dict_object)=<class 'transformers.models.encodec.modeling_encodec.EncodecOutput'> [DEBUG]: type(tuple_object)=<class 'torch.Tensor'>, type(dict_object)=<class 'torch.Tensor'> [DEBUG]: type(tuple_object)=<class 'torch.Tensor'>, type(dict_object)=<class 'torch.Tensor'>

edit: it's probably more intuitive to just iterate over the items and compare it

amyeroberts

Thanks for fixing ad iterating on this!

jla524 added 3 commits June 26, 2024 11:18

fix: use return_dict parameter

18c699f

fix: type checks

9251b82

fix: unused imports

f183580

kamilakesbi reviewed Jun 27, 2024

View reviewed changes

update: one-line if else

55556e9

amyeroberts reviewed Jun 27, 2024

View reviewed changes

remove: recursive check

1c3be34

amyeroberts approved these changes Jun 28, 2024

View reviewed changes

amyeroberts merged commit 82a1fc7 into huggingface:main Jun 28, 2024
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix return_dict in encodec #31646

Fix return_dict in encodec #31646

jla524 commented Jun 26, 2024

kamilakesbi left a comment •

edited

Loading

kamilakesbi Jun 27, 2024

kamilakesbi Jun 27, 2024

kamilakesbi commented Jun 27, 2024

HuggingFaceDocBuilderDev commented Jun 27, 2024

amyeroberts left a comment

amyeroberts Jun 27, 2024

jla524 Jun 27, 2024 •

edited

Loading

amyeroberts left a comment

		if return_dict is None:
		return_dict = self.config.return_dict

	if return_dict is None:
	return_dict = self.config.return_dict
	return_dict = return_dict if return_dict is not None else self.config.return_dict

Fix return_dict in encodec #31646

Fix return_dict in encodec #31646

Conversation

jla524 commented Jun 26, 2024

What does this PR do?

Who can review?

kamilakesbi left a comment • edited Loading

Choose a reason for hiding this comment

kamilakesbi Jun 27, 2024

Choose a reason for hiding this comment

kamilakesbi Jun 27, 2024

Choose a reason for hiding this comment

kamilakesbi commented Jun 27, 2024

HuggingFaceDocBuilderDev commented Jun 27, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Jun 27, 2024

Choose a reason for hiding this comment

jla524 Jun 27, 2024 • edited Loading

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

kamilakesbi left a comment •

edited

Loading

jla524 Jun 27, 2024 •

edited

Loading