[Feat] Enable dynamic filter #879

petar-qb · 2024-11-15T10:29:53Z

Description

This PR introduces dynamic filters for the following selectors (so, all Vizro selectors except vm.DatePicker):
vm.Dropdown, vm.RadioItems, vm.Checklist, vm.Slider and vm.RangeSlider

You can test the feature by altering values in the scratch_dev/data.yaml (in the way that's described in the comment) and running the scratch_dev/app.py.

TODOs:

This PR TODOs:

- Improve the UI of the dynamic placeholder
- Unit tests
- Documentation -> [Docs] Dynamic filters #891

This or next PR TODOs (we should decide it):

- Support dynamic for the vm.DatePicker. (Maybe it's better to wait for the dash persistence bugfix as persistence doesn't work for the vm.DatePicker even on the main branch)

Next PRs:

- Enhance the default selector.value handling for new users. There are a few cases marked with 🟠 in this Issue -> https://github.com/McK-Internal/vizro-internal/issues/1356. There are two inconsistencies listed at the bottom of the Issue description. This TODO points to the second inconsistency.
- Introduce "Universal Vizro placeholder component" -> https://github.com/McK-Internal/vizro-internal/issues/1307
- Fix the multi=False vm.Dropdown dynamic selector bug when the value is cleared. This will be solved when Vizro universal placeholder component become introduced.
- Enable dynamic filter to work with the empty data_frame.
- Propagate data_frame Parameter default values from the model_manager into the DM._multi_load() that's called from the vm.Filter.pre_build(). -> PoC can be found in the comment.
- Implement "Select ALL" for the multi=True categorical selectors.
- Implement "Select the entire range" for the multi=True numerical selectors.

References:

Miro board link
poc branch for testing dash persistence mechanism -> https://github.com/mckinsey/vizro/tree/poc/dynamic-filter
poc branch where the default selector value for a the new user is aligned with new dynamic min/max https://github.com/mckinsey/vizro/tree/feat/dynamic-filter-slider-support-for-new-user - This improvement can wait until dash persistence bug is fixed.

Open questions:

How to enable dynamic Parameters? Should we allow, for example, the "options" property of a categorical parameter selector to be a function that dynamically calculates and returns new options?
How to enable a dynamic filter to be targeted by any predefined action? For example the parameter_action target. Or like this: update_figures(targets=[”filter_1_id”])
What exactly is a new way of dash persistence handling we expect (this fix will be implemented by us)?
If we introduce the data_frame property for the vm.Filter component, and if the data_frame Parameters change the targeting form to data_manager_key.function_argument, does is mean that our dynamic Filters could be handled in a same way as any other dynamic figure components?

Notice

I acknowledge and agree that, by checking this box and clicking "Submit Pull Request":
- I submit this contribution under the Apache 2.0 license and represent that I am entitled to do so on behalf of myself, my employer, or relevant third parties, as applicable.
- I certify that (a) this contribution is my original creation and / or (b) to the extent it is not my original creation, I am authorized to submit this contribution on behalf of the original creator(s) or their licensees.
- I certify that the use of this contribution as authorized by the Apache 2.0 license does not violate the intellectual property rights of anyone else.
- I have not referenced individuals, products or companies in any commits, directly or indirectly.
- I have not added data or restricted code in any commits, directly or indirectly.

for more information, see https://pre-commit.ci

…h minimal loading - tests pass

…ests pass

…ter.pre_build - tests pass

for more information, see https://pre-commit.ci

# Conflicts: # vizro-core/src/vizro/models/_controls/filter.py

…_manager to the Filter.pre_build data_manager _multi_load

…ti_load

…namic-filter

for more information, see https://pre-commit.ci

petar-qb · 2024-11-15T13:53:52Z

vizro-core/examples/scratch_dev/data.yaml

+# Choose from 0-50
+setosa: 5
+versicolor: 10
+virginica: 15
+
+# Choose from: 4.8 to 7.4
+min: 5
+max: 7
+
+# Choose from:
+#   2020-01-01 to 2020-05-29
+date_min: 2024-01-01
+date_max: 2024-05-29


Adjusting these values, the input data is changed for the scratch/app.py example.

By adjusting setosa, versicolor and virginica (from 0 to 50), example graphs that use the load_from_file_species will be affected (page_1, page_2, page_5, page_6).

By adjusting min, max (from 4.8 to 7.4), example graphs that use the load_from_file_sepal_length will be affected (page_3).

TODO: The following still doesn't work so enable page_4 and vm.DatePicker to support dynamic mode.
By adjusting date_min, date_max (from 2020-01-01 to 2020-05-29), example graphs that use the load_from_file_date_column will be affected (page_4).

Love this method of testing, it's much better than just selecting random points and refreshing the page lots of times 💯

vizro-core/src/vizro/actions/_actions_utils.py

vizro-core/src/vizro/actions/_on_page_load_action.py

… into feat/dynamic-filter

petar-qb · 2024-11-15T15:54:55Z

vizro-core/src/vizro/models/_components/form/_form_utils.py

@@ -54,7 +54,7 @@ def validate_value(cls, value, values):
        [entry["value"] for entry in values["options"]] if isinstance(values["options"][0], dict) else values["options"]
    )

-    if value and not is_value_contained(value, possible_values):
+    if value and ALL_OPTION not in value and not is_value_contained(value, possible_values):


The possible_values here is always a self.options prop that does not contain "ALL", and this validation function is invoked from the _build_dynamic_placeholder method. Previously self.value was not set anywhere from code, so this didn't fail.

vizro-core/src/vizro/models/_controls/filter.py

vizro-core/src/vizro/actions/_actions_utils.py

petar-qb · 2024-11-18T10:16:54Z

vizro-core/src/vizro/models/_controls/filter.py

+        # TODO: Align inner and outer ids to be handled in the same way as for other figure components.
+        selector_build_obj = self.selector.build()
+        return dcc.Loading(id=self.id, children=selector_build_obj) if self._dynamic else selector_build_obj


Is the inner/outer id mechanism here aligned with how it's done for other Vizro figure components?

petar-qb · 2024-11-18T10:18:08Z

vizro-core/src/vizro/models/_controls/filter.py

+        # TODO: Align inner and outer ids to be handled in the same way as for other figure components.
+        selector_build_obj = self.selector.build()
+        return dcc.Loading(id=self.id, children=selector_build_obj) if self._dynamic else selector_build_obj


Are there any UI improvements we should introduce until the "Universal Vizro placeholder component" comes?

petar-qb · 2024-11-18T13:42:53Z

vizro-core/src/vizro/models/_controls/filter.py

        multi_data_source_name_load_kwargs: list[tuple[DataSourceName, dict[str, Any]]] = [
            (model_manager[target]["data_frame"], {}) for target in proposed_targets
        ]


It's possible to send default DFP values from the model_manager into the data_manager._multi_load() with the following code:
(Disclaimer: The code is just a PoC and could be optimised a lot)

from vizro.models._controls import Parameter multi_data_source_name_load_kwargs: list[tuple[DataSourceName, dict[str, Any]]] = [] # One tuple per filter.target # [ # ('data_1', {'arg_1': 1, 'arg_2': 2,}), # ('data_2', {'X': "ASD"}), # ('data_2', {'X': "qwe"}), # ] # TODO-NEXT: The code below is just a PoC and could be improved a lot. page_obj = model_manager[model_manager._get_model_page_id(model_id=ModelID(str(self.id)))] for target in proposed_targets: data_source_name = model_manager[target]["data_frame"] load_kwargs = {} for page_parameter in page_obj.controls: if isinstance(page_parameter, Parameter): for parameter_targets in page_parameter.targets: if parameter_targets.startswith(f'{target}.data_frame'): argument = parameter_targets.split('.')[2] # argument is explicitly defined if parameter_value := getattr(page_parameter.selector, 'value', None): load_kwargs[argument] = parameter_value # find default value else: parameter_selector = page_parameter.selector default_parameter_value = None if isinstance(parameter_selector, Dropdown): default_parameter_value = parameter_selector.options if parameter_selector.multi else parameter_selector.options[0] elif isinstance(parameter_selector, Checklist): default_parameter_value = parameter_selector.options elif isinstance(parameter_selector, RadioItems): default_parameter_value = parameter_selector.options[0] elif isinstance(parameter_selector, Slider): default_parameter_value = parameter_selector.min elif isinstance(parameter_selector, RangeSlider): default_parameter_value = [parameter_selector.min, parameter_selector.max] load_kwargs[argument] = default_parameter_value multi_data_source_name_load_kwargs.append((data_source_name, load_kwargs))

Thanks for posting this here! Just to add my latest thoughts and FYI @maxschulz-COL.

Currently dynamic data functions require a default value for every argument. Even when there is a dataframe parameter, the default value is used when pre-build the filter e.g. to find the targets, column type (and hence selector) and initial values.

There are three possible solutions here:

leave this "default value must be supplied" restriction so arguments passed to dynamic data function (through _multi_load) are just {}

lift the restriction and take values used from the model_manager to pass through to _multi_load. This should always work because a parameter always has a default value even if it's not explicitly specified by the user. It introduces more coupling with the model manager though.

lift the restriction and just don't prebuild anything much for a dynamic filter. This is simplest/best but probably doesn't work at the moment because you need to know targets upfront to build actions etc. And you can't obtain targets without loading data, which requires some values for the parametrised dynamic data loading functions.

For now we're going for option 1 but may be revisited in future, especially after we've changed how model_manager works.

@petar-qb please could you put a TODO (no NEXT needed) in here just summarising and referring to this github comment? It can replace the existing TODO NEXT comment. Make sure that the reminder about static data needing {} remains though 🙏

for more information, see https://pre-commit.ci

antonymilne

This is awesome work! 💯 As I said yesterday, the code changes here don't do credit to the many hours of understanding required to get this all working.

I've left quite a few initial comments and will then do another review. I think all is on track for approval though 👍 Let me know if you want to discuss any of the comments in a call or over slack and we can do so before Thursday.

Let me look through your questions from the PR description and answer them separately.

vizro-core/examples/scratch_dev/app.py

antonymilne · 2024-11-19T10:38:03Z

vizro-core/examples/scratch_dev/data.yaml

+# Choose from 0-50
+setosa: 5
+versicolor: 10
+virginica: 15
+
+# Choose from: 4.8 to 7.4
+min: 5
+max: 7
+
+# Choose from:
+#   2020-01-01 to 2020-05-29
+date_min: 2024-01-01
+date_max: 2024-05-29


Love this method of testing, it's much better than just selecting random points and refreshing the page lots of times 💯

antonymilne · 2024-11-19T12:04:01Z

vizro-core/src/vizro/models/_components/form/checklist.py

+    def _build_static(self, new_options=None, **kwargs):
+        options = new_options if new_options else self.options


Unless there's good reason to not do it this way let's make these arguments non-optional and just supply them all the time. Hiding the defaulting behaviour inside the _build_static function is more confusing I think.

Suggested change

def _build_static(self, new_options=None, **kwargs):

options = new_options if new_options else self.options

def _build_static(self, options):

Similar comment across all selectors.

This is definitely a better approach. I did it for the options, min and max.

However, I'm still propagating the current_value to the numerical selectors only to support dcc.Input and dcc.Store components to work properly with the persistence in numerical selectors when they work in the dynamic mode. The current_value propagation can be removed when the dash persistence bug is fixed. (I added this as a comment in the filter.py where the current_value is propagated to the selector.__call__())

vizro-core/src/vizro/models/_components/form/checklist.py

antonymilne · 2024-11-19T12:05:49Z

vizro-core/src/vizro/models/_components/form/checklist.py

+
+    def _build_static(self, new_options=None, **kwargs):
+        options = new_options if new_options else self.options
+        full_options, default_value = get_options_and_default(options=options, multi=True)


Is it right that we might change the value supplied here? Is this the case where it only works because we're lucky and we have the ALL option?

Sorry, but I don't get you..
The get_options_and_default retrieves options without the "ALL" and returns the options with the "ALL" option (if the selector is multi=True). It also returns "ALL" as the default_value for the multi=True, and options[0] for the multi=False selectors.

antonymilne · 2024-11-19T12:06:34Z

vizro-core/src/vizro/models/_components/form/checklist.py

@@ -62,3 +67,15 @@ def build(self):
                ),
            ]
        )
+
+    def _build_dynamic_placeholder(self):


Note to myself to read through this again to understand it.

antonymilne · 2024-11-19T12:08:52Z

vizro-core/src/vizro/models/_components/form/slider.py

+    def _build_static(self, current_value=None, new_min=None, new_max=None, **kwargs):
+        _min = new_min if new_min else self.min
+        _max = new_max if new_max else self.max
+        init_value = current_value or self.value or _min


Can you remind me what's happening here? Why do we need current_value here and not just the min and max?

Also, following comment elsewhere let's do something like this unless there's good reason to not do so:

Suggested change

def _build_static(self, current_value=None, new_min=None, new_max=None, **kwargs):

_min = new_min if new_min else self.min

_max = new_max if new_max else self.max

init_value = current_value or self.value or _min

def _build_static(self, value, min, max, **kwargs):

init_value = value or min

As it's described in this thread, the current_value is here to enable persistence to work properly with the slider and range_slider.

These two selectors have a specific way of handling the persistence due to "two values bound problem". The current_value propagation can be removed when the dash bug is fixed.
I'll also write down all the potential improvements that will be possible after the dash persistence bugfix 😃

antonymilne · 2024-11-19T14:28:08Z

This or next PR TODOs (we should decide it):

- Support dynamic for the vm.DatePicker. (Maybe it's better to wait for the dash persistence bugfix as persistence doesn't work for the vm.DatePicker even on the main branch)

Definitely not urgent. Let's wait for this and fix plotly/dash#2678 first.

Next PRs:

- Enhance the default selector.value handling for new users. There are a few cases marked with 🟠 in this Issue -> Fix default value for dynamic controls McK-Internal/vizro-internal#1356. There are two inconsistencies listed at the bottom of the Issue description. This TODO points to the second inconsistency.

- Introduce "Universal Vizro placeholder component" -> Add universal Vizro placeholder component to all dynamic components McK-Internal/vizro-internal#1307

- Fix the multi=False vm.Dropdown dynamic selector bug when the value is cleared. This will be solved when Vizro universal placeholder component become introduced.

How many of these will still be problems once we fix plotly/dash#2678?

- Enable dynamic filter to work with the empty data_frame.

Good idea - let's discuss how this works now and how it should work.

- Propagate data_frame Parameter default values from the model_manager into the DM._multi_load() that's called from the vm.Filter.pre_build(). -> PoC can be found in the comment.

Added to the comment there but basically let's leave as a TODO and open a ticket. Not urgent.

- Implement "Select ALL" for the multi=True categorical selectors.

Would love to do this soon (like by end of year if possible).

- Implement "Select the entire range" for the multi=True numerical selectors.

Not as high priority - let's make a ticket and forget for now.

Open questions:

How to enable dynamic Parameters? Should we allow, for example, the "options" property of a categorical parameter selector to be a function that dynamically calculates and returns new options?

Good question but I think not super high priority unless we have an immediate use for it.

How to enable a dynamic filter to be targeted by any predefined action? For example the parameter_action target. Or like this: update_figures(targets=[”filter_1_id”])

Very good question and presumably required for us to get DFPs to update filters without reloading the page? So this one needs to be worked out soon.

What exactly is a new way of dash persistence handling we expect (this fix will be implemented by us)?

Let's discuss but I think basically the "common sense" way that we initially expected things to work.

If we introduce the data_frame property for the vm.Filter component, and if the data_frame Parameters change the targeting form to data_manager_key.function_argument, does is mean that our dynamic Filters could be handled in a same way as any other dynamic figure components?

Not sure I understand this, please could you explain?

Basically I think roughly ordered priorities are:

Finish this PR
Fix Component properties set through dynamic callbacks cannot be persisted plotly/dash#2678
Tidy our code, see what remaining cases remain to fix
DFP to update filters without reloading page
@antonymilne completes some current work coming from [Tidy] Convert actions to classes #363 on class-based action and refactoring
Implement "Select ALL" for the multi=True categorical selectors. https://github.com/McK-Internal/vizro-internal/issues/1342#issue-2618580976

wdyt?

antonymilne and others added 29 commits November 4, 2024 10:52

Reduce to just one load function in filter.py

9449c73

Refactor set methods and add in __call__

b37f38f

Fix tests

3d6f570

[pre-commit.ci] auto fixes from pre-commit.com hooks

68f6f61

for more information, see https://pre-commit.ci

Add numpy lower bound

9e6f62e

Fix tests

574169b

[pre-commit.ci] auto fixes from pre-commit.com hooks

568dc58

for more information, see https://pre-commit.ci

Fix tests

b13e574

Add new tests and lint

01307bd

Final tidy

2ac895b

[pre-commit.ci] auto fixes from pre-commit.com hooks

2e4e0e2

for more information, see https://pre-commit.ci

Fix min/max=0 bug

983940b

Move _get_targets_data_and_config into _get_modified_page_figures wit…

13f022e

…h minimal loading - tests pass

Dynamic before filter tidy changes

a07fda4

Turn _get_targets_data_and_config into _get_targets_data - tests pass

f2bd362

Turn _create_target_arg_mapping into _filter_dot_separated_string - t…

1dbacb0

…ests pass

categorical selectors after the tidy/dynamic-filter

f93d780

Split up filtered and unfiltered data, create _multi_load, rework Fil…

4d0fb0e

…ter.pre_build - tests pass

[pre-commit.ci] auto fixes from pre-commit.com hooks

1a859ff

for more information, see https://pre-commit.ci

Support for Slider and RangeSlider to work as RadioItems

2b3b0ae

Merge remote-tracking branch 'origin/main' into tidy/dynamic-filter-2

30aa52a

# Conflicts: # vizro-core/src/vizro/models/_controls/filter.py

Lint and small fixes

d097733

pull changes from tidy/dynamic-filter-2

0f5b096

dynamic filters implemented as on_page_load targets

d3c677e

Propagating data_frame parameter values as load–kwargs from the model…

1d602ea

…_manager to the Filter.pre_build data_manager _multi_load

Merge main into the feature branch

2a94f6c

More improvements

e066fdb

Reverting: Sending DFP values from the MM to Filter.pre_build DM._mul…

9e5a239

…ti_load

Merge branch 'main' of https://github.com/mckinsey/vizro into feat/dy…

5f161aa

…namic-filter

github-actions bot added the Vizro-AI 🤖 Issue/PR that addresses Vizro-AI package label Nov 15, 2024

pre-commit-ci bot and others added 6 commits November 15, 2024 10:32

[pre-commit.ci] auto fixes from pre-commit.com hooks

cfdf4fc

for more information, see https://pre-commit.ci

Minor code cleaning

8ca5382

Solving conflicts

472f7a8

[pre-commit.ci] auto fixes from pre-commit.com hooks

1e3ba7f

for more information, see https://pre-commit.ci

Minor refactoring

41adc9e

[pre-commit.ci] auto fixes from pre-commit.com hooks

b912bb0

for more information, see https://pre-commit.ci

petar-qb commented Nov 15, 2024

View reviewed changes

vizro-core/src/vizro/actions/_actions_utils.py Outdated Show resolved Hide resolved

petar-qb commented Nov 15, 2024

View reviewed changes

vizro-core/src/vizro/actions/_on_page_load_action.py Outdated Show resolved Hide resolved

petar-qb added 2 commits November 15, 2024 16:48

Minor refactoring

1d2a9c5

Merge branch 'feat/dynamic-filter' of https://github.com/mckinsey/vizro…

d1bd246

… into feat/dynamic-filter

petar-qb commented Nov 15, 2024

View reviewed changes

vizro-core/src/vizro/models/_controls/filter.py Outdated Show resolved Hide resolved

Minor comment change

a89b8bf

petar-qb commented Nov 18, 2024

View reviewed changes

vizro-core/src/vizro/actions/_actions_utils.py Outdated Show resolved Hide resolved

petar-qb commented Nov 18, 2024

View reviewed changes

More refactoring

48e6716

petar-qb changed the title ~~Feat/dynamic filter~~ [Feat] Enable dynamic filter Nov 18, 2024

petar-qb commented Nov 18, 2024

View reviewed changes

petar-qb and others added 2 commits November 19, 2024 08:18

Remove fetching unfiltered data for filter.targets

161a1e9

[pre-commit.ci] auto fixes from pre-commit.com hooks

e1db7ec

for more information, see https://pre-commit.ci

antonymilne reviewed Nov 19, 2024

View reviewed changes

Merge main with the feature branch

f13d131

This was referenced Nov 19, 2024

[Tidy] Proof of concept replacing ctd for filter and parameter #880

Closed

[Docs] Dynamic filters #891

Open

petar-qb added 3 commits November 21, 2024 12:37

Addressing PR comments

b450404

Lint

82838f1

Merge main with the feature branch

53d73e5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feat] Enable dynamic filter #879

[Feat] Enable dynamic filter #879

petar-qb commented Nov 15, 2024 •

edited

Loading

petar-qb Nov 15, 2024

petar-qb Nov 18, 2024 •

edited

Loading

antonymilne Nov 19, 2024

petar-qb Nov 15, 2024 •

edited

Loading

petar-qb Nov 18, 2024

petar-qb Nov 18, 2024

petar-qb Nov 18, 2024

antonymilne Nov 19, 2024

antonymilne left a comment

antonymilne Nov 19, 2024

antonymilne Nov 19, 2024

petar-qb Nov 21, 2024

antonymilne Nov 19, 2024

petar-qb Nov 21, 2024

antonymilne Nov 19, 2024

antonymilne Nov 19, 2024

petar-qb Nov 21, 2024

antonymilne commented Nov 19, 2024 •

edited

Loading

This or next PR TODOs (we should decide it):

Next PRs:

Open questions:

		def _build_static(self, new_options=None, **kwargs):
		options = new_options if new_options else self.options

	def _build_static(self, new_options=None, **kwargs):
	options = new_options if new_options else self.options
	def _build_static(self, options):

[Feat] Enable dynamic filter #879

Are you sure you want to change the base?

[Feat] Enable dynamic filter #879

Conversation

petar-qb commented Nov 15, 2024 • edited Loading

Description

TODOs:

This PR TODOs:

This or next PR TODOs (we should decide it):

Next PRs:

References:

Open questions:

Notice

Choose a reason for hiding this comment

petar-qb Nov 18, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

petar-qb Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

antonymilne left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

antonymilne commented Nov 19, 2024 • edited Loading

This or next PR TODOs (we should decide it):

Next PRs:

Open questions:

petar-qb commented Nov 15, 2024 •

edited

Loading

petar-qb Nov 18, 2024 •

edited

Loading

petar-qb Nov 15, 2024 •

edited

Loading

antonymilne commented Nov 19, 2024 •

edited

Loading