pfBlockerNG: extend support for AdBlock-style lists #1303

andrebrait · 2023-10-06T09:19:33Z

This supersedes #1302 and closes #14838.

Extend support for AdBlock-style lists:

Unbound mode:
- Support whitelist entries, including wildcards
Python mode:
- Support whitelist entries, including wildcards
- Expand blacklist entry support to also handle wildcards

In Python mode, most processing was moved to Python, with the surrounding code merely assembling lists that get consumed by it.

User-defined, TOP1M and Whitelist entries from AdBlock-style lists get all joined in a single file (PHP) that gets loaded and parsed later (Python).

Whenever possible, simple domain matches are kept as-is. The presence of any wildcards triggers the conversion of that specific entry into a Regular Expression in the Python format.

In Unbound mode, everything is kept the same, except that whitelisting is now done in 3 distinct steps (User-defined and TOP1M using fixed string matching with ggrep, and Whitelist entries from AdBlock-style lists using extended regular expressions with ggrep), in a postprocessing step separate from the deduplication, etc.

In both cases, the # White count in the logs now refers to how many whitelist entries were found, rather than how many domains were removed from the Blacklist.

This still needs testing and input on where to log things in a more adequate fashion.

* Use less memory, write whitelist to disk right away * Fix regular expression conversion to disallow matching limiter chars * Do not recreate whitelist file for each alias

* Use a dedicated log file for whitelisting results * Restrict detailed view on whitelisted domains to log file * Make the new file visible via the Logs page

* Missing: * TOP1M support * Whitelist support * Testing it

* This is the intended behavior for them * Enabled resolving multiple wildcards for blacklists

andrebrait · 2023-10-07T20:01:59Z

Idea: move the .whitelist files under the Permit files category in the Logs tab.

andrebrait · 2024-02-08T11:14:50Z

Superseded by #1343

andrebrait added 5 commits October 4, 2023 22:04

Support AdBlock Whitelists with wildcards

642c9bd

Fix Python mode, general improvements

8042e88

* Use less memory, write whitelist to disk right away * Fix regular expression conversion to disallow matching limiter chars * Do not recreate whitelist file for each alias

Add log file for whitelist results

5f69f6e

* Use a dedicated log file for whitelisting results * Restrict detailed view on whitelisted domains to log file * Make the new file visible via the Logs page

Initial work on Python regexes

4f07e20

* Missing: * TOP1M support * Whitelist support * Testing it

Support for Whitelists with wildcards in Python

3901825

andrebrait force-pushed the pffblockerng_devel_whitelist_regex branch from 60ecbea to b26ee61 Compare October 6, 2023 16:37

Make AdBlock-style lists wildcard by default

2e8a18a

* This is the intended behavior for them * Enabled resolving multiple wildcards for blacklists

andrebrait force-pushed the pffblockerng_devel_whitelist_regex branch from b26ee61 to b9a6874 Compare October 7, 2023 19:57

Cache CNAME original name when whitelisted

c7e351a

andrebrait force-pushed the pffblockerng_devel_whitelist_regex branch from b9a6874 to c7e351a Compare October 8, 2023 16:52

babilon mentioned this pull request Oct 10, 2023

HaGeZi offers lists containing Adblock Plus syntax #1309

Closed

andrebrait added 2 commits October 10, 2023 12:38

Disambiguate wildcard and regular entries

dd694fc

Fix enablement of whitelists

2995ad5

andrebrait force-pushed the pffblockerng_devel_whitelist_regex branch 2 times, most recently from 77537b7 to f926b71 Compare October 12, 2023 22:45

netgate-git-updates force-pushed the devel branch from 15a8e61 to a8234ec Compare October 12, 2023 22:45

Add debug log file, refactor DNSBL dicts

40d5d9f

andrebrait force-pushed the pffblockerng_devel_whitelist_regex branch from f926b71 to 40d5d9f Compare October 12, 2023 22:48

Add remaining debug log functionality, polish logs

49d2fde

andrebrait mentioned this pull request Feb 8, 2024

net/pfSense-pkg-pfBlockerNG-devel: EasyList and Python mode improvements #1343

Open

andrebrait closed this Feb 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pfBlockerNG: extend support for AdBlock-style lists #1303

pfBlockerNG: extend support for AdBlock-style lists #1303

andrebrait commented Oct 6, 2023 •

edited

Loading

andrebrait commented Oct 7, 2023

andrebrait commented Feb 8, 2024 •

edited

Loading

pfBlockerNG: extend support for AdBlock-style lists #1303

pfBlockerNG: extend support for AdBlock-style lists #1303

Conversation

andrebrait commented Oct 6, 2023 • edited Loading

andrebrait commented Oct 7, 2023

andrebrait commented Feb 8, 2024 • edited Loading

andrebrait commented Oct 6, 2023 •

edited

Loading

andrebrait commented Feb 8, 2024 •

edited

Loading