Skip to content

Commit

Permalink
perf: Don't check input format before parsing for Pandoc
Browse files Browse the repository at this point in the history
  • Loading branch information
clemlesne committed Oct 3, 2024
1 parent 8aa1568 commit f308e4f
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions app/scrape.py
Original file line number Diff line number Diff line change
Expand Up @@ -960,6 +960,7 @@ def _network_used_callback(size_bytes: int) -> None:
sandbox=True, # Enable sandbox mode, we don't know what we are scraping
source=full_html_minus_resources,
to="markdown-fenced_divs-native_divs-raw_html-bracketed_spans-native_spans-link_attributes-header_attributes-inline_code_attributes",
verify_format=False, # We know the format, don't verify it
extra_args=[
"--embed-resources=false",
"--wrap=none",
Expand Down

0 comments on commit f308e4f

Please sign in to comment.