This repository has been archived by the owner on May 10, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 448
HTML tags should be removed when classifying a page for Brave ads #1918
Labels
Milestone
Comments
Only stripping html tags is enough, or do we need to do more like extracting only |
We're going to just use |
I can confirm that tests @kylehickinson showed me using |
32 tasks
kylehickinson
added a commit
that referenced
this issue
Nov 11, 2019
Instead of sending plain HTML, we now send in the document's `innerText`, which doesn't include any HTML tags, images, etc. This allows the ads classifier to properly classify pages to their correct type
kylehickinson
added a commit
that referenced
this issue
Nov 11, 2019
Instead of sending plain HTML, we now send in the document's `innerText`, which doesn't include any HTML tags, images, etc. This allows the ads classifier to properly classify pages to their correct type
7 tasks
kylehickinson
added a commit
that referenced
this issue
Nov 11, 2019
Instead of sending plain HTML, we now send in the document's `innerText`, which doesn't include any HTML tags, images, etc. This allows the ads classifier to properly classify pages to their correct type
kylehickinson
added a commit
that referenced
this issue
Nov 12, 2019
Instead of sending plain HTML, we now send in the document's `innerText`, which doesn't include any HTML tags, images, etc. This allows the ads classifier to properly classify pages to their correct type
kylehickinson
added a commit
that referenced
this issue
Nov 12, 2019
Instead of sending plain HTML, we now send in the document's `innerText`, which doesn't include any HTML tags, images, etc. This allows the ads classifier to properly classify pages to their correct type
kylehickinson
added a commit
that referenced
this issue
Nov 13, 2019
Instead of sending plain HTML, we now send in the document's `innerText`, which doesn't include any HTML tags, images, etc. This allows the ads classifier to properly classify pages to their correct type
This was referenced Nov 21, 2019
Verification passed on iPhone XR with iOS 13.2 running 1.14(19.11.22.15)
Verification passed on iPhone 7+ with iOS 13.2 running 1.14(19.11.22.15)
Verification passed on iPhone 6 with iOS 12.4 running 1.14(19.11.22.15)
Verification passed on iPad Pro with iOS 13.2 running 1.14(19.11.22.15)
Verification passed on iPad Pro with iOS 12.4 running 1.14(19.11.22.15)
|
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Labels
Description:
HTML tags should be removed when classifying a page for Brave ads otherwise the classifier will likely return "Technology & Computing-Software"
Steps to Reproduce
Actual result:
Page is classified as "technology & computing-software"
Expected result:
Page should be classified as "law-law"
Reproduces how often: [Easily reproduced, Intermittent Issue]
Easily reproduced
Brave Version:
Device details:
Website problems only:
Additional Information
Search for "Successfully classified page at" in the logs to see the page classification
The text was updated successfully, but these errors were encountered: