Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dismiss cookie permission, GDPR, and subscription popups before archiving #175

Open
pirate opened this issue Mar 14, 2019 · 3 comments
Open
Labels
size: hard status: idea-phase Work is tentatively approved and is being planned / laid out, but is not ready to be implemented yet touches: data/schema/architecture

Comments

@pirate
Copy link
Member

pirate commented Mar 14, 2019

Might be able to graft some code from: https://www.i-dont-care-about-cookies.eu for hiding modals and overlays.

Also:
https://greasyfork.org/de/scripts/34815-anti-adblock-killer-reek-

@pirate pirate added size: hard status: idea-phase Work is tentatively approved and is being planned / laid out, but is not ready to be implemented yet touches: data/schema/architecture labels Mar 14, 2019
@pirate
Copy link
Member Author

pirate commented Mar 15, 2019

This is blocked by:

@pirate
Copy link
Member Author

pirate commented Nov 18, 2022

@datoslabs
Copy link

datoslabs commented Jan 7, 2025

Hi,

I am curious if there's any interest in the community to incorporate AI/LLM to help solve this issue.

Recently I was working on a simple web scraper / retriever for a LLM project and came across the need to dismiss not only cookie consent but also newsletter subscription pop-ups. After some research on web UI element detection computer vision models and failed POCs, I decided to give LLM a try. I tested in Google AI studio using Gemini 2.0 Flash model; to my surprise, it appears Gemini 2.0 not only is able to detect if a pop-up (or cookie consent banner) exists on the screen, it is also able to correctly identify the XPath to close the pop-up!

The test I choose was artnet.com, it consisted of a newsletter subscription pop-up with a light gray "X" button to dismiss. Due to the light color, most web UI element CV detection models either ignored the button (to close the pop-up) completely or gave it a very low score of 30% confidence level; however, Gemini 2.0 had no problem identifying the button based on the same screen shot. Moreover when I provided it with the corresponding HTML, it was able to correctly identify the XPath which we can use in Playwright/Puppeteer/Selenium to send the click event. Following are the screenshots from one of my tests in Google AI Studio:

Image

Image

In my limited tests, it took 50-70 seconds to "think" (I ran it a couple times across 2 different days and both tests took >>50 seconds to process). I am sure more testing and optimization can be done to improve on this if there's enough interests to incorporate this feature into the project. Also for reference, https://github.com/browser-use/browser-use have incorporated similar technique to close pop-up dialogs; though, it appears that they highlighted each UI controls by adding colored labels and borders before taking a screen shot as an input for the LLM instead relying on LLM to identify the XPath to a specific control.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
size: hard status: idea-phase Work is tentatively approved and is being planned / laid out, but is not ready to be implemented yet touches: data/schema/architecture
Projects
None yet
Development

No branches or pull requests

2 participants