Skip to content

[Feature]: Add a way to block all cookies and site data in headless mode puppeteer/chromium #3212

@PapiJalopy

Description

@PapiJalopy

Proposed solution

I am consistently able to avoid getting 403ed on newegg as long as I block all cookies and delete site data on chromium when i first start running it.

This is a theory so take it with a grain of salt, but I've noticed that newegg allows you to visit their website one time with any IP, and as long as you don't store and block all cookies it seem to think it is always your first visit and wont give you a captcha or 403 you on your next scrape/refresh.

The issue is that it requires headless=false and settings are not persistent. If you can add or inform me of a way to modify chromium user settings to make them persistent and run in headless mode that would be perfect.

Objective

N/A

Goals

N/A

Non-goals

N/A

Anti-goals

N/A

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions