I checked my program (PlaywrightCrawler) against this thing:
https://amiunique.org/fingerprintUsed US residential proxy, did 3 screenshots, see below
It seems - there are some areas where Crawlee could do better (be less unique, less detectable)!
Here the list (these things are red on the screenshots):
- User Agent (I used fingerprint generator for this!)
- Canvas
- Navigator properties
- List of fonts
- List of plugins
- Permissions
Some settings in my PlaywrightCrawler:
useFingerprints: true
,
useFingerprintCache: false
,
launcher: firefox
Regarding list of plugins: I use some JS code (
pluginContent
string) taken from here:
https://discord.com/channels/801163717915574323/1059483872271798333and inject it into page this way:
preNavigationHooks: [
async ({ page, request }) => {
await page.addInitScript({ content: pluginContent });
},
Well, this code/hack... it simulates presence of some PDF plugins... but I have an impression there are better solutions for plugins/fonts/permissions...