PyData Amsterdam 2024

Master Advanced Web Scraping Techniques in Python
09-18, 13:30–15:00 (Europe/Amsterdam), Amstel Room - OBA Oosterdok

Join me for an incredible workshop to unlock the full potential of Anti-Ban & Web Scraping in Python! From novice to virtuoso, you’ll learn advanced techniques for collecting crucial datasets to train AI models.


🔍 Highlights 🔍

Protection Disclosed

🚀 Overcome fingerprint challenges and anti-bot measures.
🔍 Reverse engineering protection to understand signals tracking

Proxy and Browser Farms Adventure

🌊 Discover Scrapoxy, the free and open-source proxies waterfall tailored for Web Scraping
🎯 Become an expert in browser farms with Playwright

This workshop will immerse you in the secret world of anti-bot protection.

It is tailored for intermediate developers seeking to deepen their understanding of Web Scraping techniques and how to overcome protective measures. Basic knowledge of Python and JavaScript is recommended, but don't worry if you're new to it - I'll be here to help you every step of the way.

🛠️ Preflight Checklist 🛠️

To simplify the installation process, I've pre-configured an Ubuntu virtual machine for you with Chrome, VSCode, Python, Node.js, Playwright, and all the necessary dependencies for this workshop.

You can download it from this link.

The virtual machine is in OVA format and can be easily imported into VirtualBox or VMware.

I have also included the installation binaries for Ubuntu Linux, MacOS, and Windows.

Don't miss the unique opportunity to master these essential skills!

Fabien Vauchelles is an Anti-Ban Expert. With over a decade of experience in Web Scraping, Fabien's passion for code and technology helps him to bypass protections. He is the creator of Scrapoxy, a mature free and open-source proxy waterfall tailored for the Web Scraping industry.

He had the opportunity of sharing his insights at many events including Devoxx conferences, Voxxed Days, API Days, PyCon, PyData and others.