Scraping one million keywords is not a easy task. There are proxy problems, big data problems and reliability issues. In this blog post, the most valuable insights are shared.
Continue reading
Posted on September 17, 2019 in Scraping • Tagged with puppeteer, web scraping, headless chrome, 1 million, queue, architecture • 5 min read
Scraping one million keywords is not a easy task. There are proxy problems, big data problems and reliability issues. In this blog post, the most valuable insights are shared.
Posted on August 31, 2019 in Scraping • Tagged with puppeteer, web scraping, AWS lambda, headless chrome • 4 min read
In this blog post, we demonstrate how a web scraping function is deployed to the AWS cloud with puppeteer and headless chrome.
Posted on July 18, 2019 in Security • Tagged with wpa3, SAE, Dragonfly, fuzzing • 8 min read
Implementing possible fuzzing strategies with boofuzz against the WPA3 SAE Dragonfly handshake. Dragonfly is the main ingredient of WPA3 certified routers and 802.11 devices.
Posted on July 15, 2019 in Scraping • Tagged with puppeteer, web scraping, CSS selectors, XPath queries • 7 min read
I will shop an alternative approach to web scraping without using css selectors and XPath queries. We make use of the fact that most web pages visually render the information of interest in a coherent, structured way. This technique requires a remotely controllable web browser such as puppeteer, that is capable of rendering web pages visually.
Posted on March 01, 2019 in Scraping • Tagged with puppeteer, recatpcha, scraping • 5 min read
A captcha is a mechanism to distinguish human users from automated programs (bot). There are many service providers in the Internet that have a major incentive to prevent bots from (ab)using their systems.
Posted on February 22, 2019 in WPA3 • Tagged with WPA3, 802.11, hostapd, wpa_supplicant • 10 min read
Tutorial that shows how to run an WPA3 access point with hostapd 2.7 and SAE Dragonfly Handshake.