This page is about my business and personal projects.
I know a little bit of webdesign too, but tend to avoid it in favour of CSS frameworks such as bulma.io or bootstrap. I'd love to learn a modern web dev framework like Reactjs, but I simply don't have the time.
Currently, I am interested in creating reliable, distributed and queue-based scraping infrastructures, because I need them for scrapeulous.
My most recent projects are:
- struktur.js A way to extract structured information from any visually rendered HTML page. This project aims to deprecate the scraping of websites with CSS selectors and Xpath queries. I don't have enough time to push it forward, but I like the idea a lot and I think it has a tremendous amount of potential.
- scrapeulous A scraping platform, aiming to solve many annoying tasks when developing scrapers/crawlers. Currently (August 2019), scrapeulous focuses on search engine scraping. In the near future, scraping of any website will be possible.
- se-scraper The successor of GoogleScraper that builds on top of puppeteer, written in JS.
A new project of mine (November and December 2018) is a introduction into machine learning. The following blog posts cover the topic:
Some old projects of mine: