SERPent / docs /docs.md
Game4all's picture
Implement arXiv backend
21275ec

SERPent

SERP results scrapping

SERPent exposes an unified API to query SERP (Search Engine Result Pages) for a few common search engines, namely:

  • DuckDuckGo
  • Brave
  • Bing
  • Google Patents
  • arXiv
  • Google

The application uses the playwright library to control a headless web browser, to simulate normal user activity, to fool the anti-bot measures often present on those sites. See the /serp/ endpoints for search results scrapping.

Website sources scrapping

SERPent also exposes a few endpoints to scrap the contents of certain sources (patents, scholar). See the /scrap/ endpoints for supported website sources scrapping.