This is a
playground to test code. It runs a full
Node.js environment and already has all of
npm’s 400,000 packages pre-installed, including
a-extractor with all
npm packages installed. Try it out:
This service is provided by RunKit and is not affiliated with npm, Inc or the package authors.
Database of expressions used for extracting content from blogs and articles.
The extraction expressions are Cheerio, similar with jQuery.
The targeted information is:
This project is designed to be used with Clean-Mark, but you can use it however you want.
Clean-Mark already has algorithms to extract most of the info, if the website is SEO friendly, eg: it respects schema.org/Article, or Microformats, or the Open Graph protocol.
But it's not a perfect tool 🤖 and it needs help from us humans 🙄
We ❤️ contributions !!!
Want to report a bug, request a feature, or contribute? Things can only be contributed via the A-Extractor GitHub repository.
The "fork-and-pull" Git workflow:
MIT © Cristi Constantin.