Pablo Hoffman is announcing that Scrapy with version number 0.24.6 / 1.0.0rc3 is already available to download.
What is Scrapy ?
Scrapy is A light Web crawling framework written in Python to help with screen-scraping procedures and the extraction of data from the Web .
Scrappy is written 100% in Python and can be utilized for simple data mining, to page monitoring, Web search engines and even for code testing.Scrapy is not a search engine in the true meaning of the word, but it acts like one (without the indexing part). Nevertheless Scrapy can be a great tool to build your search engine logic on.The true power of this framework relies in its core’s versatility, Scrapy being a system on which to build generic or dedicated search spiders (crawlers) on.While this might sound very complicated to non-technical users, with a quick look over the documentation and available tutorials, it’s pretty simple to see how Scrapy has managed to take out all the hard-work out of this and reduce the entire process to just a few lines of code (for easier, smaller crawlers).
This is changelog for Scrapy version 0.24.6 / 1.0.0rc3 :
- Add UTF8 encoding header to templates
- Telnet console now binds to 127.0.0.1 by default
- Update debian/ubuntu install instructions
- Disable smart strings in lxml XPath evaluations
You can read the complete changelog and also download this latest version on their homepage: scrapy.org