Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / scrapy/slybot issues and pull requests

#86 - Deprecate spiderlets

Pull Request - State: closed - Opened by rdowinton almost 10 years ago - 14 comments

#85 - [WIP] Scrapy 0.25+ support

Pull Request - State: closed - Opened by kmike over 10 years ago

#81 - added atom and sitemap link extractors, added attribute for xpath link

Pull Request - State: closed - Opened by kalessin about 11 years ago

#80 - build project item classes separated from templates iteration, in order ...

Pull Request - State: closed - Opened by kalessin about 11 years ago

#79 - allow to override some spider specs using spider arguments

Pull Request - State: closed - Opened by kalessin about 11 years ago - 5 comments

#74 - Update index.rst

Pull Request - State: closed - Opened by rvogel over 11 years ago

#72 - Rename sample project 'Plants' to 'SampleProject'

Pull Request - State: closed - Opened by kalessin over 11 years ago

#71 - Show error details when failing to parse project json file

Pull Request - State: closed - Opened by andresp99999 over 11 years ago

#66 - added spiderlets code and docs

Pull Request - State: closed - Opened by kalessin almost 12 years ago

#63 - Fix small typo

Pull Request - State: closed - Opened by amferraz almost 12 years ago

#62 - Fix exception when Content-Type is empty and response type is not Html

Pull Request - State: closed - Opened by andresp99999 almost 12 years ago

#61 - avoid exception with responses which has headers with no Content-Type

Pull Request - State: closed - Opened by kalessin almost 12 years ago

#59 - Add allowed_domains variable to spider definition

Pull Request - State: closed - Opened by andresp99999 almost 12 years ago - 2 comments

#58 - allow to specify extra meta data on spiders specs, and settings (global

Pull Request - State: closed - Opened by kalessin about 12 years ago - 1 comment

#56 - Use scrapy._get_inputs to get the form inputs

Pull Request - State: closed - Opened by andresp99999 about 12 years ago

#55 - added missing project.json file

Pull Request - State: closed - Opened by kalessin about 12 years ago

#54 - make spidermanager to load the project specs from the specialized

Pull Request - State: closed - Opened by kalessin about 12 years ago

#53 - variants type must not be changed (_process_extracted_data changed it to

Pull Request - State: closed - Opened by kalessin about 12 years ago

#52 - add support for link extraction from rss files

Pull Request - State: closed - Opened by kalessin about 12 years ago

#50 - added project specs schema validation, using jsonschema library

Pull Request - State: closed - Opened by kalessin about 12 years ago

#49 - a recent fix in scrapely makes a test in slybot to match a template when

Pull Request - State: closed - Opened by kalessin about 12 years ago

#48 - add dont_filter flags in login/generic form associated requests, in

Pull Request - State: closed - Opened by kalessin about 12 years ago

#48 - add dont_filter flags in login/generic form associated requests, in

Pull Request - State: closed - Opened by kalessin about 12 years ago

#47 - added project specs schema validation

Pull Request - State: closed - Opened by kalessin about 12 years ago

#47 - added project specs schema validation

Pull Request - State: closed - Opened by kalessin about 12 years ago

#46 - Add support for 'file' type in generic_form fields

Pull Request - State: closed - Opened by andresp99999 about 12 years ago - 5 comments

#45 - update extractors code to new specs

Pull Request - State: closed - Opened by kalessin about 12 years ago

#43 - Add support for named parameters in generic forms.

Pull Request - State: closed - Opened by andresp99999 about 12 years ago

#42 - migrate spider to definitive specifications of items.

Pull Request - State: closed - Opened by kalessin about 12 years ago

#41 - fixed spec examples, removed unneeded keys in root object

Pull Request - State: closed - Opened by kalessin about 12 years ago

#40 - remove old forms code from spider, as it is coupled to an specific UI

Pull Request - State: closed - Opened by kalessin about 12 years ago

#39 - Generic form implementation

Pull Request - State: closed - Opened by andresp99999 about 12 years ago - 10 comments

#38 - add generic form to project specification.

Pull Request - State: closed - Opened by andresp99999 about 12 years ago

#37 - Add project definition for generic form request.

Pull Request - State: closed - Opened by andresp99999 about 12 years ago

#36 - allow spidermanager to instantiate a custom spider class

Pull Request - State: closed - Opened by kalessin about 12 years ago

#35 - added zipfile spider manager, added spider manager test, and move a

Pull Request - State: closed - Opened by kalessin about 12 years ago

#34 - include login form urls in allowed domains, as sometimes the login page

Pull Request - State: closed - Opened by kalessin about 12 years ago

#33 - added support for login forms in spider, added tests

Pull Request - State: closed - Opened by kalessin over 12 years ago

#32 - fix listing of spider names with a period character

Pull Request - State: closed - Opened by kalessin over 12 years ago

#31 - present sample project spec attributes in separate lines in order

Pull Request - State: closed - Opened by kalessin over 12 years ago

#30 - use generators instead of lists in spider, in order to reduce memory

Pull Request - State: closed - Opened by kalessin over 12 years ago

#29 - added basic usage in README.rst

Pull Request - State: closed - Opened by kalessin over 12 years ago - 3 comments

#27 - use hostname instead of netloc for extracting domain from url, in order

Pull Request - State: closed - Opened by kalessin over 12 years ago

#26 - improved project specs, and added documentation.

Pull Request - State: closed - Opened by kalessin over 12 years ago

#25 - extract links from link tags

Pull Request - State: closed - Opened by kalessin over 12 years ago

#24 - allow to import a project from a zip file, improved/simplified some

Pull Request - State: closed - Opened by kalessin over 12 years ago

#23 - use hostname instead of netloc for extracting domain from url, in order

Pull Request - State: closed - Opened by kalessin over 12 years ago

#22 - add name to Pipeline extractor and regex extractor names

Pull Request - State: closed - Opened by kalessin over 12 years ago

#21 - fix replacement of empty region, added test

Pull Request - State: closed - Opened by kalessin over 12 years ago

#19 - added start_url meta to start requests.\

Pull Request - State: closed - Opened by kalessin over 12 years ago

#18 - exception fixes in extractors

Pull Request - State: closed - Opened by kalessin over 12 years ago

#16 - separated duplicates detector code in an item pipeline

Pull Request - State: closed - Opened by kalessin over 12 years ago

#13 - allow start_urls to be overriden by spider args

Pull Request - State: closed - Opened by kalessin over 12 years ago

#12 - avoid link extractor to return unicode urls

Pull Request - State: closed - Opened by kalessin over 12 years ago

#11 - compare sets instead of list when testing spidermanager list

Pull Request - State: closed - Opened by kalessin over 12 years ago

#10 - added spider tests and some fixes

Pull Request - State: closed - Opened by kalessin over 12 years ago

#9 - added new test case in link extractor and fix handling

Pull Request - State: closed - Opened by kalessin over 12 years ago

#8 - Support link extraction from standard onclick inline js

Pull Request - State: closed - Opened by kalessin over 12 years ago

#6 - for safe html extractor, extract only if region has no empty html code

Pull Request - State: closed - Opened by kalessin almost 13 years ago

#3 - use htmlregion text_content in number, price and text extractors

Pull Request - State: closed - Opened by kalessin almost 13 years ago - 1 comment

#1 - changes needed for correct working of scrapinghub

Pull Request - State: closed - Opened by kalessin about 13 years ago