Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / scrapy/slybot issues and pull requests
#86 - Deprecate spiderlets
Pull Request -
State: closed - Opened by rdowinton almost 10 years ago
- 14 comments
#85 - [WIP] Scrapy 0.25+ support
Pull Request -
State: closed - Opened by kmike over 10 years ago
#81 - added atom and sitemap link extractors, added attribute for xpath link
Pull Request -
State: closed - Opened by kalessin about 11 years ago
#80 - build project item classes separated from templates iteration, in order ...
Pull Request -
State: closed - Opened by kalessin about 11 years ago
#79 - allow to override some spider specs using spider arguments
Pull Request -
State: closed - Opened by kalessin about 11 years ago
- 5 comments
#77 - Remove rendering code from field types, and removed deprecated fieldtype synonyms
Pull Request -
State: closed - Opened by kalessin over 11 years ago
#74 - Update index.rst
Pull Request -
State: closed - Opened by rvogel over 11 years ago
#73 - Generalization of link extractors (added classes and allow to define through specs) and added support for feed start urls
Pull Request -
State: closed - Opened by kalessin over 11 years ago
- 7 comments
#72 - Rename sample project 'Plants' to 'SampleProject'
Pull Request -
State: closed - Opened by kalessin over 11 years ago
#71 - Show error details when failing to parse project json file
Pull Request -
State: closed - Opened by andresp99999 over 11 years ago
#66 - added spiderlets code and docs
Pull Request -
State: closed - Opened by kalessin almost 12 years ago
#64 - pass explicit encoding in url processors. Added testcase which motivated the fix.
Pull Request -
State: closed - Opened by kalessin almost 12 years ago
#63 - Fix small typo
Pull Request -
State: closed - Opened by amferraz almost 12 years ago
#62 - Fix exception when Content-Type is empty and response type is not Html
Pull Request -
State: closed - Opened by andresp99999 almost 12 years ago
#61 - avoid exception with responses which has headers with no Content-Type
Pull Request -
State: closed - Opened by kalessin almost 12 years ago
#60 - Fix exception when response is not a HTMLResponse and the response Conte...
Pull Request -
State: closed - Opened by andresp99999 almost 12 years ago
#59 - Add allowed_domains variable to spider definition
Pull Request -
State: closed - Opened by andresp99999 almost 12 years ago
- 2 comments
#58 - allow to specify extra meta data on spiders specs, and settings (global
Pull Request -
State: closed - Opened by kalessin about 12 years ago
- 1 comment
#57 - updated schema validation code in order to work properly with last jsonschema stable version
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#56 - Use scrapy._get_inputs to get the form inputs
Pull Request -
State: closed - Opened by andresp99999 about 12 years ago
#55 - added missing project.json file
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#54 - make spidermanager to load the project specs from the specialized
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#53 - variants type must not be changed (_process_extracted_data changed it to
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#52 - add support for link extraction from rss files
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#50 - added project specs schema validation, using jsonschema library
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#49 - a recent fix in scrapely makes a test in slybot to match a template when
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#48 - add dont_filter flags in login/generic form associated requests, in
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#48 - add dont_filter flags in login/generic form associated requests, in
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#47 - added project specs schema validation
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#47 - added project specs schema validation
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#46 - Add support for 'file' type in generic_form fields
Pull Request -
State: closed - Opened by andresp99999 about 12 years ago
- 5 comments
#45 - update extractors code to new specs
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#44 - remove legacy unused code. 'scrapes' property is not part of spider spec, only templates.
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#43 - Add support for named parameters in generic forms.
Pull Request -
State: closed - Opened by andresp99999 about 12 years ago
#42 - migrate spider to definitive specifications of items.
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#41 - fixed spec examples, removed unneeded keys in root object
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#40 - remove old forms code from spider, as it is coupled to an specific UI
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#39 - Generic form implementation
Pull Request -
State: closed - Opened by andresp99999 about 12 years ago
- 10 comments
#38 - add generic form to project specification.
Pull Request -
State: closed - Opened by andresp99999 about 12 years ago
#37 - Add project definition for generic form request.
Pull Request -
State: closed - Opened by andresp99999 about 12 years ago
#36 - allow spidermanager to instantiate a custom spider class
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#35 - added zipfile spider manager, added spider manager test, and move a
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#34 - include login form urls in allowed domains, as sometimes the login page
Pull Request -
State: closed - Opened by kalessin about 12 years ago
#33 - added support for login forms in spider, added tests
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#32 - fix listing of spider names with a period character
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#31 - present sample project spec attributes in separate lines in order
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#30 - use generators instead of lists in spider, in order to reduce memory
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#29 - added basic usage in README.rst
Pull Request -
State: closed - Opened by kalessin over 12 years ago
- 3 comments
#28 - Refactor specs, add project object, modify item, items and field. Add examples
Pull Request -
State: closed - Opened by andresp99999 over 12 years ago
#27 - use hostname instead of netloc for extracting domain from url, in order
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#26 - improved project specs, and added documentation.
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#25 - extract links from link tags
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#24 - allow to import a project from a zip file, improved/simplified some
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#23 - use hostname instead of netloc for extracting domain from url, in order
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#22 - add name to Pipeline extractor and regex extractor names
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#21 - fix replacement of empty region, added test
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#19 - added start_url meta to start requests.\
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#18 - exception fixes in extractors
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#17 - refactor extractors application, ensuring type consistency (apply regex ...
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#16 - separated duplicates detector code in an item pipeline
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#15 - Compose type extractor and regex extractor instead of replacing the first by the second
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#14 - allow to specify start_urls as a string, in order to support passing of start urls with scrapinghub api.
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#13 - allow start_urls to be overriden by spider args
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#12 - avoid link extractor to return unicode urls
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#11 - compare sets instead of list when testing spidermanager list
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#10 - added spider tests and some fixes
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#9 - added new test case in link extractor and fix handling
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#8 - Support link extraction from standard onclick inline js
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#7 - include id of template used in extraction as item metadata instead of return appart
Pull Request -
State: closed - Opened by kalessin over 12 years ago
#6 - for safe html extractor, extract only if region has no empty html code
Pull Request -
State: closed - Opened by kalessin almost 13 years ago
#5 - separate code that makes the url filter, and allow to specify not to follow links
Pull Request -
State: closed - Opened by kalessin almost 13 years ago
#4 - fix text extraction to avoid to match whitespace chars (previously wrongly done in adapt method, fixes last commit)
Pull Request -
State: closed - Opened by kalessin almost 13 years ago
#3 - use htmlregion text_content in number, price and text extractors
Pull Request -
State: closed - Opened by kalessin almost 13 years ago
- 1 comment
#2 - fixed ibl spider to work with last changes for supporting link areas extraction
Pull Request -
State: closed - Opened by kalessin about 13 years ago
#1 - changes needed for correct working of scrapinghub
Pull Request -
State: closed - Opened by kalessin about 13 years ago