Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / turicas/crau issues and pull requests

#21 - Update DOWNLOAD_TIMEOUT to 15s

Pull Request - State: closed - Opened by rhenanbartels about 2 years ago

#20 - Remove custom_settings from spider

Issue - State: open - Opened by turicas about 2 years ago

#19 - Add option to autothrottle scrapy

Pull Request - State: closed - Opened by rhenanbartels over 2 years ago

#18 - UnicodeDecodeError: 'ascii' codec can't decode byte

Issue - State: open - Opened by rhenanbartels over 2 years ago

#17 - Feature/same domain option

Pull Request - State: closed - Opened by rhenanbartels over 2 years ago

#16 - Headers not preserved correctly

Issue - State: open - Opened by JustAnotherArchivist about 5 years ago - 2 comments

#15 - Transfer encoding is not preserved

Issue - State: open - Opened by JustAnotherArchivist about 5 years ago

#14 - Change default settings to optimize broad crawls

Issue - State: open - Opened by turicas about 5 years ago

#13 - Invalid Syntax

Issue - State: closed - Opened by Solemnly about 5 years ago - 1 comment

#12 - Black error on Ubuntu 18.04.03

Issue - State: closed - Opened by Solemnly about 5 years ago - 4 comments

#11 - Check possibility of migrating to scrapy.spiders.CrawlSpider

Issue - State: open - Opened by turicas about 5 years ago

#10 - Change User-Agent

Issue - State: closed - Opened by turicas about 5 years ago

#9 - Remove URL fragment before saving

Issue - State: closed - Opened by turicas about 5 years ago

#8 - Add option to restrict domains

Issue - State: open - Opened by turicas about 5 years ago

#7 - Expose spider configurations to `crau archive` and close #4

Pull Request - State: closed - Opened by victor-torres about 5 years ago - 6 comments

#6 - Create a scrapy backed cache based on WARC

Issue - State: open - Opened by turicas about 5 years ago

#5 - Create `search` command

Issue - State: open - Opened by turicas about 5 years ago

#4 - Expose spider configurations to `crau archive`

Issue - State: closed - Opened by turicas about 5 years ago

#3 - Implement a browser-based spider

Issue - State: open - Opened by turicas about 5 years ago

#2 - Capture any HTTP code

Issue - State: closed - Opened by turicas about 5 years ago

#1 - Check if redirects are being written to WARC file

Issue - State: closed - Opened by turicas about 5 years ago - 2 comments