Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / howie6879/ruia issues and pull requests
#159 - Feat: process_start_urls in parallel
Pull Request -
State: open - Opened by aircloud over 1 year ago
- 2 comments
#158 - 请问如何判断发生了跳转呢?
Issue -
State: open - Opened by ray0728 over 1 year ago
- 1 comment
#157 - Logs
Issue -
State: closed - Opened by dickermoshe over 1 year ago
- 1 comment
#156 - 如果能支持分布式就好了
Issue -
State: open - Opened by xmydjx about 2 years ago
#155 - docs.python-ruia.org is not available
Issue -
State: open - Opened by asluchevskiy about 2 years ago
- 4 comments
#154 - 通过中间件添加 socks5 代理后如何关闭 session?
Issue -
State: closed - Opened by killbus over 2 years ago
#153 - 希望添加更多功能,更多示例,更多文档,希望长期维护~
Issue -
State: open - Opened by apollo9527a over 2 years ago
#152 - Fixed: RegexField default
Pull Request -
State: closed - Opened by 123seven over 2 years ago
#151 - ruia 使用lxml编码xml文档时报错
Issue -
State: closed - Opened by showthesunli over 2 years ago
- 1 comment
#150 - worker_numbers 数值多少合适
Issue -
State: closed - Opened by atzouhua almost 3 years ago
- 1 comment
#149 - 示例代码运行报错
Issue -
State: closed - Opened by mogeko about 3 years ago
- 3 comments
#148 - 我应当如何向 Spider 传递 start_urls?
Issue -
State: closed - Opened by mogeko about 3 years ago
- 1 comment
#147 - spider 添加类型注释
Pull Request -
State: closed - Opened by Vastxiao over 3 years ago
#146 - httpx替换aiohttp支持http2
Issue -
State: open - Opened by Vastxiao over 3 years ago
- 1 comment
#145 - refactor: remove redundant param "is_async_start" in Spider
Pull Request -
State: closed - Opened by laggardkernel over 3 years ago
#144 - refactor: remove redundant param "_signal" in Spider.stop()
Pull Request -
State: closed - Opened by laggardkernel over 3 years ago
#143 - fix: limit filter in _parse_html() to skip only field "target_item"
Pull Request -
State: closed - Opened by laggardkernel over 3 years ago
#142 - POST发送请求,收不到请求中的body
Issue -
State: closed - Opened by superniao666 over 3 years ago
- 2 comments
#141 - 并发5,循环爬取1000个网页,CPU耗尽为0,但是内存没有耗完,大佬帮看看代码有什么问题
Issue -
State: closed - Opened by superniao666 over 3 years ago
- 3 comments
#140 - 是否可以用模式匹配工具-pampy来实现对json解析的支持
Issue -
State: closed - Opened by jiangfubang over 3 years ago
- 1 comment
Labels: question
#139 - 代理使用问题
Issue -
State: closed - Opened by atzouhua over 3 years ago
- 1 comment
#138 - 运行示例代码报错
Issue -
State: closed - Opened by qgyhd1234 over 3 years ago
- 10 comments
#137 - 【suggestion】重试逻辑可以添加或更换代理ip
Issue -
State: closed - Opened by michael-liumh over 3 years ago
- 8 comments
#136 - python3.9 remove asyncio.Task.all_tasks()
Issue -
State: closed - Opened by happyli0826 over 3 years ago
- 3 comments
#135 - Trouble scraping deck.tk/deckstats.net
Issue -
State: closed - Opened by Triquetra over 3 years ago
- 7 comments
Labels: bug, enhancement
#134 - Would be nice to be able to pass in "start_urls"
Issue -
State: closed - Opened by JacobJustice over 3 years ago
- 7 comments
#133 - Improve Chinese documentation
Issue -
State: open - Opened by howie6879 over 3 years ago
Labels: enhancement
#132 - Update SOCKS5 proxy example
Pull Request -
State: closed - Opened by Leezj9671 over 3 years ago
- 1 comment
#131 - Is it possible to use SOCKS5 proxy?
Issue -
State: closed - Opened by Leezj9671 over 3 years ago
- 9 comments
#130 - AttributeError: 'Response' object has no attribute 'html'
Issue -
State: closed - Opened by xiaoniaoyouhuajiang almost 4 years ago
- 6 comments
#129 - Supported Python3.9
Issue -
State: closed - Opened by howie6879 almost 4 years ago
#128 - 请问怎么添加proxy和headers 以及post
Issue -
State: closed - Opened by yangtengtx almost 4 years ago
- 5 comments
#127 - A Ruia plugin for building Distributed crawling/scraping
Issue -
State: open - Opened by howie6879 almost 4 years ago
- 3 comments
Labels: enhancement, Plugin
#126 - 我愿意用分布式函数调度框架合和你来比,看谁代码更少谁更自由来爬任意网站,欢迎交流。
Issue -
State: closed - Opened by ydf0509 almost 4 years ago
- 9 comments
#125 - 框架使用问题:图片下载场景,CPU 为何会跑满?网络 IO 却几乎为零?
Issue -
State: closed - Opened by lonsty almost 4 years ago
- 3 comments
Labels: bug, enhancement
#124 - Item: target_item is expected, more info: https://docs.python-ruia.org/en/apis/item.html
Issue -
State: closed - Opened by Gaylone almost 4 years ago
#123 - Item: target_item is expected, more info: https://docs.python-ruia.org/en/apis/item.html,
Issue -
State: closed - Opened by Gaylone almost 4 years ago
#122 - There is an error in the win10 platform:raise RuntimeError('Event loop stopped before Future completed.')
Issue -
State: closed - Opened by zeinzbern about 4 years ago
- 5 comments
#121 - Extension for batch POST requests with different payload data
Issue -
State: closed - Opened by peiyaoli about 4 years ago
- 4 comments
#120 - XML parse issue
Issue -
State: closed - Opened by peiyaoli about 4 years ago
- 3 comments
#119 - Add trackback to logger.error() for spider
Issue -
State: closed - Opened by ts709 about 4 years ago
- 5 comments
#118 - Meta字段怎么获取content?
Issue -
State: closed - Opened by jackkam85 about 4 years ago
- 1 comment
#117 - ruia 怎么使用伪造ip或者使用ip池
Issue -
State: closed - Opened by shuqian2017 over 4 years ago
- 1 comment
#116 - 用pip安装bug
Issue -
State: closed - Opened by heyaug over 4 years ago
- 3 comments
#115 - 文档方便详细点吗?
Issue -
State: closed - Opened by lyg4795 over 4 years ago
- 5 comments
#114 - Memory Leak on big number of pages
Issue -
State: closed - Opened by vkudyushev over 4 years ago
- 7 comments
#113 - 请问可以用beautifulsoup来代替默认的解析器吗?
Issue -
State: closed - Opened by synodriver over 4 years ago
- 3 comments
#112 - Add support for parsing JSON on item definition?
Issue -
State: closed - Opened by owen800q over 4 years ago
- 2 comments
#111 - RecursionError: maximum recursion depth exceeded while calling a Python object
Issue -
State: closed - Opened by Vastxiao over 4 years ago
- 1 comment
#110 - fix Chinese character gash problem in RegexField.extract
Pull Request -
State: closed - Opened by fengdongfa1995 over 4 years ago
#109 - RegexField.extract()转中文乱码
Issue -
State: closed - Opened by fengdongfa1995 over 4 years ago
- 3 comments
#108 - could ruia fetch images? 可以下载图片吗
Issue -
State: closed - Opened by scil over 4 years ago
- 1 comment
#107 - data cleaning with staticmethod
Issue -
State: closed - Opened by alenucci over 4 years ago
- 1 comment
#106 - 定义的Field类对json数据结构的抽取
Issue -
State: closed - Opened by YellowDong over 4 years ago
- 7 comments
#105 - 在start()方法传入自定义关键字参数在实例方法中无法获取到参数
Issue -
State: closed - Opened by YellowDong over 4 years ago
- 4 comments
#104 - Piping multiple scrapers
Issue -
State: closed - Opened by lormayna over 4 years ago
- 5 comments
#103 - Fix typo
Pull Request -
State: closed - Opened by abmyii almost 5 years ago
- 1 comment
#102 - Add plugin documentation and examples
Pull Request -
State: closed - Opened by abmyii almost 5 years ago
- 7 comments
#101 - Calling `self.start` as an instance method for a `Spider`
Issue -
State: closed - Opened by abmyii almost 5 years ago
- 11 comments
#100 - Log crucial information regardless of log-level
Issue -
State: closed - Opened by abmyii almost 5 years ago
- 13 comments
#99 - 如何将response中的metadata传递给item
Issue -
State: closed - Opened by YISION almost 5 years ago
- 6 comments
#98 - 从第一个例子开始,就报错了。。。然后我仿照着去爬别的网站,也会报错。
Issue -
State: closed - Opened by YuanQingLe almost 5 years ago
- 3 comments
#97 - Fix asyncio error
Pull Request -
State: closed - Opened by abmyii almost 5 years ago
- 1 comment
#96 - Fix #94
Pull Request -
State: closed - Opened by panhaoyu almost 5 years ago
- 1 comment
#95 - Characters not supported
Issue -
State: closed - Opened by panhaoyu almost 5 years ago
- 1 comment
#94 - Documentation about IgnoreThisItem
Issue -
State: closed - Opened by panhaoyu almost 5 years ago
- 3 comments
#93 - Default (when many=True) shouldn't be enclosed in list
Pull Request -
State: closed - Opened by abmyii almost 5 years ago
- 1 comment
#92 - Default shoudn't be enclosed in list
Issue -
State: closed - Opened by abmyii almost 5 years ago
- 6 comments
#91 - asyncio `RuntimeError`
Issue -
State: closed - Opened by abmyii almost 5 years ago
- 11 comments
#90 - Show URL in Error for easier debugging
Issue -
State: closed - Opened by abmyii almost 5 years ago
- 11 comments
#89 - Add ElementField
Pull Request -
State: closed - Opened by abmyii almost 5 years ago
- 2 comments
#88 - No field for capturing raw LXML elements
Issue -
State: closed - Opened by abmyii almost 5 years ago
- 5 comments
#87 - Fix indentation
Pull Request -
State: closed - Opened by abmyii almost 5 years ago
- 1 comment
#86 - Fix and test for selectors with text() failing
Pull Request -
State: closed - Opened by abmyii almost 5 years ago
- 1 comment
#85 - Remove .strip() from TextField parsing
Pull Request -
State: closed - Opened by abmyii almost 5 years ago
- 1 comment
#84 - `TextField` strips strings which may not be desirable
Issue -
State: closed - Opened by abmyii almost 5 years ago
- 9 comments
#83 - `text()` in xpath selector causes an error
Issue -
State: closed - Opened by abmyii almost 5 years ago
- 7 comments
#82 - Don't delay retries
Pull Request -
State: closed - Opened by abmyii almost 5 years ago
- 6 comments
#81 - Add RETRY_DELAY option
Pull Request -
State: closed - Opened by abmyii almost 5 years ago
- 19 comments
#80 - `DELAY` attribute specifically for retries
Issue -
State: closed - Opened by abmyii almost 5 years ago
- 13 comments
#79 - target_item is expected error
Issue -
State: closed - Opened by r3v1 almost 5 years ago
- 2 comments
#78 - Rate Limiting?
Issue -
State: closed - Opened by FaizShah about 5 years ago
- 5 comments
#77 - 能不能简单说下为什么Spider类不需要新建实例,直接空降了个start就能运行了
Issue -
State: closed - Opened by ElderWanng about 5 years ago
- 3 comments
#76 - question: Is there any option for continuous scraping
Issue -
State: closed - Opened by iAnanich about 5 years ago
- 6 comments
#75 - process_item is only call when the callback_result is an Item
Issue -
State: closed - Opened by hiancdtrsnm over 5 years ago
- 1 comment
#74 - Update Documentation of use of spider.request method
Issue -
State: closed - Opened by hiancdtrsnm over 5 years ago
- 2 comments
#73 - spider.request is not awaitable
Issue -
State: closed - Opened by hiancdtrsnm over 5 years ago
- 1 comment
#72 - Pass kwargs to init and return Spider instance
Pull Request -
State: closed - Opened by maxzheng over 5 years ago
- 1 comment
#71 - Add re_flags to pass thru flags keyword to re.compile
Pull Request -
State: closed - Opened by maxzheng over 5 years ago
- 1 comment
#70 - 如何在爬取过程中增加新的目标url页面呢
Issue -
State: closed - Opened by gxtrobot over 5 years ago
- 4 comments
#69 - 多个 spider 同时开始,实现真的异步
Issue -
State: closed - Opened by ctaoist over 5 years ago
- 6 comments
Labels: bug, enhancement
#68 - ruia 似乎没有使用一个队列来维护所有的任务,如果有突发情况停止了爬虫,下次重新启动就需要重新开始一遍
Issue -
State: closed - Opened by DeemoASCII over 5 years ago
- 7 comments
Labels: enhancement, Plugin
#67 - 建议和疑惑
Issue -
State: closed - Opened by Developer27149 over 5 years ago
- 2 comments
#66 - 教程中的middleware只有1个参数,但应该有2个参数,同时希望修改报错信息
Issue -
State: closed - Opened by ofooo over 5 years ago
- 1 comment
#65 - correction "parst" to "parse"
Pull Request -
State: closed - Opened by duolaAOA over 5 years ago
- 1 comment
#56 - A Ruia plugin that uses the motor to store data
Issue -
State: closed - Opened by howie6879 almost 6 years ago
Labels: enhancement, Plugin
#32 - A Ruia plugin for debugging
Issue -
State: closed - Opened by howie6879 almost 6 years ago
- 2 comments
Labels: enhancement, Plugin
#28 - write a middleware to filter repeat request
Issue -
State: closed - Opened by panhaoyu almost 6 years ago
- 2 comments
#21 - Write a website for Ruia
Issue -
State: closed - Opened by howie6879 almost 6 years ago
- 1 comment
Labels: enhancement
#17 - HtmlField?
Issue -
State: closed - Opened by panhaoyu almost 6 years ago
- 6 comments
Labels: enhancement, Plugin