Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / xuxueli/xxl-crawler issues and pull requests
#35 - 是否允许基于身份认证的爬虫
Issue -
State: closed - Opened by xcmonline almost 2 years ago
- 1 comment
#34 - Bump jsoup from 1.11.2 to 1.15.3
Pull Request -
State: closed - Opened by dependabot[bot] over 2 years ago
Labels: dependencies
#33 - Bump jsoup from 1.11.2 to 1.14.2
Pull Request -
State: closed - Opened by dependabot[bot] over 3 years ago
- 1 comment
Labels: dependencies
#32 - 请问该项目还维护和更新吗
Issue -
State: closed - Opened by mackyuqimack over 3 years ago
- 1 comment
#31 - JsoupUtil工具类loadPageSource()方法里Connection没有调用requestBody
Issue -
State: closed - Opened by AlexWang1988 over 3 years ago
- 1 comment
#30 - Bump junit from 4.11 to 4.13.1
Pull Request -
State: closed - Opened by dependabot[bot] over 4 years ago
Labels: dependencies
#29 - 支持自定义获取页面urls
Pull Request -
State: closed - Opened by igoso over 4 years ago
- 1 comment
#28 - Bump htmlunit from 2.24 to 2.37.0
Pull Request -
State: closed - Opened by dependabot[bot] over 4 years ago
Labels: dependencies
#27 - 使用SeleniumPhantomjsPageLoader后,jsoup解析后document对象中的baseUri为空
Issue -
State: closed - Opened by VincentHQL about 5 years ago
- 1 comment
#26 - connect timeout超时处理
Issue -
State: closed - Opened by ghost about 5 years ago
- 1 comment
#25 - com.xuxueli.crawler.thread.CrawlerThread#processPage问题
Issue -
State: closed - Opened by landy8530 about 5 years ago
- 1 comment
#24 - 请问一下,有登录后再爬取内容的功能吗?
Issue -
State: closed - Opened by landy8530 about 5 years ago
- 1 comment
#23 - 发送post请求时返回400
Issue -
State: closed - Opened by 2637977081 about 5 years ago
- 2 comments
#22 - setWhiteUrlRegexs正则传参不起作用
Issue -
State: closed - Opened by zhangnd about 5 years ago
- 1 comment
#21 - [issue] 多线程情况下,tryFinish()很小的概率会误判当前运行状态
Issue -
State: open - Opened by 1988tianyuan over 5 years ago
- 1 comment
#20 - 扩散全站功能异常问题.
Issue -
State: open - Opened by lihuiby over 5 years ago
- 1 comment
#19 - 线程安全问题
Issue -
State: closed - Opened by lihuiby almost 6 years ago
- 1 comment
#18 - 使用HtmlUnitPageLoader加载的页面获取不到当前页面url
Pull Request -
State: closed - Opened by minggen almost 6 years ago
- 1 comment
#17 - 【需求】VO嵌套
Issue -
State: open - Opened by yuki-xin almost 6 years ago
- 1 comment
#16 - maven引入1.2.2版本,测试07报错
Issue -
State: closed - Opened by 437865981 almost 6 years ago
- 1 comment
#15 - [新需求]针对post请求,相同的url,根据参数不同返回不同结果的页面抓取实现
Issue -
State: open - Opened by zhaoxin1124 about 6 years ago
- 1 comment
#14 - ajax请求爬取
Issue -
State: closed - Opened by windhc over 6 years ago
#13 - 建议使用jdk1.8
Issue -
State: closed - Opened by windhc over 6 years ago
- 1 comment
#12 - CrawlerThread的process方法里判断当前链接是否是白名单链接逻辑有问题
Issue -
State: closed - Opened by lomoye over 6 years ago
- 1 comment
#11 - 修改页面默认限制1M->∞
Pull Request -
State: closed - Opened by wysnxzm almost 7 years ago
#10 - 爬取到的页面可能出现"截断"问题-----网瘾少年徐志摩
Issue -
State: closed - Opened by wysnxzm almost 7 years ago
#9 - 能否支持获取js执行之后的网页
Issue -
State: closed - Opened by hgx about 7 years ago
- 2 comments
#8 - selectType.VAL 貌似存在问题
Issue -
State: closed - Opened by wysnxzm about 7 years ago
- 1 comment
#7 - pageVo对象注入使用get/set
Issue -
State: closed - Opened by wysnxzm about 7 years ago
- 1 comment
#6 - 优化setPageParser避免匿名函数
Issue -
State: closed - Opened by wysnxzm about 7 years ago
- 2 comments
#5 - 目前可使用用的选择器过少
Issue -
State: closed - Opened by wysnxzm about 7 years ago
- 2 comments
#4 - 增加配置validateTLSCertificates,因为jsoup爬取https时默认进行验证
Pull Request -
State: closed - Opened by jnan88 about 7 years ago
#2 - 用url正则检查url是否符合规范
Pull Request -
State: closed - Opened by Ngone51 about 7 years ago
#1 - 接入xxl-crawler的公司请留下 ”公司名称 + 公司官网地址“,谢谢。
Issue -
State: open - Opened by xuxueli about 7 years ago
- 4 comments