Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / simgroep/concurrent-spider-bundle issues and pull requests

#115 - DEVOPS-13 strict composer validation

Pull Request - State: closed - Opened by othillo over 7 years ago - 3 comments

#114 - SEARCH-805 filter content from style tags

Pull Request - State: closed - Opened by e0jopeka over 7 years ago - 1 comment

#113 - SEARCH-805 filter content from style tags

Pull Request - State: closed - Opened by e0jopeka over 7 years ago - 1 comment

#112 - SEARCH-801 redis moved outsited uris foreach

Pull Request - State: closed - Opened by e0jopeka over 7 years ago - 3 comments

#111 - added error 400 to exception with reject

Pull Request - State: closed - Opened by e0jopeka over 7 years ago - 3 comments

#110 - SEARCH-772 added content parsing with preg

Pull Request - State: closed - Opened by e0jopeka over 7 years ago - 2 comments

#109 - Cookie require pages

Pull Request - State: closed - Opened by e0jopeka over 7 years ago - 2 comments

#108 - added rabbitmq param vhost

Pull Request - State: closed - Opened by e0jopeka over 7 years ago - 2 comments

#107 - phpword unrecognized content fix

Pull Request - State: closed - Opened by e0jopeka over 7 years ago - 3 comments

#106 - SEARCH-774 low numDocs issue

Pull Request - State: closed - Opened by e0jopeka over 7 years ago - 4 comments

#105 - SEARCH-769 added predis bundle

Pull Request - State: closed - Opened by e0jopeka over 7 years ago - 4 comments

#104 - Rabbit queue filter

Pull Request - State: closed - Opened by e0jopeka over 7 years ago - 5 comments

#103 - SEARCH-742 Introduce a keyword which can be used to exclude pages fro…

Pull Request - State: closed - Opened by websid over 7 years ago - 2 comments

#102 - Commit in this repository that I did not make?

Issue - State: closed - Opened by asm89 over 7 years ago - 3 comments

#101 - Bugfix/search 724 x

Pull Request - State: closed - Opened by websid over 7 years ago - 2 comments

#100 - Bugfix/search 724

Pull Request - State: closed - Opened by websid over 7 years ago - 1 comment

#99 - SEARCH-677 302 redirect leading to 404, page still found in search re…

Pull Request - State: closed - Opened by websid over 7 years ago - 2 comments

#98 - search-652 separate revisit messages and put it in other queue, fixed…

Pull Request - State: closed - Opened by lkalinka over 7 years ago - 5 comments

#97 - Bugfix/search 678

Pull Request - State: closed - Opened by websid over 7 years ago - 3 comments

#96 - Hackaton

Pull Request - State: closed - Opened by websid over 7 years ago - 4 comments

#95 - add separate queue for crawling pdf documents

Pull Request - State: closed - Opened by lkalinka over 7 years ago - 2 comments

#94 - Search 686

Pull Request - State: closed - Opened by lkalinka over 7 years ago - 1 comment

#93 - search-668 added removing message from queue to unstuck crawler and g…

Pull Request - State: closed - Opened by lkalinka almost 8 years ago - 1 comment

#92 - search-668 add new cases for phpunittest

Pull Request - State: closed - Opened by lkalinka almost 8 years ago - 1 comment

#91 - search-668 added additional check for queuing of new documents

Pull Request - State: closed - Opened by lkalinka almost 8 years ago - 1 comment

#90 - SEARCH-665 Added support for http status code 301

Pull Request - State: closed - Opened by smolowik almost 8 years ago - 1 comment

#89 - search-668 fixing phpoffice dependencies to not shown error with miss…

Pull Request - State: closed - Opened by lkalinka almost 8 years ago - 1 comment

#88 - search-668 replace php library for reading pdf by bash command, fixed…

Pull Request - State: closed - Opened by lkalinka almost 8 years ago - 2 comments

#87 - vdb/php-spider composer dependies problem

Issue - State: closed - Opened by RomainMarecat about 8 years ago - 1 comment

#86 - SEARCH-639 Disable commit to solar when deleting document

Pull Request - State: closed - Opened by smolowik about 8 years ago - 1 comment

#85 - Update to newest version of smalot/pdfparser library

Pull Request - State: closed - Opened by smolowik over 8 years ago - 1 comment

#84 - Spider now looking for url's in 'loc' element

Pull Request - State: closed - Opened by smolowik over 8 years ago

#83 - Skipping url with rel=nofollow. Catching error when loading documents

Pull Request - State: closed - Opened by smolowik over 8 years ago - 4 comments

#82 - Moved isUrlBlacklisted, isUrlWhitelisted to separate class

Pull Request - State: closed - Opened by smolowik over 8 years ago - 2 comments

#81 - Fixed recrawl. Urls now are savet without last '/'

Pull Request - State: closed - Opened by smolowik over 8 years ago

#80 - Regexp for blacklist is now case insensetive. ID generated from url i…

Pull Request - State: closed - Opened by smolowik over 8 years ago

#79 - Added recrawl command, for now only delete documents that are on blac…

Pull Request - State: closed - Opened by smolowik over 8 years ago

#78 - Added gc_collect_cycles

Pull Request - State: closed - Opened by smolowik over 8 years ago

#77 - SEARCH-469 Replace space to %20. Add Unit tests

Pull Request - State: closed - Opened by smolowik over 8 years ago

#76 - evert php 5.4 travis tests

Pull Request - State: closed - Opened by lkalinka over 8 years ago

#75 - move repository to general simgroep namespace, update solarium, updat…

Pull Request - State: closed - Opened by lkalinka over 8 years ago

#74 - fixed phpunit tests

Pull Request - State: closed - Opened by lkalinka almost 9 years ago

#73 - remove Html exception when minimum chars are not reached to prevent f…

Pull Request - State: closed - Opened by lkalinka almost 9 years ago

#72 - Do not autocommit anymore and change whitelist behavior

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#71 - Deal with scenarios where date/time cannot be parsed.

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#70 - Several fixes and more explainable logging

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#69 - Prevent not allowed urls to be queued.

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#68 - First crawl new found urls and then persist.

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#67 - added feature to empty an index/core

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#66 - Added function to find the amount of documents in a core.

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#65 - Make the crawler smart when pages are revisited.

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#64 - Be able to save documents from multiple cores.

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#63 - Deal with multiple endpoints

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#62 - replaced packages from Kees github to lkalinka packages, update depen…

Pull Request - State: closed - Opened by lkalinka almost 9 years ago

#61 - Add custom NelmioSolariumBundle repository

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#60 - Usage of NelmioSolariumBundle and enabled load balancing.

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#59 - Increase amount of rows to be found.

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#58 - Small fix to make range queries more safe and precisly.

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#57 - Service misconfiguration fix.

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#56 - Don't queue URL if it's not whitelisted or within the same domain

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#55 - Increased timeout

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#54 - Fixes PDF documents without content-disposition header.

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#53 - add whitelist to crawler

Pull Request - State: closed - Opened by lkalinka almost 9 years ago

#52 - Be able to find expired urls

Pull Request - State: closed - Opened by keesschepers almost 9 years ago

#51 - Support for keeping documents up to date.

Pull Request - State: closed - Opened by keesschepers about 9 years ago

#50 - Support for keeping documents up to date.

Pull Request - State: closed - Opened by keesschepers about 9 years ago

#49 - Upgrade vendor and changes for php-amqplib

Pull Request - State: closed - Opened by keesschepers about 9 years ago

#48 - fixed typo

Pull Request - State: closed - Opened by lkalinka about 9 years ago

#47 - fixed types to not return empty values, fixed phpunit tests

Pull Request - State: closed - Opened by lkalinka about 9 years ago

#46 - pass metadata to event

Pull Request - State: closed - Opened by lkalinka about 9 years ago

#45 - Removed SIM related content extraction.

Pull Request - State: closed - Opened by keesschepers about 9 years ago - 1 comment

#43 - Improved filename guessing.

Pull Request - State: closed - Opened by keesschepers about 9 years ago

#42 - File size limit + ignore empty messages.

Pull Request - State: closed - Opened by keesschepers about 9 years ago

#41 - remove documents from solr when they are not found, remove unused par…

Pull Request - State: closed - Opened by lkalinka about 9 years ago

#40 - Search-212 Add document resolver

Pull Request - State: closed - Opened by lkalinka about 9 years ago

#39 - Temporary use our own solarium client in review of the pull-request

Pull Request - State: closed - Opened by keesschepers about 9 years ago

#38 - Proxy should be mandatory.

Pull Request - State: closed - Opened by keesschepers about 9 years ago

#37 - Make configuration less strict.

Pull Request - State: closed - Opened by keesschepers about 9 years ago

#35 - Content blacklisting.

Pull Request - State: closed - Opened by keesschepers about 9 years ago

#34 - update dependencies with solarium in version 3

Pull Request - State: closed - Opened by lkalinka about 9 years ago - 1 comment

#33 - Relocated blacklist

Pull Request - State: closed - Opened by lkalinka about 9 years ago

#32 - Relocated blacklist

Pull Request - State: closed - Opened by lkalinka about 9 years ago

#31 - Refactored code

Pull Request - State: closed - Opened by keesschepers about 9 years ago

#29 - Better handling of invalid content / Deal with shebang URL's

Pull Request - State: closed - Opened by keesschepers about 9 years ago

#28 - Better handling of invalid content.

Pull Request - State: closed - Opened by keesschepers about 9 years ago

#27 - Relocated blacklist

Pull Request - State: closed - Opened by nickvkaam over 9 years ago

#26 - SEARCH-272 SEARCH-191

Pull Request - State: closed - Opened by lkalinka over 9 years ago - 1 comment

#25 - add core name and passing it to rabbitMq, fixed phpunit tests

Pull Request - State: closed - Opened by lkalinka over 9 years ago

#24 - Add missed solr fields

Pull Request - State: closed - Opened by lkalinka over 9 years ago

#23 - Service extraction

Pull Request - State: closed - Opened by Breuls over 9 years ago

#22 - More tests & coverage.

Pull Request - State: closed - Opened by Breuls over 9 years ago

#21 - Added test for StartCrawlerCommand.

Pull Request - State: closed - Opened by Breuls over 9 years ago

#20 - Expansion of tests.

Pull Request - State: closed - Opened by Breuls over 9 years ago

#19 - Search-226

Pull Request - State: closed - Opened by lkalinka over 9 years ago

#18 - Fixed typo's and grammar.

Pull Request - State: closed - Opened by Breuls over 9 years ago

#17 - Made mapping configureable, added tests and fixed configuration bug.

Pull Request - State: closed - Opened by keesschepers over 9 years ago

#4 - Update Solarium client to 3.x

Issue - State: closed - Opened by keesschepers over 9 years ago