Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / codelibs/elasticsearch-river-web issues and pull requests

#138 - Improve logger.debug()

Pull Request - State: open - Opened by deka0106 over 4 years ago

#137 - Windows Installer

Issue - State: open - Opened by conradbm over 4 years ago - 1 comment
Labels: question

#135 - How to save an image from page

Issue - State: open - Opened by moorthi07 over 6 years ago

#134 - unexpected behavior of robots_txt option

Issue - State: open - Opened by viktor-svirsky over 6 years ago - 1 comment

#133 - riverweb: command not found

Issue - State: open - Opened by aratrika1 over 6 years ago

#132 - lastModified header

Issue - State: open - Opened by viktor-svirsky almost 7 years ago

#131 - RiverWeb-2.4.0-snapshot connectivity issue

Issue - State: open - Opened by mykola-shulba almost 7 years ago - 1 comment
Labels: question

#130 - include_urls doesn't work

Issue - State: open - Opened by viktor-svirsky almost 7 years ago - 3 comments
Labels: question

#129 - Nothing happens when I run Riverweb

Issue - State: closed - Opened by bducharme about 7 years ago

#128 - Crawler is connecting then disconnecting??

Issue - State: open - Opened by osmanra2 about 7 years ago - 1 comment
Labels: question

#127 - Index objects on page instead of entire page

Issue - State: open - Opened by dutchiexl over 7 years ago

#126 - IsArray does not seem to work

Issue - State: open - Opened by dutchiexl over 7 years ago

#125 - How to enable retry on crawler?

Issue - State: open - Opened by lmatt-bit over 7 years ago

#124 - Can use river web with ES 5.0.0 ?

Issue - State: open - Opened by iDongkil over 7 years ago - 4 comments

#123 - Website indexation on AWS Elasticsearch service

Issue - State: open - Opened by femat almost 8 years ago - 2 comments
Labels: question

#122 - Website Indexation

Issue - State: open - Opened by hkhail almost 8 years ago - 1 comment
Labels: question

#121 - River-web and Cluster Elasticsearch

Issue - State: open - Opened by hkhail almost 8 years ago - 1 comment
Labels: question

#120 - Error when i run riverweb

Issue - State: open - Opened by youcefboukersi almost 8 years ago - 3 comments
Labels: question

#119 - Disconnected - Connection manager is shutting down

Issue - State: closed - Opened by hkhail about 8 years ago - 5 comments
Labels: question

#118 - Problem with news.yahoo.com

Issue - State: closed - Opened by rdrgporto about 8 years ago - 1 comment

#117 - None of the configured nodes are available

Issue - State: open - Opened by devmiyax about 8 years ago - 1 comment

#116 - Help with URL patterns

Issue - State: closed - Opened by beefwellington13 over 8 years ago - 2 comments

#115 - Error message when running river-web

Issue - State: open - Opened by marcshep-scribe over 8 years ago - 4 comments
Labels: question

#114 - Failure starting riverweb under Windows Server 2012

Issue - State: open - Opened by ndrwchn over 8 years ago - 1 comment
Labels: question

#113 - Riverweb stops before crawling

Issue - State: open - Opened by SarahBaeriswyl over 8 years ago - 1 comment
Labels: question

#112 - update ES index when the website has been changing

Issue - State: open - Opened by hanasian over 8 years ago - 1 comment

#111 - ES Version And elasticsearch-river-web Version

Issue - State: open - Opened by hanasian over 8 years ago - 2 comments

#110 - Property 'url' changed to 'urls' in version 2.0

Issue - State: open - Opened by neilneyman over 8 years ago - 1 comment

#109 - None of the configured nodes are available

Issue - State: open - Opened by neilneyman over 8 years ago - 3 comments

#108 - The max file size (1804200/1000000 is exceeded

Issue - State: closed - Opened by beefwellington13 over 8 years ago - 1 comment

#105 - Error: Could not find or load main class org.codelibs.elasticsearch.web.RiverWeb (PC)

Issue - State: open - Opened by osmanra2 over 8 years ago - 5 comments
Labels: question

#104 - Could not find or load main class org.codelibs.elasticsearch.web.RiverWeb

Issue - State: closed - Opened by aroonseenamurthy almost 9 years ago - 1 comment

#103 - Crawl page immediately when page is updated

Issue - State: open - Opened by audunru almost 9 years ago

#102 - Ignoring already stored URL's

Pull Request - State: open - Opened by Choumy almost 9 years ago - 1 comment

#101 - File System Crawling

Issue - State: open - Opened by ln-lv almost 9 years ago - 7 comments
Labels: question

#100 - Schema.org facetting

Issue - State: open - Opened by marvink almost 9 years ago

#99 - URL with Parameters

Issue - State: closed - Opened by marvink about 9 years ago - 1 comment

#98 - Retrieve elasticsearch info from properties file

Issue - State: closed - Opened by marevol about 9 years ago
Labels: enhancement

#97 - JDK-7 ? or must use 8?

Issue - State: open - Opened by jbardu about 9 years ago - 1 comment
Labels: question

#96 - ExcludeFilters are sometimes ignored

Issue - State: open - Opened by LeNightHawk about 9 years ago - 4 comments
Labels: question

#95 - Improve a log message for skipping scraping

Issue - State: closed - Opened by marevol about 9 years ago
Labels: task

#94 - Riverweb stops after indexing < 200 pages

Issue - State: closed - Opened by neilneyman about 9 years ago - 6 comments
Labels: question

#93 - Need to poll data from crawlingUrlQueue if over maxCrawlingQueueSize

Issue - State: closed - Opened by marevol about 9 years ago
Labels: bug

#92 - "target.pattern" does not work

Issue - State: closed - Opened by marevol about 9 years ago
Labels: bug

#91 - Use version info from pom.xml

Issue - State: closed - Opened by marevol about 9 years ago
Labels: task

#90 - Use javascript as a default lang

Issue - State: closed - Opened by marevol about 9 years ago
Labels: task

#89 - Refactoring for overwrite/incremental options

Issue - State: closed - Opened by marevol about 9 years ago
Labels: task

#88 - Create mappings for S2Robot if they does not exist

Issue - State: closed - Opened by marevol about 9 years ago
Labels: enhancement

#87 - Change to Java Application

Issue - State: closed - Opened by marevol about 9 years ago
Labels: enhancement

#86 - River is deprecated in ES 1.5

Issue - State: open - Opened by wenchiching about 9 years ago - 4 comments
Labels: question

#84 - always only 10 out of 50 start urls are visited

Issue - State: open - Opened by abou over 9 years ago - 1 comment

#83 - Not able to start elasticsearch 1.4.3 with river-web 1.4.0

Issue - State: open - Opened by szelee over 9 years ago

#82 - Combining completion suggester

Issue - State: open - Opened by orenorgad over 9 years ago - 1 comment

#81 - use of "isChildUrl"

Issue - State: open - Opened by zhacli over 9 years ago

#80 - Remove Java 7 and Seasar2 support

Issue - State: closed - Opened by marevol over 9 years ago - 1 comment
Labels: enhancement

#79 - indexing pdf content

Issue - State: open - Opened by jirkaMat over 9 years ago - 2 comments
Labels: question

#78 - Incremental crawl error

Issue - State: open - Opened by oneshot-nc over 9 years ago - 9 comments
Labels: question

#77 - Proposed improvements

Issue - State: open - Opened by oneshot-nc over 9 years ago - 1 comment
Labels: question

#76 - Create robot index asynchronously

Issue - State: closed - Opened by marevol over 9 years ago
Labels: enhancement

#75 - River Web does not support configuration reloading?!

Issue - State: open - Opened by dweidenfeld over 9 years ago - 1 comment
Labels: question

#74 - robotsTxt Parameter Not Working

Issue - State: closed - Opened by necouchman over 9 years ago - 1 comment

#73 - [ERROR][org.codelibs.robot.helper.impl.LogHelperImpl] Crawling Exception

Issue - State: open - Opened by abanger over 9 years ago - 1 comment
Labels: question

#72 - Failed to load class with value [web]

Issue - State: open - Opened by vishal-grazitti over 9 years ago - 3 comments
Labels: question

#71 - Update for Elasticsearch 1.4

Issue - State: closed - Opened by marevol over 9 years ago
Labels: enhancement

#70 - Nothing is crawled while using ExcludeFilter

Issue - State: open - Opened by Choumy over 9 years ago - 2 comments
Labels: question

#69 - Update to S2Robot 0.8.0

Issue - State: closed - Opened by marevol over 9 years ago
Labels: enhancement

#68 - Incompatibility with river-imap

Issue - State: open - Opened by finalspy almost 10 years ago - 6 comments
Labels: question

#67 - Invalid indexing of links on the same page.

Issue - State: closed - Opened by SuperTank almost 10 years ago - 2 comments
Labels: bug

#65 - Overwrite and version of doc

Issue - State: open - Opened by Choumy almost 10 years ago

#64 - Crawling Activity Stopped

Issue - State: closed - Opened by Choumy almost 10 years ago - 2 comments
Labels: question

#63 - Documentation

Issue - State: open - Opened by ghost almost 10 years ago - 1 comment
Labels: question

#62 - EAS returns jiberish

Issue - State: open - Opened by ilazaridis almost 10 years ago - 3 comments
Labels: question

#61 - Getting error right after installing quartz and river-web

Issue - State: open - Opened by iouri-kostine almost 10 years ago - 3 comments
Labels: question

#60 - Multiple Meta tag crawling issue

Issue - State: open - Opened by selvas4u almost 10 years ago - 5 comments
Labels: question

#59 - Allow crawling HTTPS with self-signed certificates

Issue - State: open - Opened by Fapiko almost 10 years ago - 1 comment

#58 - Crawler ignoring .htm files

Issue - State: closed - Opened by sjrand almost 10 years ago - 2 comments

#57 - River deleted without crawling, property<requestListener> not found

Issue - State: closed - Opened by sjrand almost 10 years ago - 3 comments

#56 - Failed to register one time crawling after ES startup

Issue - State: closed - Opened by marevol almost 10 years ago
Labels: bug

#55 - Replace MVEL with Elasticsearch's ScriptService

Issue - State: closed - Opened by marevol almost 10 years ago
Labels: enhancement

#54 - Double Quotes

Issue - State: closed - Opened by ilazaridis almost 10 years ago - 2 comments
Labels: question

#53 - Is it possible to index rest web services..?

Issue - State: open - Opened by srinivasv2 almost 10 years ago - 1 comment
Labels: question

#52 - Error during elastic search startup

Issue - State: open - Opened by selvas4u almost 10 years ago - 15 comments
Labels: question

#51 - The check method of status

Issue - State: closed - Opened by miyuki25 almost 10 years ago - 2 comments
Labels: question

#50 - 動的にクロールURLを設定するには

Issue - State: closed - Opened by johna1203 about 10 years ago - 2 comments
Labels: question

#49 - Update dependencies

Issue - State: closed - Opened by marevol about 10 years ago
Labels: enhancement

#48 - How can I get attribute src of the image tag?

Issue - State: closed - Opened by johna1203 about 10 years ago - 2 comments
Labels: question

#47 - question about support of robots.txt

Issue - State: closed - Opened by miyuki25 about 10 years ago - 2 comments
Labels: question

#46 - How to delete the lost web page?

Issue - State: closed - Opened by miyuki25 about 10 years ago - 4 comments
Labels: question

#45 - Duplicated contents of different URLs

Issue - State: closed - Opened by miyuki25 about 10 years ago - 2 comments
Labels: question

#44 - How to index secured page(via Forms authentication) using Elastic Search service

Issue - State: open - Opened by srinivasv2 about 10 years ago - 2 comments
Labels: question

#35 - Duplicated URLs

Issue - State: open - Opened by mezuqu about 10 years ago - 13 comments
Labels: question

#28 - question - is there a way to specify a file containing URL's to crawl......?

Issue - State: open - Opened by Chadwiki over 10 years ago - 2 comments
Labels: question

#21 - Not all pages being crawled

Issue - State: open - Opened by timcreatewell over 10 years ago - 3 comments
Labels: question

#18 - NoClassSettingsException[Failed to load class with value [web]]

Issue - State: closed - Opened by timcreatewell over 10 years ago - 18 comments
Labels: question

#6 - Change the target version of Elasticsearch to 1.0.0.RC1

Pull Request - State: closed - Opened by johtani over 10 years ago

#5 - Cannot create index?

Issue - State: closed - Opened by timcreatewell over 10 years ago - 12 comments