Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / iipc/webarchive-commons issues and pull requests

#96 - Drop dependency on log4j 1

Pull Request - State: open - Opened by kris-sigur 11 months ago

#95 - Apache httpclient 3.1 sonatype

Issue - State: open - Opened by DEBARYYA over 1 year ago

#94 - Consider syncing up from the Common Crawl fork

Issue - State: closed - Opened by anjackson over 2 years ago - 1 comment

#93 - Compressed WARC InputStream is closed by record iterator.

Issue - State: open - Opened by tlipkis over 3 years ago

#92 - Bump commons-io from 2.4 to 2.7

Pull Request - State: open - Opened by dependabot[bot] over 3 years ago
Labels: dependencies

#90 - Bump junit from 3.8.1 to 4.13.1

Pull Request - State: closed - Opened by dependabot[bot] almost 4 years ago
Labels: dependencies

#89 - WAT extractor: do not fail on missing WARC-Filename in warcinfo record

Pull Request - State: closed - Opened by sebastian-nagel over 4 years ago - 1 comment

#87 - Prevent from stackoverflow by limiting length of matched pattern

Pull Request - State: open - Opened by sebastian-nagel almost 5 years ago - 1 comment

#86 - ExtractingParseObserver: extract rel, hreflang and type attributes

Pull Request - State: closed - Opened by sebastian-nagel almost 5 years ago

#85 - ExtractingParseObserver: extract links from onClick attributes

Pull Request - State: closed - Opened by sebastian-nagel almost 5 years ago - 1 comment

#84 - Replace the org.json dependency by openjson library

Pull Request - State: open - Opened by sebastian-nagel almost 5 years ago - 4 comments

#83 - Update TravisCI config; resolves #82.

Pull Request - State: closed - Opened by ruebot over 5 years ago - 4 comments

#82 - Update TravisCI config

Issue - State: closed - Opened by ruebot over 5 years ago

#81 - CompressedWARCReader does not work for Common Crawl WARC files.

Issue - State: closed - Opened by YossiTamari almost 6 years ago - 3 comments

#80 - Fixing bad dates in WARC file

Issue - State: closed - Opened by cjer over 6 years ago - 6 comments

#79 - Update API documentation to reflect current behaviour:

Issue - State: open - Opened by anjackson almost 7 years ago - 1 comment
Labels: bug

#78 - commons-httpclient-3.1 vulnerability

Issue - State: open - Opened by ldko about 7 years ago - 1 comment

#77 - use commons-collections v3.2.2 to avoid v3.2.1 vulnerability

Pull Request - State: closed - Opened by ndushay about 7 years ago - 2 comments

#76 - upgrade to commons-collections.jar 3.2.2

Issue - State: closed - Opened by ndushay about 7 years ago

#75 - Extract also `property` attributes of HTML meta elements

Pull Request - State: closed - Opened by sebastian-nagel over 7 years ago - 1 comment

#74 - Do not add value of preceding HTTP header field if there is no value

Pull Request - State: closed - Opened by sebastian-nagel over 7 years ago - 3 comments

#73 - Move missing unit tests over from Heritrix3

Pull Request - State: closed - Opened by MohammedElsayyed over 7 years ago - 4 comments

#72 - Improve HTML link extraction

Pull Request - State: closed - Opened by sebastian-nagel over 7 years ago

#71 - Logging changes for next release.

Pull Request - State: closed - Opened by ldko over 7 years ago

#70 - Whatwg conformant uri

Pull Request - State: closed - Opened by johnerikhalse over 7 years ago

#70 - Whatwg conformant uri

Pull Request - State: closed - Opened by johnerikhalse over 7 years ago

#69 - URLParser to strip empty port

Pull Request - State: closed - Opened by sebastian-nagel almost 8 years ago - 1 comment
Labels: accepted

#68 - Use CharsetDetector to guess encoding of HTML documents

Pull Request - State: closed - Opened by sebastian-nagel almost 8 years ago
Labels: accepted

#68 - Use CharsetDetector to guess encoding of HTML documents

Pull Request - State: closed - Opened by sebastian-nagel almost 8 years ago
Labels: accepted

#67 - Add attribute "property" of HTML meta elements to WAT HTML-Metadata

Issue - State: closed - Opened by sebastian-nagel almost 8 years ago - 1 comment

#66 - support WET files

Issue - State: closed - Opened by dportabella about 8 years ago - 3 comments

#66 - support WET files

Issue - State: closed - Opened by dportabella about 8 years ago - 3 comments

#65 - fix: last header was lost if LF LF (intead of CRLF CRLF)

Pull Request - State: closed - Opened by dportabella about 8 years ago - 4 comments
Labels: accepted

#65 - fix: last header was lost if LF LF (intead of CRLF CRLF)

Pull Request - State: closed - Opened by dportabella about 8 years ago - 4 comments
Labels: accepted

#64 - HTTPS via a Proxy

Issue - State: open - Opened by PsypherPunk about 8 years ago - 1 comment

#64 - HTTPS via a Proxy

Issue - State: open - Opened by PsypherPunk about 8 years ago - 1 comment

#63 - Make regular expression to extract URLs from CSS more restrictive

Pull Request - State: closed - Opened by sebastian-nagel about 8 years ago - 7 comments
Labels: accepted

#62 - Remove invalid constant

Pull Request - State: closed - Opened by kris-sigur about 8 years ago
Labels: bug, accepted

#62 - Remove invalid constant

Pull Request - State: closed - Opened by kris-sigur about 8 years ago
Labels: bug, accepted

#61 - empty header fields populated from previous value

Issue - State: open - Opened by ghost about 8 years ago

#61 - empty header fields populated from previous value

Issue - State: open - Opened by ghost about 8 years ago

#60 - Non-ascii mimetypes

Issue - State: open - Opened by ghost about 8 years ago

#60 - Non-ascii mimetypes

Issue - State: open - Opened by ghost about 8 years ago

#59 - dns records in ARCs

Issue - State: open - Opened by ghost about 8 years ago

#59 - dns records in ARCs

Issue - State: open - Opened by ghost about 8 years ago

#58 - urls with spaces unescaped

Issue - State: open - Opened by ghost about 8 years ago - 1 comment

#58 - urls with spaces unescaped

Issue - State: open - Opened by ghost about 8 years ago - 1 comment

#57 - StringIndexOutOfBoundsException in patternCSSExtract

Pull Request - State: closed - Opened by sebastian-nagel about 8 years ago - 2 comments

#56 - Require Java 8

Issue - State: open - Opened by johnerikhalse over 8 years ago
Labels: enhancement

#56 - Require Java 8

Issue - State: open - Opened by johnerikhalse over 8 years ago
Labels: enhancement

#55 - Reorganize into mother and child pom

Issue - State: open - Opened by johnerikhalse over 8 years ago - 2 comments
Labels: enhancement

#55 - Reorganize into mother and child pom

Issue - State: open - Opened by johnerikhalse over 8 years ago - 2 comments
Labels: enhancement

#54 - Make canonicalizer be able to strip session id params even if they ar…

Pull Request - State: closed - Opened by vonrosen over 8 years ago - 4 comments

#54 - Make canonicalizer be able to strip session id params even if they ar…

Pull Request - State: closed - Opened by vonrosen over 8 years ago - 4 comments

#53 - Allow chars in querystring before params to strip

Pull Request - State: closed - Opened by vonrosen over 8 years ago

#53 - Allow chars in querystring before params to strip

Pull Request - State: closed - Opened by vonrosen over 8 years ago

#52 - Store origin-code in ARCRecord header

Pull Request - State: closed - Opened by jrwiebe over 8 years ago - 7 comments
Labels: enhancement

#52 - Store origin-code in ARCRecord header

Pull Request - State: closed - Opened by jrwiebe over 8 years ago - 7 comments
Labels: enhancement

#51 - flush output etc before tallying stats to fix sizeOnDisk calculation

Pull Request - State: closed - Opened by nlevitt almost 9 years ago

#51 - flush output etc before tallying stats to fix sizeOnDisk calculation

Pull Request - State: closed - Opened by nlevitt almost 9 years ago

#50 - fix for HER-2089 -

Pull Request - State: closed - Opened by nlevitt almost 9 years ago

#50 - fix for HER-2089 -

Pull Request - State: closed - Opened by nlevitt almost 9 years ago

#47 - WAT extractor: adding information in WAT's warcinfo

Issue - State: closed - Opened by scheylord over 9 years ago

#47 - WAT extractor: adding information in WAT's warcinfo

Issue - State: closed - Opened by scheylord over 9 years ago

#46 - Fix issues #42 #43 #44 #45 and #47

Pull Request - State: closed - Opened by scheylord over 9 years ago - 7 comments

#45 - WAT extractor: missing WARC format version

Issue - State: closed - Opened by saraaubry over 9 years ago

#45 - WAT extractor: missing WARC format version

Issue - State: closed - Opened by saraaubry over 9 years ago

#44 - WAT extractor: envelope structure does not conform to the WAT specification

Issue - State: closed - Opened by saraaubry over 9 years ago - 1 comment

#44 - WAT extractor: envelope structure does not conform to the WAT specification

Issue - State: closed - Opened by saraaubry over 9 years ago - 1 comment

#41 - Error-prone HTTP-header parsing in ARCRecord

Issue - State: open - Opened by tokee over 9 years ago - 4 comments

#41 - Error-prone HTTP-header parsing in ARCRecord

Issue - State: open - Opened by tokee over 9 years ago - 4 comments

#40 - ARCRecord entered inconsistent state for some ARC files

Pull Request - State: open - Opened by tokee over 9 years ago - 6 comments

#40 - ARCRecord entered inconsistent state for some ARC files

Pull Request - State: open - Opened by tokee over 9 years ago - 6 comments

#38 - RecordingOutputStream can affect tcp packets sent in an undesirable way

Issue - State: closed - Opened by nlevitt over 9 years ago - 3 comments

#38 - RecordingOutputStream can affect tcp packets sent in an undesirable way

Issue - State: closed - Opened by nlevitt over 9 years ago - 3 comments

#36 - Escape redirect URLs in RealCDXExtractorOutput

Pull Request - State: closed - Opened by gerhardgossen almost 10 years ago - 3 comments

#36 - Escape redirect URLs in RealCDXExtractorOutput

Pull Request - State: closed - Opened by gerhardgossen almost 10 years ago - 3 comments

#35 - Convert UsableUri to get IDN in non-puny form

Pull Request - State: closed - Opened by johnerikhalse almost 10 years ago - 2 comments

#35 - Convert UsableUri to get IDN in non-puny form

Pull Request - State: closed - Opened by johnerikhalse almost 10 years ago - 2 comments

#34 - Issue #4 Guava for public suffix

Pull Request - State: open - Opened by johnerikhalse almost 10 years ago - 1 comment

#34 - Issue #4 Guava for public suffix

Pull Request - State: open - Opened by johnerikhalse almost 10 years ago - 1 comment

#33 - Change test value to get around Java 8 bug

Pull Request - State: closed - Opened by kris-sigur about 10 years ago

#33 - Change test value to get around Java 8 bug

Pull Request - State: closed - Opened by kris-sigur about 10 years ago

#32 - Java 6 compatibility

Pull Request - State: closed - Opened by kris-sigur about 10 years ago - 1 comment

#32 - Java 6 compatibility

Pull Request - State: closed - Opened by kris-sigur about 10 years ago - 1 comment

#31 - ArchiveUtils.doubleToString() rounding fails unit test under Java 8

Issue - State: closed - Opened by kris-sigur about 10 years ago - 6 comments

#31 - ArchiveUtils.doubleToString() rounding fails unit test under Java 8

Issue - State: closed - Opened by kris-sigur about 10 years ago - 6 comments

#30 - Issue #3 Require the oldest recommended version of Maven 3

Pull Request - State: closed - Opened by johnerikhalse about 10 years ago - 2 comments