Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / alan-turing-institute/ReadabiliPy issues and pull requests

#111 - Error in Readability.js ?

Issue - State: open - Opened by fernand0 2 months ago - 2 comments

#110 - fix issue-109 (use unique temp files for input/output of ExtractArticle.js)

Pull Request - State: open - Opened by erpic 3 months ago - 1 comment

#108 - node.js javascript runtime

Issue - State: open - Opened by cmdcam 6 months ago - 1 comment

#104 - Set up linting on GHA and fix existing linter issues

Pull Request - State: closed - Opened by nelson-liu almost 2 years ago

#103 - Move away from deprecated `setup.py install`, setup GHA

Pull Request - State: closed - Opened by nelson-liu almost 2 years ago - 9 comments

#102 - Purpose of Node.js

Issue - State: open - Opened by swetepete over 2 years ago

#101 - How to update newest Readability.js of Mozilla?

Issue - State: closed - Opened by ducnguyenphanhoai almost 3 years ago - 1 comment

#99 - Quiet execution of ExtractArticle.js

Pull Request - State: closed - Opened by lodrantl almost 3 years ago - 1 comment

#98 - thread & process safe text extraction from html string

Pull Request - State: closed - Opened by InzamamAnwar over 3 years ago

#97 - ReadabiliPy from multiple threads

Issue - State: closed - Opened by econaxis over 3 years ago - 3 comments

#96 - Extra entries with full text in plain_text list

Issue - State: open - Opened by malicialab over 3 years ago - 1 comment

#95 - Feature: Import readability.js from npm

Pull Request - State: closed - Opened by GjjvdBurg over 3 years ago - 1 comment

#94 - Solves bug regarding change of working directory

Pull Request - State: closed - Opened by giovannigarifo over 3 years ago - 1 comment

#93 - How to allow extracting YouTube videos or <iframe> tags?

Issue - State: open - Opened by cayolblake almost 4 years ago - 5 comments
Labels: future

#92 - Bug in extracted images sources returning a base64

Issue - State: open - Opened by cayolblake almost 4 years ago - 13 comments
Labels: bug

#91 - Improve the check for Node

Pull Request - State: closed - Opened by GjjvdBurg about 4 years ago - 8 comments

#90 - Fix 0% output from coveralls

Pull Request - State: closed - Opened by jemrobinson about 4 years ago - 1 comment

#89 - Fix coverage test

Pull Request - State: closed - Opened by jemrobinson about 4 years ago - 1 comment

#88 - Improve/generate documentation

Issue - State: open - Opened by jemrobinson about 4 years ago

#87 - Bump lodash from 4.17.15 to 4.17.19

Pull Request - State: closed - Opened by dependabot[bot] over 4 years ago
Labels: dependencies

#86 - Packaging

Pull Request - State: closed - Opened by GjjvdBurg over 4 years ago - 2 comments

#85 - Bump minimist from 1.2.0 to 1.2.3

Pull Request - State: closed - Opened by dependabot[bot] over 4 years ago
Labels: dependencies

#84 - Bump acorn from 6.0.4 to 6.4.1

Pull Request - State: closed - Opened by dependabot[bot] over 4 years ago
Labels: dependencies

#83 - Nested blocks break parser

Issue - State: closed - Opened by jemrobinson over 5 years ago
Labels: bug

#82 - Deal with nested blocks

Pull Request - State: closed - Opened by jemrobinson over 5 years ago - 2 comments

#81 - ReadabiliPy vs Readability.js

Issue - State: open - Opened by kjoshi over 5 years ago - 2 comments
Labels: future

#80 - Fix element replacement issue.

Pull Request - State: closed - Opened by jemrobinson over 5 years ago - 1 comment

#79 - Cannot replace an element with its contents

Issue - State: closed - Opened by jemrobinson over 5 years ago

#78 - Fix breitbart issue

Pull Request - State: closed - Opened by jemrobinson over 5 years ago - 2 comments

#77 - Crash when interpreting article from breitbart

Issue - State: closed - Opened by jemrobinson over 5 years ago
Labels: bug

#76 - Go for the next highest scoring date when the first is not isoformat

Pull Request - State: closed - Opened by edwardchalstrey1 over 5 years ago - 1 comment

#75 - add 2 extra date xpaths

Pull Request - State: closed - Opened by edwardchalstrey1 over 5 years ago - 1 comment

#74 - add extra supported iso date format

Pull Request - State: closed - Opened by edwardchalstrey1 over 5 years ago - 1 comment

#73 - Simplify benchmarking with containers folder

Pull Request - State: closed - Opened by edwardchalstrey1 over 5 years ago - 1 comment

#72 - Fix empty pages

Pull Request - State: closed - Opened by jemrobinson over 5 years ago - 1 comment

#71 - Fix return value for empty pages

Issue - State: closed - Opened by jemrobinson over 5 years ago
Labels: bug

#70 - Add support for isoformat dates with microseconds

Pull Request - State: closed - Opened by edwardchalstrey1 over 5 years ago

#69 - Trigger coveralls upload

Pull Request - State: closed - Opened by jemrobinson over 5 years ago

#68 - Updated date extraction logic

Pull Request - State: closed - Opened by jemrobinson over 5 years ago

#67 - Fix potential issue in date extraction

Issue - State: closed - Opened by jemrobinson over 5 years ago
Labels: bug

#66 - Added coveralls support

Pull Request - State: closed - Opened by jemrobinson over 5 years ago

#65 - Add test coverage badge

Issue - State: closed - Opened by jemrobinson over 5 years ago
Labels: feature

#64 - Date extraction fix(es)

Pull Request - State: closed - Opened by edwardchalstrey1 over 5 years ago - 1 comment

#63 - Make date extraction more robust

Issue - State: closed - Opened by jemrobinson over 5 years ago - 1 comment
Labels: bug

#62 - Add benchmarking

Issue - State: closed - Opened by edwardchalstrey1 over 5 years ago
Labels: feature

#61 - Add benchmarking

Pull Request - State: closed - Opened by edwardchalstrey1 over 5 years ago

#60 - Update publication date extraction

Pull Request - State: closed - Opened by edwardchalstrey1 over 5 years ago - 2 comments

#59 - Title updates

Pull Request - State: closed - Opened by edwardchalstrey1 over 5 years ago

#58 - Date extraction

Pull Request - State: closed - Opened by edwardchalstrey1 over 5 years ago - 17 comments

#57 - Title extraction

Pull Request - State: closed - Opened by edwardchalstrey1 over 5 years ago - 3 comments

#56 - extruct for structured data

Issue - State: open - Opened by westurner over 5 years ago - 1 comment
Labels: feature, future

#55 - Unnecessary <div> elements

Issue - State: open - Opened by jemrobinson almost 6 years ago - 1 comment
Labels: bug, future

#54 - Update BeautifulSoup version

Pull Request - State: closed - Opened by jemrobinson almost 6 years ago

#53 - Use correct name for beautifulsoup

Pull Request - State: closed - Opened by jemrobinson almost 6 years ago

#52 - BeautifulSoup hanging on find_all

Issue - State: closed - Opened by jemrobinson almost 6 years ago
Labels: bug

#51 - Replace single <br> with space

Pull Request - State: closed - Opened by jemrobinson almost 6 years ago

#50 - Clarify rule for single <br>

Issue - State: closed - Opened by jemrobinson almost 6 years ago
Labels: bug

#49 - Fix CData behaviour and improve test coverage

Pull Request - State: closed - Opened by jemrobinson almost 6 years ago

#48 - Update README and restructure

Pull Request - State: closed - Opened by jemrobinson almost 6 years ago - 1 comment

#47 - New method of whitespace joining

Pull Request - State: closed - Opened by jemrobinson almost 6 years ago - 2 comments

#46 - Dealing with white space

Issue - State: closed - Opened by jemrobinson almost 6 years ago
Labels: feature

#45 - Deal with tags inside words

Issue - State: closed - Opened by jemrobinson almost 6 years ago
Labels: feature

#44 - Added additional comment type

Pull Request - State: closed - Opened by jemrobinson almost 6 years ago

#43 - Add use Readability option to commandline tool and README

Issue - State: closed - Opened by martintoreilly almost 6 years ago - 4 comments
Labels: feature

#42 - Fix comments inside tags

Pull Request - State: closed - Opened by jemrobinson almost 6 years ago

#41 - Comments inside tags

Issue - State: closed - Opened by jemrobinson almost 6 years ago - 1 comment
Labels: bug

#40 - Fix erroneous whitespace

Pull Request - State: closed - Opened by jemrobinson almost 6 years ago - 1 comment

#39 - Erroneous whitespace

Issue - State: closed - Opened by jemrobinson almost 6 years ago
Labels: bug

#38 - Fix rogue unescaped span

Pull Request - State: closed - Opened by jemrobinson almost 6 years ago

#37 - ReadabiliPy has not removed a span element from plain content and plain text

Issue - State: closed - Opened by sgibson91 almost 6 years ago
Labels: bug

#36 - ImportError: No module named 'ReadabiliPy'

Issue - State: closed - Opened by kochkinaelena almost 6 years ago - 8 comments
Labels: not a bug

#35 - Fix extra div element wrapping

Pull Request - State: closed - Opened by jemrobinson almost 6 years ago
Labels: bug

#34 - Extra div element wrapping

Issue - State: closed - Opened by sgibson91 almost 6 years ago

#33 - Define explicit handling rules for HTML 4 elements

Issue - State: open - Opened by martintoreilly almost 6 years ago
Labels: future

#32 - How should CDATA be dealt with?

Issue - State: closed - Opened by jemrobinson almost 6 years ago - 2 comments

#31 - Define handling rules for <iframe>

Issue - State: open - Opened by jemrobinson almost 6 years ago - 2 comments
Labels: future

#30 - Non-HTML5 element

Issue - State: closed - Opened by jemrobinson almost 6 years ago

#29 - FileNotFoundError: [WinError 2] at parse

Issue - State: closed - Opened by orange391224 almost 6 years ago - 3 comments

#28 - Replaced readability with pure-python parser

Pull Request - State: closed - Opened by jemrobinson about 6 years ago - 1 comment

#27 - Add travis support

Pull Request - State: closed - Opened by jemrobinson about 6 years ago - 2 comments

#26 - [ABANDONED] Reverted history rewrite

Pull Request - State: closed - Opened by jemrobinson about 6 years ago - 2 comments

#25 - Add Travis support

Issue - State: closed - Opened by jemrobinson about 6 years ago

#24 - Updated Readability.js

Pull Request - State: closed - Opened by jemrobinson about 6 years ago

#22 - Add unit tests for HTML elements

Pull Request - State: closed - Opened by jemrobinson about 6 years ago - 1 comment

#21 - Make plain-content generation more robust

Issue - State: closed - Opened by jemrobinson about 6 years ago

#20 - Update README to correct errors

Pull Request - State: closed - Opened by martintoreilly about 6 years ago

#19 - Add node index to plain_text output when generated for plain_content

Pull Request - State: closed - Opened by martintoreilly about 6 years ago

#18 - Add option to tag plain_content with node indexes

Pull Request - State: closed - Opened by martintoreilly about 6 years ago

#16 - Ensure plain_text field always returned

Pull Request - State: closed - Opened by martintoreilly about 6 years ago

#15 - Revise plain content extraction to handle lists

Pull Request - State: closed - Opened by martintoreilly about 6 years ago

#14 - Revise plain content extraction to handle lists

Issue - State: closed - Opened by martintoreilly about 6 years ago - 2 comments

#13 - Add command line script

Pull Request - State: closed - Opened by martintoreilly about 6 years ago

#12 - Add python command line script

Issue - State: closed - Opened by martintoreilly about 6 years ago - 1 comment