Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / andythefactory/newspaper4k issues and pull requests

#547 - lxml.etree Import Error on M1 mac

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 4 comments
Labels: bug

#543 - SSLError Certificate Verify Failed

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 3 comments
Labels: bug, refactoring

#543 - SSLError Certificate Verify Failed

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 3 comments
Labels: bug, refactoring

#535 - `parse` hangs on some files

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 9 comments
Labels: bug

#535 - `parse` hangs on some files

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 9 comments
Labels: bug

#531 - Error converting html to string.

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 7 comments
Labels: enhancement, help wanted, sites not working

#531 - Error converting html to string.

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 7 comments
Labels: enhancement, help wanted, sites not working

#530 - How to get the list of all websites that are available for scraping?

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: documentation

#530 - How to get the list of all websites that are available for scraping?

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: documentation

#529 - Not able to crawl Javascript-disabled webpages

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: sites not working

#529 - Not able to crawl Javascript-disabled webpages

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: sites not working

#515 - Include all nodes with text

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 3 comments
Labels: PR-verify, sites not working

#515 - Include all nodes with text

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 3 comments
Labels: PR-verify, sites not working

#473 - what are the mechnisms of "keywords" and "summary"? any documents about them?

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: documentation

#473 - what are the mechnisms of "keywords" and "summary"? any documents about them?

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: documentation

#435 - Integration of YAKE!

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: PR-verify

#435 - Integration of YAKE!

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: PR-verify

#425 - Author not extracted correctly

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 3 comments
Labels: sites not working

#425 - Author not extracted correctly

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 3 comments
Labels: sites not working

#404 - Categories filters don't work as expected

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 3 comments
Labels: bug

#404 - Categories filters don't work as expected

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 3 comments
Labels: bug

#401 - How to extract article urls just from the main page?

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: enhancement

#401 - How to extract article urls just from the main page?

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: enhancement

#394 - Not working on New York Times

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 3 comments
Labels: bug, sites not working

#394 - Not working on New York Times

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 3 comments
Labels: bug, sites not working

#384 - Duplicate content on certain site

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 8 comments
Labels: sites not working

#384 - Duplicate content on certain site

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 8 comments
Labels: sites not working

#333 - Using build() in monkeypatched script sometimes produces infinite loop

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: enhancement, help wanted

#333 - Using build() in monkeypatched script sometimes produces infinite loop

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: enhancement, help wanted

#277 - extract text without image caption

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: bug

#277 - extract text without image caption

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: bug

#271 - install

Issue - State: closed - Opened by AndyTheFactory over 1 year ago
Labels: documentation, enhancement

#271 - install

Issue - State: closed - Opened by AndyTheFactory over 1 year ago
Labels: documentation, enhancement

#264 - How does MIN_WORD_COUNT and MIN_SENT_COUNT work with Article?

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: bug, documentation

#264 - How does MIN_WORD_COUNT and MIN_SENT_COUNT work with Article?

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: bug, documentation

#261 - Patch 1

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: enhancement, undecided yet

#261 - Patch 1

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: enhancement, undecided yet

#257 - Not scraping all articles

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: sites not working

#257 - Not scraping all articles

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: sites not working

#233 - Sites protected by CloudFlare

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: enhancement, sites not working

#233 - Sites protected by CloudFlare

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: enhancement, sites not working

#232 - Iterating over multiple runs - no new articles in spite of memoize=False

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 9 comments
Labels: bug, security

#226 - title-body mismatches

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 4 comments
Labels: sites not working

#226 - title-body mismatches

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 4 comments
Labels: sites not working

#220 - Website login functionality

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 9 comments
Labels: documentation, help wanted

#219 - Added auto language detection when language isn't in meta data

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: enhancement, wontfix

#219 - Added auto language detection when language isn't in meta data

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: enhancement, wontfix

#213 - [extractors] get_title() filters Cyrillic and other but CJK Unified Ideographs

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: enhancement

#213 - [extractors] get_title() filters Cyrillic and other but CJK Unified Ideographs

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: enhancement

#202 - many changes

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: enhancement, PR-verify

#202 - many changes

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: enhancement, PR-verify

#196 - Fixed raise statement

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: bug, enhancement

#196 - Fixed raise statement

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: bug, enhancement

#191 - Why .ico and other types of icon are downloaded as main image?

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: enhancement

#191 - Why .ico and other types of icon are downloaded as main image?

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: enhancement

#188 - You must `download()` an article first!

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 7 comments
Labels: bug

#188 - You must `download()` an article first!

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 7 comments
Labels: bug

#186 - Unable to get all the articles.

Issue - State: closed - Opened by AndyTheFactory over 1 year ago
Labels: sites not working

#186 - Unable to get all the articles.

Issue - State: closed - Opened by AndyTheFactory over 1 year ago
Labels: sites not working

#160 - extend the "*" into the contents of working directory

Issue - State: closed - Opened by AndyTheFactory over 1 year ago
Labels: sites not working

#160 - extend the "*" into the contents of working directory

Issue - State: closed - Opened by AndyTheFactory over 1 year ago
Labels: sites not working

#132 - News Homepage article links alone

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: enhancement, help wanted

#132 - News Homepage article links alone

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: enhancement, help wanted

#129 - ENH: parse schema.org/NewsArticle RDFa, Microdata, or JSONLD

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 3 comments
Labels: enhancement

#129 - ENH: parse schema.org/NewsArticle RDFa, Microdata, or JSONLD

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 3 comments
Labels: enhancement

#127 - Chinese language for content

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 6 comments
Labels: bug

#127 - Chinese language for content

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 6 comments
Labels: bug

#126 - Too many authors on Techcrunch...

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: bug

#126 - Too many authors on Techcrunch...

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: bug

#120 - Article Multi-threading downloading

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: documentation, help wanted

#120 - Article Multi-threading downloading

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: documentation, help wanted

#114 - Scrape og:image:secure_url og:image:url

Issue - State: open - Opened by AndyTheFactory over 1 year ago
Labels: enhancement, PR-verify

#114 - Scrape og:image:secure_url og:image:url

Issue - State: open - Opened by AndyTheFactory over 1 year ago
Labels: enhancement, PR-verify

#112 - change content tag name for datePublished

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: enhancement, PR-verify

#110 - using stopwords-iso JSON to cover more languages

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: enhancement

#110 - using stopwords-iso JSON to cover more languages

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: enhancement

#109 - Article.text doesn't provide full article for some URLs

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: sites not working

#109 - Article.text doesn't provide full article for some URLs

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: sites not working

#107 - Extract not full text but only a few paragraphs

Issue - State: open - Opened by AndyTheFactory over 1 year ago
Labels: sites not working

#107 - Extract not full text but only a few paragraphs

Issue - State: open - Opened by AndyTheFactory over 1 year ago
Labels: sites not working

#105 - Unique Articles urls

Issue - State: open - Opened by AndyTheFactory over 1 year ago
Labels: enhancement

#105 - Unique Articles urls

Issue - State: open - Opened by AndyTheFactory over 1 year ago
Labels: enhancement

#101 - Different result of usual usage VS set_html(html)

Issue - State: open - Opened by AndyTheFactory over 1 year ago
Labels: sites not working

#101 - Different result of usual usage VS set_html(html)

Issue - State: open - Opened by AndyTheFactory over 1 year ago
Labels: sites not working

#100 - Wrong URL in newspaper.popular_urls()

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: bug

#99 - Add JSON-LD support by using extruct

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 6 comments
Labels: enhancement

#98 - Deployment issues with TOP_DIRECTORY

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: enhancement

#97 - Silent failing on medium article

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: sites not working

#96 - Parsing fails silently on OSX + Python 3 when source text has certain characters

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: bug, undecided yet

#95 - SSL Certification error

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: help wanted

#94 - Does not build local news source

Issue - State: open - Opened by AndyTheFactory over 1 year ago
Labels: sites not working

#94 - Does not build local news source

Issue - State: open - Opened by AndyTheFactory over 1 year ago
Labels: sites not working

#93 - summary

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment
Labels: help wanted

#92 - Doesn't scrap any article on http://www.nzherald.co.nz/road-accidents/news/archive.cfm?c_id=663

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 2 comments
Labels: sites not working

#91 - Cannot parse urls from some sites

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 6 comments
Labels: enhancement, wontfix, undecided yet

#90 - Not woking on "nytimes.com"

Issue - State: open - Opened by AndyTheFactory over 1 year ago - 12 comments
Labels: sites not working

#89 - Trailing slash for url date regex

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 3 comments

#88 - Article.text not working

Issue - State: closed - Opened by AndyTheFactory over 1 year ago - 1 comment