Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / rajatomar788/pywebcopy issues and pull requests

#129 - Bot being detected

Issue - State: open - Opened by wnsite about 2 months ago - 1 comment

#128 - Fatal error when running the demo script

Issue - State: open - Opened by MarcelBigger 2 months ago - 1 comment

#127 - Incomplete Read

Issue - State: open - Opened by I-dontcode 3 months ago - 13 comments

#126 - How to set cookies on a url

Issue - State: closed - Opened by tadam98s 3 months ago - 3 comments

#124 - Login before save

Issue - State: closed - Opened by tadam98s 3 months ago - 14 comments

#123 - Unreliable iterator based incremental parsing

Issue - State: open - Opened by TLCFEM 3 months ago - 6 comments

#122 - Pywebcopy save

Issue - State: closed - Opened by Riveraagard11 5 months ago - 1 comment

#121 - [WinError 3] The system cannot find the path specified: 'E:\\'

Issue - State: closed - Opened by DEEPANJANSAHA 6 months ago - 2 comments

#120 - UTF-8 encoding issues

Issue - State: open - Opened by claell 9 months ago

#119 - Problem with paths with _ on them

Issue - State: closed - Opened by jose-pr 9 months ago - 1 comment

#118 - "Blocked resource" error

Issue - State: closed - Opened by danyaljj 12 months ago - 1 comment

#115 - Only Downloads HTML and Nothing Else

Issue - State: open - Opened by jet082 about 1 year ago - 4 comments

#114 - Testing current full web crawling functionality

Issue - State: open - Opened by BrandonKMLee about 1 year ago

#113 - Relative Path Support

Issue - State: open - Opened by BrandonKMLee about 1 year ago

#111 - Svg compression issue fixed

Pull Request - State: closed - Opened by aodmrz over 1 year ago

#110 - How to change session header settings when getting a 403 Forbiden error

Issue - State: closed - Opened by Alipser over 1 year ago - 1 comment

#109 - Cannot download a website if it has invalid SSL certificate

Issue - State: closed - Opened by fumbles over 1 year ago - 3 comments

#108 - Fixed failing on tel links

Pull Request - State: closed - Opened by rajatomar788 over 1 year ago - 1 comment

#107 - Fails on tel links

Issue - State: closed - Opened by totalhack over 1 year ago - 6 comments

#105 - Fix missing links due to delayed parser events

Pull Request - State: closed - Opened by monim67 almost 2 years ago - 5 comments

#104 - how to pass cookies with V7.0

Issue - State: closed - Opened by a2689378 almost 2 years ago - 5 comments

#103 - catch exceptions in threaded retrieval..

Pull Request - State: closed - Opened by gallavee about 2 years ago - 2 comments
Labels: enhancement

#102 - pywebcopy is not found

Issue - State: closed - Opened by X-Gorn about 2 years ago - 2 comments

#101 - Project_folder path doesn't seem to be a valid path.

Issue - State: closed - Opened by OlMi1 about 2 years ago

#100 - Fix for handling href tel: element in parse_url() urls.py

Pull Request - State: closed - Opened by serbathome about 2 years ago - 5 comments
Labels: invalid

#99 - Exception when processing href tel:

Issue - State: closed - Opened by serbathome about 2 years ago - 4 comments

#97 - python 3.10.4 : TypeError: multiple bases have instance lay-out conflict

Issue - State: open - Opened by anshi43 about 2 years ago - 11 comments

#96 - Module not found 'CacheControl'

Issue - State: closed - Opened by sudocpMATHdotPY over 2 years ago - 6 comments

#95 - multiple bases have instance lay-out conflict

Issue - State: closed - Opened by david0091 over 2 years ago - 1 comment

#94 - Skip crawling and replacement of other domains

Issue - State: open - Opened by macwinnie over 2 years ago - 5 comments

#93 - Encoding issue

Issue - State: closed - Opened by pbtsrc over 2 years ago - 7 comments

#92 - New release work flow: migration plan and GitHub release

Issue - State: open - Opened by NickVeld over 2 years ago - 2 comments

#91 - AttributeError: 'tuple' object attribute '__doc__' is read-only

Issue - State: open - Opened by atmaniak over 2 years ago - 6 comments

#90 - [Python 3.10] TypeError: multiple bases have instance lay-out conflict

Issue - State: closed - Opened by jaraco over 2 years ago - 4 comments

#89 - URL changed when i set url property of the WebPage's get method.

Issue - State: closed - Opened by zengyinggang over 2 years ago - 1 comment

#88 - Logging Error - TypeError: %d format: a number is required, not NoneType

Issue - State: open - Opened by mar2194 almost 3 years ago - 1 comment

#87 - urls: fix comment in get_fileext_and_pos

Pull Request - State: closed - Opened by NickVeld almost 3 years ago

#86 - urls: put base_* assignment before usage

Pull Request - State: closed - Opened by NickVeld almost 3 years ago - 2 comments

#85 - urls: _id instead of one more _hex()

Pull Request - State: closed - Opened by NickVeld almost 3 years ago - 1 comment

#84 - parsers: use url from utx in parse as default base_url

Pull Request - State: closed - Opened by NickVeld almost 3 years ago

#83 - webpage: support external requests.Response

Pull Request - State: closed - Opened by NickVeld almost 3 years ago - 2 comments

#82 - Parser.ensure_parse_is_completed and ensure_root_is_generated added

Pull Request - State: closed - Opened by NickVeld almost 3 years ago

#81 - elements: force AnchorTag generate new names like in Webpage._new_utx

Pull Request - State: closed - Opened by NickVeld almost 3 years ago - 3 comments

#80 - urls: remove blocker of post-init fn generation by property

Pull Request - State: closed - Opened by NickVeld almost 3 years ago - 2 comments

#79 - parsers.py: let users know type of html

Pull Request - State: closed - Opened by NickVeld almost 3 years ago

#78 - adds ability to toggle multithreading

Pull Request - State: closed - Opened by gravelcycles almost 3 years ago - 5 comments

#77 - configs.py: call of allowed_file_ext fixed

Pull Request - State: closed - Opened by NickVeld almost 3 years ago

#76 - parsers.py: links_to_pages property added

Pull Request - State: closed - Opened by NickVeld almost 3 years ago

#75 - Non web page files support in attrs of <a>

Pull Request - State: closed - Opened by NickVeld almost 3 years ago - 1 comment

#74 - same domain pdf links wrapped by anchor tag is skipped

Issue - State: closed - Opened by NickVeld almost 3 years ago - 3 comments

#73 - elements.py: stash quotes/apostrophes for enquoted urls

Pull Request - State: closed - Opened by NickVeld almost 3 years ago - 6 comments

#72 - Fix allowed_file_exts and add it and http_headers into setup_config

Pull Request - State: closed - Opened by NickVeld almost 3 years ago

#71 - configs.py: fix log_file

Pull Request - State: closed - Opened by NickVeld almost 3 years ago - 2 comments

#70 - configs.py: fix service strings in setup_paths

Pull Request - State: closed - Opened by NickVeld almost 3 years ago - 1 comment

#69 - Processing of a NodeBB Forum post does not terminate

Issue - State: closed - Opened by bm765 about 3 years ago - 1 comment

#68 - Bump urllib3 from 1.24.2 to 1.26.5

Pull Request - State: closed - Opened by dependabot[bot] about 3 years ago - 1 comment
Labels: dependencies

#67 - Bump py from 1.6.0 to 1.10.0

Pull Request - State: closed - Opened by dependabot[bot] about 3 years ago
Labels: dependencies

#66 - Not downloading any website

Issue - State: closed - Opened by santoshbs over 3 years ago - 1 comment

#65 - Problem with Flask

Issue - State: closed - Opened by FFally over 3 years ago - 1 comment

#63 - How to clone linked pages?

Issue - State: open - Opened by rstmsn over 3 years ago - 15 comments

#62 - How to limit the crawling depth?

Issue - State: open - Opened by cfytrok over 3 years ago - 2 comments

#61 - TypeError

Issue - State: closed - Opened by athesh-pargau7 over 3 years ago - 6 comments

#60 - Download only of package not possible

Issue - State: open - Opened by cmusik over 3 years ago - 1 comment

#59 - Documented command-line interface example fails

Issue - State: closed - Opened by metaperl over 3 years ago - 1 comment

#58 - Path is on mount S: start on mount C:

Issue - State: closed - Opened by bobsburgers almost 4 years ago - 4 comments

#57 - Script not completed

Issue - State: closed - Opened by bralbral almost 4 years ago - 2 comments

#56 - ValueError: path is on mount 'S:', start on mount 'C:'

Issue - State: open - Opened by kevtv almost 4 years ago - 5 comments

#55 - Question: can it continue a suspended job?

Issue - State: open - Opened by User670 almost 4 years ago - 3 comments

#54 - Hangs in involuntary places. Using a basic example.

Issue - State: closed - Opened by Salim9304 almost 4 years ago - 1 comment

#53 - readme: remove extra underscore

Pull Request - State: closed - Opened by KuceraMartin about 4 years ago

#52 - Relative links

Issue - State: closed - Opened by danielfaulknor about 4 years ago - 2 comments

#51 - save_webpage() never exits

Issue - State: closed - Opened by deleuzer about 4 years ago - 2 comments

#50 - Cannot import name 'save_webpage' from 'pywebcopy'

Issue - State: closed - Opened by slavakurilyak about 4 years ago - 2 comments

#49 - save "complete webpage" page.html and /page

Issue - State: closed - Opened by MajdMustapha about 4 years ago - 6 comments

#48 - inconsistent handling of filetypes

Issue - State: closed - Opened by youngblood about 4 years ago - 4 comments

#47 - load_css/images/javascript arguments not working

Issue - State: closed - Opened by youngblood about 4 years ago - 4 comments

#46 - program hangs and does not exit

Issue - State: open - Opened by youngblood about 4 years ago - 28 comments

#45 - setup_config() got an unexpected keyword argument 'url'

Issue - State: closed - Opened by kennym about 4 years ago - 7 comments

#44 - Update README.md

Pull Request - State: closed - Opened by kennym about 4 years ago - 1 comment

#43 - Crawler.crawl() only saves first page

Issue - State: closed - Opened by KuroiKuro about 4 years ago - 8 comments

#42 - AssertionError: A file like object with read method is required!

Issue - State: closed - Opened by renMarkHan over 4 years ago - 5 comments

#41 - Overwrite only if file changed mode

Issue - State: open - Opened by afonari over 4 years ago - 2 comments

#40 - ImportError: cannot import name UserDict

Issue - State: closed - Opened by Z-Zen over 4 years ago - 2 comments

#39 - Fixed should continue on error

Pull Request - State: closed - Opened by rajatomar788 over 4 years ago

#38 - improved github workflows jobs.

Pull Request - State: closed - Opened by rajatomar788 over 4 years ago

#37 - removed file logger and stopped dir change on setup

Pull Request - State: closed - Opened by rajatomar788 over 4 years ago

#36 - BUG REPORT: Log file never flushes causing drive to run out of space

Issue - State: closed - Opened by kaavik over 4 years ago - 2 comments

#35 - Scraping process stucks

Issue - State: closed - Opened by TonySchneider over 4 years ago - 7 comments

#34 - ModuleNotFound error

Issue - State: closed - Opened by kaavik over 4 years ago - 2 comments

#33 - pywebcopy/configs.py, setup_paths method changes the working directory

Issue - State: closed - Opened by TonySchneider over 4 years ago - 3 comments

#32 - Major Bug Fix and Minor Tweaks

Pull Request - State: closed - Opened by kdmoss over 4 years ago

#31 - program download method

Issue - State: closed - Opened by marshonhuckleberry over 4 years ago - 3 comments

#30 - how to use cookies?

Issue - State: closed - Opened by marshonhuckleberry over 4 years ago - 9 comments

#29 - how to change user agent?

Issue - State: closed - Opened by marshonhuckleberry over 4 years ago - 1 comment

#28 - site restrictions

Issue - State: open - Opened by marshonhuckleberry over 4 years ago - 5 comments

#27 - save_website/crawl() does not download PDF

Issue - State: closed - Opened by chstrehlow over 4 years ago - 6 comments