Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / medialab/minet issues and pull requests
#897 - Add capabilities to retry fetch/crawl/resolve etc. from an already written report so errors might tried again
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#896 - Add --retries flag to fetch
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#895 - Way to enable extension in non persistent contexts
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#894 - Adblock and automatic consent option for screenshot command
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#893 - The typed cli_args conundrum and the action kwargs
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: refactor
#892 - Yt channelvideo time
Pull Request -
State: closed - Opened by bmaz over 1 year ago
#891 - Multi execution capabilities for easy unit test on trivial commands
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: refactor
#890 - request known_encoding kwarg should be utf8 by default?
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#889 - CallbackResultType output should always be optional
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: typing
#888 - Distinguish between hostnames not existing and network outage
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#887 - v1 blockade
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#886 - Improve UrlCache add_many & enable contains_many etc.
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#885 - Weird output while scrapping Instagram posts
Issue -
State: closed - Opened by florianezanella over 1 year ago
- 16 comments
Labels: bug
#884 - rather than remove occasionally missing keys, change dict method
Pull Request -
State: closed - Opened by kat-kel over 1 year ago
#883 - Avoid key error when Buzzsumo sends data that's missing certain fields
Pull Request -
State: closed - Opened by kat-kel over 1 year ago
#882 - Rollback to default threads being inferred from number of cores rather than hardcoded 25
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#881 - Sqlite temp dir issues
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#880 - create exact url command
Pull Request -
State: closed - Opened by kat-kel over 1 year ago
#879 - Improve progress bar ETA
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#878 - tiktok search-videos retrieve only 12 results at most
Issue -
State: closed - Opened by csamson-sf over 1 year ago
- 4 comments
Labels: bug
#877 - Crawl job output does not indicate file encoding
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#876 - Rename minet fetch --filename to --filename-column
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement, refactor
#875 - crawler depth indicator
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#874 - crawler input should disregard empty urls
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: bug
#873 - crawler error message should be clearer when target is not found inside module
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#872 - Add -s/--select to crawl command
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#871 - Alias x for twitter commands?
Issue -
State: closed - Opened by boogheta over 1 year ago
- 2 comments
#870 - [twitter scrape] media_urls is not populated anymore
Issue -
State: closed - Opened by boogheta over 1 year ago
Labels: bug
#869 - Instagram comments
Issue -
State: closed - Opened by ZeynepP over 1 year ago
- 9 comments
Labels: bug
#868 - Bench url cache
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: investigation, optimization
#867 - Operational Error SQLite database is locked
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#866 - Hyphe crawl targets should include the webentity to avoid redirection mismatch
Issue -
State: open - Opened by Yomguithereal over 1 year ago
- 1 comment
Labels: bug
#865 - Is this useful to vacuum sqlar at end?
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: investigation, optimization
#864 - hyphe start page spider should canonicalize to be consistent with process
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#863 - --sqlar does not work on resume
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#862 - pycurl segfautls on centos-like
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#861 - sqlite backend writer for large numbers of files
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#860 - There is probably something wrong with urllib3 & --compressed-transfer right now
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#859 - num_pools should always be greater than the number of threads at least
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: optimization
#858 - pycurl errors to handle
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#857 - --only-html flag for crawlers
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#856 - --in-memory flag for crawler
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#855 - max size option for transfers
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: optimization
#854 - Use head for resolve by default
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: optimization
#853 - Make pyinstaller build pycurl for the linux build
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#852 - Try to play with accept compressed requests
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: investigation, optimization
#851 - Filename too long issues
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#850 - Move httpheadersdict import to types
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#849 - Focus crawler should use new extraction scheme
Issue -
State: open - Opened by Yomguithereal over 1 year ago
- 1 comment
Labels: enhancement
#848 - Option for the hyphe spider not to emit internal links like recent hyphe setting
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#847 - Maybe 1024 num_pools is a bit high on resources
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: optimization
#846 - Crawlers should be able to defer processing errors
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#845 - Better path mangling to make sure import_target works
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#844 - Errored crawl result should not display degree
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#843 - drain_conn rather than closing when status is 3xx
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#842 - Response should only take body and http headers dict
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#841 - Move hyphe-crawl command to hyphe subpackage
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#840 - Move how to cite section upper and add interrogation mark
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: documentation
#839 - force_select_one
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#838 - Upgrade ebbe
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#837 - Typical scraper for rss feeds
Pull Request -
State: closed - Opened by 16arpi over 1 year ago
- 1 comment
#836 - dataclass as cli_args namespace?
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
- 1 comment
Labels: refactor
#835 - dust the url-parse command
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement, refactor
#834 - url cache should canonicalize
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#833 - link extraction
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#832 - We should probably internalize our crawler queue
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#831 - Decide if Spider.tabulate should take data or result
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: refactor
#830 - Could refactor examples as command kwarg
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: refactor
#829 - Playwright ublock origin
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#828 - Experiment with a Hyphe spider
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
- 1 comment
Labels: enhancement, investigation
#827 - Add rss urls scraper
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#826 - dump-queue command does not work with dfs
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#825 - crawl command should run callback after verbose print
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#824 - Pyinstaller browsers issues
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement, refactor
#823 - Explore possibility of customizing rc_key / ConfigAction prefix
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#822 - Automatic --cookie help
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: refactor
#821 - Refactor crawl_command not to set missing flags to default resulting in crawler kwargs init
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#820 - crawl_command accept_input=False does not work
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#819 - crawl_action target typing does not allow factory
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: typing
#818 - crawl_command adjustments
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#817 - Relax crawl_command typings
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: typing
#816 - Fix typos
Pull Request -
State: closed - Opened by kianmeng over 1 year ago
- 1 comment
#815 - Twitter Scraper seems down because of 404
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#814 - Executor and crawler should have a kwarg to defer invalid statuses as errored results
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#813 - loading_bar.print should have more console kwargs
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#812 - Progress bar total might be wrong when reading from stdin
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
- 1 comment
Labels: bug
#811 - ThreadsafeBrowser
Issue -
State: closed - Opened by 3mora2 over 1 year ago
- 4 comments
Labels: question
#810 - Select specific fields for trafilatura extraction
Pull Request -
State: closed - Opened by 16arpi over 1 year ago
#809 - Flags to turn regexs into negative filters (focus crawler)
Pull Request -
State: closed - Opened by 16arpi over 1 year ago
- 1 comment
#808 - Crawl command should work without target as basic crawler
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#807 - Some crawl_command kwarg are not tied to their resolution
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug
#806 - Rename crawl_action spiders to target & allow instantiated spiders/crawler
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#805 - Export WonderfulSoup from minet.scrape
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#804 - removing the flag --keep-irrelevant for the focus crawler
Pull Request -
State: closed - Opened by 16arpi over 1 year ago
#803 - Fix yt cli docs
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
- 1 comment
Labels: bug, documentation
#802 - Drop focus-crawl command --keep-irrelevant flag
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#801 - Add crawl command flag that can disable data writing
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#800 - Wonderfulsoup could type .get to avoid issues with id/class
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor
#799 - focus-crawl should export full network link data & we should force -u
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug, enhancement
#798 - Maybe it would be better to instantiate crawler executor in start?
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor