Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / medialab/minet issues and pull requests

#896 - Add --retries flag to fetch

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#895 - Way to enable extension in non persistent contexts

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#894 - Adblock and automatic consent option for screenshot command

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#893 - The typed cli_args conundrum and the action kwargs

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: refactor

#892 - Yt channelvideo time

Pull Request - State: closed - Opened by bmaz over 1 year ago

#891 - Multi execution capabilities for easy unit test on trivial commands

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: refactor

#890 - request known_encoding kwarg should be utf8 by default?

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor

#889 - CallbackResultType output should always be optional

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: typing

#888 - Distinguish between hostnames not existing and network outage

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug

#887 - v1 blockade

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#886 - Improve UrlCache add_many & enable contains_many etc.

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#885 - Weird output while scrapping Instagram posts

Issue - State: closed - Opened by florianezanella over 1 year ago - 16 comments
Labels: bug

#884 - rather than remove occasionally missing keys, change dict method

Pull Request - State: closed - Opened by kat-kel over 1 year ago

#883 - Avoid key error when Buzzsumo sends data that's missing certain fields

Pull Request - State: closed - Opened by kat-kel over 1 year ago

#881 - Sqlite temp dir issues

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#880 - create exact url command

Pull Request - State: closed - Opened by kat-kel over 1 year ago

#879 - Improve progress bar ETA

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#878 - tiktok search-videos retrieve only 12 results at most

Issue - State: closed - Opened by csamson-sf over 1 year ago - 4 comments
Labels: bug

#877 - Crawl job output does not indicate file encoding

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug

#876 - Rename minet fetch --filename to --filename-column

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement, refactor

#875 - crawler depth indicator

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#874 - crawler input should disregard empty urls

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: bug

#872 - Add -s/--select to crawl command

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#871 - Alias x for twitter commands?

Issue - State: closed - Opened by boogheta over 1 year ago - 2 comments

#870 - [twitter scrape] media_urls is not populated anymore

Issue - State: closed - Opened by boogheta over 1 year ago
Labels: bug

#869 - Instagram comments

Issue - State: closed - Opened by ZeynepP over 1 year ago - 9 comments
Labels: bug

#868 - Bench url cache

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: investigation, optimization

#867 - Operational Error SQLite database is locked

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug

#866 - Hyphe crawl targets should include the webentity to avoid redirection mismatch

Issue - State: open - Opened by Yomguithereal over 1 year ago - 1 comment
Labels: bug

#865 - Is this useful to vacuum sqlar at end?

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: investigation, optimization

#863 - --sqlar does not work on resume

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug

#862 - pycurl segfautls on centos-like

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug

#861 - sqlite backend writer for large numbers of files

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#859 - num_pools should always be greater than the number of threads at least

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: optimization

#858 - pycurl errors to handle

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#857 - --only-html flag for crawlers

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#856 - --in-memory flag for crawler

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#855 - max size option for transfers

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: optimization

#854 - Use head for resolve by default

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: optimization

#853 - Make pyinstaller build pycurl for the linux build

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#852 - Try to play with accept compressed requests

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: investigation, optimization

#851 - Filename too long issues

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug

#850 - Move httpheadersdict import to types

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor

#849 - Focus crawler should use new extraction scheme

Issue - State: open - Opened by Yomguithereal over 1 year ago - 1 comment
Labels: enhancement

#848 - Option for the hyphe spider not to emit internal links like recent hyphe setting

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#847 - Maybe 1024 num_pools is a bit high on resources

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: optimization

#846 - Crawlers should be able to defer processing errors

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#845 - Better path mangling to make sure import_target works

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug

#844 - Errored crawl result should not display degree

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug

#843 - drain_conn rather than closing when status is 3xx

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor

#842 - Response should only take body and http headers dict

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor

#841 - Move hyphe-crawl command to hyphe subpackage

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor

#840 - Move how to cite section upper and add interrogation mark

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: documentation

#839 - force_select_one

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor

#838 - Upgrade ebbe

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor

#837 - Typical scraper for rss feeds

Pull Request - State: closed - Opened by 16arpi over 1 year ago - 1 comment

#836 - dataclass as cli_args namespace?

Issue - State: closed - Opened by Yomguithereal over 1 year ago - 1 comment
Labels: refactor

#835 - dust the url-parse command

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement, refactor

#834 - url cache should canonicalize

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#833 - link extraction

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#832 - We should probably internalize our crawler queue

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor

#831 - Decide if Spider.tabulate should take data or result

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: refactor

#830 - Could refactor examples as command kwarg

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: refactor

#829 - Playwright ublock origin

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#828 - Experiment with a Hyphe spider

Issue - State: closed - Opened by Yomguithereal over 1 year ago - 1 comment
Labels: enhancement, investigation

#827 - Add rss urls scraper

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#826 - dump-queue command does not work with dfs

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug

#825 - crawl command should run callback after verbose print

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor

#824 - Pyinstaller browsers issues

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement, refactor

#823 - Explore possibility of customizing rc_key / ConfigAction prefix

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#822 - Automatic --cookie help

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: refactor

#820 - crawl_command accept_input=False does not work

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug

#819 - crawl_action target typing does not allow factory

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: typing

#818 - crawl_command adjustments

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor

#817 - Relax crawl_command typings

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: typing

#816 - Fix typos

Pull Request - State: closed - Opened by kianmeng over 1 year ago - 1 comment

#815 - Twitter Scraper seems down because of 404

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug

#813 - loading_bar.print should have more console kwargs

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#812 - Progress bar total might be wrong when reading from stdin

Issue - State: closed - Opened by Yomguithereal over 1 year ago - 1 comment
Labels: bug

#811 - ThreadsafeBrowser

Issue - State: closed - Opened by 3mora2 over 1 year ago - 4 comments
Labels: question

#810 - Select specific fields for trafilatura extraction

Pull Request - State: closed - Opened by 16arpi over 1 year ago

#809 - Flags to turn regexs into negative filters (focus crawler)

Pull Request - State: closed - Opened by 16arpi over 1 year ago - 1 comment

#808 - Crawl command should work without target as basic crawler

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#807 - Some crawl_command kwarg are not tied to their resolution

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug

#806 - Rename crawl_action spiders to target & allow instantiated spiders/crawler

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor

#805 - Export WonderfulSoup from minet.scrape

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#804 - removing the flag --keep-irrelevant for the focus crawler

Pull Request - State: closed - Opened by 16arpi over 1 year ago

#803 - Fix yt cli docs

Issue - State: closed - Opened by Yomguithereal over 1 year ago - 1 comment
Labels: bug, documentation

#802 - Drop focus-crawl command --keep-irrelevant flag

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#801 - Add crawl command flag that can disable data writing

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#800 - Wonderfulsoup could type .get to avoid issues with id/class

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor

#799 - focus-crawl should export full network link data & we should force -u

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug, enhancement

#798 - Maybe it would be better to instantiate crawler executor in start?

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: refactor