Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / medialab/minet issues and pull requests

#997 - Reddit

Pull Request - State: open - Opened by jpontoire 29 days ago

#996 - Experiment with importlib.LazyLoader

Issue - State: open - Opened by Yomguithereal about 1 month ago
Labels: investigation, optimization

#995 - Drop commands relying on the Facebook legacy mobile website scraper

Issue - State: open - Opened by Yomguithereal about 2 months ago
Labels: refactor

#994 - Add instagram search by location id.

Pull Request - State: closed - Opened by uf0 about 2 months ago - 3 comments

#993 - update tiktok api url

Pull Request - State: closed - Opened by uf0 about 2 months ago - 2 comments

#992 - update tiktok api url

Pull Request - State: closed - Opened by uf0 about 2 months ago

#991 - Upgrade trafilatura to v2

Issue - State: open - Opened by Yomguithereal about 2 months ago
Labels: enhancement

#990 - Twitter search query cannot use double quotes

Issue - State: closed - Opened by Tyrannas 3 months ago - 3 comments
Labels: bug

#989 - when i try to extract comments from instagram post i get running time error

Issue - State: closed - Opened by wacns 4 months ago - 7 comments
Labels: bug

#988 - Fix typos again

Pull Request - State: closed - Opened by kianmeng 4 months ago - 1 comment

#987 - Refactor RequestRetrying to avoid it altogether

Issue - State: closed - Opened by Yomguithereal 4 months ago
Labels: refactor

#986 - Upgrade ural

Issue - State: closed - Opened by Yomguithereal 5 months ago

#985 - Upgrade trafilatura

Issue - State: closed - Opened by Yomguithereal 5 months ago
Labels: enhancement, refactor

#984 - Retire minet.buzzsumo, minet.crowdtangle

Issue - State: closed - Opened by Yomguithereal 5 months ago
Labels: refactor

#983 - Forward SelectionError to minet.scrape

Issue - State: closed - Opened by Yomguithereal 7 months ago
Labels: dx

#982 - Command to add jobs to a crawler's queue

Issue - State: open - Opened by Yomguithereal 7 months ago
Labels: enhancement

#981 - CrawlJob data type should not be wrapped in an optional by default

Issue - State: closed - Opened by Yomguithereal 7 months ago
Labels: typing

#980 - Adjust twitter scraper retryer and rate limit (again)

Issue - State: closed - Opened by Yomguithereal 7 months ago
Labels: bug

#979 - Add more automatic context when Spider.process raises

Issue - State: closed - Opened by Yomguithereal 7 months ago
Labels: dx

#978 - KeyError: 'expanded_url' with minet twitter scrape tweets

Issue - State: closed - Opened by lakonis 7 months ago - 5 comments

#977 - Issues (core dump or cannot unpack non-iterable FocusCrawlInfo object)

Issue - State: closed - Opened by TeaS0710 7 months ago - 1 comment
Labels: bug

#976 - potential changes in rate limit of twitter public API

Issue - State: closed - Opened by taniki 8 months ago - 3 comments
Labels: bug

#975 - When -c is not specified, we should default to test all available browsers instead of only firefox

Issue - State: open - Opened by Yomguithereal 8 months ago - 1 comment
Labels: enhancement, dx

#974 - -I should default to "downloaded" in scrape and extract

Issue - State: closed - Opened by Yomguithereal 8 months ago - 1 comment
Labels: enhancement, dx

#973 - "Invalid Twitter cookie!" error (possibly due to migration from twitter.com to x.com ?)

Issue - State: closed - Opened by leomignot 9 months ago - 3 comments
Labels: bug

#972 - tiktok search-videos error

Issue - State: closed - Opened by csamson-sf 9 months ago - 3 comments
Labels: bug

#971 - Spider process exceptions should at least be raised with some context around them

Issue - State: closed - Opened by Yomguithereal 9 months ago
Labels: enhancement, dx

#970 - Add FORWARD_SPIDER option

Issue - State: closed - Opened by Yomguithereal 9 months ago
Labels: enhancement, dx

#969 - Error on wikipedia pageviews

Issue - State: open - Opened by bmaz 9 months ago - 1 comment
Labels: bug

#968 - Scrapping 1000's of comments on Instagram

Issue - State: open - Opened by Geminy3 9 months ago - 3 comments
Labels: bug, question

#967 - Retrieve videos from instagram hashtag function

Issue - State: open - Opened by Tyrannas 10 months ago - 9 comments
Labels: bug, enhancement

#966 - Draw edges kwarg

Issue - State: open - Opened by Yomguithereal 10 months ago
Labels: enhancement

#964 - Add LoadingBar.track

Issue - State: closed - Opened by Yomguithereal 10 months ago
Labels: refactor

#963 - ThreadsafeBrowser enhancements

Issue - State: open - Opened by Yomguithereal 10 months ago
Labels: enhancement

#961 - Refactor Crawler request_args as inheritance

Issue - State: open - Opened by Yomguithereal 10 months ago
Labels: refactor

#960 - Upgrade rich and other deps

Issue - State: closed - Opened by Yomguithereal 10 months ago
Labels: enhancement

#959 - Upgrade trafilatura and deal with lxml_html_clean

Issue - State: closed - Opened by Yomguithereal 10 months ago
Labels: refactor

#958 - Spider process error should lead to errorred crawl?

Issue - State: closed - Opened by Yomguithereal 10 months ago - 1 comment
Labels: bug

#957 - Add a playwright version of the crawler

Issue - State: closed - Opened by Yomguithereal 10 months ago
Labels: enhancement

#956 - Upgrade to min version py3.8

Issue - State: closed - Opened by Yomguithereal 10 months ago
Labels: refactor

#955 - Method of CrawlJob to get an identical CrawlTarget to retry

Issue - State: closed - Opened by Yomguithereal 10 months ago
Labels: enhancement

#954 - There should be a Crawler side global callback for each job

Issue - State: open - Opened by Yomguithereal 10 months ago
Labels: enhancement

#953 - os.makedirs is already threadsafe

Issue - State: closed - Opened by Yomguithereal 10 months ago
Labels: refactor, optimization

#952 - Move away from lxml as default soup engine?

Issue - State: closed - Opened by Yomguithereal 10 months ago
Labels: discussion

#950 - Add some crawler level job filter

Issue - State: open - Opened by Yomguithereal 10 months ago
Labels: enhancement

#949 - Make minet installable without playwright

Issue - State: open - Opened by Yomguithereal 10 months ago
Labels: dx

#948 - Youtube improvements

Issue - State: closed - Opened by Yomguithereal 11 months ago
Labels: enhancement

#947 - A challenges challenge

Issue - State: open - Opened by Yomguithereal 11 months ago
Labels: bug

#946 - Multithreaded API clients connection pools should allow more connections

Issue - State: closed - Opened by Yomguithereal 11 months ago
Labels: optimization

#945 - Add a "minet hal" module?

Issue - State: open - Opened by boogheta 11 months ago
Labels: enhancement

#944 - x.com urls are not usually recognized

Issue - State: closed - Opened by Yomguithereal 11 months ago
Labels: bug

#943 - Command tw tweets should not require API key

Issue - State: closed - Opened by Yomguithereal 11 months ago
Labels: bug

#942 - ErroredCrawlReponse should de facto None response attributes

Issue - State: closed - Opened by Yomguithereal 11 months ago
Labels: bug

#941 - User friendly error message when spider returns a non-2-tuple

Issue - State: closed - Opened by Yomguithereal 12 months ago
Labels: dx

#938 - Adding builtin scraper for Europresse

Pull Request - State: closed - Opened by bmaz 12 months ago

#937 - lzma issues

Issue - State: closed - Opened by Yomguithereal 12 months ago
Labels: bug

#936 - /usr/local/bin does not exist on recent mac installs

Issue - State: closed - Opened by Yomguithereal 12 months ago
Labels: bug

#935 - Telegram channel-messages crashes on string parsing

Issue - State: closed - Opened by Yomguithereal 12 months ago
Labels: bug

#934 - Stop condition of twitter scraper is not working anymore

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug

#933 - Try to scrape twitter embed for tweets rather than using the Guest scraper

Issue - State: open - Opened by Yomguithereal about 1 year ago
Labels: enhancement, investigation

#932 - Return None if TwitterGuestAPIScraper.tweet does not find tweet

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug

#931 - Faster rate limit for TwitterGuestAPIScraper

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: enhancement

#929 - Add --raw flag to yt captions

Issue - State: open - Opened by Yomguithereal about 1 year ago
Labels: enhancement

#927 - Crawl jobs output should have a timestamp column

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: enhancement

#926 - Add linktr.ee scraper

Issue - State: open - Opened by Yomguithereal about 1 year ago
Labels: enhancement

#923 - documentation is gone

Issue - State: closed - Opened by uf0 about 1 year ago - 3 comments

#922 - Scrape command -m, -e and --field

Issue - State: open - Opened by Yomguithereal about 1 year ago
Labels: enhancement

#921 - Add flag to yt captions command to emit one lossy line per video instead

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: enhancement

#920 - Harmonize fetch command behavior wrt variadicity, it confuses people

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: enhancement

#919 - Add timeout kwargs to crawl_command

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: enhancement

#916 - Add way to feed data to particular spider from crawl command

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: enhancement

#915 - Support mixed function/Spider instances dicts in crawl command

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug

#913 - Invalid spider target error is not clear

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug, documentation

#911 - Add some diagram for the crawler's lifecyle and architecture

Issue - State: open - Opened by Yomguithereal about 1 year ago
Labels: documentation

#910 - Doc often use #.scrape instead of #.scrape_one

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug, documentation

#909 - Refactor mediacloud client and unify auth errors

Issue - State: open - Opened by Yomguithereal about 1 year ago
Labels: refactor

#908 - The twitter scraper is erratic because of new data format again

Issue - State: closed - Opened by Yomguithereal about 1 year ago - 1 comment
Labels: bug

#907 - GitHub broke custom html id in markdown rendering

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug, documentation

#906 - Integrate a file writer to the http executor like the crawler? also integrate the folder strategy?

Issue - State: open - Opened by Yomguithereal about 1 year ago
Labels: enhancement, refactor

#905 - Force users to rely on -o/--output on windows

Issue - State: closed - Opened by Yomguithereal about 1 year ago - 1 comment
Labels: bug, enhancement

#904 - Issue when asking for text or soup of empty bodies

Issue - State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug

#903 - conform buzzsumo to TabularRecord dataclass type

Pull Request - State: closed - Opened by kat-kel over 1 year ago

#902 - Move nested total of progress bar to upper column?

Issue - State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#900 - --scraped-column-prefix

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement

#899 - Rework scrape command semantics

Issue - State: closed - Opened by Yomguithereal over 1 year ago - 1 comment
Labels: enhancement, refactor

#898 - Filename was not renamed to path in scrape/extract?

Issue - State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug, documentation