Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / medialab/minet issues and pull requests
#997 - Reddit
Pull Request -
State: open - Opened by jpontoire 29 days ago
#996 - Experiment with importlib.LazyLoader
Issue -
State: open - Opened by Yomguithereal about 1 month ago
Labels: investigation, optimization
#995 - Drop commands relying on the Facebook legacy mobile website scraper
Issue -
State: open - Opened by Yomguithereal about 2 months ago
Labels: refactor
#994 - Add instagram search by location id.
Pull Request -
State: closed - Opened by uf0 about 2 months ago
- 3 comments
#993 - update tiktok api url
Pull Request -
State: closed - Opened by uf0 about 2 months ago
- 2 comments
#992 - update tiktok api url
Pull Request -
State: closed - Opened by uf0 about 2 months ago
#991 - Upgrade trafilatura to v2
Issue -
State: open - Opened by Yomguithereal about 2 months ago
Labels: enhancement
#990 - Twitter search query cannot use double quotes
Issue -
State: closed - Opened by Tyrannas 3 months ago
- 3 comments
Labels: bug
#989 - when i try to extract comments from instagram post i get running time error
Issue -
State: closed - Opened by wacns 4 months ago
- 7 comments
Labels: bug
#988 - Fix typos again
Pull Request -
State: closed - Opened by kianmeng 4 months ago
- 1 comment
#987 - Refactor RequestRetrying to avoid it altogether
Issue -
State: closed - Opened by Yomguithereal 4 months ago
Labels: refactor
#986 - Upgrade ural
Issue -
State: closed - Opened by Yomguithereal 5 months ago
#985 - Upgrade trafilatura
Issue -
State: closed - Opened by Yomguithereal 5 months ago
Labels: enhancement, refactor
#984 - Retire minet.buzzsumo, minet.crowdtangle
Issue -
State: closed - Opened by Yomguithereal 5 months ago
Labels: refactor
#983 - Forward SelectionError to minet.scrape
Issue -
State: closed - Opened by Yomguithereal 7 months ago
Labels: dx
#982 - Command to add jobs to a crawler's queue
Issue -
State: open - Opened by Yomguithereal 7 months ago
Labels: enhancement
#981 - CrawlJob data type should not be wrapped in an optional by default
Issue -
State: closed - Opened by Yomguithereal 7 months ago
Labels: typing
#980 - Adjust twitter scraper retryer and rate limit (again)
Issue -
State: closed - Opened by Yomguithereal 7 months ago
Labels: bug
#979 - Add more automatic context when Spider.process raises
Issue -
State: closed - Opened by Yomguithereal 7 months ago
Labels: dx
#978 - KeyError: 'expanded_url' with minet twitter scrape tweets
Issue -
State: closed - Opened by lakonis 7 months ago
- 5 comments
#977 - Issues (core dump or cannot unpack non-iterable FocusCrawlInfo object)
Issue -
State: closed - Opened by TeaS0710 7 months ago
- 1 comment
Labels: bug
#976 - potential changes in rate limit of twitter public API
Issue -
State: closed - Opened by taniki 8 months ago
- 3 comments
Labels: bug
#975 - When -c is not specified, we should default to test all available browsers instead of only firefox
Issue -
State: open - Opened by Yomguithereal 8 months ago
- 1 comment
Labels: enhancement, dx
#974 - -I should default to "downloaded" in scrape and extract
Issue -
State: closed - Opened by Yomguithereal 8 months ago
- 1 comment
Labels: enhancement, dx
#973 - "Invalid Twitter cookie!" error (possibly due to migration from twitter.com to x.com ?)
Issue -
State: closed - Opened by leomignot 9 months ago
- 3 comments
Labels: bug
#972 - tiktok search-videos error
Issue -
State: closed - Opened by csamson-sf 9 months ago
- 3 comments
Labels: bug
#971 - Spider process exceptions should at least be raised with some context around them
Issue -
State: closed - Opened by Yomguithereal 9 months ago
Labels: enhancement, dx
#970 - Add FORWARD_SPIDER option
Issue -
State: closed - Opened by Yomguithereal 9 months ago
Labels: enhancement, dx
#969 - Error on wikipedia pageviews
Issue -
State: open - Opened by bmaz 9 months ago
- 1 comment
Labels: bug
#968 - Scrapping 1000's of comments on Instagram
Issue -
State: open - Opened by Geminy3 9 months ago
- 3 comments
Labels: bug, question
#967 - Retrieve videos from instagram hashtag function
Issue -
State: open - Opened by Tyrannas 10 months ago
- 9 comments
Labels: bug, enhancement
#966 - Draw edges kwarg
Issue -
State: open - Opened by Yomguithereal 10 months ago
Labels: enhancement
#965 - Improve ThreadsafeBrowser.request stability by retrying content acquisition if needed
Issue -
State: closed - Opened by Yomguithereal 10 months ago
Labels: bug
#964 - Add LoadingBar.track
Issue -
State: closed - Opened by Yomguithereal 10 months ago
Labels: refactor
#963 - ThreadsafeBrowser enhancements
Issue -
State: open - Opened by Yomguithereal 10 months ago
Labels: enhancement
#962 - instagram post-infos should have line parity in the output and increase a stat rather than log
Issue -
State: closed - Opened by Yomguithereal 10 months ago
Labels: bug
#961 - Refactor Crawler request_args as inheritance
Issue -
State: open - Opened by Yomguithereal 10 months ago
Labels: refactor
#960 - Upgrade rich and other deps
Issue -
State: closed - Opened by Yomguithereal 10 months ago
Labels: enhancement
#959 - Upgrade trafilatura and deal with lxml_html_clean
Issue -
State: closed - Opened by Yomguithereal 10 months ago
Labels: refactor
#958 - Spider process error should lead to errorred crawl?
Issue -
State: closed - Opened by Yomguithereal 10 months ago
- 1 comment
Labels: bug
#957 - Add a playwright version of the crawler
Issue -
State: closed - Opened by Yomguithereal 10 months ago
Labels: enhancement
#956 - Upgrade to min version py3.8
Issue -
State: closed - Opened by Yomguithereal 10 months ago
Labels: refactor
#955 - Method of CrawlJob to get an identical CrawlTarget to retry
Issue -
State: closed - Opened by Yomguithereal 10 months ago
Labels: enhancement
#954 - There should be a Crawler side global callback for each job
Issue -
State: open - Opened by Yomguithereal 10 months ago
Labels: enhancement
#953 - os.makedirs is already threadsafe
Issue -
State: closed - Opened by Yomguithereal 10 months ago
Labels: refactor, optimization
#952 - Move away from lxml as default soup engine?
Issue -
State: closed - Opened by Yomguithereal 10 months ago
Labels: discussion
#951 - Crawler should lazily open data files to be written until there is actually something to write
Issue -
State: closed - Opened by Yomguithereal 10 months ago
Labels: bug
#950 - Add some crawler level job filter
Issue -
State: open - Opened by Yomguithereal 10 months ago
Labels: enhancement
#949 - Make minet installable without playwright
Issue -
State: open - Opened by Yomguithereal 10 months ago
Labels: dx
#948 - Youtube improvements
Issue -
State: closed - Opened by Yomguithereal 11 months ago
Labels: enhancement
#947 - A challenges challenge
Issue -
State: open - Opened by Yomguithereal 11 months ago
Labels: bug
#946 - Multithreaded API clients connection pools should allow more connections
Issue -
State: closed - Opened by Yomguithereal 11 months ago
Labels: optimization
#945 - Add a "minet hal" module?
Issue -
State: open - Opened by boogheta 11 months ago
Labels: enhancement
#944 - x.com urls are not usually recognized
Issue -
State: closed - Opened by Yomguithereal 11 months ago
Labels: bug
#943 - Command tw tweets should not require API key
Issue -
State: closed - Opened by Yomguithereal 11 months ago
Labels: bug
#942 - ErroredCrawlReponse should de facto None response attributes
Issue -
State: closed - Opened by Yomguithereal 11 months ago
Labels: bug
#941 - User friendly error message when spider returns a non-2-tuple
Issue -
State: closed - Opened by Yomguithereal 12 months ago
Labels: dx
#940 - Adding a scraper for Facebook users' hometown and current city information
Pull Request -
State: closed - Opened by camillechanial 12 months ago
#939 - path column is not very useful when using --glob file on scrape/extract
Issue -
State: open - Opened by Yomguithereal 12 months ago
Labels: bug
#938 - Adding builtin scraper for Europresse
Pull Request -
State: closed - Opened by bmaz 12 months ago
#937 - lzma issues
Issue -
State: closed - Opened by Yomguithereal 12 months ago
Labels: bug
#936 - /usr/local/bin does not exist on recent mac installs
Issue -
State: closed - Opened by Yomguithereal 12 months ago
Labels: bug
#935 - Telegram channel-messages crashes on string parsing
Issue -
State: closed - Opened by Yomguithereal 12 months ago
Labels: bug
#934 - Stop condition of twitter scraper is not working anymore
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug
#933 - Try to scrape twitter embed for tweets rather than using the Guest scraper
Issue -
State: open - Opened by Yomguithereal about 1 year ago
Labels: enhancement, investigation
#932 - Return None if TwitterGuestAPIScraper.tweet does not find tweet
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug
#931 - Faster rate limit for TwitterGuestAPIScraper
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: enhancement
#930 - Should json encode the query sent in GET param when scraping twitter search
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug
#929 - Add --raw flag to yt captions
Issue -
State: open - Opened by Yomguithereal about 1 year ago
Labels: enhancement
#928 - Possible issue with jobs duplication if crawler crashes somewhere around the processing?
Issue -
State: open - Opened by Yomguithereal about 1 year ago
Labels: bug
#927 - Crawl jobs output should have a timestamp column
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: enhancement
#926 - Add linktr.ee scraper
Issue -
State: open - Opened by Yomguithereal about 1 year ago
Labels: enhancement
#925 - The crawler might ask a spider to process an already visited url even if the end_url is already in cache
Issue -
State: open - Opened by Yomguithereal about 1 year ago
Labels: bug
#924 - crawl_command unique & url_cache don't work well together because of double del
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug
#923 - documentation is gone
Issue -
State: closed - Opened by uf0 about 1 year ago
- 3 comments
#922 - Scrape command -m, -e and --field
Issue -
State: open - Opened by Yomguithereal about 1 year ago
Labels: enhancement
#921 - Add flag to yt captions command to emit one lossy line per video instead
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: enhancement
#920 - Harmonize fetch command behavior wrt variadicity, it confuses people
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: enhancement
#919 - Add timeout kwargs to crawl_command
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: enhancement
#918 - Add extraction method to scrape command related to social network account displayed on homepages
Issue -
State: open - Opened by Yomguithereal about 1 year ago
Labels: enhancement
#917 - Threading & Processing race condition over written file in scrape and extract command
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug
#916 - Add way to feed data to particular spider from crawl command
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: enhancement
#915 - Support mixed function/Spider instances dicts in crawl command
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug
#914 - Add method to response to return stripped body and/or add middleware to edit response body on the flight
Issue -
State: open - Opened by Yomguithereal about 1 year ago
Labels: enhancement
#913 - Invalid spider target error is not clear
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug, documentation
#912 - Decide whether iterable/singular polymorphism is a good idea for next target enqueuing from spiders
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: question, typing
#911 - Add some diagram for the crawler's lifecyle and architecture
Issue -
State: open - Opened by Yomguithereal about 1 year ago
Labels: documentation
#910 - Doc often use #.scrape instead of #.scrape_one
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug, documentation
#909 - Refactor mediacloud client and unify auth errors
Issue -
State: open - Opened by Yomguithereal about 1 year ago
Labels: refactor
#908 - The twitter scraper is erratic because of new data format again
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
- 1 comment
Labels: bug
#907 - GitHub broke custom html id in markdown rendering
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug, documentation
#906 - Integrate a file writer to the http executor like the crawler? also integrate the folder strategy?
Issue -
State: open - Opened by Yomguithereal about 1 year ago
Labels: enhancement, refactor
#905 - Force users to rely on -o/--output on windows
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
- 1 comment
Labels: bug, enhancement
#904 - Issue when asking for text or soup of empty bodies
Issue -
State: closed - Opened by Yomguithereal about 1 year ago
Labels: bug
#903 - conform buzzsumo to TabularRecord dataclass type
Pull Request -
State: closed - Opened by kat-kel over 1 year ago
#902 - Move nested total of progress bar to upper column?
Issue -
State: open - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#901 - Scrape & extract should produce ordered output by default and have a --unordered flag
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#900 - --scraped-column-prefix
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: enhancement
#899 - Rework scrape command semantics
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
- 1 comment
Labels: enhancement, refactor
#898 - Filename was not renamed to path in scrape/extract?
Issue -
State: closed - Opened by Yomguithereal over 1 year ago
Labels: bug, documentation