Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / ropensci/textreuse issues and pull requests
#97 - New Maintainer Welcome :-)
Issue -
State: closed - Opened by maelle 5 months ago
Labels: help wanted
#96 - Deprecated feature reported in lsh-function
Issue -
State: open - Opened by Lrantala over 1 year ago
- 1 comment
#95 - Punctuation align_local
Issue -
State: open - Opened by EtienneFerrandi about 3 years ago
#94 - align_local documentation
Issue -
State: closed - Opened by under-score almost 4 years ago
- 2 comments
#93 - Adding new documents to a LSH object
Issue -
State: open - Opened by retrography about 4 years ago
#92 - Parallel lsh_compare
Issue -
State: closed - Opened by retrography about 4 years ago
#91 - Move "lsh_buckets" class to the left
Pull Request -
State: open - Opened by romainfrancois over 4 years ago
- 1 comment
#90 - Inconsistent skipping behavior in TextReuseCorpus
Issue -
State: open - Opened by tylerandrewscott over 4 years ago
- 3 comments
#89 - Added encoding argument to TextReuseCorpus and TextReuseTextDocument
Pull Request -
State: open - Opened by davidfuhry over 4 years ago
- 1 comment
#88 - Short documents and skip_grams assertion do not match
Issue -
State: open - Opened by awagner-mainz almost 5 years ago
- 2 comments
#87 - Add official docs url to description
Pull Request -
State: open - Opened by jeroen almost 5 years ago
- 1 comment
#86 - jaccard_similarity result?
Issue -
State: closed - Opened by bihappywater almost 5 years ago
- 1 comment
#85 - record linkage using textreuse
Issue -
State: closed - Opened by bihappywater over 5 years ago
- 1 comment
#84 - Apparently `min(bitwXor(h, i)` is fed a double at times when it wants integers
Issue -
State: closed - Opened by ghost almost 6 years ago
- 6 comments
#83 - Appveyor webhook
Issue -
State: closed - Opened by maelle about 6 years ago
- 4 comments
#82 - Error when calculating local_alignment
Issue -
State: open - Opened by ManuelBurghardt over 6 years ago
- 1 comment
#81 - Added rOpenSci review badge
Pull Request -
State: closed - Opened by karthik almost 7 years ago
- 1 comment
#80 - Operation with unevaluated n_call
Pull Request -
State: closed - Opened by quartin about 7 years ago
- 2 comments
#79 - Database backends
Issue -
State: open - Opened by lmullen about 7 years ago
- 1 comment
#78 - Implement a method like the one described in Smith, Cordell, Mullen
Issue -
State: open - Opened by lmullen about 7 years ago
#77 - Add Rcpp interrupts
Issue -
State: open - Opened by lmullen about 7 years ago
#76 - Typo: search for "two few words"
Issue -
State: open - Opened by lmullen about 7 years ago
#75 - Error when using functions from tokenizers package
Issue -
State: closed - Opened by mdlincoln about 7 years ago
- 2 comments
#74 - Merging of corpora
Issue -
State: closed - Opened by Ninoninoninonino over 7 years ago
- 2 comments
#73 - can this packages support chinese corpus
Issue -
State: closed - Opened by AlexYoung757 over 7 years ago
- 1 comment
#72 - Question: Reuse minhash functions
Issue -
State: closed - Opened by iainmwallace over 7 years ago
- 2 comments
#71 - Implement earth mover distances
Issue -
State: open - Opened by lmullen about 8 years ago
#70 - Depend on LSHR package
Issue -
State: open - Opened by lmullen about 8 years ago
#69 - Parallelize lsh_compare()
Issue -
State: open - Opened by lmullen over 8 years ago
- 1 comment
#68 - Extra newline for print method for local alignments
Issue -
State: open - Opened by lmullen over 8 years ago
#67 - Redo matrix methods
Issue -
State: open - Opened by lmullen over 8 years ago
#66 - Some problem with lsh() function and data_frame?
Issue -
State: closed - Opened by vmustafa over 8 years ago
- 6 comments
#65 - Problem with converting to matrix
Issue -
State: closed - Opened by lmullen over 8 years ago
- 1 comment
#64 - Try to build a Corpus from character vector and got an error
Issue -
State: closed - Opened by pommedeterresautee over 8 years ago
- 3 comments
#63 - Set interactive = FALSE in all vignettes
Issue -
State: closed - Opened by lmullen over 8 years ago
#62 - Re-documents imported/exported functions with roxygen 5.0
Issue -
State: closed - Opened by lmullen over 8 years ago
#61 - switch from CharacterVector to a string vector
Pull Request -
State: closed - Opened by Ironholds over 8 years ago
- 1 comment
#60 - Parallelize wordcount.TextReuseCorpus?
Issue -
State: closed - Opened by lmullen over 8 years ago
- 1 comment
#59 - Define variables in sw_matrix only once
Pull Request -
State: closed - Opened by noamross over 8 years ago
- 1 comment
#58 - Function to query potential matches for just one (or more) documents from buckets
Issue -
State: closed - Opened by lmullen over 8 years ago
#57 - TextReuseCorpus does not always emit warnings when skipping short documents
Issue -
State: closed - Opened by lmullen over 8 years ago
#56 - Add citation to original lsh/minhash paper
Issue -
State: closed - Opened by lmullen over 8 years ago
#55 - Fix bug with blank ID in skipped documents
Issue -
State: closed - Opened by lmullen over 8 years ago
#54 - Parallelize text reuse corpus?
Issue -
State: closed - Opened by lmullen over 8 years ago
- 1 comment
#53 - Keep more information in alignment objects
Issue -
State: closed - Opened by lmullen over 8 years ago
#52 - Function to write alignment object to a file
Issue -
State: closed - Opened by lmullen over 8 years ago
#51 - Add a minhashes element to a document/corpus
Issue -
State: closed - Opened by lmullen over 8 years ago
- 1 comment
#50 - Add vignette pipeable
Issue -
State: closed - Opened by lmullen over 8 years ago
#49 - Implement Smith-Waterman local sequence alignment
Issue -
State: closed - Opened by lmullen over 8 years ago
- 1 comment
#48 - Performance regression in skipping documents?
Issue -
State: closed - Opened by lmullen over 8 years ago
- 1 comment
#47 - Reimplement lsh with dplyr rather than package hash
Issue -
State: closed - Opened by lmullen over 8 years ago
- 1 comment
#46 - Replace checks for columns with class identifier
Issue -
State: closed - Opened by lmullen over 8 years ago
#45 - Better handling of documents where number of words is less than n in n-grams
Issue -
State: closed - Opened by lmullen over 8 years ago
#44 - Create corpus from character vector (or possibly a list)
Issue -
State: closed - Opened by lmullen over 8 years ago
#43 - Fix/docs spellcheck
Pull Request -
State: closed - Opened by ashander over 8 years ago
- 3 comments
#42 - Typo in `dir` parameter of TextReuseCorpus
Issue -
State: closed - Opened by lmullen over 8 years ago
#41 - Hashing LSH buckets
Issue -
State: closed - Opened by lmullen almost 9 years ago
- 1 comment
#40 - Progress bar for tokenize() and rehash()
Issue -
State: closed - Opened by lmullen almost 9 years ago
#39 - Add checks for hashes when using TextReuseTextDocument methods
Issue -
State: closed - Opened by lmullen almost 9 years ago
#38 - Tokenize() and rehash() should write their names to meta()
Issue -
State: closed - Opened by lmullen almost 9 years ago
#37 - Put all comparison functions in same documentation
Issue -
State: closed - Opened by lmullen almost 9 years ago
#36 - Write vignettes and README
Issue -
State: closed - Opened by lmullen almost 9 years ago
#35 - Standardize candidates functions
Issue -
State: closed - Opened by lmullen almost 9 years ago
#34 - Function to calculate number of bands and rows
Issue -
State: closed - Opened by lmullen almost 9 years ago
- 1 comment
#33 - Reimplement lsh as a method that can work for documents or corpora
Issue -
State: closed - Opened by lmullen almost 9 years ago
#32 - Functions to chunk documents
Issue -
State: closed - Opened by lmullen almost 9 years ago
- 1 comment
#31 - Save hash_func and tokenizer in meta of corpus or document
Issue -
State: closed - Opened by lmullen almost 9 years ago
#30 - Make pairwise_cf parallel
Issue -
State: closed - Opened by lmullen almost 9 years ago
- 1 comment
#29 - In vignette, go into clustering functions
Issue -
State: closed - Opened by lmullen almost 9 years ago
#28 - Add dedup function
Issue -
State: closed - Opened by lmullen almost 9 years ago
#27 - Fix pairwise_cf so it works with corpus objects
Issue -
State: closed - Opened by lmullen almost 9 years ago
- 2 comments
#26 - Add `as` functions
Issue -
State: closed - Opened by lmullen almost 9 years ago
#25 - Idea about skip ngrams and numbers
Issue -
State: closed - Opened by lmullen almost 9 years ago
#24 - Implement retokenizing function
Issue -
State: closed - Opened by lmullen almost 9 years ago
#23 - Jaccard similarity should use hashes not n-grams
Issue -
State: closed - Opened by lmullen almost 9 years ago
#22 - Check the validity of the hash_string function
Issue -
State: closed - Opened by lmullen almost 9 years ago
- 1 comment
#21 - ngrams tokenizer is super slow
Issue -
State: closed - Opened by lmullen almost 9 years ago
- 2 comments
#20 - Provide a set of tokenization functions
Issue -
State: closed - Opened by lmullen almost 9 years ago
#19 - Abstract out design decisions
Issue -
State: closed - Opened by lmullen almost 9 years ago
- 1 comment
#18 - Add jaccard_dissimilarity
Issue -
State: closed - Opened by lmullen almost 9 years ago
#17 - Add word count method
Issue -
State: closed - Opened by lmullen almost 9 years ago
#16 - Remove OCR quality measures into their own package
Issue -
State: closed - Opened by lmullen almost 9 years ago
- 1 comment
#15 - Remove heavy OCR files
Issue -
State: closed - Opened by lmullen about 9 years ago
- 1 comment
#14 - Depend on CRAN version of stringr
Issue -
State: closed - Opened by lmullen about 9 years ago
- 2 comments
#13 - Create a corpus of sample data
Issue -
State: closed - Opened by lmullen about 9 years ago
- 2 comments
#12 - Rename jaccard_coef
Issue -
State: closed - Opened by lmullen about 9 years ago
#11 - Add "enhances"
Issue -
State: closed - Opened by lmullen over 9 years ago
- 1 comment
#10 - Should corpus_cf be called pairwise_cf?
Issue -
State: closed - Opened by lmullen over 9 years ago
#9 - Bag similarity
Issue -
State: closed - Opened by lmullen over 9 years ago
#8 - Document the meta() and content() functions from NLP
Issue -
State: closed - Opened by lmullen over 9 years ago
#7 - Implement sum of matches
Issue -
State: closed - Opened by lmullen over 9 years ago
#6 - Implement outer for lists
Issue -
State: closed - Opened by lmullen over 9 years ago
#5 - Implement corpus index of ngrams
Issue -
State: closed - Opened by lmullen over 9 years ago
#4 - Hash n-grams
Issue -
State: closed - Opened by lmullen over 9 years ago
#3 - Implement minhash
Issue -
State: closed - Opened by lmullen over 9 years ago
- 2 comments
#2 - Implement Jaccard coefficient
Issue -
State: closed - Opened by lmullen over 9 years ago
#1 - Measure OCR accuracy
Issue -
State: closed - Opened by lmullen over 9 years ago
- 2 comments