Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / ropensci/textreuse issues and pull requests

#97 - New Maintainer Welcome :-)

Issue - State: closed - Opened by maelle 5 months ago
Labels: help wanted

#96 - Deprecated feature reported in lsh-function

Issue - State: open - Opened by Lrantala over 1 year ago - 1 comment

#95 - Punctuation align_local

Issue - State: open - Opened by EtienneFerrandi about 3 years ago

#94 - align_local documentation

Issue - State: closed - Opened by under-score almost 4 years ago - 2 comments

#93 - Adding new documents to a LSH object

Issue - State: open - Opened by retrography about 4 years ago

#92 - Parallel lsh_compare

Issue - State: closed - Opened by retrography about 4 years ago

#91 - Move "lsh_buckets" class to the left

Pull Request - State: open - Opened by romainfrancois over 4 years ago - 1 comment

#90 - Inconsistent skipping behavior in TextReuseCorpus

Issue - State: open - Opened by tylerandrewscott over 4 years ago - 3 comments

#89 - Added encoding argument to TextReuseCorpus and TextReuseTextDocument

Pull Request - State: open - Opened by davidfuhry over 4 years ago - 1 comment

#88 - Short documents and skip_grams assertion do not match

Issue - State: open - Opened by awagner-mainz almost 5 years ago - 2 comments

#87 - Add official docs url to description

Pull Request - State: open - Opened by jeroen almost 5 years ago - 1 comment

#86 - jaccard_similarity result?

Issue - State: closed - Opened by bihappywater almost 5 years ago - 1 comment

#85 - record linkage using textreuse

Issue - State: closed - Opened by bihappywater over 5 years ago - 1 comment

#84 - Apparently `min(bitwXor(h, i)` is fed a double at times when it wants integers

Issue - State: closed - Opened by ghost almost 6 years ago - 6 comments

#83 - Appveyor webhook

Issue - State: closed - Opened by maelle about 6 years ago - 4 comments

#82 - Error when calculating local_alignment

Issue - State: open - Opened by ManuelBurghardt over 6 years ago - 1 comment

#81 - Added rOpenSci review badge

Pull Request - State: closed - Opened by karthik almost 7 years ago - 1 comment

#80 - Operation with unevaluated n_call

Pull Request - State: closed - Opened by quartin about 7 years ago - 2 comments

#79 - Database backends

Issue - State: open - Opened by lmullen about 7 years ago - 1 comment

#77 - Add Rcpp interrupts

Issue - State: open - Opened by lmullen about 7 years ago

#76 - Typo: search for "two few words"

Issue - State: open - Opened by lmullen about 7 years ago

#75 - Error when using functions from tokenizers package

Issue - State: closed - Opened by mdlincoln about 7 years ago - 2 comments

#74 - Merging of corpora

Issue - State: closed - Opened by Ninoninoninonino over 7 years ago - 2 comments

#73 - can this packages support chinese corpus

Issue - State: closed - Opened by AlexYoung757 over 7 years ago - 1 comment

#72 - Question: Reuse minhash functions

Issue - State: closed - Opened by iainmwallace over 7 years ago - 2 comments

#71 - Implement earth mover distances

Issue - State: open - Opened by lmullen about 8 years ago

#70 - Depend on LSHR package

Issue - State: open - Opened by lmullen about 8 years ago

#69 - Parallelize lsh_compare()

Issue - State: open - Opened by lmullen over 8 years ago - 1 comment

#68 - Extra newline for print method for local alignments

Issue - State: open - Opened by lmullen over 8 years ago

#67 - Redo matrix methods

Issue - State: open - Opened by lmullen over 8 years ago

#66 - Some problem with lsh() function and data_frame?

Issue - State: closed - Opened by vmustafa over 8 years ago - 6 comments

#65 - Problem with converting to matrix

Issue - State: closed - Opened by lmullen over 8 years ago - 1 comment

#64 - Try to build a Corpus from character vector and got an error

Issue - State: closed - Opened by pommedeterresautee over 8 years ago - 3 comments

#63 - Set interactive = FALSE in all vignettes

Issue - State: closed - Opened by lmullen over 8 years ago

#62 - Re-documents imported/exported functions with roxygen 5.0

Issue - State: closed - Opened by lmullen over 8 years ago

#61 - switch from CharacterVector to a string vector

Pull Request - State: closed - Opened by Ironholds over 8 years ago - 1 comment

#60 - Parallelize wordcount.TextReuseCorpus?

Issue - State: closed - Opened by lmullen over 8 years ago - 1 comment

#59 - Define variables in sw_matrix only once

Pull Request - State: closed - Opened by noamross over 8 years ago - 1 comment

#56 - Add citation to original lsh/minhash paper

Issue - State: closed - Opened by lmullen over 8 years ago

#55 - Fix bug with blank ID in skipped documents

Issue - State: closed - Opened by lmullen over 8 years ago

#54 - Parallelize text reuse corpus?

Issue - State: closed - Opened by lmullen over 8 years ago - 1 comment

#53 - Keep more information in alignment objects

Issue - State: closed - Opened by lmullen over 8 years ago

#52 - Function to write alignment object to a file

Issue - State: closed - Opened by lmullen over 8 years ago

#51 - Add a minhashes element to a document/corpus

Issue - State: closed - Opened by lmullen over 8 years ago - 1 comment

#50 - Add vignette pipeable

Issue - State: closed - Opened by lmullen over 8 years ago

#49 - Implement Smith-Waterman local sequence alignment

Issue - State: closed - Opened by lmullen over 8 years ago - 1 comment

#48 - Performance regression in skipping documents?

Issue - State: closed - Opened by lmullen over 8 years ago - 1 comment

#47 - Reimplement lsh with dplyr rather than package hash

Issue - State: closed - Opened by lmullen over 8 years ago - 1 comment

#46 - Replace checks for columns with class identifier

Issue - State: closed - Opened by lmullen over 8 years ago

#44 - Create corpus from character vector (or possibly a list)

Issue - State: closed - Opened by lmullen over 8 years ago

#43 - Fix/docs spellcheck

Pull Request - State: closed - Opened by ashander over 8 years ago - 3 comments

#42 - Typo in `dir` parameter of TextReuseCorpus

Issue - State: closed - Opened by lmullen over 8 years ago

#41 - Hashing LSH buckets

Issue - State: closed - Opened by lmullen almost 9 years ago - 1 comment

#40 - Progress bar for tokenize() and rehash()

Issue - State: closed - Opened by lmullen almost 9 years ago

#39 - Add checks for hashes when using TextReuseTextDocument methods

Issue - State: closed - Opened by lmullen almost 9 years ago

#38 - Tokenize() and rehash() should write their names to meta()

Issue - State: closed - Opened by lmullen almost 9 years ago

#37 - Put all comparison functions in same documentation

Issue - State: closed - Opened by lmullen almost 9 years ago

#36 - Write vignettes and README

Issue - State: closed - Opened by lmullen almost 9 years ago

#35 - Standardize candidates functions

Issue - State: closed - Opened by lmullen almost 9 years ago

#34 - Function to calculate number of bands and rows

Issue - State: closed - Opened by lmullen almost 9 years ago - 1 comment

#33 - Reimplement lsh as a method that can work for documents or corpora

Issue - State: closed - Opened by lmullen almost 9 years ago

#32 - Functions to chunk documents

Issue - State: closed - Opened by lmullen almost 9 years ago - 1 comment

#31 - Save hash_func and tokenizer in meta of corpus or document

Issue - State: closed - Opened by lmullen almost 9 years ago

#30 - Make pairwise_cf parallel

Issue - State: closed - Opened by lmullen almost 9 years ago - 1 comment

#29 - In vignette, go into clustering functions

Issue - State: closed - Opened by lmullen almost 9 years ago

#28 - Add dedup function

Issue - State: closed - Opened by lmullen almost 9 years ago

#27 - Fix pairwise_cf so it works with corpus objects

Issue - State: closed - Opened by lmullen almost 9 years ago - 2 comments

#26 - Add `as` functions

Issue - State: closed - Opened by lmullen almost 9 years ago

#25 - Idea about skip ngrams and numbers

Issue - State: closed - Opened by lmullen almost 9 years ago

#24 - Implement retokenizing function

Issue - State: closed - Opened by lmullen almost 9 years ago

#23 - Jaccard similarity should use hashes not n-grams

Issue - State: closed - Opened by lmullen almost 9 years ago

#22 - Check the validity of the hash_string function

Issue - State: closed - Opened by lmullen almost 9 years ago - 1 comment

#21 - ngrams tokenizer is super slow

Issue - State: closed - Opened by lmullen almost 9 years ago - 2 comments

#20 - Provide a set of tokenization functions

Issue - State: closed - Opened by lmullen almost 9 years ago

#19 - Abstract out design decisions

Issue - State: closed - Opened by lmullen almost 9 years ago - 1 comment

#18 - Add jaccard_dissimilarity

Issue - State: closed - Opened by lmullen almost 9 years ago

#17 - Add word count method

Issue - State: closed - Opened by lmullen almost 9 years ago

#16 - Remove OCR quality measures into their own package

Issue - State: closed - Opened by lmullen almost 9 years ago - 1 comment

#15 - Remove heavy OCR files

Issue - State: closed - Opened by lmullen about 9 years ago - 1 comment

#14 - Depend on CRAN version of stringr

Issue - State: closed - Opened by lmullen about 9 years ago - 2 comments

#13 - Create a corpus of sample data

Issue - State: closed - Opened by lmullen about 9 years ago - 2 comments

#12 - Rename jaccard_coef

Issue - State: closed - Opened by lmullen about 9 years ago

#11 - Add "enhances"

Issue - State: closed - Opened by lmullen over 9 years ago - 1 comment

#10 - Should corpus_cf be called pairwise_cf?

Issue - State: closed - Opened by lmullen over 9 years ago

#9 - Bag similarity

Issue - State: closed - Opened by lmullen over 9 years ago

#8 - Document the meta() and content() functions from NLP

Issue - State: closed - Opened by lmullen over 9 years ago

#7 - Implement sum of matches

Issue - State: closed - Opened by lmullen over 9 years ago

#6 - Implement outer for lists

Issue - State: closed - Opened by lmullen over 9 years ago

#5 - Implement corpus index of ngrams

Issue - State: closed - Opened by lmullen over 9 years ago

#4 - Hash n-grams

Issue - State: closed - Opened by lmullen over 9 years ago

#3 - Implement minhash

Issue - State: closed - Opened by lmullen over 9 years ago - 2 comments

#2 - Implement Jaccard coefficient

Issue - State: closed - Opened by lmullen over 9 years ago

#1 - Measure OCR accuracy

Issue - State: closed - Opened by lmullen over 9 years ago - 2 comments