Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / karlhigley/lexrank-summarizer issues and pull requests

#44 - Use accumulators to quantify boilerplate removal

Issue - State: open - Opened by karlhigley almost 9 years ago

#44 - Use accumulators to quantify boilerplate removal

Issue - State: open - Opened by karlhigley almost 9 years ago

#43 - Use GraphX .reverse method to generate bidirectional edges

Issue - State: open - Opened by karlhigley almost 9 years ago

#43 - Use GraphX .reverse method to generate bidirectional edges

Issue - State: open - Opened by karlhigley almost 9 years ago

#42 - Use Spark's Dataframes API

Issue - State: open - Opened by karlhigley almost 9 years ago - 1 comment

#42 - Use Spark's Dataframes API

Issue - State: open - Opened by karlhigley almost 9 years ago - 1 comment

#41 - Maintain the order of excerpted sentences

Issue - State: open - Opened by karlhigley almost 9 years ago

#40 - Add a link to the LSH paper that explains the pooling trick

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#40 - Add a link to the LSH paper that explains the pooling trick

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#39 - Update the README to reflect dynamic stopword filtering

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#39 - Update the README to reflect dynamic stopword filtering

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#38 - Remove obsolete option for number of LSH buckets

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#38 - Remove obsolete option for number of LSH buckets

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#37 - Switch from a stopword list to dynamically identified stopwords

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#37 - Switch from a stopword list to dynamically identified stopwords

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#36 - Replace explicit removal of zeros with conversion to SparseVector

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#36 - Replace explicit removal of zeros with conversion to SparseVector

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#35 - Remove extraneous launch script

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#35 - Remove extraneous launch script

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#34 - Stop combining documents with the same identifier during pre-processing

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#34 - Stop combining documents with the same identifier during pre-processing

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#33 - Revise description of SRP-LSH boilerplate filtering

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#33 - Revise description of SRP-LSH boilerplate filtering

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#32 - Add LSH cosine estimation method of graph building to LexRank

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#32 - Add LSH cosine estimation method of graph building to LexRank

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#31 - Compute stopwords from the corpus on the fly

Issue - State: closed - Opened by karlhigley about 9 years ago

#31 - Compute stopwords from the corpus on the fly

Issue - State: closed - Opened by karlhigley about 9 years ago

#30 - Represent similarities as Floats instead of Doubles in LexRank

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#30 - Represent similarities as Floats instead of Doubles in LexRank

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#29 - Update to Spark 1.5.0

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#29 - Update to Spark 1.5.0

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#28 - Consolidate input document content by doc ID

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#28 - Consolidate input document content by doc ID

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#27 - Rename CosineLSH to SignRandomProjectionLSH

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#27 - Rename CosineLSH to SignRandomProjectionLSH

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#26 - Update to Spark 1.4.1

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#26 - Update to Spark 1.4.1

Pull Request - State: closed - Opened by karlhigley about 9 years ago

#25 - Add a configuration option for the number of LSH buckets

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#24 - Cache (LSH signature, feature vector) pairs

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#23 - Disentangle/test similarity computation and Lexrank model

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#22 - Test and refactor featurization code

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#21 - Add basic tests for CosineLSH model

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#20 - Precompute size of sparsified matrices (instead of auto-computation)

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#19 - Improve tokenization to reduce dimensionality

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#18 - Fix broken variable reference from repartitioning code

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#17 - Use minPartitions argument when reading file instead of repartitioning

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#15 - Properly parse input lines with text containing tabs

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#14 - Combine input entries with the same identifier into a single document

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#13 - Apply Kryo serialization

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#12 - Adjust spacing and driver class name in README

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#11 - Remove extraneous imports from LexRank model

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#9 - Move featurization to separate class and excerpt selection to Driver

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#8 - Amend script to allow 2g memory for driver and executors

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#7 - Straighten out package name, split into separate files

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#6 - Add corpus-level boilerplate filtering

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#5 - Add a link to the LexRank paper in the README

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#4 - Avoid creating graph edges between different documents

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#3 - Extract featurization into a companion object utility function

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#2 - Fix a typo in the previous edge filtering refactor

Pull Request - State: closed - Opened by karlhigley over 9 years ago

#1 - Pre-filter the graph edges rather than filtering the graph itself

Pull Request - State: closed - Opened by karlhigley over 9 years ago