Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / tim-sh/embeddings issues and pull requests

#18 - feat: max pooling aggregation strategy

Pull Request - State: closed - Opened by tim-sh 3 months ago

#17 - fix: deduplicate n-grams

Pull Request - State: closed - Opened by tim-sh 3 months ago

#16 - feat: store embeddings and combine using weighted mean

Pull Request - State: closed - Opened by tim-sh 3 months ago

#15 - feat: require self-contained docs

Pull Request - State: closed - Opened by tim-sh 3 months ago

#14 - fix: correctly attach tfIdf to each ngram

Pull Request - State: closed - Opened by tim-sh 3 months ago

#13 - Improve precision of vector computations

Pull Request - State: closed - Opened by tim-sh 4 months ago

#12 - Basic CLI frontend, API-usage reduction, more config options

Pull Request - State: closed - Opened by tim-sh 4 months ago

#11 - Compute embeddings and find most similar document

Pull Request - State: closed - Opened by tim-sh 4 months ago

#10 - Document library

Pull Request - State: closed - Opened by tim-sh 4 months ago

#9 - Transform to lowercase for deduplication

Pull Request - State: closed - Opened by tim-sh 4 months ago

#8 - Transform tokens to n-grams

Pull Request - State: closed - Opened by tim-sh 4 months ago

#7 - Remove stopwords

Pull Request - State: closed - Opened by tim-sh 4 months ago

#6 - Tokenize text

Pull Request - State: closed - Opened by tim-sh 4 months ago

#4 - Filter out 'omitted' stack frames

Pull Request - State: closed - Opened by tim-sh 4 months ago

#3 - Remove code delimiters before stack frames

Pull Request - State: closed - Opened by tim-sh 4 months ago

#2 - Add PR workflow

Pull Request - State: closed - Opened by tim-sh 4 months ago

#1 - GitHub Issue → text pipeline

Pull Request - State: closed - Opened by tim-sh 4 months ago