Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / moj-analytical-services/splink issues and pull requests
#2600 - [FEAT] Profiling: array size
Issue -
State: open - Opened by samnlindsay 2 days ago
Labels: enhancement, profiling
#2599 - Update the version of bundled vega package
Issue -
State: open - Opened by hedsnz 3 days ago
#2598 - country flags
Pull Request -
State: closed - Opened by RobinL 3 days ago
#2597 - add ontario
Pull Request -
State: closed - Opened by RobinL 3 days ago
#2596 - update use cases
Pull Request -
State: closed - Opened by RobinL 6 days ago
#2595 - [FEAT] Include the option to retain the __splink__blocked_id_pairs table
Issue -
State: open - Opened by fscholes 8 days ago
- 1 comment
Labels: enhancement
#2593 - ICS use case of splink
Pull Request -
State: closed - Opened by BenNBEIS 9 days ago
#2591 - add modify settings exampel to cookbook
Pull Request -
State: closed - Opened by RobinL 17 days ago
#2590 - Update index.md
Pull Request -
State: closed - Opened by gidelpanta 17 days ago
#2589 - Bug - Realtime cache collision
Pull Request -
State: closed - Opened by ADBond 17 days ago
Labels: bug, caching
#2588 - Should `BlockingRule`s store identifier column information?
Issue -
State: open - Opened by ADBond 21 days ago
Labels: blocking, maintenance
#2587 - Add ArrayIntersect default
Pull Request -
State: closed - Opened by RossKen 21 days ago
#2586 - ColumnExpression - `NULLIF`
Pull Request -
State: closed - Opened by ADBond 21 days ago
Labels: enhancement
#2585 - `ColumnExpression` first/last index
Pull Request -
State: closed - Opened by ADBond 22 days ago
Labels: enhancement
#2584 - TF adjustments not displayed and most likely not applied
Issue -
State: open - Opened by pierpaolocreanza 23 days ago
- 2 comments
#2583 - [FEAT] Specify logger within SettingsCreator
Issue -
State: open - Opened by will-holley 24 days ago
- 1 comment
Labels: enhancement
#2581 - Error in Pipeline from estimate_m_from_label_column with Spark Backend
Issue -
State: closed - Opened by ktzsh 28 days ago
- 8 comments
Labels: bug
#2580 - Improve runtimes but 'pushing up' common Case Statements into precomputed values
Issue -
State: open - Opened by RobinL 29 days ago
- 1 comment
#2578 - One to one clustering
Pull Request -
State: open - Opened by aymonwuolanne about 1 month ago
- 9 comments
#2577 - Fix spark database double-quoting
Pull Request -
State: closed - Opened by julijonas about 1 month ago
#2576 - Spark database is double-quoted when it's not a valid identifier
Issue -
State: closed - Opened by julijonas about 1 month ago
Labels: bug
#2572 - Remove unused binder files
Pull Request -
State: closed - Opened by RobinL about 1 month ago
#2571 - Bump jinja2 from 3.0.3 to 3.1.5 in /scripts
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
- 1 comment
Labels: dependencies, python
#2570 - [FEAT] datetime range columns support
Issue -
State: open - Opened by happysalada about 2 months ago
Labels: enhancement
#2568 - Return Similarity scores from Linker
Issue -
State: closed - Opened by hananshandler about 2 months ago
- 1 comment
#2567 - add ukhsa
Pull Request -
State: closed - Opened by RobinL about 2 months ago
#2565 - fix typo
Pull Request -
State: closed - Opened by RossKen about 2 months ago
#2563 - [FEAT] block_on Method Should Support Arrays And SUBSTR
Issue -
State: open - Opened by ModeMonkey about 2 months ago
Labels: enhancement
#2562 - [FEAT] One-to-one clustering
Issue -
State: open - Opened by aymonwuolanne about 2 months ago
- 11 comments
Labels: enhancement
#2561 - [DOCS] Use block on rather than sql strings in 50k example
Pull Request -
State: closed - Opened by RobinL about 2 months ago
#2560 - Handling order of list arguments when producing comparison levels
Issue -
State: open - Opened by medwar99 about 2 months ago
- 1 comment
Labels: bug
#2559 - add SAIL SERP usage
Pull Request -
State: closed - Opened by medwar99 about 2 months ago
#2558 - add dfe
Pull Request -
State: closed - Opened by RobinL about 2 months ago
#2557 - Improve llm prompts
Pull Request -
State: closed - Opened by RobinL about 2 months ago
#2556 - added dod
Pull Request -
State: closed - Opened by RobinL about 2 months ago
#2555 - Add rationale for training
Pull Request -
State: closed - Opened by RobinL about 2 months ago
#2554 - Fix formatting of docs
Pull Request -
State: closed - Opened by RobinL about 2 months ago
#2553 - Modelling guide
Pull Request -
State: closed - Opened by RobinL about 2 months ago
#2552 - Add gn group
Pull Request -
State: closed - Opened by RobinL about 2 months ago
#2551 - Make igraph explicitly non-optional
Pull Request -
State: closed - Opened by ADBond about 2 months ago
Labels: dependencies
#2549 - add knowledgebase
Pull Request -
State: closed - Opened by RobinL about 2 months ago
#2549 - add knowledgebase
Pull Request -
State: closed - Opened by RobinL about 2 months ago
#2547 - Fix reference to similarity_jar_location
Pull Request -
State: closed - Opened by julijonas about 2 months ago
#2546 - Add Spark support for PairwiseStringDistanceFunction
Pull Request -
State: closed - Opened by zmbc 2 months ago
- 1 comment
#2545 - [FEAT] Generalize `link_type`
Issue -
State: open - Opened by zmbc 2 months ago
Labels: enhancement
#2544 - Link to custom GPT
Pull Request -
State: closed - Opened by RobinL 2 months ago
#2543 - improve llm prompt
Pull Request -
State: closed - Opened by RobinL 2 months ago
#2542 - Fix typos in docs
Pull Request -
State: closed - Opened by RobinL 2 months ago
#2541 - Llm prompt to docs
Pull Request -
State: closed - Opened by RobinL 2 months ago
#2540 - Storing outcomes of similarity functions across a comparison to avoid re-compute
Issue -
State: open - Opened by lamaeldo 2 months ago
- 1 comment
Labels: enhancement
#2539 - Optimise speed of training u
Issue -
State: open - Opened by RobinL 2 months ago
- 1 comment
#2538 - Add speed tests to docs
Pull Request -
State: closed - Opened by RobinL 2 months ago
#2537 - 4.0.6 release
Pull Request -
State: closed - Opened by RobinL 2 months ago
#2536 - ModuleNotFoundError: No Module named 'splink.duckdb'
Issue -
State: closed - Opened by jpasner 2 months ago
- 2 comments
Labels: bug
#2534 - Unlinkables chart cuts off for high match weights
Issue -
State: open - Opened by RobinL 2 months ago
Labels: good first issue
#2533 - Use __eq__ to allow InputColumns to be compared without having to call quote() or unquote()
Issue -
State: open - Opened by RobinL 2 months ago
- 1 comment
#2532 - Make `Settings._columns_used_by_comparisons` unquoted
Pull Request -
State: closed - Opened by ADBond 2 months ago
Labels: maintenance
#2532 - Make `Settings._columns_used_by_comparisons` unquoted
Pull Request -
State: closed - Opened by ADBond 2 months ago
Labels: maintenance
#2531 - [FEAT] Labelling tool - Hide Splink predictions from the page by default
Issue -
State: open - Opened by RobinL 2 months ago
#2531 - [FEAT] Labelling tool - Hide Splink predictions from the page by default
Issue -
State: open - Opened by RobinL 2 months ago
#2529 - Splink comparison viewer barplot and waterfall chart don't agree on match probability
Issue -
State: open - Opened by francisduval 2 months ago
- 2 comments
Labels: bug
#2529 - Splink comparison viewer barplot and waterfall chart don't agree on match probability
Issue -
State: open - Opened by francisduval 2 months ago
- 2 comments
Labels: bug
#2527 - Explicit tf columns select
Pull Request -
State: closed - Opened by ADBond 2 months ago
#2527 - Explicit tf columns select
Pull Request -
State: closed - Opened by ADBond 2 months ago
#2525 - Avoid bug with checkpointing by switching to parquet
Pull Request -
State: closed - Opened by RobinL 2 months ago
#2525 - Avoid bug with checkpointing by switching to parquet
Pull Request -
State: closed - Opened by RobinL 2 months ago
#2524 - Problems with `checkpoint` and `cluster_at_multiple_thresholds`
Issue -
State: closed - Opened by RobinL 2 months ago
#2524 - Problems with `checkpoint` and `cluster_at_multiple_thresholds`
Issue -
State: closed - Opened by RobinL 2 months ago
#2522 - Test duration of debug_mode tests significantly increasing total time of test suite
Issue -
State: open - Opened by RobinL 2 months ago
- 1 comment
#2521 - Test python 3.13
Pull Request -
State: closed - Opened by ADBond 2 months ago
Labels: dependencies, continuous integration
#2520 - Deprecation warning for python 3.8
Pull Request -
State: closed - Opened by ADBond 2 months ago
- 3 comments
Labels: dependencies
#2518 - Constrain dev pandas version
Pull Request -
State: closed - Opened by ADBond 3 months ago
Labels: dependencies, maintenance
#2517 - Pairwise string distance comparison
Pull Request -
State: closed - Opened by zmbc 3 months ago
- 3 comments
#2516 - Add poetry configuration to conda script, bump versions
Pull Request -
State: closed - Opened by zmbc 3 months ago
#2515 - Realtime test sometime failing?
Issue -
State: closed - Opened by ADBond 3 months ago
- 11 comments
Labels: testing
#2514 - Update lockfile + fixes for latest package versions
Pull Request -
State: closed - Opened by ADBond 3 months ago
Labels: dependencies, testing, maintenance
#2513 - Update CONTRIBUTING.md with correct link
Pull Request -
State: closed - Opened by zmbc 3 months ago
#2511 - New sqlglot breaks custom Spark dialect
Issue -
State: closed - Opened by ADBond 3 months ago
- 3 comments
Labels: bug, dependencies
#2510 - Bug - get columns of DuckDB frame even when table is empty
Pull Request -
State: closed - Opened by ADBond 3 months ago
- 1 comment
#2506 - With Splink 4.0.5, cluster_pairwise_predictions_at_threshold(...) throws an error if df_predict is empty
Issue -
State: closed - Opened by thibault1024 3 months ago
- 1 comment
Labels: bug
#2505 - Streamline docs
Pull Request -
State: closed - Opened by RobinL 3 months ago
#2504 - Spark test session handling
Pull Request -
State: closed - Opened by ADBond 3 months ago
Labels: spark, testing
#2503 - Fix count_comparisons_from_blocking_rule
Pull Request -
State: closed - Opened by RobinL 3 months ago
#2502 - [Term frequency adjustments ] Would like the ability to include pre-computed term frequency adjustments (e.g. from the general population)
Issue -
State: closed - Opened by pierpaolocreanza 3 months ago
- 2 comments
Labels: enhancement
#2501 - count_comparisons_from_blocking_rule gets incorrect answer if blocking conditions are surrounded with brackets
Issue -
State: closed - Opened by RobinL 3 months ago
#2500 - remove unnecessary import
Pull Request -
State: closed - Opened by lubrst 3 months ago
- 1 comment
#2499 - ModuleNotFoundError: No module named "pyarrow"
Issue -
State: closed - Opened by lubrst 3 months ago
- 1 comment
Labels: bug
#2498 - Improve compare two records
Pull Request -
State: closed - Opened by RobinL 3 months ago
#2497 - Update changelog
Pull Request -
State: closed - Opened by RobinL 3 months ago
#2496 - Athena: Running predict `link_only` when one input is empty should return an empty result set, or raise an error if an empty input is not valid
Issue -
State: open - Opened by alanakilleen 3 months ago
Labels: bug
#2495 - 4.0.5 release
Pull Request -
State: closed - Opened by RobinL 3 months ago
#2493 - Compare two records - allow dataframes to be registered
Pull Request -
State: closed - Opened by RobinL 3 months ago
#2492 - Use Trusted Publisher action for publishing to pypi
Issue -
State: open - Opened by Thomas-Hirsch 3 months ago
#2491 - Tests fail on latest packages
Issue -
State: closed - Opened by ADBond 3 months ago
- 2 comments
Labels: dependencies, testing
#2490 - input_table_aliases order os lables
Issue -
State: open - Opened by mjdias 3 months ago
Labels: bug
#2489 - Specify version range for `pytest-cov` in CI
Pull Request -
State: closed - Opened by ADBond 3 months ago
Labels: bug, dependencies, continuous integration
#2488 - Less caching in debug mode
Pull Request -
State: closed - Opened by ADBond 3 months ago
Labels: bug, caching, debug_mode
#2487 - Error when using Clickhouse WITH FILL + INTERPOLATE
Issue -
State: closed - Opened by brunorpinho 3 months ago
Labels: bug
#2486 - Issue with Registering Splink UDFs in Dataproc PySpark Job on GCP
Issue -
State: open - Opened by jessicadahdouh 3 months ago
#2485 - Fix clustering in debug mode
Pull Request -
State: closed - Opened by ADBond 3 months ago