Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / moj-analytical-services/splink issues and pull requests

#1705 - Implement datediff using sqglot (Splink4)

Pull Request - State: closed - Opened by RobinL over 1 year ago - 8 comments

#1704 - InputColumn fails on column name with space when using Spark SQL dialect

Issue - State: closed - Opened by RobinL over 1 year ago - 1 comment

#1703 - Splink4 Comparison Levels: Clarify and improve existing code

Pull Request - State: closed - Opened by RobinL over 1 year ago - 1 comment

#1702 - Corrected docstring to match connected components algorithm

Pull Request - State: closed - Opened by zslade over 1 year ago

#1701 - BlockingRule: Refactor to enable better iteration

Pull Request - State: closed - Opened by RobinL over 1 year ago - 5 comments

#1700 - BlockingRule: Clarify name of sql property

Pull Request - State: closed - Opened by RobinL over 1 year ago - 2 comments

#1699 - (WIP) Refactor blocking rule for clarity

Pull Request - State: closed - Opened by RobinL over 1 year ago - 1 comment

#1698 - Fix spark fixture

Pull Request - State: closed - Opened by ADBond over 1 year ago
Labels: testing

#1697 - FYI: paper using combo of embeddings and comparisons

Issue - State: open - Opened by NickCrews over 1 year ago - 1 comment

#1696 - [Dev] Enqueue multiple SQLs

Issue - State: open - Opened by ADBond over 1 year ago
Labels: dev, refactoring

#1695 - Refactor `block_using_rules_sql` to follow normal pattern and avoid confusion

Pull Request - State: closed - Opened by RobinL over 1 year ago - 2 comments

#1694 - fix docstrings to use .to_dict() instead of .spec

Pull Request - State: closed - Opened by aymonwuolanne over 1 year ago

#1693 - Explicitly cast postgres function return values

Pull Request - State: closed - Opened by sluhn-harrisr over 1 year ago - 3 comments

#1692 - Refactor of 1664: add ability to do efficient blocking based on list/array intersections

Pull Request - State: closed - Opened by RobinL over 1 year ago - 8 comments

#1691 - Fix typos

Pull Request - State: closed - Opened by ADBond over 1 year ago

#1690 - Added a Changelog

Pull Request - State: closed - Opened by ADBond over 1 year ago
Labels: documentation, user experience

#1689 - Comparison level builders initial work

Pull Request - State: closed - Opened by ADBond over 1 year ago - 4 comments
Labels: comparison levels, splink4, refactoring

#1688 - InputColumn methods could be properties

Issue - State: closed - Opened by ADBond over 1 year ago - 1 comment
Labels: refactoring

#1687 - [DOCS] Broken link to performance guide

Issue - State: closed - Opened by ADBond over 1 year ago - 4 comments
Labels: documentation

#1686 - `_names_of_tables_created_by_splink` is unused

Issue - State: closed - Opened by OlivierBinette over 1 year ago - 2 comments

#1685 - DB type error when creating ave_months_between function in postgres 9.4

Issue - State: closed - Opened by sluhn-harrisr over 1 year ago - 1 comment
Labels: bug, postgres

#1682 - typo

Pull Request - State: closed - Opened by sama-ds over 1 year ago

#1681 - Splink4 comparison levels

Pull Request - State: closed - Opened by RobinL over 1 year ago

#1680 - Connection refused using SparkLinker

Issue - State: closed - Opened by msp297 over 1 year ago - 1 comment

#1679 - [FEAT] Cross-Backend / Lazily Evaluated Comparisons and Blocking Rule builders

Issue - State: closed - Opened by ThomasHepworth over 1 year ago - 8 comments
Labels: enhancement, Interface/API improvement, backend_change, comparison levels, blocking, splink4

#1678 - [MAINT] `linker.py` cleanin/refactoring

Issue - State: open - Opened by ThomasHepworth over 1 year ago - 3 comments
Labels: enhancement, splink4, refactoring

#1677 - Cluster metrics

Pull Request - State: closed - Opened by zslade over 1 year ago - 2 comments

#1676 - Check for single comparison or single valid dataframe column

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago - 1 comment

#1675 - Clean load settings

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago - 2 comments

#1674 - improve the settings validation documentation

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago

#1673 - [FEAT] - Validate numeric values in the settings dictionary

Issue - State: open - Opened by ThomasHepworth over 1 year ago
Labels: enhancement, validation, settings

#1672 - fixed null level issue for composing comparison levels

Pull Request - State: closed - Opened by aymonwuolanne over 1 year ago - 2 comments

#1670 - [MAINT] Consolidate our multiple linkers/introducing lazy factories

Issue - State: closed - Opened by ThomasHepworth over 1 year ago - 2 comments
Labels: enhancement, splink4

#1669 - [MAINT] Settings restructuring - bringing everything into a centralised settings object

Issue - State: open - Opened by ThomasHepworth over 1 year ago - 1 comment
Labels: enhancement, splink4, settings

#1668 - Automatically detect blocking rules for prediction and blocking rules for EM training

Pull Request - State: closed - Opened by RobinL over 1 year ago - 3 comments

#1667 - Compute the cost of combinations of blocking rules

Pull Request - State: closed - Opened by RobinL over 1 year ago - 2 comments

#1666 - [FEAT] Diagnostics mode to get summary breakdowns of SQL execution

Issue - State: open - Opened by ThomasHepworth over 1 year ago - 1 comment
Labels: enhancement, profiling, performance, user experience

#1664 - Add ability to do efficient blocking based on list/array intersections

Pull Request - State: closed - Opened by nerskin over 1 year ago - 12 comments

#1663 - Linker API subset pages in docs have all methods

Issue - State: open - Opened by ADBond over 1 year ago - 5 comments
Labels: documentation

#1662 - Update missingness.py

Pull Request - State: closed - Opened by samnlindsay over 1 year ago - 2 comments

#1661 - `completeness_data` unused `cols` argument

Issue - State: closed - Opened by samnlindsay over 1 year ago
Labels: bug, profiling, charts

#1659 - fix duplicate doc files

Pull Request - State: closed - Opened by RossKen over 1 year ago

#1658 - [MAINT] Prefix non-user facing functions with an underscore

Issue - State: open - Opened by RobinL over 1 year ago
Labels: enhancement

#1656 - Bump urllib3 from 1.26.16 to 1.26.18

Pull Request - State: closed - Opened by dependabot[bot] over 1 year ago
Labels: dependencies

#1655 - Update binder to point to splink repo

Pull Request - State: closed - Opened by RobinL over 1 year ago - 1 comment

#1654 - (WIP) Binder config

Pull Request - State: closed - Opened by RobinL over 1 year ago - 2 comments

#1653 - Lambda default warning

Pull Request - State: closed - Opened by RossKen over 1 year ago - 1 comment

#1652 - Fix issue 1651 - comparison viewer bars sorted improperly

Pull Request - State: closed - Opened by RobinL over 1 year ago - 2 comments

#1651 - BUG: `comparison_viewer_dashboard` vectors are not synced with histogram

Issue - State: closed - Opened by NickCrews over 1 year ago - 2 comments

#1650 - implement a janitorclass to automatically format class attributes

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago - 1 comment

#1649 - [FEAT] use base 10 for log odds

Issue - State: closed - Opened by NickCrews over 1 year ago - 4 comments
Labels: enhancement

#1648 - add settings validation docs

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago - 1 comment

#1646 - Fix bug with labelling tool where it didn't work offline

Pull Request - State: closed - Opened by RobinL over 1 year ago - 1 comment

#1645 - The Labelling tool does not work offline as intended

Issue - State: closed - Opened by RobinL over 1 year ago

#1644 - Splink 4: Backwards-incompatible API changes

Issue - State: open - Opened by RobinL over 1 year ago - 9 comments
Labels: splink4

#1643 - [FEAT] Addition to training rules topic guide to prevent violating independence assumption

Issue - State: open - Opened by sama-ds over 1 year ago - 1 comment
Labels: enhancement

#1642 - [DEPS] Update sqlglot to >=13.0.0

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago

#1640 - [FEAT] Date comparison without datediffs

Issue - State: open - Opened by samnlindsay over 1 year ago
Labels: enhancement

#1639 - Migrate splink_vis_utils.js changes to upstream repo

Pull Request - State: closed - Opened by RobinL over 1 year ago - 4 comments

#1638 - [FEAT] Improved docs on estimate_probability_two_random_records_match

Issue - State: open - Opened by sama-ds over 1 year ago - 1 comment
Labels: enhancement

#1637 - V3.9.8

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago

#1636 - [MAINT] Refactor and clean our settings validation logs

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago - 3 comments

#1635 - fix problem with csv overwriting in spark

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago

#1633 - [FEAT] Charts gallery - Precision Recall

Issue - State: open - Opened by RossKen over 1 year ago
Labels: documentation, enhancement, charts

#1632 - [FEAT] Charts gallery - Cluster Studio

Issue - State: open - Opened by RossKen over 1 year ago
Labels: documentation, enhancement, charts

#1631 - [FEAT] Charts gallery - Comparison Viewer dashboard

Issue - State: open - Opened by RossKen over 1 year ago
Labels: documentation, enhancement, charts

#1630 - Bump urllib3 from 1.26.16 to 1.26.17

Pull Request - State: closed - Opened by dependabot[bot] over 1 year ago - 1 comment
Labels: dependencies

#1624 - update ctl docs

Pull Request - State: closed - Opened by afua-moj over 1 year ago

#1622 - [DEPS] Update scala jars and adjust dependencies

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago
Labels: dependencies, spark

#1620 - Perform spellcheck

Pull Request - State: closed - Opened by zslade over 1 year ago

#1619 - [BUG] History of probability_two_random_records_match not shown correctly in tooltip

Issue - State: open - Opened by RobinL over 1 year ago
Labels: good first issue

#1617 - Installation docs tweak

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago

#1616 - enable_splink() in databricks fails

Issue - State: closed - Opened by robertwhiffin over 1 year ago - 3 comments

#1614 - [FEAT] viz blocking rules using upset chart

Issue - State: closed - Opened by NickCrews over 1 year ago - 4 comments
Labels: enhancement, graphs, blocking

#1611 - Check input frames have same columns - missingness

Pull Request - State: closed - Opened by ADBond over 1 year ago

#1610 - 3.9.7

Pull Request - State: closed - Opened by RossKen over 1 year ago

#1605 - Duplicate docs files

Issue - State: closed - Opened by ADBond over 1 year ago
Labels: documentation

#1604 - Fix tf table init on settings load

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago - 4 comments

#1600 - [DEPS] Drop support for py 3.7

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago - 2 comments

#1599 - Add roc chart to gallery

Pull Request - State: closed - Opened by RossKen over 1 year ago

#1598 - [FEAT] Add AUC to ROC chart

Issue - State: open - Opened by RossKen over 1 year ago
Labels: enhancement, charts

#1597 - [FEAT] Reassess benchmarking

Issue - State: open - Opened by RossKen over 1 year ago
Labels: enhancement, continuous integration

#1596 - Comparison viewer - handle comparisons with spacey names + fix waterfall tooltip

Pull Request - State: closed - Opened by ADBond over 1 year ago - 1 comment
Labels: charts

#1595 - add path arguments to automated tests

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago - 1 comment

#1594 - Only run tests if files under `splink` or `tests` have been edited

Issue - State: closed - Opened by ThomasHepworth over 1 year ago
Labels: enhancement, continuous integration

#1593 - Fix local docs table gen

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago - 2 comments

#1592 - SQlite example notebook fix

Pull Request - State: closed - Opened by ADBond over 1 year ago - 2 comments
Labels: continuous integration

#1591 - add initial block on docs

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago - 2 comments

#1590 - Stopped repeat installs if already in docs-venv

Pull Request - State: closed - Opened by zslade over 1 year ago

#1589 - run the linter

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago - 1 comment

#1588 - Add Spellchecker

Pull Request - State: closed - Opened by zslade over 1 year ago - 2 comments

#1587 - [FEAT] Spellchecker for Docs

Issue - State: closed - Opened by zslade over 1 year ago - 1 comment
Labels: enhancement

#1586 - add charts dev guide

Pull Request - State: closed - Opened by RossKen over 1 year ago - 1 comment

#1585 - `group` -> `cluster`

Pull Request - State: closed - Opened by ThomasHepworth over 1 year ago

#1584 - Unable to create `SparkLinker` on Databricks

Issue - State: closed - Opened by cswaney over 1 year ago - 2 comments