-
Notifications
You must be signed in to change notification settings - Fork 158
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TF adjustments not displayed and most likely not applied #2584
Comments
Apologies for not replying sooner. This is probably the most complex custom comparison I've ever seen! I think it's unlikely the term frequency configuration 'plays' nicely with some of the other operators (e.g. And). Sorry - this is not documented anywhere - it's just an unusual use case I think to get this going you'd need to use the low-level dictionary representation: I'm not on my dev machine at the moment, but tomorrow I will try and add a couple of additional pointers and ways to debug this |
Sorry for the delay. When you use the comparison/comparison level libraries, these are really just using helper functions that generate the underlying settings dictionary for you. i.e. Splink always uses the spec-as-a-dict under the hood, it's just it's fiddly to write, so we help people out. So the way to know what Splink is going to do is to look at the dict representation: Which gives you:
This confirms my suspicion that that the following keys:
To be clear, this is a bug - so thank you for bringing it to our attention. But probably quite a fiddly one to fix |
So, I defined a name comparison composed of various levels, several of which have term frequency adjustments. However, when I inspect the comparison viewer dashboard I only find evidence of tf adjustments being applied to the first exact level match on full name. How can I be sure that these adjustments are indeed being used to compute the final match score? Or is their absence from the dashboard enough evidence that something isn't working?
My comparison:
and here are two examples of two levels, both of which should have tf adjustments, but where only the one for the full name is showing up.
Originally posted by @pierpaolocreanza in #2523
I cross posted here as an issue rather than a discussion question
The text was updated successfully, but these errors were encountered: