You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To give another example: I'm using the Splink package to do probabilistic record linkage. At one point I run the linker.training.estimate_probability_two_random_records_match() function, which calculates a Cartesian product and stores it in an interim variable. My code runs fine in native Python but returns the following error when I use reticulate():
ValueError: Deterministic matching rules led to more observed matches than is consistent with supplied recall. With these rules, recall must be at least -0.00.
A line of Splink code before this error message is:
When I display the value of num_total_comparisons, it shows as:
-2147483648
Which is -2^31, the minimum value for a 32-bit signed integer. When I reduce the size of my data frame so that the Cartesian product of the number of records is less than 2^31, it runs fine.
Let me know if you need any more info. Thanks.
The text was updated successfully, but these errors were encountered:
The easiest way of seeing this is to run:
To give another example: I'm using the Splink package to do probabilistic record linkage. At one point I run the linker.training.estimate_probability_two_random_records_match() function, which calculates a Cartesian product and stores it in an interim variable. My code runs fine in native Python but returns the following error when I use reticulate():
A line of Splink code before this error message is:
When I display the value of num_total_comparisons, it shows as:
Which is -2^31, the minimum value for a 32-bit signed integer. When I reduce the size of my data frame so that the Cartesian product of the number of records is less than 2^31, it runs fine.
Let me know if you need any more info. Thanks.
The text was updated successfully, but these errors were encountered: