Duplicate entries created in Merge table #920

samuelbray32 · 2024-04-10T22:21:37Z

Describe the bug

Found that some of the entries in SpikeSortingOutput.CuratedSpikeSorting are duplicated with different merge_ids. This should be prevented by the UUID hashing in Merge._merge_insert().
Possible that these were due to a change in how we hash Fault-permit insert and remove mutual exclusivity protections on Merge #824
- If so, should _merge_insert check for a matching entry in the part table for a given source primary key before generating a UUID and inserting? This would help prevent overlapping entries going forward in case of other hash method changes
- Side note: Essentially this is the inverse problem of Merge key conflict in PositionOutput #915

To Reproduce
Example duplicate key:

part_key = {
    "curation_id": 1,
    "nwb_file_name": "Winnie20220717_.nwb",
    "sort_group_id": 13,
    "sort_interval_name": "12_lineartrack",
    "preproc_params_name": "franklab_tetrode_hippocampus",
    "team_name": "ms_stim",
    "sorter": "mountainsort4",
    "sorter_params_name": "franklab_tetrode_hippocampus_30KHz_tmp",
    "artifact_removed_interval_list_name": "Winnie20220717_.nwb_12_lineartrack_13_franklab_tetrode_hippocampus_ampl_2000_prop_75_artifact_removed_valid_times",
}

SpikeSortingOutput().CuratedSpikeSorting() & part_key

The text was updated successfully, but these errors were encountered:

samuelbray32 · 2024-04-10T22:28:55Z

Can confirm the difference is due to inclusion of the source table in the hash generation. duplicate entry UUIDs match hash results with and without the source table respectively.

Solution: Implement check for existing key in part table prior to UUID generation and insert

samuelbray32 added bug Something isn't working spike sorting merge To do with merge tables labels Apr 10, 2024

samuelbray32 self-assigned this Apr 10, 2024

samuelbray32 mentioned this issue Apr 10, 2024

Check for entry in merge part table prior to insert #922

Merged

4 tasks

edeno closed this as completed in #922 Apr 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Duplicate entries created in Merge table #920

Duplicate entries created in Merge table #920

samuelbray32 commented Apr 10, 2024

samuelbray32 commented Apr 10, 2024

Duplicate entries created in Merge table #920

Duplicate entries created in Merge table #920

Comments

samuelbray32 commented Apr 10, 2024

samuelbray32 commented Apr 10, 2024