Hm, so far I understood duplicates as being identical words/meanings with different variants of kanji-kana-combinations.
I might be wrong here, but the example of つとめる with different kanji… so far I thought of it as a slightly different meaning - a nuance only, but yet different. A homophone but not the same word…
Will the entire dictionary entry of 務める disappear and come up as variant in 勤める instead?
I‘m thinking also of words like きく (聞く、聴く) that are almost identical, but actually aren‘t…
If they are currently in renshuu with the same definition, then yes, they'll be blended. As far as renshuu is concerned, they are the same.
In the first case, it looks like renshuu's definitions need to be adjusted (I can handle this tomorrow) so they do appear as distinct entries. I do have the full list of merges (it's maybe 500 sets long, although most of those are 御 variants, and can be easily dismissed), and we can do a spot check on the remaining ones (like this) to see if any other definition changes need to go into place before this happens.
Excellent find! I'll trim down the list tomorrow and we can see if any of the other ones need to be adjusted.
Update: There are about 190 pairs with same definition on renshuu, but different kanji, so I'll need to go through those first :(. It's too long to post in here, but I'll try to get this done as quickly as I can so we can continue with this.
All "mixed" pairs finished. I still need to figure out how to best handle the number/kanji words, but after that, will be good to get a test suite set up!
Also, if one or two people are up for even MORE data sharing, I'd like to take one user's data (just the duplicate term data) and post it here so everyone here can see what I'm looking at, and I can get better feedback on the process that you will all take when this is rolled out. Just one or two users would be enough (older accounts are better, heh.)
If that‘s still, or rather again, helpful after the cleanup of „mixed“ definitions, I‘d be happy to provide my data. I‘m not sure about how many relevant terms I have though…
I had noticed a couple duplicate words in my schedules but I thought I misread something which led to me getting questions wrong cus I thought i already did that word so it can't be the same- lol. I'd be happy to help out, I just don't know exactly how to so I'm just copying everyone I see here, lol