So, it looks like the original piece data simply doesn't contain the 白 element - it only has the 日 part, which is where the disconnect is coming from.
I did a quick addition for those kanji, but did not have time to verify it. Additionally, there is not a programmatic way to scan for these "absences", unfortunately.