1 week later and my script has compared almost 2M bib records, finding about 13,000 mismatches for our staff to check and possibly correct. Still churning. Maybe I should have parallelized it? But I'm not sure if either Alma or Evergreen would have been very happy with that.
Err, math is hard. 2 weeks later, for a total of 3 weeks.
Should have just dumped the entire MARC dataset from Evergreen to remove it as a bottleneck. Oh well. Next time.
code4lib.social is a GLAM-themed Mastodon Instance.