Duplicate detection is one of the things any serious reference manager should offer. Zotero users have been clamouring for it since the early days. There are basically two ways to implement it: as a preflight check, warning the user when they are about to add a potential duplicate; and as an after the fact scan, which enables users to weed duplicate items from their library.
The most recent version of Zotero takes the second route: a posthoc duplicate detection mechanism. Though definitely better than nothing, and with an elegant merging solution, the interface is still far from perfect and yields a lot of false positives, making it somewhat difficult to use. Besides, it is slow, because it tries to compare everything with everything, which amounts to a huge amount of operations even in moderately sized libraries. Although it is good to have at least something, what seems to me have been overlooked is that prevention is better than cure, and that a quick check before adding new items to the library would help users a lot. Continue reading
The linguistic blogosphere featured some posts recently on the topic of phonosymbolism, phonosemantics, and Chinese characters. It started with a 