1

Why is uniq ignoring Unicode and lines with a single letter?

view story
linux-howto

http://unix.stackexchange.com – I'm trying to combine both the American and British dictionaries into one large dictionaries, and I'm trying to remove all the duplicates from the superset, but it seems that uniq is not outputting words like, "épée" or single letters. This is what I've tried using: LC_COLLATE=en_US.UTF-8 cat american-english british-english |sort|uniq -u > unique_sorted_combined_dict If I just do this: LC_COLLATE=en_US.UTF-8 cat american-english british-english |sort > sorted_combined_dict "épée" and other such words do show up, along with single letters. Is there something I'm missing here (HowTos)