I am trying to divide a file using the number of words as a condition. Alternatively, I would at least like to be able to retrieve the first x words of a given file. Any tips?
Thanks in advance.
on 09/23/2010 – Made popular on 09/23/2010
I have a file with words, some of them repeats. I need to write one liner that shows me the first 5 words with the biggest number of aparition, and will show me like this: number of aparitions space word. Any ideea?
I have huge (100 Gbytes) files which I need to post process after they have been generated.
I wonder if there is an easy way to split the file and then process each file coming form split file.
I mean doing it automatically on one command line without waiting for split output and then start the processing commands.
Here is an example where I want to split a file and the count number of McDo
If i use
wc -m filename
it will generate the number of characters and
wc -w filename
will generate number of words if i used this info by dividing number of characters/number of words it will give me misleading result as number of character will include spaces and punctuation as well as words count any advice ?