I have a large text file containing over 180k lines and another text file containing about 1k. I would like to remove lines in the 180k-line file that exist in the 1k-line file. I thought there was a simple way to do this but I haven't come across it yet. Any advice? Thanks :)
I have a text file which is having 30000 lines in it. I have to create a xml file for each 10000 lines until all the lines in the text files are written. Also please help how can i get number of lines in the text file in a shell variable?
I have a 42M line text file. Th first nine characters of each line are a numeric key. What is the most efficient way to extract only the lines whose key exists in another list of about 1.5M keys? Both the file and the list of keys are sorted.
I have a huge text file generated from our system and i need a script that asks the user to specify which file to open, Once the file is open, the script should extract ALL of the XML in that file ONLY and then write it to another file.
The XML needs to be extracted in to the new file, currently it is not arranged very well, so will need some line breaks once i have completed the latte
I have several (427 to be precise) text files with a million lines each containing 31 numbers separated be spaces (possible double spaces). However there might be some data corruption and there may be lines containing junk.
I now want to check if every line satisfies the property of containing 31 items separated by spaces (I assume that those items are numbers.