I need a piece of software that will thwart evil.
I have about a hundred text files, which contain lists of about eighty numbered questions, 4000 words in length. Most of them seem to be identical text-wise, but I've turned up one that has some very minor differences.
I need a way to search through all these files and figure out if there are any others with different text. I really don't want to have to skim 1600 pages of this stuff - I'm afraid I'll miss something, and I've got more important things to be doing. I guess conceivably I could put a script together myself, but I've only recently started playing with regular expressions, and it might take longer than doing the project by hand.
The thwarting evil part comes in here: We were sent these files by a horrible law firm representing a horrible corporation, who inserted these discrepancies precisely to waste our time. The horrible corporation needs representation because they recently -ed over several hundred desperately poor people, a number of whom are now homeless. There will come a reckoning. But first I need, like, a Python script or something.
I have about a hundred text files, which contain lists of about eighty numbered questions, 4000 words in length. Most of them seem to be identical text-wise, but I've turned up one that has some very minor differences.
I need a way to search through all these files and figure out if there are any others with different text. I really don't want to have to skim 1600 pages of this stuff - I'm afraid I'll miss something, and I've got more important things to be doing. I guess conceivably I could put a script together myself, but I've only recently started playing with regular expressions, and it might take longer than doing the project by hand.
The thwarting evil part comes in here: We were sent these files by a horrible law firm representing a horrible corporation, who inserted these discrepancies precisely to waste our time. The horrible corporation needs representation because they recently -ed over several hundred desperately poor people, a number of whom are now homeless. There will come a reckoning. But first I need, like, a Python script or something.

no subject
Date: 2010-11-08 12:55 am (UTC)no subject
Date: 2010-11-09 12:08 am (UTC)no subject
Date: 2010-11-08 10:00 am (UTC)If they're numbered, and each number should be the same, one ouuuuught to be able to put them in a table, with one column for each file, each question or line being one cell. I believe that the spreadsheet should be able to evaluate equality of text strings, in addition to just numbers. Fill down. Set a conditional so that equal displays as green, unequal displays as red. Scan down for red.
uncouth
Date: 2010-11-08 09:57 pm (UTC)* Kidding. Seriously.
Re: uncouth
Date: 2010-11-09 12:09 am (UTC)Re: uncouth
Date: 2010-11-10 01:33 am (UTC)no subject
Date: 2010-11-08 08:55 pm (UTC)no subject
Date: 2010-11-10 09:40 am (UTC)