WCopyfind is a windows-based program that examines a collection of document files, looking for similarities. It extracts the text portions of those documents and scours them for matching words in phrases of a specified minimum length. When it finds two files that share enough words in those phrases, WCopyfind generates html report files. These reports contain the document text with the matching phrases underlined.
What WCopyfind can do:
It can find documents that share large amounts of text. This result may indicate that one file is a copy or partial copy of the other, or that they are both copies or partial copies of a third document. WCopyfind can presently handle: .docx, .doc, .txt, .htm, .html, and .pdf formats. It will also try to find text in other file formats, but there are no guarantees it will succeed.
What WCopyfind cannot do:
It cannot search the web or internet to find matching documents for you. You must specify which documents it compares. Those documents can be local ones—on your computer or a file server—or web-resident html or text documents that are pointed to by local internet shortcuts. If you suspect that a particular web page has been copied, you must create an internet shortcut to that page and include that shortcut in the collection of documents that you give to WCopyfind.
Copyfind is a command-line program that examines collections of document files, looking for similarities. Copyfind runs only under Windows (at present) and has the same internal comparison engine as WCopyfind. Unlike WCopyfind, however, Copyfind reads instructions from the command line and is therefore much more flexible and capable of more complicated comparison activities.