Count the word frequency in text files
WordFrequencyCounter counts the frequency of words from a single file, multiple files or the clipboard. The many options make it a very useful word counting tool for language analysis and learning.
Word Frequency Counter enables you to:
Define words. A word is made up of characters from an alphabet, but there are some characters
* that you might or might not want to include in a word definition such as & or -.
Define word separators. Word separators are used to divide language into individual words (text segmentation). The space character and punctuation (in the English language) are the most important word separators, but you also need to decide whether you want to use characters such as & or - as separators. Using our word counting software you can add and remove characters which are used as word separators.
Count words from the clipboard, directories and sub-directories. You can count word frequencies from either a single file, the clipboard or all files in a directory and its subdirectories. You can also ignore unimportant files.
Count word frequencies of Ascii and Unicode text files. WordFrequencyCounter works with both Unicode and Ascii text files.
Count words in English and other languages. As WordFrequencyCounter works with Unicode files, you can count word frequencies of words in any alphabet.
Automatically exclude words from the scan. You can load a wordlist or enter words individually of words that you want to be excluded from the frequency count.
Click to view larger image.
Word frequency counter settings are split into three areas. General, Scan options and Skip words.
Scan options and Skip words are optional and allow you to fully configure scanning parameters.
Firstly, decide on the files that you wish to scan.
WordFrequencyCounter will only scan text files which may be either Ascii or Unicode. A text file contains only characters which make up words and no other hidden characters. A microsoft word file, for example is not a text file because there are many hidden characters which format the text, set the font size, page layout and so on. Counnting words in non-text files. What is a text file gives a more
detailed explanation of what is and what isn't a text file.
You can scan a single file, information in the clipboard or entire directories. When counting the words in multiple files, it is possible to exclude ranges of files depending of their extension.
Secondly, decide on the scan options.
The most important of these allow word separators (text segmentation characters) to be chosen, and
characters that make up words to be included or ignored.
There are additional parameters such as case sensitivity and line length which can optionally be
specified.
Thirdly, optionally create a list of any words to be excluded.
Sometimes it is useful to exclude words such as 'and', 'the', 'in' and so on when analysing word frequencies.
You can experiment with the free download which is a fully functional demo.