The Project Gutenberg eBook of Moby Multiple Language Lists of Common Words

This ebook is for the use of anyone anywhere in the United States and most other parts of the world at no cost and with almost no restrictions whatsoever. You may copy it, give it away or re-use it under the terms of the Project Gutenberg License included with this ebook or online at book.klll.cc. If you are not located in the United States, you will have to check the laws of the country where you are located before using this eBook.

Title: Moby Multiple Language Lists of Common Words

Author: Grady Ward

Release date: May 1, 2002 [eBook #3206]
Most recently updated: August 28, 2025

Language: English

Credits: Produced by Mike Pullen

*** START OF THE PROJECT GUTENBERG EBOOK MOBY MULTIPLE LANGUAGE LISTS OF COMMON WORDS ***

MOBY (tm) LANGUAGE II


MOBY (tm) LANGUAGE II DOCUMENTATION NOTES

This documentation, the software and/or database are:

Public Domain material by grant from the author, January, 2001.


HISTORICAL NOTE:

The Ward word lists were some of the largest public domain word lists in the world, at the time they were added to the Project Gutenberg collection in 2007. These word lists do not contain 8-bit accented characters or Unicode, as would be found in a more recent Project Gutenberg eBook. Instead, the lists include phonetic spelling, utilizing backslashes and other characters to indicate where accents would normally occur. There is no detailed guide on how these extra characters were used, and therefore it is likely infeasible to map from the word lists back to a correct representation of the word (i.e., to map from a word list entry with slashes or other characters, back to the actual non-English word with accents or other non-ASCII characters).

These lists may still be useful, but they are no longer the state-of-the-art in word lists. In the time since the lists were created, it has become much easier for anyone with interests to make their own lists of unique words from the Project Gutenberg collection or other sources.

Moby (tm) Language II for MSDOS operating systems is compressed and distributed as a single zip file. After decompression the language files included with this product is in ordinary ASCII format with CRLF (ASCII 13/10) delimiters.


MOBY Language II CONTENTS

French Language list
German Language list
Italian Language list
Japanese Language list
Spanish Language list

Quick Start

  1. Insure you have at least 3Mb of free disk space to hold the contents of this zip file.
  2. Create a destination directory to hold the files listed above.
  3. On the PG Catalog page click on the selection "More Files". You will see a "files.zip" folder in the list. Move this zipped folder to your computer. On your computer open "files.zip", double click on its "files" subdirectory and copy the contents into the destination directory on your computer.

Word lists in five of the world's great languages:

FRENCH    number of words  138257  size in bytes   1524757
GERMAN    number of words  159809  size in bytes   2055986
ITALIAN   number of words   60453  size in bytes    561981
JAPANESE  number of words  115523  size in bytes    934783
SPANISH   number of words   86059  size in bytes    850523

Total     number of words  560101  size in bytes   5928030

Once decompressed, the vocabulary files may be viewed and used just as any TEXT-type file might.