OpenCyc Flat Files
The zip archive below consists of the files we extracted from OpenCyc. Each file has the following format:
(case <mt>)
<fact>
<fact>
…
Each fact <fact> was explicitly found in microtheory <mt>. If you scrutinize the flat files, you will notice that some of them are empty except for the microtheory name. Moreover, you will notice that the number of axioms in these flat files is far short of the contents listed on the OpenCyc web site. There are several reasons for this:
- Our extraction process is imperfect.
- We do not bother extracting provenance information; for us it is of limited utility, and if we ever need it (e.g., reporting an ontological bug), one can use the OpenCyc browser to find it easily.
Moreover, we converted all strings to ASCII encoding from Unicode. Since most of our machines are 32-bits, and many people using our software still live in a 32-bit world, we are obliged to squeeze where we can. Our apologies to any people, places, or linguistic/cultural entities whose names we have inadvertently mangled in the process.
Opencyc-flatfiles.zip: Extracted 2/14/07, from OpenCyc 1.02.