ISO 639-3 Downloads
- Terms of Use
- Set of Complete Code Tables
- ISO 639-3 Code Set
- Language Names Index
- Macrolanguage Mappings
- Deprecated Code Element Mappings
This page offers links to downloadable versions of the complete ISO 639-3 code set, a language names index, the mapping of macrolanguages to individual languages, and the mapping of retired code elements to current code elements. The Code Set table and the Language Names Index table are formatted as tab-delimited, UTF-8 text files. These tables are also offered as ISO/IEC 8859-1 encoded text files, though these simplified encodings should not be considered normative. The remaining two files map between identifiers; they are presently encoded in ISO/IEC 8859-1, as currently all data in these tables can be correctly encoded in the more limited character set. The first line of each file contains the column names rather than the first row of data.
Reference Names
The "Ref_Name" column in the download tables contains a reference name by which this language is identified in the standard. The Reference Name is employed for ease of use of the code set, and does not imply it is to be preferred in any application to any other name that may be associated with the particular code element as given in the Language Names Index.
Note: We wish to avoid use of any pejorative name as the Reference Name. If you are aware of any instance in which a code element uses a pejorative or derogatory name as the Reference Name, please bring this to our attention at the contact email address below.
ISO 639-3 Code Tables Terms of Use
The ISO 639-3 code set may be downloaded and incorporated into software products, web-based systems, digital devices, etc., either commercial or non-commercial, provided that:
- attribution is given www.iso639-3.sil.org as the source of the codes;
- the identifiers of the code set are not modified or extended except as may be privately agreed using the Private Use Area (range qaa to qtz), and then such extensions shall not be distributed publicly;
- the product, system, or device does not provide a means to redistribute the code set.
Expansions to the information provided by the standard (e.g., population data, names in other languages, geographic coordinates, etc.) may be made and distributed as long as such added information is clearly identified as not being part of the standard itself. The ISO 639-3 website is the only authorized distribution site for the ISO 639-3 code tables.
For any questions about whether a particular use is covered by these guidelines, contact the Registration Authority at [email protected].
Complete Set of Tables
A complete set of all current code tables, containing the main ISO 639-3 table, the Language Names Index Table, the Macrolanguage Mappings, and the Retired Code Element Mappings, is available here:
ISO 639-3 Code Set
NOTE: The "Ref_Name" column in this table contains a reference name by which this language is identified in the standard. The Reference Name is employed for ease of use of the code set, and does not imply it is to be preferred in any application to any other name that may be associated with the particular code element as given in the Language Names Index.
The complete code table of active code elements may be downloaded by clicking the following link.
- Download ISO 639-3 code set UTF-8
For users with systems unable to utilize Unicode character encoding, the code set table is also offered in a simplified version in Latin-1 (ISO/IEC 8859-1).
The following declaration is a sample formal definition for a SQL database table into which the tab-delimited file can be loaded (Comment column added 18 Oct 2007):
Language Names Index
In ISO 639-2, there are multiple name forms for some identified languages. The ISO 639-3 code tables now include a language name index with multiple names associated many code elements (primarily in English forms or variant anglicized spellings of indigenous names). The reference name from the Ref_Name field of the main table is included as an entry in this table, thus every code element has at least one row occurrence in the Language Names Index table. The name appears in two forms, a "print" form used in most contexts, and an inverted form which fronts a language name root, e.g., "Isthmus Zapotec" and "Zapotec, Isthmus". Where there is no root part to the name, the Print_Name and the Inverted_Name contain identical strings. The Language Names Index may be downloaded by clicking the following link
- Download ISO 639-3 Language Names Index UTF-8
For users with systems unable to utilize Unicode character encoding, the Language Names Index table is also offered in a simplified version in Latin-1 (ISO/IEC 8859-1).
The following declaration is a sample formal definition for a SQL database table into which the tab-delimited file can be loaded:
Macrolanguage Mappings
The complete set of mappings from macrolanguages to the individual languages that comprise them may be downloaded by clicking the following link.
- Download ISO 639-3 macrolanguage mappings
The table has three columns (this is a change from previous versions of this table). The first identifies a macrolanguage and the second identifies one of its members. The third specifies the status of the individual member language, as being Active or Deprecated (Retired). (This last column is actually redundant, but indicates to the user which table will contain the identifier as primary key: the main code set table for active code elements, or the retirement mappings table for deprecated code elements.) Thus a given macrolanguage has as many rows as it has individual languages that are its members. The following declaration is a sample formal definition for a SQL database table into which the tab-delimited file can be loaded:
Deprecated (Retired) Code Element Mappings
The annual update to the 639-3 code set will include a complete listing of the code elements that have been deprecated with instructions on how to update existing data. Although the word "retired" was previously used for codes no longer in use, we now use the word "deprecated" as the code will continue to have the same meaning that was originally established for it. Deprecated codes are not reused for another meaning in the code set.
Since the initial release of ISO/FDIS 639-3 and prior to the release of ISO 639-3, there was one list of retirements (deprecations), a correction to the alignment between ISO 639-3 and ISO 639-2. It is included in the Deprecated Code Element Mappings because it has been a source of confusion for users. The Deprecated Code Element Mappings table may be downloaded by clicking the following link.
- Download ISO 639-3 deprecated code mappings UTF-8
For users with systems unable to utilize Unicode character encoding, the deprecated code mappings table is also offered in a simplified version in Latin-1 (ISO/IEC 8859-1).
The table has five columns; the first has the affected identifier, the second has a coded value for the reason the deprecation was necessary, the third contains a single identifier if the deprecated identifier maps unambiguously to another identifier, the fourth contains a prose statement about what should be done to update a code element split, and the fifth gives the date the change was made effective. The following declaration is a sample formal definition for a SQL database table into which the tab-delimited file can be loaded: