The following information and examples may guide those who wish to explore licensing options for the AAT, ULAN, or TGN. (CONA is in development and not yet available for licensing.) The Getty vocabularies change and grow over time; thousands of records are edited or added each year. Implementors are encouraged to refresh the data over time.

Licensing: The Getty vocabulary data is copyrighted by the J. Paul Getty Trust. All rights reserved. Your institution must sign a license and pay the required fees in order to use the data. The data is compiled from various contributors using published sources, which must be cited along with the J. Paul Getty Trust when the data is displayed.

The data files are released annually in July (barring unforeseen technical difficulties). If you wish to license multiple vocabularies, you must obtain a separate license for each one. Licensing agreements are for a fixed five-year term; annual data updates are now being offered free of charge during that five-year term. The license fee must be paid and the license agreement signed prior to the transfer of data. At the end of five years, you may renew your license for an additional five-year term upon payment of a fee. Contact us at vocab@getty.edu, subject line: Licensing, to learn the amount of the fees and terms of licenses. Please include an explanation of how you intend to use the data and whether your institution is for-profit or not-for-profit.

Formats of the releases: The next data release will take place in July 2012.

The data is released in data files only. No software is provided. Only full sets of data are available for licensing; we do not provide subsets of the data.

Web services: Data files may also be accessed through Web services APIs (application programming interfaces), which provide access to data that is refreshed every two weeks. Note that these files contain "in process" data, meaning it could change prior to the annual July release. The APIs are available to any institution that holds a valid license. Please read details in the Web Services User's Instructions (pdf).

Changes to the data structure: Changes have been made in the structure of all three vocabularies to allow for the seamless integration of multilingual terms. One significant change is that the Qualifier field is now linked to language (which has multiple occurrences per Term), to better accommodate qualifiers in multilingual data. Originally, there was only one qualifier per term; however, the same term may be appropriate for multiple languages, while the qualifier for different languages should be different (e.g., English: gouache (paint), Spanish: gouache (pintura); French: gouache (pientre)). In other words, while language was always repeating for each Term, now with the revised data structure, the Qualifier will also be repeating for each term. In addition, the Term Type was moved to the language link (because, for example, the same term could be a Descriptor in one language, but a Used For term in another language). A Language Status flag was added, which will eventually be used to flag loan terms. The Scope Note was made multilingual, so there may be multiple occurrences of SN, and SN is linked to language. Another change is that the Hierarchical Relationship Type is now linked to the hierarchical relationship as a separate field, whereas it had previously been stored in the relationship historical flag field. The Hierarchical Relationship Type records the thesaural standards codes for the type of relationship between child and parent: BTP (part/whole or partitive), BTG (genus/species or generic), or BTI (instance of).

When were the changes implemented? The new data structure is included in the July 2011 licensed data file releases.

Data dictionaries: Note that new data dictionaries available below are current for the 2011 release. The documentation does not give step-by-step instructions on how to construct a database or interface based on the data files; analysis and a competent programmer will be required to implement the vocabulary data files. The Getty does not provide technical support. For details regarding data content and editorial rules, see the Editorial Guidelines.

Persistent IDs: We have implemented new functionalities in our editorial system that will result in more persistent IDs for our vocabulary records. Previously, although each record had a unique numeric ID, the ID would change when new records were "merged" with existing records, and in other rare situations. While licensees of Getty vocabulary data received annual mappings of old IDs to new ones, our user community was anxious to have a more persistent ID for the Getty vocabulary records over time. The new merge process, implemented in January 2008, results in the ID of the original vocabulary record being maintained when a new record is merged into it. Other editorial situations may occasionally require the generation of new IDs (e.g., when one existing record is divided into two records); for these rare cases, a mapping of the old IDs to the new ones will continue to be published with the annual releases.

Data currently available: The data currently available for licensing was cut in May 2011. The next data release will be in July 2012.



Download sample records from the Art & Architecture Thesaurus (AAT). The full AAT is a hierarchical vocabulary of around 34,000 records, including 134,000 terms, descriptions, bibliographic citations, and other information relating to fine art, architecture, decorative arts, archival materials, archaeology, and other material culture. The full XML data file for the AAT is 20,425 kilobytes in size. The sample available here is a small subset of the AAT. The next data release will take place in July 2012. For further information, see About the AAT.

AAT Sample Data
Format options:

 

Data Dictionary for the
AAT Data Release

 

 

 

XML UTF-8 (2.38 MB)

 

XML format PDF

Relational Table UTF-8 (40.4 MB)

 

Relational Table format PDF



Download sample records from the Union List of Artist Names (ULAN). The full ULAN is a vocabulary of around 162,000 records, including 453,000 names and biographical and bibliographic information for artists, architects, firms, shops, and art repositories, including a wealth of variant names, pseudonyms, and language variants. The full XML data file for the ULAN is 97,383 kilobytes in size. The sample available here is a small subset of the ULAN. The next data release will take place in July 2012. For further information, see About the ULAN.

ULAN Sample Data
Format options:

 

Data Dictionary for the
ULAN Data Release

 

 

 

XML UTF-8 (14 MB)

 

XML format PDF

Relational Table UTF-8 (92 MB)

 

Relational Table format PDF



Download sample records from the Getty Thesaurus of Geographic Names (TGN). The full TGN is a hierarchical vocabulary of around 895,000 records, including 1.1 million names, place types, coordinates, and descriptive notes, focusing on places important for the study of art and architecture. TGN is not a GIS; coordinates are included for many records, but they are for finding purposes only (i.e., to find the place on a map). The full XML data file for the TGN is 133,312 kilobytes in size. The sample available here is a small subset of the TGN. The next data release will take place in July 2012. For further information, see About the TGN.

TGN Sample Data
Format options:

 

Data Dictionary for the
TGN Data Release

 

 

 

XML UTF-8 (1.10 MB)

 

XML format PDF

Relational Table UTF-8 (7.86 MB)

 

Relational Table format PDF



Data is copyrighted: The Art & Architecture Thesaurus® (AAT), the Union List of Artists Names ® (ULAN), and the Getty Thesaurus of Geographic Names® (TGN) are copyrighted by the J. Paul Getty Trust. Companies and institutions interested in regular or extensive use of the vocabularies should explore licensing options by sending an email to vocab@getty.edu.

No warranties by Getty: The databases are provided "as is." Getty disclaims all other warranties, either express or implied, including, but not limited to, implied warranties of merchantability and fitness for a particular purpose, with respect to the databases.



Revised 31 May 2011