This is a generic format that minimizes the effort involved in receiving and loading the data, and reduces the likelihood of errors being introduced during exchange. Tab-delimited formats are preferable to comma-separated formats, as commas appear regularly within the distributed data and, though they can be "commented out", doing so leaves a greater opportunity for error than the use of a tab-delimited format. Tab-delimited formats can be easily exported from all commonly used spreadsheet programs.
126.96.36.199 The file should be entitled "[ProviderName]_AllTitles_[YYYY-MM-DD].txt".
For example, JSTOR_AllTitles_2008-12-01.txt.
188.8.131.52 The provider name should be the web domain at which your data is hosted (but
without the punctuation).
For example, jstor or ebscohost. This ensures that your data is clearly distinguished from data provided by others with similar package names. Also, the file name should be consistent for each metadata file deposited.
184.108.40.206 Separate files should be produced for each package of content that the provider
Files should be named as customers would expect to see it labelled in the knowledge base, using the syntax "[ProviderName] _[CollectionName] _[YYYY-MM-DD].txt". For example, JSTOR_Arts&SciencesV_2008-12- 01.txt. Providers and recipients can agree in advance how best to present complex collection names.
220.127.116.11 All metadata should be provided as plain text.
If metadata is provided in a format that does support additional style or formatting, it should be presented without those enhancements. Data should not include colors, typefaces, italics, or other markup.
18.104.22.168 Text should be encoded as UTF-8.
The UTF-8 character set is well supported and encompasses the writing systems of many languages. This is also a common output option for programs such as Microsoft Excel.
22.214.171.124 One publication should be given in each line of the file, with a column for each field given in Section 5.3.2, Data Fields.
126.96.36.199 Data should be provided with column headers (see Section 5.3.2) and without a blank row between the column header and the first row of content.
188.8.131.52 A title should be listed twice if there is a coverage gap of greater than or equal
to 12 months, with only the coverage field changing.
Greater granularity in reporting data coverage gaps is desirable, and should be agreed with the link resolver supplier if it can be supported.
184.108.40.206 All rows should be consistent in terms of format.
For example, ISSN should always be expressed as nine characters with a hyphen separator, and date fields should always be in the format described in Section 5.3.2.
220.127.116.11 The metadata file should be supplied in alphabetical order by title to ensure ease of checking and import by knowledge base developers.
- Proceed to section 5.3.2 of the KBART report: data fields and labels
- Download PDF of KBART report: PDF 517kb
- Return to homepage of HTML report: HTML