ISD Discussion Document |
Changes from 1.01:
Internet Stamp Databases - Format Description ver 1.02 Martti Tolvanen 1994-02-16
This document (format.isd) describes the format used in the Internet Stamp Databases (ISD) and gives some guidelines for creating their content. Another document (ideology.isd - still not assembled from previous discussions) will give more information about what the ISD are, what they can be used for and why they took the form they currently have.
Acknowledgements:
A major part of the ideas and some of the writing up has come from (in alphabetical order) Dave Mills, Björn Munch and Michael Rys. Steve Anderson and Andy Goodall have provided valuable criticism.
ISD listings will be provided one country per file (or one country plus dependencies etc.). Each ISD file has a named curator who coordinates (and probably also does) all the work on his/her country. See Appendix I for the presently active curators.
ISD files are database-like text files that contain one stamp or variant described per one formatted line. Optional comment lines can be dispersed anywhere between the stamp data lines. Comment lines can contain free text or some parsable special information. Section 2 discusses comment lines in detail and section 3 deals with data lines (the actual stamp descriptions).
The files must be limited to printable 7-bit characters for maximal portability. Individual curators may adopt various standards for coding "national" characters within a particular country (examples: n~ for Spanish "enye", or ISO 10 for writing Scandinavian characters as }{| ][\, or LaTeX codes). Any such standards will be documented within the ISD files.
Empty lines are allowed anywhere in the file. No leading spaces are allowed on any line.
The line terminators used in the ISD files will depend on the ftp site where they are stored. The first home for them will be nic.funet.fi (a.k.a. ftp.funet.fi), in the directory /pub/doc/mail/stamps and its subdirectories. That is a unix machine, so the lines will terminate in LF. If the files are transferred in ascii (text) mode, the line terminators should become automatically transformed to conform to the style of your own computer.
Different data items (fields) are separated by commas in the list. The fields may optionally be surrounded by double quotes ("). No commas are allowed within the fields, unless in a field within double quotes. No double quotes are allowed within the fields. The fields should be presented without leading or trailing blanks, meaning that there will be no fixed lengths. The correct number and order of fields must be kept in each data record.
Each data line should be contain a unique description of the stamp/variant with no references to other data lines. Knowing previously defined abbreviations may be required.
***writing going on here***
so that single lines will remain understandable. The data lines should be complete that one can identify most stamps just by seeing the line (and understanding the abbreviations it contains) *and* by having *any* catalog for the country in question. Recognizing special variants, especially different dies, may require having a special catalog that lists those variants (sorry, you can't have everything in an ASCII listing).
There is no universal sorting order in which stamps should be presented, each curator is free to arrange them as (s)he likes.
For file size considerations, most properties of stamp variants are recommended to be presented in a coded or abbreviated form instead of the fully written values. Whenever possible, the abbreviations should have some resemblance to the full forms so that the listings will remain more usable even if no database tools are used. Example: iT for imperforate at top instead of "A".
All abbreviations used must be stated explicitly in a parsable format within the listing (see section 2.4).
Most non-distinctive data should be omitted, like perforation in the case of issues where no perforation differences exist. This is both for brevity and for avoiding copyright problems.
Recommendations for comment lines:
%%: Internet Stamp Databases, reproduction for commercial use prohibited
%Country: Estonia
%Country-prefix: ee
%Last-update: 1993-11-16
%ISD-version: 1.02
%Curator: Martti Tolvanen
%e-mail: Martti.Tolvanen@helsinki.fi
%Catalog-used: none
%Archive: anonymous ftp at nic.funet.fi under /pub/doc/mail/stamps
%Data-records:
%Comment-lines:
%Fields: Categ,ID,Shade,Paper,Perf,Wmk,Other,Date,Denom,Color,Descr,Qty,Alt-ID
%Field-lengths-max:
%Field-lengths-min:
%Field-lengths-avrg:
%Currency: up to June 21 1992 1 rouble = 100 kopeks
%Currency: starting from June 22 1992 1 kroon (crown) = 100 senti (cents)
% The change was made at 10 roubles to 1 kroon.
% Now the kroon is supposed to be coupled to Deutschmark.
%Copyright (C) Martti Tolvanen 1993
%
% This listing covers the post-independence issues of Estonia in
% as much detail as I know.
%
% The numbering is one of my own, simply chronological.
%
% The quantities printed are taken from an exhibit by Eesti Post
% in a stamp show in Helsinki on Nov 6th 1993.
%
% Booklets and mini-sheets are put in separate categories BK and MS
% where numbers are not consecutive. Instead, the ID numbers are taken
% from the first stamp of the BK/MS.
%
% I'll be happy to receive any comments, corrections or new information.
%
% Abbreviation format: %Abb:Field,Abbreviation,Meaning,Type
% "Field" tells you in which part of the data the abbreviation is
% applicable.
% "Type" can be C/P/S (complete/prefix/suffix) for indicating
% whether there may or may not be other characters on either
% side of the abbreviation.
% See file format.isd in nic.funet.fi for more info on the format
% (it's not there yet, though.)
%
%Abb:Categ,BK,Booklet,C
%Abb:Paper,lN,non-phos. paper,C
%Abb:Paper,lP,phos. paper,C
%Abb:Descr,CoA,coat-of-arms,
%Abb:Other,di,die ,P
....
Notes:
The key for putting the comments back where they belong without manual editing is a "comment place marker", the characters %% followed by the ID number range where the comment applies and terminated by a colon. Examples of syntactically correct place markers:
%% 1-4:
%%1-4:
%% 1:
%% :
When referring to stamps outside the "main" category, like postage dues, the appropriate category identifier (see section 3.2.1) must preceed the ID numbers. Examples:
%% PD: (applies to all postage dues)
%% PD10-15:
Question: should this appear as PD10-PD15?
Use of the place markers:
A place marker line may or may not contain free text after the colon. Any single-% lines after the place marker until the next %%-line (in the context of comments separated from data) are taken to constitute a comment block that moves together with the place marker line.
When comments are restored into a re-sorted list or a partial list, the idea is to put each comment block in front of the first occurence of a data line that falls within the ID-range of the place marker.
(Why do I take up the idea of comment blocks again just after telling I've abandoned it? It's because currently the abbreviation lines can't show ranges and yet they may be placed within the data.)
Suggestion for an abbreviation syntax:
%AbbXX:Field,Abbreviation,Meaning,Type
Examples:
%Abb:Perf,L,Line perforation,P
%Abb:Perf,I,Imperforate,C
Then another rhetoric question: Shall we presume that the pretty printers and whatever programs we have for expanding the %Abb- lines will automatically add the field name into a verbose description so that
%Abb:Wmk,LC,Large Crown,
is OK, or do we have to include the field name in the definition like in the following?
%Abb:Wmk,LC,Wmk Large Crown,
I'm more in favour of the latter style, it would leave more choices to the curator, like enabling the alternatives:
%Abbde:Wmk,LC,Wz. grosse Krone,
%Abbes:Wmk,LC,Fil. corona grande,
%Abbse:Wmk,LC,Vm. stor krona,
One final comment: now that the ranges are not supported in the abbreviations, there is not much point for Michael to introduce the shades in pseudo-abbreviations. Or can some of you figure out a mechanism for taking the range from a previous %%-line?
The basic ISD identifier (index key) for each listed stamp variant is formed from the fields 1-7 (Category, ID#, Shade, Paper, Perforation, Watermark, Other) separated by dashes, with trailing dashes removed. Therefore each data record must have a unique combination of these fields. Exception: the dash between category and ID is left out, so that most IDs wouldn't be like -245
As an aside, this index key is meant to be used in cross- reference files for various alternative catalog numbers (in the case of copyright-protected numbers the xrefs can be constructed for private use only and must not to be distributed over the Internet) or for xref to any data that may appear in the future, like picture archives of stamps.
For compactness of the xref files it is highly recommended that the data presented within fields 1-7 be abbreviated as much as possible.
The "Other" field can list several properties at once and can be expanded so that any strange properties that we haven't been able to think of right now can be incorporated there, so the field structure should be stable.
Fields 8-11 (Date, Denomination, Color, Description) are the ones that the user needs for finding the stamp in her/his catalog, the "verbal cross-reference". These fields should be presented clearly enough so that all the "basic stamps" can be identified in any catalog, even though such clarity cannot be achieved in all of the special property fields.
Field 13 (Alt-ID) is meant for providing a direct catalog cross- reference when the numbering used in the ISD comes from a real catalog.
(No category indicated=normal mail)
%Abb:Categ,AM,Air Mail,C
%Abb:Categ,O,Official,C
%Abb:Categ,PD,Postage Due,C
%Abb:Categ,E,Express,C
%Abb:Categ,AE,Air Mail Express,C
%Abb:Categ,MS,Mini Sheet,C
%Abb:Categ,BK,Booklet,C
%Abb:Categ,MI,Military Post,C
%Abb:Categ,TG,Telegraph,C
(No final spaces in "Air Mail " etc. in this version.) What did I miss? (Not counting country-specific ones.)
Each curator decides which categories, if any, are used in her/his listing and documents them in the header.
Any letters etc. that may be part of a catalog number of a variant should be kept out of the main ID. If the curator wants to reproduce the catalog numbering in complete detail, the variant descriptors should be put in the Alt-ID field (the last one).
An ideal numbering would IMHO assign different numbers only for types that are different in major design, color and denomination. All other differences (perf, watermark, luminescence, shades, dies and plates etc) would be listed under the same ID number with the variant properties given in their appropriate fields (3-7). In this way each user would have an easy way to skip any type of variants that (s)he is not interested in. However, if a listing uses an existing catalog numbering, there may be different main IDs given to stamps with different watermark, perforation, or tagging.
Question: should we require that the properties are given in some standard order (like alphabetically) in the rare cases where several apply to one variant?
We'd like to have all of the properties that are to be used in the "Other" field included in a universal list to avoid naming collisions. Here's what we have until now: (should do them in the abbreviation style)
bp[X] | back printing |
---|---|
diX | Die/Type |
erX | Error |
foX | format |
guX | Gum |
ha[X] | halved (L/R) |
pc[X] | Precancel (argument specifying the type of precancel if several are available) |
plX | plate |
prX | printing method and/or house |
rp[X] | Reprint (x could indicate year) |
X represents any string consisting of upper case letters and numbers.
Any more ideas of other properties that might be needed for distinguishing any variants of any country?
If there are variants that differ in the currency unit presentation, like certain Spanish general Franco stamps (PTAS vs. PTS), we feel the distinction is too easy to miss (by the human eye) in the denomination field and it should rather appear in the actual stamp description (see 3.2.10 below). If such differences are distinguishing for variants within one main ID number, they would be best listed as die differences.
Denominations of overprinted issues are recommended to be indicated with a slash: 0.60/0.15 (new/old). If there is a danger of confusion between a fraction and an overprint (like 1/2), currency indicators will help (1(kr)/2(kr)). A curator may choose some other style for showing overprints, as long as it is documented and constistently used so that the overprint collector can find what he needs.
Surcharges for charity etc. are given in the style 2.10+0.40 if the charity value shows on the stamp, and in the style 1.50(+8.50) if it doesn't. Mini-sheet stamps where a premium is paid over the face value may be problematic with regard to the division of the premium to individual stamps.
For bicoloured issues the colours should be given outer/inner. The notation Multicolour/blue+ can be used to indicate the predominant colour.
It's a good idea to give some short indication of set relations here so that one sees which stamps belong together without checking the dates. A bad example from consecutive Finnish stamps of 1934-5:
Kivi Calonius Porthan Chydenius
...becomes much better when the Red Cross set is clearly marked
Kivi centenary RC: Calonius RC: Porthan RC: Chydenius
Question mark after a catalog name indicates that a final permission for the use of numbering is not sure yet.
Country | Numbering | Language(s) | Curator | Status |
---|---|---|---|---|
Andorra (Spanish) | Edifil (?) | English | Martti Tolvanen | not started |
Estonia | self-made | English | Martti Tolvanen | complete 1991-93 |
Finland | Norma | Finnish and English | Martti Tolvanen | main types complete |
France | Yvert | English | Dave Mills | 1849-1991 nearly complete |
Greece | self-made | English | Steve Anderson | 50 stamps ready |
Ireland | McDonnell (?) | English | Steve Anderson | not started |
Netherlands | NVPH (?) | English | Ed Voermans | just starting Assisted by Twan Laan (specialized perforation and paper variants) |
Norway | NK | English | Björn Munch | complete 1855-1969 & 1989-94 |
Peru | ?? | English | Steve Anderson | not started |
Schleswig | self-made | English | Björn Munch | complete |
Spain | Edifil (?) | English | Martti Tolvanen | not started |
Switzerland | Zumstein (?) | German and English | Michael Rys | ready (just waits for Zumstein's permission) |
UK | Stanley Gibbons | English | Andy Goodall | Complete to 1992 |
If you are interested in the ISD project, Mail one of us :-