-
Notifications
You must be signed in to change notification settings - Fork 13
Open
Description
Well, you created yet another PICA+ serialization format, so I would like to add its documentation to http://format.gbv.de/pica and support it in PICA::Data (see gbv/PICA-Data#83).
As far as I understand the script, PICA+ records are first transformed to XML with scripts/pica2xml.pl. There are examples of this XML format in scripts/test and in test. As far as I could analyze it, the format includes
- root element
collectionwith (optional?) attributecount- repeatable element
record- element
headerwith mandatory attributestatus, having one of the valuesdeletedorupsert- element
identifierwith the PPN
- element
- element
metadata- repeatable element
datafieldwith attributestag,fulltag(mandatory) andoccurrence(optional)- element
subfieldwith mandatory attributecode
- element
- repeatable optional element
itemwith mandatory attributeepn
- repeatable element
- element
- repeatable element
Some files use a slightly different form
- root element
collectionwith (optional?) attributecount- repeatable element
record- element
statushaving one of the valuesdeletedorupsert - element
hridwith the PPN - element
metadata- repeatable element
datafieldwith attributestag,fulltag(mandatory) andoccurrence(optional)- element
subfieldwith mandatory attributecode
- element
- repeatable optional element
itemwith mandatory attributeepn
- repeatable element
- element
- optional (?) element
rawrecordwith full record (syntax of this is another issue)
- repeatable element
Questions:
- why not PPXML or an extension (well I guess it's too late now)
- why two variants? could both at least be consolidated?
- what happens when a record contains multiple level 1 records? can
datafieldanditembe mixed or is the format limited to one ILN? - why are
x-occurrencesnot included infulltag(e.g. "209Ax00/01" for field 209Ax/01 with $x=00). For some fields on level 2 subfield$xis crucial to distinguish the meaning of the field, see formal specification at https://format.gbv.de/schema/avram/specification#field-identifier - last but not least: what would be a proper name for the format? How about PICA Import XML (PIXML)?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels