75 | Beskrivelse
76 |
77 |
87 | Format
88 |
89 |
97 | Kolofon
98 |
99 |
109 | 110 | Mere om objektet 111 | 112 |
113 | 114 |
115 |
116 |
117 |
Tryggelev Nor ligger på Langelands vestkyst. Indtil 2008 lå der en 11 | langdysse på toppen af klinten. Dyssen lå langs stranden. Gennem mange 12 | år har havet arbejdet sig ind på kysten. Til sidst lå alle randstenene 13 | fra den ene langside af dyssen nede på stranden i en lang 14 | række. Langdyssen er bare én af Danmarks cirka 2500 fredede dysser og 15 | jættestuer. Oprindelig var der ti gange så mange. Der bliver passet godt 16 | på de tilbageværende, men i nogle tilfælde kan lovgivningen alene ikke 17 | forhindre ødelæggelserne. Det gælder, når det er naturen selv, som 18 | nedbryder fortidsminderne.
19 |Langdyssen ved Tryggelev Nor blev ikke kun angrebet af havet. På 23 | landsiden var dyssen også under angreb. Her havde en greve fra 24 | Tranekær nemlig fjernet randstenene fra den anden langside i den 25 | sidste halvdel af 1800-tallet. Stenene blev kløvet og brugt til at 26 | bygge et stendige. Samtidig havde greven bygget sig en jagthytte på 27 | stedet. I den forbindelse var gravkammeret blevet indrettet som 28 | viktualiekælder, altså en slags naturligt køleskab. I 2006 gnavede 29 | en storm endnu et stykke af klinten. Selve gravkammeret var i fare 30 | for at falde ned og til fare for publikum. Det var ikke muligt at 31 | kystsikre klinten effektivt. I 2008 blev gravkammeret derfor skilt 32 | ad og flyttet i sikkerhed.
33 |35 | Skrevet af Jørgen Westphal 36 |
37 |This document is a part of Royal 18 | Danish Library's APIs, and in particular The 20 | documentation on how use our texts. See also Licences 22 | & Legalese and 23 | Caveats
24 | 25 | 26 | 88 |91 |92 |
All the texts that can be searched in using the API are in Text 100 | Encoding Initiative, TEI for short, markup.
101 | 102 |The solfware used for indexing is described in the documentation of the project 103 | SOLR and Snippets
104 | 105 |Read chapter 5, it 140 | is so good!They are indexed and searchable in 141 | principle. However, the user interface only support them 142 | in table of contents and quotation services.
paragraph, level, which implies 155 |
Note that this document does not define or describe all 182 | fields in the index. The index is far too rich for that, but I 183 | believe that it contains what it takes to use 184 | it. The thing I have left out is basically more of the same.
185 | 186 |Finally, all fields are not available for all editions, 187 | because the heterogeneity of the data, or wishes from the 188 | projects contributing data.
189 | 190 |
196 | ID and Relations fields197 | |
198 | ||
label | 202 |description | 203 |values | 204 |
---|---|---|
id |
208 | The ID of 209 | the record. It identifies the collection, the TEI file and 210 | is constructed as a string concatenation of that basename 211 | with the xml:id of the the content indexed and some other 212 | stuff. | 213 |
214 | string215 | |
216 |
volume_id_ssi |
220 | The ID of 221 | the volume that contain the node | 222 ||
part_of_ssim |
226 |
227 |
228 | Array of IDs of trunk nodes being containers of the node
229 | at hand. Typically containing
230 |
231 |
|
241 | |
245 | Filter fields246 | |
247 | ||
label | 251 |description | 252 |values | 253 |
cat_ssi |
256 | Category of 257 | a text. Use when limiting searches to works or to find 258 | volumes or find author portraits (biographies), omit 259 | otherwise. | 260 |
261 | 262 | work 263 | author 264 | period 265 |266 | |
267 |
is_editorial_ssi |
270 | The contents 271 | originator is someone else than the author. In this service 272 | it is typically forewords, prefaces, comments etc in a 273 | scientific edition. | 274 |
275 | 276 | yes 277 | no 278 |279 | |
280 |
type_ssi |
283 | Node type 284 | in document. A trunk node can be a whole work, a chapter 285 | etc, whereas a leaf could a paragraph of prose, a stanza (or 286 | strophe) of poetry or a speak in a dialog in a scenic 287 | work. For historical reasons, whole texts have 288 | type_ssi:work. A type_ssi:trunk will yield a 289 | result set comprising chapters or section of some kind. | 290 |
291 | 292 | work 293 | trunk 294 | leaf 295 | volume 296 |297 | |
298 |
is_monograph_ssi |
303 | A monograph 304 | in text service is perhaps not what you expect (on the other 305 | hand, what you expect is a monograph in text service). A 306 | monograph is a volume with only one work. | 307 |
308 | 309 | yes 310 | no 311 |312 | |
313 |
314 |
genre_ssi |
318 | Genre of a 319 | leaf node. Note that this is not the genre of a work, but 320 | the structure of the paragraph level markup. If there is a 321 | song in a scenic work, the speak in question might be 322 | classified as containing mostlty poetry. Available for all editions except GV. | 323 |
324 | 325 | prose 326 | poetry 327 | play 328 |329 | |
330 |
subcollection_ssi |
334 | Filter with respect to collection. 335 | 336 | 337 | public-index.kb.dk contains all these editions. | 338 |
339 | 340 | adl 341 | gv 342 | jura 343 | letters 344 | lh 345 | sks 346 | tfs 347 |348 | |
349 |
352 | Sort fields353 | |
354 | ||
359 | 360 | position_isi 361 |362 | |
363 | The position 364 | of the current node along the sibling xpath axis in the 365 | document. Sorting with respect to this field will guarantee 366 | that the result is presented in document order. (We cannot 367 | use page number, which might be a roman numeral or an arabic 368 | one. Also, we need to take into account leaf 369 | nodes within pages.) | 370 |
371 | 372 | integer 373 |374 | |
375 |
379 | Search fields380 | |
381 | ||
label | 385 |description | 386 |values | 387 |
work_title_tesim |
390 | Misc. metadata 391 | fields. There are more of them, but they should be self 392 | explanatory. | 393 | just plain text |
394 |
volume_title_tesim |
397 | ||
work_title_tesim |
400 | ||
author_name_tesim |
403 | The 404 | author(s) of a document. For messages it is assumed that 405 | author is a synonym of sender. | 406 ||
text_tesim |
409 | The text | 410 | just plain text |
411 |
prose_extract_tesim 414 | verse_extract_tesim 415 | performance_extract_tesim 416 | |
417 | The text, as text_tesim, split up into fields according to its form. The to fields get their content from <p> ... </p>, <lg> ... </lg> and <sp> ... </sp> respectively. | 418 | just plain text |
419 |
contains_ssi |
423 | We measures the length of the texts in prose_extract_tesim 424 | verse_extract_tesim 425 | performance_extract_tesim, whichever is the longest is used to assign the value of this field. | 426 |
427 | 428 | prose 429 | poetry 430 | play 431 |432 | |
433 |
speaker_tesim |
436 | The name of a character uttering something in a dialogue | 437 | just plain text |
438 |
page_ssi |
442 | The page number where a leaf node (paragraph, speak or strophe) starts. | 443 |
444 | string (either integer 445 | or roman numerals)446 | |
447 |
451 | person_name_ssim 452 | person_name_tesim 453 | |
454 | Name of 455 | persons mentioned in works, or, in case of letters, name of 456 | the recipient. The field can be accessed both as text 457 | (tesim) and string (ssim). The names in these fields are 458 | normalized to last name first (LNF) format. Also, the 459 | normalized form usually hits variants, such as Shakespeare, 460 | William hits William Shakespeare, and Jesus hits Kristus 461 | (Danish for Christ) as well. But only in these fields, there 462 | is no query expansion for the full text. | 463 ||
other_location_ssim 468 | other_location_tesim sender_location_tesim | 469 |Names of 470 | places mentioned in works, or, in case of letters, the 471 | residence of the sender. The field can be accessed both as 472 | text (tesim) and string (ssim). The place names are usually 473 | normalized. For instance, a search in these field for 474 | Danmark hits Dannemark as well. The reverse is not true, a 475 | search for Dannemark hits only the word Dannemark in the 476 | full text (see text_tesim above). 477 | sender_location_tesim applies to letters 478 | only. | 479 ||
484 | bible_ref_ssim 485 | bible_ref_tesim |
486 | References 487 | to the bible mentioned in works. The field can be accessed 488 | both as text (tesim) and string (ssim). The references is 489 | using standard Danish abbreviations, like 1 Mos; 1 Kor 490 | 13,12; 1 Mos 2,7; Matt 16,18; Sl; Åb; ApG; Joh 1,14; Jak; 491 | Job. In many cases use bible_ref_ssim and then search 492 | for the exact string "1 Kor 13,12". The references are 493 | standardized annotations but in the full texts (of Grundtvig 494 | and Kierkegaard) may just allude to a place in the 495 | Bible. | 496 ||
year_itsi | 501 |Year of 502 | release, publication or, in case of a message, the year it 503 | was sent. | 504 |long int | 505 |
525 | type_ssi:work AND is_editorial_ssi:no 526 |527 |
540 | author_name_tesim:munch 541 | AND 542 | type_ssi:work 543 |544 |
558 | genre_ssi:play 559 | AND 560 | subcollection_ssi:adl 561 | AND 562 | author_name_tesim:jeppe 563 |564 |
577 | genre_ssi:play 578 | AND 579 | subcollection_ssi:adl 580 | AND 581 | speaker_tesim:jeppe 582 |583 |
599 | type_ssi:leaf 600 | AND 601 | genre_ssi:poetry 602 | AND 603 | subcollection_ssi:adl 604 | AND 605 | author_name_tesim:grundtvig 606 | AND 607 | text_tesim:hjerte 608 | AND 609 | text_tesim:smerte 610 |611 |
622 | genre_ssi:play 623 | AND 624 | subcollection_ssi:adl 625 | AND 626 | text_tesim:mester erich 627 | AND 628 | author_name_tesim:holberg 629 |630 |
643 | subcollection_ssi:letters 644 | AND 645 | author_name_tesim:georg brandes 646 | AND 647 | sender_location_tesim:berlin 648 |649 |
662 | subcollection_ssi:letters 663 | AND 664 | sender_location_tesim:paris 665 | AND 666 | year_itsi:[1000 TO 1850] 667 |668 |
684 | author_name_tesim:holberg 685 |
688 | {!join to=id from=part_of_ssim}genre_ssi:poetry 689 |690 |
703 | year_itsi desc 704 |705 |
727 | subcollection_ssi:gv 728 | AND 729 | verse_extract_tesim:helvede 730 | AND 731 | type_ssi:work 732 |733 | field list 734 |
735 | id year_itsi 736 |737 | sort by ascending 738 |
739 | year_itsi asc 740 |741 |
754 | subcollection_ssi:gv 755 | AND 756 | text_tesim:helvede 757 | AND 758 | type_ssi:work 759 | AND 760 | genre_ssi:poetry 761 |
764 | subcollection_ssi:gv 765 | AND 766 | text_tesim:helvede 767 | AND 768 | type_ssi:leaf 769 | AND 770 | genre_ssi:poetry 771 |
776 |777 |
804 | volume_id_ssi:adl-texts-munp1-root 805 | AND 806 | text_tesim:regn 807 | AND 808 | genre_ssi:poetry 809 |810 |
816 | position_isi desc 817 |818 |
842 | volume_id_ssi:adl-texts-munp1-root 843 | AND 844 | text_tesim:regn 845 |846 |
851 | {!join to=id from=part_of_ssim}genre_ssi:poetry 852 |853 |
859 | position_isi asc 860 |861 |
For now we see only a 868 | reflection as in a mirror; then we shall see face to 869 | face.) in the works of N.F.S. Grundtvig. 870 | 871 | try it! 880 |
885 | bible_ref_ssim:"1 Kor 13,12" 886 | AND 887 | subcollection_ssi:gv 888 | AND 889 | is_editorial_ssi:no 890 |891 |
896 | year_itsi asc 897 |898 |
903 | {!join to=volume_id_ssi from=part_of_ssim}genre_ssi:prose 904 |905 |
920 | {!join to=volume_id_ssi from=part_of_ssim}genre_ssi:poetry 921 |922 |
You cannot use the index-test instance outside our 935 | network. Forget this if you are not developer at kb.dk
936 | 937 | 943 |This document was authored by
964 |Sigfrid Lundberg
965 | The Royal Danish Library
966 | Denmark
who also wrote the indexer. However, a large number of people 969 | has contributed to this by coding services on top the 970 | index. That process has required clarifications of this document 971 | and modification of the index. This is the fruit of a teamwork.
972 | 973 |This document is a part of Royal 13 | Danish Library's APIs, and in particular The 15 | documentation on how use our image based resources.. See also Licences 17 | & Legalese and 18 | Caveats
19 | 20 | 52 |55 |56 |
More to come
62 | 63 |See COP 65 | SOLR data in our public index.
66 | 67 |The viewer-js used in here isn't compatible with the needles we are asking for in our kml
7 | 13 | 14 | 15 | -------------------------------------------------------------------------------- /links.md: -------------------------------------------------------------------------------- 1 | 2 | # Links in COP 3 | 4 | The objects are presented as parts of what we refer to as 5 | editions. Each edition is typically the fruit of a digitisation 6 | project within a collection. 7 | 8 | 9 | 10 | 11 | 12 | pamphlets 13 | manus 14 | images 15 | editions 16 | letters 17 | maps 18 | books 19 | 20 | 21 | 22 | | edition | description | 23 | |:--------|:------------| 24 | | /editions/any/2009/jul/editions | The edition of all editions | 25 | | /books/judsam/2010/maj/jstryk | Judaistisk Samling: Tidlige & sjældne tryk | 26 | | /books/ortsam/2011/mar/ostryk | Tidlige tryk i Orientalsk Samling | 27 | | /images/billed/2010/okt/billeder | Billeder | 28 | | /images/billed/2014/jun/hca | H.C. Andersens Papirklip | 29 | | /letters/judsam/2011/mar/dsa | David Simonsens Arkiv | 30 | | /manus/judsam/2009/sep/dsh | David Simonsens Håndskrifter | 31 | | /manus/judsam/2010/maj/jsmss | Judaistisk Samling: Håndskrifter | 32 | | /manus/musman/2010/dec/viser | DFS | 33 | | /manus/ortsam/2009/okt/orientalia | Oriental Collection: Manuscripts | 34 | | /manus/vmanus/2011/dec/ha | Vesterlandske håndskrifter | 35 | | /maps/kortsa/2012/jul/kortatlas | Kort og Atlas | 36 | | /pamphlets/dasmaa/2008/feb/daellsvarehus | Varehuskataloger | 37 | | /pamphlets/dasmaa/2008/feb/partiprogrammer | Partiprogram | 38 | | /pamphlets/dasmaa/2012/jul/smaatryk | Småtryk | 39 | -------------------------------------------------------------------------------- /metadata-formats.md: -------------------------------------------------------------------------------- 1 | [READ ME](README.md) - [OAI Dissemination](oai-pmh.md) - [Web services in COP](cop-backend.md) - [Aerial Photography](geographic-data.md) - [Image delivery](image-delivery.md) - [Metadata Formats](metadata-formats.md) - [Text Corpora](text-corpora.md) 2 | 3 | # The Metadata Formats Used in Syndication and Dissemination 4 | 5 | Through the `format` CGI variable, data can be syndicated in the 6 | following formats: `kml`, `rss`, `solr` and `mods`. 7 | 8 | ## format=kml 9 | 10 | The [KML](https://developers.google.com/kml/documentation/) feed has 11 | not been designed to be consumed directly by external software like 12 | [Google MAPS](http://maps.google.com/), and as of writing this it is 13 | not known if that works. 14 | 15 | ## format=rss 16 | 17 | [RSS 2.0](https://cyber.harvard.edu/rss/rss.html) is the main 18 | format. This feed includes Open Search, geo and GeoRSS extensions. For 19 | example 20 | 21 | ``` 22 |