1. Introduction
This specification is a supplementary specification to the CLARIN-FCS Core specification and defines additional Data View to be used in CLARIN-FCS. This specification defines the supplementary Data Views. For detailed information about the CLARIN-FCS interface specification, see CLARIN-FCS-Core 2.0.
1.1. Terminology
The key words MUST
, MUST NOT
, REQUIRED
, SHALL
, SHALL NOT
, SHOULD
, SHOULD NOT
, RECOMMENDED
, MAY
, and OPTIONAL
in this document are to be interpreted as in RFC2119.
1.2. Normative References
- RFC2119
-
Key words for use in RFCs to Indicate Requirement Levels, IETF RFC 2119, March 1997, https://www.ietf.org/rfc/rfc2119.html
- XML-Namespaces
-
Namespaces in XML 1.0 (Third Edition), W3C, 8 December 2009, http://www.w3.org/TR/2009/REC-xml-names-20091208/
- CLARIN-FCS-Core 2.0
-
CLARIN Federated Content Search (CLARIN-FCS) - Core 2.0, SCCTC FCS Task-Force, May 2017, https://trac.clarin.eu/wiki/FCS/Specification, https://github.com/clarin-eric/fcs-misc/tree/main/fcs-core-2.0, https://office.clarin.eu/v/CE-2017-1046-FCS-Specification-v20230426.pdf
1.3. Non-Normative References
- RFC6838
-
Media Type Specifications and Registration Procedures, IETF RFC 6838, January 2013, https://www.ietf.org/rfc/rfc6838.html
- RFC3023
-
XML Media Types, IETF RFC 3023, January 2001, https://www.ietf.org/rfc/rfc3023.html
- KML
-
Keyhole Markup Language (KML), Open Geospatial Consortium, 2008, https://www.opengeospatial.org/standards/kml
1.4. Typographic and XML Namespace conventions
The following typographic conventions for XML fragments will be used throughout this specification:
-
<prefix:Element>
An XML element with the Generic Identifier Element that is bound to an XML namespace denoted by the prefix prefix.
-
@attr
An XML attribute with the name attr.
-
string
The literal string must be used either as element content or attribute value.
Endpoints and Clients MUST
adhere to the XML-Namespaces specification. The CLARIN-FCS interface specification generally does not dictate whether XML elements should be serialized in their prefixed or non-prefixed syntax, but Endpoints MUST
ensure that the correct XML namespace is used for elements and that XML namespaces are declared correctly. Clients MUST
be agnostic regarding syntax for serializing the XML elements, i.e. if the prefixed or un-prefixed variant was used, and SHOULD
operate solely on expanded names, i.e. pairs of namespace name and local name.
The following XML namespace names and prefixes are used throughout this specification. The column "Recommended Syntax" indicates which syntax variant SHOULD
be used by the Endpoint to serialize the XML response.
Prefix | Namespace Name | Comment | Recommended Syntax |
---|---|---|---|
|
CLARIN-FCS Resources |
prefixed |
|
|
Component Metadata |
un-prefixed |
|
|
Keyhole Markup Language |
un-prefixed |
2. Data Views
A Data View serves as a container for representing search results within CLARIN-FCS. Data Views are designed to allow for different representations of results. This specification defines supplementary Data Views beyond the Generic Hits Data View and Advanced Data View, that is defined as part of the CLARIN-FCS Core 2.0 specification. For detailed information as to what Data Views are and how they are integrated in CLARIN-FCS, see CLARIN-FCS-Core 2.0.
Note
|
The examples in the following sections show only the payload with the enclosing <fcs:DataView> element of a Data View. Of course, the Data View must be embedded either in a <fcs:Resource> or a <fcs:ResourceFragment> element. The @pid and @ref attributes have been omitted for all inline payload types.
|
2.1. Generic Hits (HITS)
The Generic Hits (HITS) Data View is an integral part of the Core specification and serves as the as the most basic agreement in CLARIN-FCS for the serialization of search results. For details about this Data View, see the Core specification CLARIN-FCS-Core 1.0 or 2.0, Section "Generic Hits (HITS)".
2.2. Advanced (ADV)
The Advanced (ADV) Data View is an integral part of the Core 2.0 specification and serves as the as the agreement in CLARIN-FCS for the serialization of CLARIN-FCS 2.0 Advanced Search results. For details about this Data View, see the Core specification CLARIN-FCS-Core 2.0, Section "Advanced (ADV)".
2.3. Component Metadata (CMDI)
Description |
A CMDI metadata record |
---|---|
MIME type |
|
Payload Disposition |
inline or reference |
Payload Delivery |
send-by-default ( |
Recommended Short Identifier |
|
The Component Metadata Data View allows the embedding of a CMDI metadata record that is applicable to the specific context into the Endpoint response, e.g. metadata about the resource in which the hit was produced. If this CMDI record is applicable for the entire Resource, it SHOULD
be put in a <fcs:DataView>
element below the <fcs:Resource>
element. If it is applicable to the Resource Fragment, i.e. it contains more specialized metadata than the metadata for the encompassing resource, it SHOULD
be put in a <fcs:DataView>
element below the <fcs:ResourceFragment>
element. Endpoints SHOULD
provide the payload inline, but Endpoints MAY
also use the reference method. If an Endpoint uses the reference method, the CMDI metadata record MUST
be downloadable without any restrictions.
<!-- potential @pid and @ref attributes omitted -->
<fcs:DataView type="application/x-cmdi+xml">
<cmdi:CMD xmlns:cmdi="http://www.clarin.eu/cmd/" CMDVersion="1.1">
<!-- content omitted -->
</cmdi:CMD>
</fcs:DataView>
<!-- potential @pid attribute omitted -->
<fcs:DataView type="application/x-cmdi+xml" ref="http://repos.example.org/resources/4711/0815.cmdi" />
2.4. Images (IMG)
Description |
An image related to the hit |
---|---|
MIME type |
|
Payload Disposition |
reference |
Payload Delivery |
need-to-request ( |
Recommended Short Identifier |
|
The Image Data View provides an image that is relevant to the hit, e.g. a facsimile of the source of a transcription. Endpoints MUST
provide the payload by the reference method and the image file SHOULD
be downloadable without any restrictions.
<!-- potential @pid attribute omitted -->
<fcs:DataView type="image/png" ref="http://repos.example.org/resources/4711/0815.png" />
2.5. Geolocation (GEO)
Description |
An geographic location related to the hit |
---|---|
MIME type |
|
Payload Disposition |
inline |
Payload Delivery |
need-to-request ( |
Recommended Short Identifier |
|
The Geolocation Data View allows to geolocalize a hit. If MUST
be encoded using the XML representation of the Keyhole Markup Language (KML). The KML fragment MUST
comply with the specification as defined by [ref:KML].
<!-- potential @pid and @ref attributes omitted -->
<fcs:DataView type="application/vnd.google-earth.kml+xml">
<kml:kml xmlns:kml="http://www.opengis.net/kml/2.2">
<kml:Placemark>
<kml:name>IDS Mannheim</kml:name>
<kml:description>Institut für Deutsche Sprache, R5 6-13, 68161 Mannheim, Germany</kml:description>
<kml:Point>
<kml:coordinates>8.4719510,49.4883700,0</kml:coordinates>
</kml:Point>
</kml:Placemark>
</kml:kml>
</fcs:DataView>
Changelog
2023-06-12 — Conversion to AsciiDoc and Migration of specification documents to Github
-
Convert specification documents for FCS Core 2.0, Core 1.0, DataView and AAI to AsciiDoc
-
Migrate from CLARIN Trac to CLARIN Github
-
Add Github Actions workflow to automate build process
2017-06-13 — Last version on Trac
-
Proofing, typo fixes, updated links
2014-05-05 — Data Views specification
-
Add full specification