As an Open Data Hub maintainer i want a precise message specification on top of which build and broadcast all messages #42

Luscha · 2023-10-26T07:17:48Z

As an Open Data Hub maintainer i want a precise message specification on top of which build and broadcast all messages.

Questions to be defined:

do we want to assign an ID to each message?
which fields do we want to have in each message (e.g. origin, data collector, timestamp, history, etc.)?

This is mainly a documentation issue.

sseppi · 2023-10-30T14:25:27Z

@Luscha can we discuss this issue a bit more into detail on Monday, in order to define the priority and the right Milestone?

clezag · 2024-07-16T11:40:25Z

The current format (by convention - as defined in WriterRoute) is

{
string provider
string timestamp
string rawdata
}

provider is in format provider/dataset
timestamp is in format ISO 8601 (not sure how timezones are handled - has to be tested)
rawdata is a string, could be a raw json or yaml, or base64 encoded whatever. max 16MB due to mongodb limitations

Reasonable candidates to be included are:

id (a unique ID propagated as correlation ID in rabbitmq, so that we can track a datapoint from start to finish)
content type to enable handling of specific data formats, and document what blobs actually are

Luscha added documentation Improvements or additions to documentation priority/high labels Oct 26, 2023

sseppi added question Further information is requested and removed question Further information is requested labels Oct 30, 2023

sseppi added this to the Infrastructure 2.0 Production Ready milestone Nov 6, 2023

Provide feedback