Key concepts
Enriching event data with the latest contextual data is crucial for processing real-time business insights.
Low latency enrichment architecture
OOTB Joule has provided the heavy lifting required to deploy a low-latency solution so that developers can focus on building advanced use case that required addition contextual data. The Joule contextual data architecture supports various low-latency in-memory data stores that reduce the inherent I/O overhead of out-of-process databases and thus enable stream based enrichment.
Example
The below example enriches events using the linked reference data and metrics. Reference data is imported at process startup using a provided parquet file. Metrics are calculated after 1 minute using a three minute tumbling window approach.
Output
The above example generates the following output, in this case CSV.
General Enricher DSL
The enricher processor provide users to the ability to enrich an event with multiple data elements from various data sources through the use of enhanced mapping.
Example
Top level attributes
Two key attributes are required for the enricher processor; one is to define which fields to enrich whereas the other provides the data store binding.
Fields Attribute
Enrichment is applied at the field level whereby each returned data element is added to the defined field either as map of values or as a domain object.
The field attribute is logical organised as three definition type:
Query approach
Response approach
Binding store
Query approach
Contextual data is retrieved using one of two methods, by key or by query.
By Key
Using the key based look up approach enables you to perform a look up against a store using either the primary key or the key within a caching solution.
Example returns specific attributes from ReferenceDataObject
Example returns a ReferenceDataObject as a linked object
See ReferenceDataObject for further information
By Query
To fine tune your enrichment process you can define a query rather than a strict key based look up. This would provide you with a greater flexibility to drive further pipeline processing. Below represents a OQL based query using an in-memory cache solution.
Attributes
Response Approach
On a successful data retrieval the response object, ReferenceDataObejct, key values are added directly in to the event or added as an object.
Either one of the attributes must be provided.
with values
Add selected attributes to the event.
all attributes
Add all attributes to the event.
as object
Add the returned object to the event using the field name.
Binding store
Bind the field configuration to a data store using a logical store name. This would either be custom or using the pre-defined stores. A custom store should be defined under the Stores Attribute, see section for more details.
Supported stores
JouleDB
MetricsDB
If either one of these are provided there is no need to specify the stores attribute.
Stores Attribute
Bind the processor to one or more linked data stores using a logical store name mapped to a set of configuration attributes. The defined store name configuration needs to have been provided, see reference data documentation for further details.
Example
The example below uses the nasdaqIndexCompanies
as a logical name to bind the reference data lookup criteria to be performed.
Attributes
ReferenceDataObject
For further information read Reference Data documentation
Last updated