Field Tokeniser
Tokenise attribute values in to component parts
Objective
Split complex aggregated event attributes in to independent attributes using custom field tokeniser plugins
Uses
The uses of this filter is very much depended upon context to which Joule is deployed within. As such it relies entirely upon custom implementations developed by business developers.
Listed below are various uses for this type filter:
Split an address provided as a single string in to independent address components.
Get a device code from a mobile IMEI code.
Tokenise sentences ready for LLM processing
Example
This example take an aggregated value of longitude and latitude and split it to independent fields and added to the StreamEvent
object. This uses the custom plugin LatitudeLongitudeDecoder
provided in the telco project, see code below.
Attributes schema
Attribute | Description | Data Type | Required |
---|---|---|---|
tokenizers | Map of field and tokenizers. Tokenizers are provided by supporting business developers. | Map<String, FieldTokenizer> |
Plugin code
The plugin code extracts a latitude and longitude string, splits it by the comma, and converts each part into separate numeric event attributes.
A FieldTokenizer API is provided for developers to build and deploy custom implementation.
Last updated