1-4 Convert Data To UTF-8
This example demonstrates the ConvertCharacterSet processor and shows how to convert data from another character set to UTF-8.
There is currently no way to automatically detect the character set. Related NiFi issues: 1874, 4550.
- Download Template
-
Processors:
-
GetFile – reads files from a watched directory
-
Properties
- Input Directory: /some/path
-
Properties
-
ConvertCharacterSet
-
Properties
- Input Character Set: Windows-1252: (or whatever your content is)
- Output Character Set: UTF-8
-
Properties
-
PutMarkLogic
-
Properties
- DatabaseClient Service: (your MarkLogic DatabaseClient Service)
- URI Attribute Name: filename
- URI Prefix: /files/
-
Settings
- Automatically Terminate Relationships: failure, success
-
Properties
-
GetFile – reads files from a watched directory
![](../images/04-001 ConvertCharacterSet.png)