Navigation

1-4 Convert Data To UTF-8

This example demonstrates the ConvertCharacterSet processor and shows how to convert data from another character set to UTF-8.

There is currently no way to automatically detect the character set. Related NiFi issues: 1874, 4550.

  • Download Template
  • Processors:
    • GetFile – reads files from a watched directory
      • Properties
        • Input Directory: /some/path
    • ConvertCharacterSet
      • Properties
        • Input Character Set: Windows-1252: (or whatever your content is)
        • Output Character Set: UTF-8
    • PutMarkLogic
      • Properties
        • DatabaseClient Service: (your MarkLogic DatabaseClient Service)
        • URI Attribute Name: filename
        • URI Prefix: /files/
      • Settings
        • Automatically Terminate Relationships: failure, success