Ideas

Treasure Data's primary idea portal. 

Submit your ideas & feature requests directly to our product requirements team! We look forward to hearing from you.

Support Japanese morphological analyzer/Tokenizer UDF in Hive

Support Japanese morphological analyzer/Tokenizer UDF in Hive using Kuromoji or something.

The UDF is beneficial for making feature vectors from documents.

  • Makoto Yui
  • Jun 10 2016
  • Shipped
  • Y.Kentaro (@yoshi_ken) commented
    June 10, 2016 23:36

    It also needs to bundle some dictionary like IPA, Unidic, NAIST.