Treasure Data's primary idea portal. 

Submit your ideas & feature requests directly to our product requirements team! We look forward to hearing from you.

customize dictionary of hivemall tokenize_ja

This is a customer's request.

The customer can do morphological analysis using tokenize_ja, but current behavior is not suitable for the customer (e.g. 「二番目」is separated 「二」and「番」,「目」)
So it's better if we can set customize dictionary optionally.

  • Keisuke Noda
  • Feb 14 2017
  • Shipped