Skip to content

jxpress/kuromoji4s

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

kuromoji4s

kuromoji4s is a wrapper library of kuromoji (http://www.atilika.org/) for scala

Requirements

  • Java 7 or later
  • scala 2.11.7 or later
  • sbt 0.13.8 or later

Library Dependencies

resolvers += "Maven Repository on Github" at "https://jxpress.github.io/mvnrepos/"

libraryDependencies += "net.jxpress" % "kuromoji4s_2.11" % "0.0.6"

Limitations

Only kuromoji-ipadic is supported in the current version (0.0.6)

if you want to use neologd dictionary, you need to be the following additional dependency:

libraryDependencies += "net.jxpress" % "kuromoji-neologd-dic_2.11" % "0.9.0"  from "https://github.com/jxpress/kuromoji-neologd-dic/releases/download/0.9.0/kuromoji-neologd-dic_2.11-0.0.9.jar"

Then you can create a neologd tokenizer as follows:

import net.jxpress.kuromoji4s.ipadic.Tokenizer
import net.jxpress.kuromoji4s.DictionaryType.IPA.Neologd

val tokenizer = Tokenizer(Neologd)

The detail for kuomoji-neologd-dic is here

Sample code

see here