We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
测试123EDF,避免拼音拆分多个token,期望效果“ceshi123EDF”
A description of what the bug is. 出现问题:数字被拆分,EDF被拆分,拆分成ce,shi
索引设置: PUT /my_index { "settings": { "analysis": { "analyzer": { "pinyin_analyzer": { "tokenizer": "my_pinyin_tokenizer" } }, "tokenizer": { "my_pinyin_tokenizer": { "type": "pinyin", "keep_first_letter": false, "keep_separate_first_letter": false, "keep_full_pinyin": true, "limit_first_letter_length": 16, "lowercase": true, "none_chinese_pinyin_tokenize": true } } } } }
分词测试: GET /my_index/_analyze { "analyzer": "pinyin_analyzer", "text": "理财123EDF" }
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Description
测试123EDF,避免拼音拆分多个token,期望效果“ceshi123EDF”
A description of what the bug is.
出现问题:数字被拆分,EDF被拆分,拆分成ce,shi
Steps to reproduce
索引设置:
PUT /my_index
{
"settings": {
"analysis": {
"analyzer": {
"pinyin_analyzer": {
"tokenizer": "my_pinyin_tokenizer"
}
},
"tokenizer": {
"my_pinyin_tokenizer": {
"type": "pinyin",
"keep_first_letter": false,
"keep_separate_first_letter": false,
"keep_full_pinyin": true,
"limit_first_letter_length": 16,
"lowercase": true,
"none_chinese_pinyin_tokenize": true
}
}
}
}
}
分词测试:
GET /my_index/_analyze
{
"analyzer": "pinyin_analyzer",
"text": "理财123EDF"
}
Environment
The text was updated successfully, but these errors were encountered: