Does any one know how to expand queries using Wordnet with Lucene 3.6?(有谁知道如何使用带有 Lucene 3.6 的 Wordnet 扩展查询?)
问题描述
我在 org.apache.lucene.analysis.synonym 中找到了 WordnetSynonymParser 类,但在 API 和 google 中都没有使用示例.有谁有这方面的经验吗?
I've found the class WordnetSynonymParser in org.apache.lucene.analysis.synonym but there aren't examples of its usage neither in the API nor in google. Does any one have experience with it?
谢谢!
编辑:我知道以前有类 SynExpand,但是在 3.6 版本中它消失了...
EDIT: I know that there used to be the class SynExpand, but with version 3.6 it disappeared...
我试试:
try {
FileReader rulesReader = new FileReader("wn/wn_s.pl");
SynonymMap.Builder parser = null;
parser = new WordnetSynonymParser(true, true, analyzer);
((WordnetSynonymParser)parser).add(rulesReader);
synonymMap = parser.build();
} catch (Exception e) {
e.printStackTrace();
System.exit(1);
}
但我收到以下错误:
java.text.ParseException: Invalid synonym rule at line 109
at org.apache.lucene.analysis.synonym.WordnetSynonymParser.add(WordnetSynonymParser.java:75)
at pirServer.QueryClassifier.<init>(QueryClassifier.java:77)
at pirServer.PIRServer.main(PIRServer.java:32)
Caused by: java.lang.IllegalArgumentException: term: course of action analyzed to a token with posinc != 1
at org.apache.lucene.analysis.synonym.SynonymMap$Builder.analyze(SynonymMap.java:131)
at org.apache.lucene.analysis.synonym.WordnetSynonymParser.parseSynonym(WordnetSynonymParser.java:92)
at org.apache.lucene.analysis.synonym.WordnetSynonymParser.add(WordnetSynonymParser.java:67)
... 2 more
推荐答案
我正在做类似的事情,只是阅读了文档 - 所以 SynonymFilter 文档中的相关警告非常新鲜:
I am working on a similar thing and just read the documentation - so a relevant caution from the SynonymFilter doc is very fresh:
""此令牌流无法正确处理位置增量!= 1,即您应该在过滤掉停用词之前放置此过滤器""
""This token stream cannot properly handle position increments != 1, ie, you should place this filter before filtering out stop words""
http://lucene.apache.org/core/3_6_0/api/all/org/apache/lucene/analysis/synonym/SynonymFilter.html
您传递给 WordNetSynonymParser 的分析器(您没有在帖子中描述)可能会删除停用词(大多数情况下都是如此),从而导致:
It's possible that the analyzer you're passing (which you fail to describe in your post) to the WordNetSynonymParser does remove stop words (as is the case for most of them) causing:
java.lang.IllegalArgumentException: term: 使用 posinc != 1 分析到令牌的操作过程
java.lang.IllegalArgumentException: term: course of action analyzed to a token with posinc != 1
这篇关于有谁知道如何使用带有 Lucene 3.6 的 Wordnet 扩展查询?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持编程学习网!
本文标题为:有谁知道如何使用带有 Lucene 3.6 的 Wordnet 扩展查询?
基础教程推荐
- 在 Libgdx 中处理屏幕的正确方法 2022-01-01
- 无法使用修饰符“public final"访问 java.util.Ha 2022-01-01
- 降序排序:Java Map 2022-01-01
- FirebaseListAdapter 不推送聊天应用程序的单个项目 - Firebase-Ui 3.1 2022-01-01
- 减少 JVM 暂停时间 >1 秒使用 UseConcMarkSweepGC 2022-01-01
- 设置 bean 时出现 Nullpointerexception 2022-01-01
- “未找到匹配项"使用 matcher 的 group 方法时 2022-01-01
- 如何使用 Java 创建 X509 证书? 2022-01-01
- Java:带有char数组的println给出乱码 2022-01-01
- Java Keytool 导入证书后出错,"keytool error: java.io.FileNotFoundException &拒绝访问" 2022-01-01