Package | Description |
---|---|
messif.buckets.storage.impl |
Implementations of physical bucket storage.
|
messif.objects.impl |
Implementation of basic data objects.
|
messif.objects.text |
Support for text data.
|
Modifier and Type | Class and Description |
---|---|
class |
DatabaseStorage<T>
Database-based storage.
|
class |
MemoryStorage<T>
Memory based storage.
|
Modifier and Type | Method and Description |
---|---|
IntStorageIndexed<java.lang.String> |
MetaObjectProfiSCT.DatabaseSupport.getWordIndex()
Returns the current word-index instance.
|
Modifier and Type | Method and Description |
---|---|
protected ObjectIntMultiVectorJaccard |
MetaObjectProfiSCT.convertWordsToIdentifiers(WordExpander expander,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex,
java.lang.Object title,
java.lang.Object keyword,
java.lang.Object search)
Convert the given title, key and additional words to a int multi-vector
object with Jaccard distance function.
|
static CophirXmlParser |
CophirXmlParser.create(java.io.File file,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Factory method that parses the given CoPhIR XML file.
|
static CophirXmlParser |
CophirXmlParser.create(java.io.File xmlDir,
java.lang.String identifier,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Factory method that parses a CoPhIR XML file with the given identifier.
|
static TextDescriptorFactory<ObjectIntMultiVectorJaccard> |
ObjectIntMultiVectorJaccard.createTextDescriptorFactory(java.util.Set<java.lang.String> stopWords,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> writableWordIndex,
IntStorageIndexed<java.lang.String>[] readonlyWordIndexes)
Creates a factory method that converts the given array of strings into
ObjectIntMultiVectorJaccard . |
static TextDescriptorFactory<ObjectIntMultiVectorJaccard> |
ObjectIntMultiVectorJaccard.createTextDescriptorFactory(java.util.Set<java.lang.String> stopWords,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> writableWordIndex,
IntStorageIndexed<java.lang.String>[] readonlyWordIndexes)
Creates a factory method that converts the given array of strings into
ObjectIntMultiVectorJaccard . |
static java.util.Map<java.lang.String,DatabaseStorage.ColumnConvertor<MetaObjectProfiSCT>> |
MetaObjectProfiSCT.DatabaseSupport.getDBColumnMap(boolean addTextStreamColumn,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex,
boolean useLinkTable)
Returns the database column definitions for the
MetaObjectProfiSCT object. |
java.lang.String[] |
MetaObjectProfiSCT.getKeywordWords(IntStorageIndexed<java.lang.String> wordIndex)
Returns the key words of this object.
|
java.lang.String[] |
MetaObjectProfiSCT.getSearchWords(IntStorageIndexed<java.lang.String> wordIndex)
Returns the search words of this object.
|
static DatabaseStorage.ColumnConvertor<MetaObjectProfiSCT> |
MetaObjectProfiSCT.DatabaseSupport.getTextStreamColumnConvertor(Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex,
boolean useLinkTable)
Returns the database column convertor for creating the
MetaObjectProfiSCT object
from the text stream. |
java.lang.String[] |
MetaObjectProfiSCT.getTitleWords(IntStorageIndexed<java.lang.String> wordIndex)
Returns the title words of this object.
|
static ObjectIntMultiVectorJaccard |
CophirXmlParser.parseKeyWordsType(java.lang.String[] texts,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Parse the keywords descriptor data.
|
Constructor and Description |
---|
CophirXmlParser(Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Creates a new handler for parsing CoPhIR XML files.
|
CophirXmlParser(Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex,
java.lang.String csvObjectName)
Creates a new handler for parsing CoPhIR XML files.
|
MetaObjectPixMacSCT.MultiWeightIgnoreProviderPixMac(float[] weights,
float ignoreWeight,
java.lang.String[] ignoredKeywords,
IntStorageIndexed<java.lang.String> keyWordIndex)
Creates a new instance of MultiWeightProvider with the the given array of weights.
|
MetaObjectPixMacSCT(java.io.BufferedReader stream,
IntStorageIndexed<java.lang.String> keyWordIndex)
Creates a new instance of MetaObjectPixMacShapeAndColor.
|
MetaObjectPixMacSCT(java.io.BufferedReader stream,
IntStorageIndexed<java.lang.String> keyWordIndex,
boolean readAdditionalKeyWords)
Creates a new instance of MetaObjectPixMacShapeAndColor.
|
MetaObjectPixMacSCT(MetaObjectPixMacSCT object,
IntStorageIndexed<java.lang.String> keyWordIndex,
java.lang.String[] titleWords,
java.lang.String[] keyWords)
Creates a new instance of MetaObjectPixMacShapeAndColor from the given
MetaObject
and given set of keywords. |
MetaObjectPixMacSCT(MetaObjectPixMacSCT object,
IntStorageIndexed<java.lang.String> keyWordIndex,
java.lang.String[] titleWords,
java.lang.String[] keyWords,
java.lang.String searchWords)
Creates a new instance of MetaObjectPixMacShapeAndColor from the given
MetaObject
and given set of keywords. |
MetaObjectPixMacSCT(MetaObjectPixMacSCT object,
IntStorageIndexed<java.lang.String> keyWordIndex,
java.lang.String[] titleWords,
java.lang.String[] keyWords,
java.lang.String[] searchWords)
Creates a new instance of MetaObjectPixMacShapeAndColor from the given
MetaObject
and given set of keywords. |
MetaObjectPixMacShapeAndColor(java.io.BufferedReader stream,
IntStorageIndexed<java.lang.String> keyWordIndex)
Creates a new instance of MetaObjectPixMacShapeAndColor.
|
MetaObjectPixMacShapeAndColor(MetaObject object,
IntStorageIndexed<java.lang.String> keyWordIndex,
java.lang.String... keyWords)
Creates a new instance of MetaObjectPixMacShapeAndColor from the given
MetaObject
and given set of keywords. |
MetaObjectProfiSCT.DatabaseSupport(java.lang.String dbConnUrl,
java.util.Properties dbConnInfo,
java.lang.String dbDriverClass,
java.lang.String tableName,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Creates a new instance of DatabaseSupport.
|
MetaObjectProfiSCT.DatabaseSupport(java.lang.String dbConnUrl,
java.util.Properties dbConnInfo,
java.lang.String dbDriverClass,
java.lang.String tableName,
java.lang.String wordLinkTable,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Creates a new instance of DatabaseSupport.
|
MetaObjectProfiSCT.DatabaseSupport(java.lang.String dbConnUrl,
java.util.Properties dbConnInfo,
java.lang.String dbDriverClass,
java.lang.String tableName,
java.lang.String wordLinkTable,
java.lang.String wordFrequencyTable,
java.lang.String stopwordTable,
java.lang.String[] stopwordCategories,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Creates a new instance of DatabaseSupport.
|
MetaObjectProfiSCT.MetaObjectProfiSCTKwDistCosine(java.io.BufferedReader stream,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex,
java.lang.Float keywordsWeight,
float[] keywordLayerWeights)
Creates a new instance of MetaObjectProfiSCTKwDistCosine.
|
MetaObjectProfiSCT.MetaObjectProfiSCTKwDistJaccard(java.io.BufferedReader stream,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex,
java.lang.Float keywordsWeight,
float[] keywordLayerWeights)
Creates a new instance of MetaObjectProfiSCTKwDistJaccard.
|
MetaObjectProfiSCT.MultiWeightIgnoreProviderProfi(float[] weights,
float ignoreWeight,
java.lang.String[] ignoredKeywords,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> keyWordIndex)
Creates a new instance of MultiWeightProvider with the the given array of weights.
|
MetaObjectProfiSCT(java.io.BufferedReader stream,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Creates a new instance of MetaObjectProfiSCT.
|
MetaObjectProfiSCT(java.io.BufferedReader stream,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex,
java.lang.String searchString)
Creates a new instance of MetaObjectProfiSCT.
|
MetaObjectProfiSCT(MetaObjectProfiSCT object,
java.lang.String[] titleWords,
java.lang.String[] keywordWords,
java.lang.String[] searchWords,
WordExpander expander,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Creates a new instance of MetaObjectProfiSCT from the given
MetaObjectProfiSCT
and given title words, key words, and search words. |
MetaObjectProfiSCT(MetaObjectProfiSCT object,
java.lang.String titleString,
java.lang.String keywordString,
java.lang.String searchString,
WordExpander expander,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Creates a new instance of MetaObjectProfiSCT from the given
MetaObjectProfiSCT
and given set of keywords. |
MetaObjectProfiSCT(MetaObjectProfiSCT object,
java.lang.String titleString,
java.lang.String keywordString,
WordExpander expander,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Creates a new instance of MetaObjectProfiSCT from the given
MetaObjectProfiSCT
and given set of keywords. |
MetaObjectProfiSCT(MetaObjectProfiSCT object,
java.lang.String searchString,
WordExpander expander,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Creates a new instance of MetaObjectProfiSCT from the given
MetaObjectProfiSCT
and given set of keywords. |
MetaObjectProfiSCTiDIM.MetaObjectProfiSCTiDIMKwDistCosine(java.io.BufferedReader stream,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex,
java.lang.Float keywordsWeight,
float[] keywordLayerWeights)
Creates a new instance of MetaObjectProfiSCTiDIMKwDistCosine.
|
MetaObjectProfiSCTiDIM(java.io.BufferedReader stream,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex) |
MetaObjectProfiSCTiDIM(java.io.BufferedReader stream,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex,
java.lang.String searchString) |
MetaObjectProfiSCTiDIM(MetaObjectProfiSCT object,
java.lang.String titleString,
java.lang.String keywordString,
java.lang.String searchString,
WordExpander expander,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex) |
MetaObjectProfiSCTiDIM(MetaObjectProfiSCT object,
java.lang.String titleString,
java.lang.String keywordString,
WordExpander expander,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex) |
MetaObjectProfiSCTiDIM(MetaObjectProfiSCT object,
java.lang.String searchString,
WordExpander expander,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex) |
Modifier and Type | Method and Description |
---|---|
static java.lang.String[] |
TextConversion.identifiersToWords(IntStorageIndexed<java.lang.String> wordIndex,
int[] ids)
Convert the given array of word identifiers to words using the given storage.
|
static int[][] |
TextConversion.textsToWordIdentifiersMultiIndex(java.lang.String[] strings,
java.lang.String stringSplitRegexp,
java.util.Set<java.lang.String> stopWords,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> writableWordIndex,
IntStorageIndexed<java.lang.String>[] readonlyWordIndexes)
Transforms multiple strings of words into multi-array of addresses.
|
static int[][] |
TextConversion.textsToWordIdentifiersMultiIndex(java.lang.String[] strings,
java.lang.String stringSplitRegexp,
java.util.Set<java.lang.String> stopWords,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> writableWordIndex,
IntStorageIndexed<java.lang.String>[] readonlyWordIndexes)
Transforms multiple strings of words into multi-array of addresses.
|
static int[] |
TextConversion.textToWordIdentifiers(java.lang.String string,
java.lang.String stringSplitRegexp,
java.util.Set<java.lang.String> ignoreWords,
java.util.Set<java.lang.String> stopWords,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Transforms a string of words into array of addresses.
|
static int[] |
TextConversion.textToWordIdentifiers(java.lang.String string,
java.lang.String stringSplitRegexp,
java.util.Set<java.lang.String> ignoreWords,
java.util.Set<java.lang.String> stopWords,
WordExpander expander,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex)
Transforms a string of words into array of addresses.
|
static int[] |
TextConversion.textToWordIdentifiersMultiIndex(java.lang.String string,
java.lang.String stringSplitRegexp,
java.util.Set<java.lang.String> ignoreWords,
java.util.Set<java.lang.String> stopWords,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> writableWordIndex,
IntStorageIndexed<java.lang.String>[] readonlyWordIndexes)
Transforms a string of words into array of addresses.
|
static int[] |
TextConversion.textToWordIdentifiersMultiIndex(java.lang.String string,
java.lang.String stringSplitRegexp,
java.util.Set<java.lang.String> ignoreWords,
java.util.Set<java.lang.String> stopWords,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> writableWordIndex,
IntStorageIndexed<java.lang.String>[] readonlyWordIndexes)
Transforms a string of words into array of addresses.
|
static int[] |
TextConversion.wordsToIdentifiers(java.lang.String[] words,
java.util.Set<java.lang.String> ignoreWords,
java.util.Set<java.lang.String> stopWords,
WordExpander expander,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex,
boolean normalize)
Transforms a list of words into array of addresses.
|
static int[] |
TextConversion.wordsToIdentifiers(java.lang.String[] words,
java.util.Set<java.lang.String> ignoreWords,
WordExpander expander,
Stemmer stemmer,
IntStorageIndexed<java.lang.String> wordIndex,
boolean normalize)
Transforms a list of words into array of addresses.
|
static int |
TextConversion.wordsToIdentifiersRead(java.util.Collection<java.lang.String> words,
IntStorageIndexed<java.lang.String> wordIndex,
int[] identifiers,
int index)
Transforms a list of words into array of addresses by reading the given word index.
|
static int |
TextConversion.wordsToIdentifiersStore(java.util.Collection<java.lang.String> words,
IntStorageIndexed<java.lang.String> wordIndex,
int[] identifiers,
int index)
Transforms a list of words into array of addresses by storing the words
into the given word index and retrieving the generated identifiers.
|