|
Oracle Secure Enterprise Search Java API Reference 10g Release 1 (10.1.8) B32260-01 |
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
CrawlingThreadService is an interface used by a crawler plugin to perform crawl related tasks. It has execution context specific to the crawling thread that invokes the plugin crawl() method
Field Summary | |
static int |
DOC_EXCLUDED_BY_MIMETYPE document excluded by mimetype |
static int |
DOC_EXCLUDED_BY_SIZE document excluced by document size |
static int |
DOC_EXCLUDED_BY_URL_BOUNDARY document excluded by url boundary |
static int |
DOC_INCLUDED document should be included |
Method Summary | |
int |
checkDocumentExcluded(DocumentMetadata meta) check if the document should be crawled or not the check stops if one rule excludes the document and only status code for this rule is returned. |
java.lang.String |
inferMimeType(java.lang.String url) check the mime type based on the URL suffix. |
void |
markStatusNotChanged(DocumentMetadata meta) mark a url entry as not requiring any changes or updates. |
void |
submitForProcessing(DocumentContainer target) submit the document for processing. |
Field Detail |
public static final int DOC_INCLUDED
public static final int DOC_EXCLUDED_BY_URL_BOUNDARY
public static final int DOC_EXCLUDED_BY_MIMETYPE
public static final int DOC_EXCLUDED_BY_SIZE
Method Detail |
public void submitForProcessing(DocumentContainer target) throws ProcessingException
target
- the document container containing the content and metadata.ProcessingException
public void markStatusNotChanged(DocumentMetadata meta) throws ProcessingException
meta
- the metadata object corresponding to the url entryProcessingException
public int checkDocumentExcluded(DocumentMetadata meta)
The internal exclusion chekcing always happens during submiting the documents.
public java.lang.String inferMimeType(java.lang.String url)
url
-
|
Oracle Secure Enterprise Search Java API Reference 10g Release 1 (10.1.8) B32260-01 |
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |