|
Oracle Secure Enterprise Search Java API Reference 10g Release 1 (10.1.8) B32260-01 |
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||||
CrawlingThreadService is an interface used by a crawler plugin to perform crawl related tasks. It has execution context specific to the crawling thread that invokes the plugin crawl() method
| Field Summary | |
static int |
DOC_EXCLUDED_BY_MIMETYPEdocument excluded by mimetype |
static int |
DOC_EXCLUDED_BY_SIZEdocument excluced by document size |
static int |
DOC_EXCLUDED_BY_URL_BOUNDARYdocument excluded by url boundary |
static int |
DOC_INCLUDEDdocument should be included |
| Method Summary | |
int |
checkDocumentExcluded(DocumentMetadata meta)check if the document should be crawled or not the check stops if one rule excludes the document and only status code for this rule is returned. |
java.lang.String |
inferMimeType(java.lang.String url)check the mime type based on the URL suffix. |
void |
markStatusNotChanged(DocumentMetadata meta)mark a url entry as not requiring any changes or updates. |
void |
submitForProcessing(DocumentContainer target)submit the document for processing. |
| Field Detail |
public static final int DOC_INCLUDED
public static final int DOC_EXCLUDED_BY_URL_BOUNDARY
public static final int DOC_EXCLUDED_BY_MIMETYPE
public static final int DOC_EXCLUDED_BY_SIZE
| Method Detail |
public void submitForProcessing(DocumentContainer target)
throws ProcessingException
target - the document container containing the content and metadata.ProcessingException
public void markStatusNotChanged(DocumentMetadata meta)
throws ProcessingException
meta - the metadata object corresponding to the url entryProcessingExceptionpublic int checkDocumentExcluded(DocumentMetadata meta)
The internal exclusion chekcing always happens during submiting the documents.
public java.lang.String inferMimeType(java.lang.String url)
url -
|
Oracle Secure Enterprise Search Java API Reference 10g Release 1 (10.1.8) B32260-01 |
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||||