Orchestrator
This module contains functionality related to the the orchestrator
module for extraction.orchestrators.basic
.
Orchestrator
BasicDatasourceOrchestrator
Bases: BaseDatasourceOrchestrator
Orchestrator for multi-datasource content processing.
Source code in src/extraction/orchestrators/basic/orchestrator.py
12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 |
|
full_refresh_sync()
async
Extract and process content from all datasources.
Processes each configured datasource to extract documents and clean content.
Returns: |
|
---|
Source code in src/extraction/orchestrators/basic/orchestrator.py
17 18 19 20 21 22 23 24 25 26 27 |
|
incremental_sync()
async
Not implemented yet.
Source code in src/extraction/orchestrators/basic/orchestrator.py
29 30 31 32 33 |
|
BasicDatasourceOrchestratorFactory
Bases: Factory
Factory for creating BasicDatasourceOrchestrator instances.
Creates orchestrator instances configured with appropriate datasource managers based on the provided extraction configuration.
Source code in src/extraction/orchestrators/basic/orchestrator.py
36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 |
|