Embed
This module contains functionality related to the the embed
script.
Embed
This script is used to process datasources documents and embed them into a vector storage. In summary, this script reads, cleans, splits, and embeds datasources documents into a vector storage. To run the script execute the following command from the root directory of the project:
python src/embed.py
main(injector)
Execute embedding workflow with validation.
Parameters: |
|
---|
Note
Exits with code 100 if collection already exists
Source code in src/embed.py
44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 |
|
run_embedding(injector)
async
Process and embed documents from datasources.
Parameters: |
|
---|
Note
Executes extraction, embedding and storage operations Exits with code 0 on success
Source code in src/embed.py
22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 |
|