Loading
Fix: Use singleton for embedding and summary operations
Changed generate_embeddings() and query_embeddings() in embeddings.py to use AiServiceContainer singleton instead of creating new EmbeddingsManager instances. This ensures the embedding model is loaded once and reused across all documents. Changed summarize_document() in summarize.py to use AiServiceContainer for storage instead of creating new AiStorage instance when storage is None. These changes eliminate the 'Loading weights' progress bar that was appearing for every document processed. The model is now loaded once at startup. Also updated docstrings to mark 'storage' parameter as deprecated since singleton is now used.