Comment by Sterling Blue on "Building a Data Ingestion Solution for Amazon Bedrock Knowledge Bases"

Thanks for your message. You are right that off-hour ingestion is more of a blanket statement, as it really depends on the usage pattern, the type of data, and the complexity of the ingestion pipeline. It's more of an art than a rule, and I've not seen AWS reference that provides substantial guidance yet.

So far I've worked on a few RAG system (not exclusively on AWS) with relatively static data which can afford scheduled ingestion. But I've not had the opportunity to work on one with real-time ingestion requirements just yet.

I know of the Bedrock KB streaming data ingestion feature that was announced in re:Invent 2024, but have not had a chance to experiment with it. Would be interesting to see how this changes things.

Search Hashnode