Hello @Jeff Copeland and welcome back to Microsoft Q&A. I'm sorry you had a negative experience.
The task described will be very difficult to implement in Azure Data Factory. There is no feature that fits all the requirements.
The difficulty is in part due to compatibility of activities with cosmos.
Tools for checking existance:
- validation activity (only available for structured stream)
- get metadata activity (not available for cosmos)
Tools for comparing content:
- get metadata activity (only available for structured stream)
- lookup activity (available for SQL API and for structured stream)
While the lookup activity is available for cosmos, the lookup activity is limited in how much it returns. It stops at 4 MB or 5000 rows. I do not know the size of your documents, so I cannot say whether this will work for you.
Given all these limitations, I need to get a little creative in a work-around.
One idea is to copy the two documents to blob, and compare the MD5. This is not an efficient approach.
Another idea is to craft some custom code for the job and run it as custom activity. With custom code you can run without ADF if you want.
My colleague suggested using Cosmos change feed. I need to research this a little more and discuss with you whether the change feed can be used as part of the solution.