-
Finding duplicates among 7000+ videos
Dear forum members,
We have a rather large and daunting potential project. Our client has 7000+ video files on their Brightcove.com account. Client needs us to find duplicate videos so they can be taken down from the account. The client has downloaded all 7000+ videos from their Brightcove account via the API. When searching for dupes, we cannot use file names because each time a video is uploaded to Brightcove, it assigns a 13 digit number to the renditions and this is how a video file is named when you download it. Therefore, right now we have a hard drive with 7000+ MP4s with file names such as 2591587223001.MP4—which is what we have to work with.
We are hoping to find an application or service that can find possible duplicate videos by analyzing the content of each and create some kind of report that we can then use to make it easier to manually identify the *definite* duplicates.
We understand that no tool will be able to find duplicates with 100% accuracy. Nevertheless, such tool will help us narrow it down to a more manageable level.
Hoping to hear feedback from anyone that can point us in the right direction or that may have some experience with such task.
Thank you in advance!
Xavier