M3S is an end-to-end capability for collecting, translating, searching and organizing content from a range of media types in multiple languages. Sources include content from the World Wide Web, broadcast media, YouTube videos, Twitter and Facebook. M3S integrates and manages the media analysis process from beginning to end — from data collection and processing, to automated triage and retrieval, to machine-assisted translation and support for human translation, to export and dissemination. The system's automatic analysis of content supports effective retrieval and triage for human analysts who must deal with overwhelming volumes of continuously accumulating media.
M3S continuously captures content from user-selected sites, channels and social media users into an archive that can be shared by multiple distributed user groups. The captured media is archived and versioned for later use. Internal links are preserved in harvested Web pages so users can navigate within the archive.
Using state-of-the-art human language technologies, M3S converts speech into text, identifies and extracts text from web pages and social media platforms, and automatically tags named entities (people, places and organizations). The extracted text is then automatically translated into English using machine translation software. English speakers can use the machine translation to get the gist of an article, broadcast or post; linguists and analysts can correct the machine translation and add analytical commentary.
M3S supports more than 40 languages for text-based sources, and 16 languages for speech-based sources.
M3S not only harvests the text of social media postings, but also gathers metadata such as geotags, hashtags, user references, URLs, topic threading information and retweet information. Using this information, the system can accurately reconstruct interactions across networks, setting the stage for social media analytics. Relationships between participants are presented as visualizations of explicit connections (such as "follower-followed") or implicit connections indicated by use of hashtags, retweets or sharing of content.