The project comprises of 6 work packages
WP1: Develop Video Summarization Algorithms (VSA) for amateur social media video to display local event highlights as they occur anywhere in the world
The first brute force strategy to create a GIF summary by down-sampling videos, will look at the different resolutions that could be proposed to the user with the requirement that the information displayed is of good quality for a quick understanding of the content of the input video. Software 1.0 will be used to process a database of videos and create short summaries at different resolutions. The analysis of the low level content (e.g. motion, colour) of the video can be performed with metrics defined in Information theory. We propose to create a smart software prototype for creating summaries that will use these metrics for measuring automatically the content of videos. These metrics will be used to automatically select what are the images in the video that should be retained to be part of the summary, and also this will help in selecting the best spatial resolution automatically. Some artifacts (hand shake, blur, and occlusion) that often occur in amateur videos will be dealt with to improve the quality of the summaries.
WP2: Develop automatic local 3D Scene Rendering Algorithms (SRA) leveraging public geo-located social media photos of a particular location.
Merging several images or videos to create an augmented image can be a step further towards creating a good quality summary. This work package led by TCD will look at merging several images and/or videos recorded at the same place at the same time for creating a 3D rendering of a scene. Using location information embedded with the input images and videos, and potentially using additional 3D content available (e.g. Google Maps in 3D), this work package will look at computing local descriptors in the images, suitable for image stitching and 3D reconstruction, but also for image classification (WP5). We propose to use a modeling based on the Generalised Relaxed Radon Transform (GR2T) to estimate a probability density function of the 3D location and colour. An animated gif will then be created by moving a virtual camera into the scene and a perceptual testing using questionnaires and eye tracking technique will be used to assess the quality of the rendering. The path of the virtual camera will be automatically chosen such that the summary is both informative, and visually pleasing. Metrics from information theory will be used to assess the information content of the summary.
WP3: Trajectory Mining
Mathematical methods and prototypes for mining and predicting local communities workflows through contextualized trajectory pattern mining applied to social network data from Tapastreet. For that, the following sub-tasks are considered: (i) community detection, (ii) places of interest (POI) characterization, and finally (iii) constrained-based mining of contextualized trajectories. The goal is to provide a valuable input for WP5 which entails making recommendations to social media end users w.r.t. their social trajectories and context, the recommendation could be a breaking news events (whose detection is handled in WP4) and is triggered by a fix on the persons location.
WP4: Event Detection
Develop a computing solution for (i) event detection and (ii) sources of trust identification in geolocalized social media data streams. It is a centre piece between WP3 and WP5: Geo-localized breaking events are detected and recommended to a user entering (or predicted to enter in) the corresponding location. For task (i), we design/use data-mining techniques to detect/predict unexpected events/patterns in data streams in presence of concept drift. We propose to model task (ii) as an original problem of temporal dependency discovery between some topics from different social media and bring an algorithmic solution through a graph-mining algorithm.
Identification of a recommendation strategy and the design of a recommender prototype for geo-located social media users in a geo-local context. Thanks to the results of WP3 and WP4, we make use of user trajectories stratified by demography, characterized points of interest, and trusted breaking news. Here we close the loop on WP3&4 who's learnings are applied and built into this recommender system prototype. The strategy will be based on real data supplied by Tapastreet, expertise from Tapastreet's machine learning department and knowledge and expertise from INSA de Lyon.
WP6: Develop strategies and methods for implementation of Video Summarisation and Automated 3D Scene Rendering into Tapastreet’s social media search engine platform
Research effort in WP6 led by Tapastreet and will be looking at what strategy (e.g. cloud computing, parallel processing, GPU processing, etc.) can be used for a fast reliable implementation of WP1 & WP2 into the Tapastreet platform. Video summarization will need to be processed rapidly, and therefore some solutions may be better adapted than others. A hierarchical approach can be considered where about simple summaries are proposed and then replaced when the smarter ones becomes available. Beside the strategy for processing the information in a timely fashion, this workpackage will look at storage of the summaries, and easy access via mobile platform.