Research
Ongoing projects
Visual Intelligence and Machine Perception group
Since early 2018, I am the principal investigator of the Visual Intelligence and Machine Perception (VIMP) group at UniPD. We conduct research in computer vision, applied machine (deep) learning, natural language processing and multimedia. We aim at developing artificially intelligent systems to help computers perform visual perception and recognition tasks. You can know more about our ongoing research projects, publications and activities by visiting our research group web page. |
Completed projects (but not forgotten)
EAGLE – Exploiting semAntic and social knowledGe for visuaL rEcognition
The Marie Curie IOF grant EAGLE started in October 2014. The aim of this three-years project is to effectively exploit semantic and social knowledge for visual recognition. Marie Curie individual fellowships are highly prestigious and competitive and are meant to support the best, most promising European researchers. Project page.
Advisors: A. Del Bimbo (U Florence), Li Fei-Fei (Stanford) |
Tag refinement and localization in web videos
We present a novel data-driven approach for automatic video annotation that effectively increases the number of tags originally provided by users, and localizes them temporally, associating tags to shots (…).
Collaborators: M. Bertini, G. Serra, A. Del Bimbo |
Data-driven methods for social media analysis and annotation
We are working at a framework for evaluating nearest-neighbor methods for tag refinement and automatic image annotation. We performed extensive and rigorous evaluation using standard large-scale datasets to show that the performance of these methods is comparable to that of more complex and computationally intensive approaches (…).
Collaborators: T. Uricchio, M. Bertini, A. Del Bimbo |
A SIFT-based method for copy-move forgery detection
Understanding if a digital image is authentic or not, is a key purpose of image forensics. We proposed an effective approach, based on SIFT features, for copy-move detection and localization (…).
Collaborators: I. Amerini, G. Serra, R. Caldelli, L. Del Tongo, A. Del Bimbo |
A Context-Dependent Kernel for Logo Recognition
We contributed through this work to the design of a variational framework able to recognize multiple instances of logos in image archives. Logos are seen as constellations of local features and matched by minimizing an energy function which captures feature co-occurrence/geometry (…).
Collaborators: H. Sahbi (Telecom ParisTech), G. Serra, A. Del Bimbo |
Human action categorization in unconstrained videos
We introduced a novel local descriptor based on both image gradient and optic flow to respectively model the appearance and motion of human actions. We used also radius-based clustering with soft assignment in order to create an effective codebook model (…).
Collaborators: L. Seidenari, G. Serra, M. Bertini, A. Del Bimbo |
Automatic trademark detection and recognition in sports videos
We developed a semi-automatic system for detection and recognition of pre-defined brands and trademarks in broadcast TV. The number of appearances of a logo, its position, size and duration is recorded to derive indexes and statistics that can be used for marketing analysis (…).
Collaborators: M. Bertini, A. Del Bimbo, A. D. Bagdanov, A. Jain |