Copyright disclaimer: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Journal papers

  • Support Vector Motion Clustering
    I.A. Lawal, F. Poiesi, D. Anguita, A. Cavallaro
    IEEE Trans. on Circuits and Systems for Video Technology, vol. 27, no. 11, pp. 2395-2408, Nov 2017
  • Tracking multiple high-density homogeneous targets
    F. Poiesi, A. Cavallaro
    IEEE Trans. on Circuits and Systems for Video Technology, vol. 25, no. 4, pp. 623-637, Apr 2015
  • Predicting and recognizing human interactions in public spaces
    F. Poiesi, A. Cavallaro
    Journal of Real-Time Image Processing, Springer, vol. 10, no. 4, pp. 785-803, Dec 2015
  • Measures of effective video tracking
    T. Nawaz, F. Poiesi, A. Cavallaro
    IEEE Trans. on Image Processing, vol. 23, no. 1, pp. 376-388, Jan 2014
  • Multi-target tracking on confidence maps: an application to people tracking
    F. Poiesi, R. Mazzon, A. Cavallaro
    Computer Vision and Image Understanding, Elsevier, vol. 117, no. 10, pp. 1257-1272, Oct 2013

Book chapters and Magazines

  • Point Clouds from Smartphones - Cheap and Rapid 3D Modelling
    E. Nocerino, F. Poiesi, F. Remondino, L. Van Gool
    GIM International, Mar 2018
  • Towards cognitive and perceptive video systems
    T. Akgun, C. Attwood, A. Cavallaro, C. Fabre, F. Poiesi, P. Szczuko
    Human Behaviour Understanding in Networked Sensing, Springer, Dec 2014
  • Multi-target tracking in video
    F. Poiesi, A. Cavallaro
    Academic Press Library in Signal Processing: Volume 4, (Ed. S. Theodoridis), Elsevier, Sep 2013

Conference papers

  • Seamless bare-hand interaction in Mixed Reality
    C. Battisti, S. Messelodi, F. Poiesi
    IEEE Proc. of Int'l Symposium on Mixed and Augmented Reality (ISMAR), Munich, DE, Oct 2018 (Poster)
  • A distributed vision-based consensus model for aerial-robotic teams
    F. Poiesi, A. Cavallaro
    IEEE Proc. of Intelligent Robots and Systems (IROS), Madrid, ES, Oct 2018 (Oral)
  • Distributed data exchange with Leap Motion
    M. Pani, F. Poiesi
    Int. Conference on Augmented Reality, Virtual Reality and Computer Graphics, Lecce, IT, Jun 2018 (Oral)
  • 3Dnow: image-based 3D reconstruction and modeling via web
    Y. Tefera, F. Poiesi, D. Morabito, F. Remondino, E. Nocerino, P. Chippendale
    Towards photogrammetry 2020, Trento, IT, Jun 2018 (Oral)
  • Cloud-based collaborative 3D reconstruction using smartphones
    F. Poiesi, A. Locher, P. Chippendale, E. Nocerino, F. Remondino, L. Van Gool
    European Conference on Visual Media Production (CVMP), London, UK, Dec 2017 (Oral)
  • Towards gesture-based multi-user interactions in collaborative virtual environments
    N. Pretto, F. Poiesi
    Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLII-2-W8, 203-208, 2017 (Oral)
  • 3D Reconstruction with a collaborative approach based on smartphones and a cloud-based server
    E. Nocerino, F. Poiesi, A. Locher, Y.T. Tefera, F. Remondino, P. Chippendale, L. Van Gool
    Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLII-2-W8, 187-194, 2017 (Oral)
  • A Smartphone-based pipeline for the creative industry - The REPLICATE project
    E. Nocerino, F. Lago, D. Morabito, F. Remondino, L. Porzi, F. Poiesi, S. Rota Bulo', P. Chippendale, A. Locher, M. Havlena, L. Van Gool, M. Eder, A. Fotschl, A. Hilsmann, L. Kausch, P. Eisert
    Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLII-2/W3, 535-541, 2017 (Oral)
  • Online multi-target tracking with strong and weak detections
    R. Sanchez Matilla, F. Poiesi, A. Cavallaro
    European Conference on Computer Vision (ECCV): Benchmarking Multi-target Tracking: MOTChallenge 2016, Amsterdam, NL, Oct 2016 (Oral)
  • Detection of fast incoming objects with a moving camera
    F. Poiesi, A. Cavallaro
    British Machine Vision Conference (BMVC), York, UK, Sep 2016 (Oral)
  • Distributed vision-based flying cameras to film a moving target
    F. Poiesi, A. Cavallaro
    IEEE Proc. of Intelligent Robots and Systems (IROS), Hamburg, GE, Sep 2015 (Oral)
  • Self-positioning of a team of flying smart cameras
    F. Poiesi, A. Cavallaro
    IEEE Proc. of Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP), Singapore, Apr 2015 (Oral)
  • MTTV: an interactive trajectory visualization and analysis tool
    F. Poiesi, A. Cavallaro
    Proc. of Information Visualization Theory and Applications (IVAPP), Berlin, GE, Mar 2015 (Poster)
  • Assessing tracking assessment measures
    T. Nawaz, F. Poiesi, A. Cavallaro
    IEEE Proc. of Image Processing (ICIP), Paris, FR, Oct 2014 (Poster)
  • Detection and tracking of groups in crowd
    R. Mazzon, F. Poiesi, A. Cavallaro
    IEEE Proc. of Advanced Video and Signal-Based Surveillance (AVSS), Krakow, PL, Aug 2013 (Poster)
  • Detector-less ball localization using context and motion flow analysis
    F. Poiesi, F. Daniyal, A. Cavallaro
    IEEE Proc. of Image Processing (ICIP), Hong Kong, CN, Sep 2010 (Poster)


  • 3D reconstruction in your pocket
    F. Poiesi and P. Chippendale
    Mathematics for Computer Vision, Trento, IT, Feb 2018
  • REPLICATE project
    M. Galanti, A. Ragazzon, F. Poiesi, P. Chippendale
    EuroVR, Laval, FR, Dec 2017
  • 360-video authoring pipeline using user-generated 3D models
    F. Poiesi, P. Chippendale, J. Ceballos, R. Harris, E. Nocerino, F. Remondino
    European Conference on Visual Media Production (CVMP), London, UK, Dec 2017
  • REPLICATE project
    A. Foetschl, F. Poiesi, P.Chippendale
    AWE Europe, Munich, DE, Oct 2017
  • Replicare in 3D
    F. Poiesi, E. Nocerino
    European Researchers' Night, Trento, IT, Sep 2017
    video (local news - RTTR)


  • Multi-target tracking and performance evaluation on videos
    F. Poiesi
    PhD Thesis, Queen Mary University of London, United Kingdom, Dec 2013
    Advisor: Prof. Andrea Cavallaro. Examiners: Dr. Krystian Mikolajczyk (University of Surrey, UK), Dr. Lewis Griffin (University College London, UK)
    Abstract - Multi-target tracking is the process that allows the extraction of object motion patterns of interest from a scene. Motion patterns are often described through metadata representing object locations and shape information. In the first part of this thesis we discuss the state-of-the-art methods aimed at accomplishing this task on monocular views and also analyse the methods for evaluating their performance. The second part of the thesis describes our research contribution to these topics.
    We begin presenting a method for multi-target tracking based on track-before-detect (MT- TBD) formulated as a particle filter. The novelty involves the inclusion of the target identity (ID) into the particle state, which enables the algorithm to deal with an unknown and unlimited number of targets. We propose a probabilistic model of particle birth and death based on Markov Random Fields. This model allows us to overcome the problem of the mixing of IDs of close targets.
    We then propose three evaluation measures that take into account target-size variations, combine accuracy and cardinality errors, quantify long-term tracking accuracy at different accuracy levels, and evaluate ID changes relative to the duration of the track in which they occur. This set of measures does not require pre-setting of parameters and allows one to holistically evaluate tracking performance in an application-independent manner.
    Lastly, we present a framework for multi-target localisation applied on scenes with a high density of compact objects. Candidate target locations are initially generated by extracting object features from intensity maps using an iterative method based on a gradient-climbing technique and an isocontour slicing approach. A graph-based data association method for multi-target tracking is then applied to link valid candidate target locations over time and to discard those which are spurious. This method can deal with point targets having indistinguishable appearance and unpredictable motion.
    MT-TBD is evaluated and compared with state-of-the-art methods on real-world surveillance datasets (static and moving cameras) by using the proposed evaluation measures. In the case of online applications the inclusion of the ID in the particle state is effective, but it does not allow the proposed tracker to outperform offline trackers. The proposed measures are compared with existing measures for multi-target tracking and it is shown that the proposed ones comparatively maintain a reliable evaluation of the performance without prior knowledge about the application. The tracking of point targets in high-density scenes is evaluated on datasets containing insects and compared with MT-TBD and alternative multi-target trackers. The proposed solutions achieved the best results, especially in terms of ID maintenance on the targets
    , pdf
  • Motion-based ball localisation through motion flow analysis
    F. Poiesi
    MSc Thesis, Universita' degli studi di Brescia, Italy, Mar 2010
    Advisor: Prof. Riccardo Leonardi. Co-advisor: Prof. Andrea Cavallaro
    Abstract - We present a technique for estimating the location of the ball during a basketball game without using a detector based on appearance features. The methods present in the state-of-the-art which aim to retrieve the ball, generally estimate the position of it using spatial features such as color, shape and size. Moreover, several approaches perform an additional temporal smoothing to filter out incorrect estimates. These methods are dependent upon the initial detection phase, which is based on the extraction of the visual features that not reliable because the ball is frequently occluded and similar to the background. Unlike existing approaches, instead of using visual features associated to the ball, we estimate the ball candidates based on the location of the players and their motion during attack actions. Hence, we propose an approach for ball localization that uses contextual information, i.e. players' bahavior, to estimate the approximate location of the ball. By this way this technique allows us to overcome the challenges due to frequent occlusions of the ball and its similarity in the appearance with the background. Based on this assumption, we use expected dynamics of the game and motion flow to estimate regions of the convergence of the players and the most probable region for the ball location. So, the most probable candidates for the ball location are extracted for each frame. Temporal consistency is then validated using the Kalman filter. Finally, we test the proposed approach on a real basketball scenario, where the ball is most of the time either partially of completely occluded. Experimental results show that the location of the ball can be estimated with an average accuracy of 82.6%
  • Development of an application for the visualisation of dynamic video summaries
    F. Poiesi
    BSc Thesis, Universita' degli studi di Brescia, Italy, Nov 2007
    Advisor: Dr. Sergio Benini. Co-advisor: Dr. Pierangelo Migliorati
    Abstract - The large amount of multimedia content requires systems capable of automatically manage these data. Therefore, there is the necessity of summarising videos in order to quickly access to the desired content. There exist two methods for the automatic video summarisation. The first is a static summarsation, the second is a dynamic summarisation. The former uses key-frames, the latter uses short video clips (shots) to present the most informative content.
    In this thesis the summarisation is dynamic and assumed to be done upstream. We propose two systems for the generation of the output video. The first method is offline and can produce either high and low quality videos. High quality videos require a full re-encoding of the shots. The second method is online and is performed with Video Lan via Java interface.
    Experiments show that the offline process produces a more pleasant summary at a cost of a longer processing time.