The dataset serves for checking the ability in detecting social interactions, i.e., people that talk with themselves.
The dataset represents a coffee-break scenario of a social event that lasted 4 days, captured by two cameras. The dataset is part of a social signaling project whose aim is to monitor how social relations evolve over time. Nowadays, only 2 sequences of a single day of a single camera have been annotated (but novel sequences are going to appear, keep in touch!). A psychologist annotated the videos indicating the groups present in the scenes, for a total of 45 frames for Seq1 and 75 frames for Seq2. The annotations have been done by analyzing each frame and a set of questionnaires that the subjects filled in. The dataset is still challenging from the tracking and head pose estimation point of view, due to multiple occlusions. Results of this dataset have been published in the referenced paper.