Paper of the day – A Survey on Data-driven Performance Tuning for Big Data Analytics Platforms

#paperoftheday

Title: A Survey on Data-driven Performance Tuning for Big Data Analytics Platforms

Venue: Big Data Research, vol. 25 (2021)

Authors: Rogério Luís de C.Costa, José Moreira, Paulo Pintor, Veronica dos Santos, Sérgio Lifschitz

Abstract: Many research works deal with big data platforms looking forward to data science and analytics. These are complex and usually distributed environments, composed of several systems and tools. As expected, there is a need for a closer look at performance issues.

In this work, we review performance tuning strategies in the big data environment. We focus on data-driven tuning techniques, discussing the use of database inspired approaches. Concerning big data and NoSQL stores, performance tuning issues are quite different from the so-called conventional systems. Many existing solutions are mostly ad-hoc activities that do not fit for multiple situations. But there are some categories of data-driven solutions that can be taken as guidelines and incorporated into general-purpose auto-tuning modules for big data systems.

We examine typical performance tuning actions, discussing available solutions to support some of the tuning process’s primary activities. We also discuss recent implementations of data-driven performance tuning solutions for big data platforms. We propose an initial classification based on the domain state-of-the-art and present selected tuning actions for large-scale data processing systems. Finally, we organized existing works towards self-tuning big data systems based on this classification and presented general and system-specific tuning recommendations. We found that most of the literature pieces evaluate the use of tuning actions at the physical design perspective, and there is a lack of self-tuning machine-learning-based solutions for big data systems.

More in: https://doi.org/10.1016/j.bdr.2021.100206

Paper of the day – Unsupervised Method for Video Action Segmentation Through Spatio-Temporal and Positional-Encoded Embeddings

 

#paperoftheday

Title: Unsupervised Method for Video Action Segmentation Through Spatio-Temporal and Positional-Encoded Embedding

Venue: ACM Multimedia Systems Conference (2022)

Authors: Guilherme de A. P. Marques, Antonio José G. Busson, Álan Lívio V. Guedes, Julio Cesar Duarte, Sérgio Colcher

Abstract: Action segmentation consists of temporally segmenting a video and labeling each segmented interval with a specific action label. In this work, we propose a novel action segmentation method that requires no prior video analysis and no annotated data. Our method involves extracting spatio-temporal features from videos using a pre-trained deep network. Data is then transformed using a positional encoder, and finally a clustering algorithm is applied, where each produced cluster presumably corresponds to a different single and distinguishable action. In experiments, we show that our method produces competitive results on the Breakfast and Inria Instructional Videos dataset benchmarks.

More in: https://doi.org/10.1145/3524273.3528187