A Cluster-Based Method for Action Segmentation Using Spatio-Temporal and Positional Encoded Embeddings