This question might be very specific application related but I was blocked and I thought this is a good place to ask. Let's say we have an LSTM in Keras that is sequence to sequence, for example Part of Speech Tagger. The last layer gives me, sequence of labels with the probability of each label. Consider the following predicted output;This question might be very specific applicatio