With learning-cycles I mean the number of repetitions you made when training a certain gesture. Afaik the trained samples will be overwritten when retraining the gesture but if you want to you should be able to concatenate samples from multiple training-sets by using the import/export-functionality
For 2-3 gestures the recognizer should be sufficient so I guess that your gestures are to similar-looking. This could also be due to the $P-logic (it actually just sums up the minimal distances of every sample-point to the closest point in the reference-point-cloud). E.g. if you have to circles, one starting at the top going to the left and another one starting at the bottom going the the left, they could be hard to differentiate.
It could also be helpful to play around with some options (i.e. the threshold).