Phase: calibration Camera idle
Recognized speech
Voice idle