Timer Module for a Voice Assistant

Your task is to build the module that allows a voice assistant to maintain timers. The timers should have a name, and it should be possible for a user to start as many timers as they want. This means, a user should be able to say:

"Hey Assistant, please start a 10-minute spaghetti timer."

During this timer is active, the user can start another timer:

"Hey Assistant, start a 3-minute green tea timer."

These timers are independent of each other. Once they expire, the assistant says:

"Your green tea timer is ready!"

And then later:

"Your spaghetti timer is ready!"

The part of the system you are concerned with looks as follows:

The voice command detection is responsible for listening for the activation command ("Hey Assistant"), and the subsequent interpretation of the command. It uses some support from the data center, which we are not concerned with here.
The text-to-speech engine runs locally. It receives strings that are then transformed into speech.

You should design the timer manager in the middle. It receives commands from the voice command detection, should maintain the control over the necessary timers, and produce commands for the text-to-speech engine.

From the voice command detection, you may receive the following commands:

Starting a new timer: The command is called timer_start and includes the name of the timer (string) and the duration already converted into seconds (as int).
Stopping a timer: The command is called timer_stop and includes the name of the timer (string) that should be stopped.
Status: The command is called timer_status. The user can either ask for a specific timer, then the name is set. If the name is not set, the device should present a list of all currently active timers.

The text-to-speech just accepts a command that contains a sentence in English. For the timers, you will probably produce the following texts:

"Your spaghetti timer is ready!"
"You have 30 seconds left on the spaghetti timer."
"You have 2 active timers, spaghetti and green tea."

Component Template

You can use our code template for components that contains a lot of the boilerplate code that is useful for any component that connects STMPY state machines with MQTT:

TimerManager.py

Template for a component with an MQTT client and STMPY state machines.

Logging

Logging is important to figure out what the system is doing, especially when something is going wrong. When configuring logging, we can select a log level, which determines which messages are printed. There are the following log levels:

DEBUG: Most fine-grained logging, printing everything
INFO: Only the most important informational log items
WARN: Show only warnings and errors.
ERROR: Show only error messages.

Different parts of our component have different log levels, the code below registers the log level for the main component (with name __name__):

debug_level = logging.DEBUG
logger = logging.getLogger(__name__)
logger.setLevel(debug_level)
ch = logging.StreamHandler()
ch.setLevel(debug_level)
formatter = logging.Formatter('%(asctime)s - %(name)-12s - %(levelname)-8s - %(message)s')
ch.setFormatter(formatter)
logger.addHandler(ch)

debug_level = logging.DEBUG
logger = logging.getLogger(__name__)
logger.setLevel(debug_level)
ch = logging.StreamHandler()
ch.setLevel(debug_level)
formatter = logging.Formatter('%(asctime)s - %(name)-12s - %(levelname)-8s - %(message)s')
ch.setFormatter(formatter)
logger.addHandler(ch)

To change the log level for STMPY state machine:

...
logger = logging.getLogger('stmpy')
...

...
logger = logging.getLogger('stmpy')
...

Communication: MQTT Client and JSON

The component has its own MQTT client that gets configured and started with the main component.

JSON

JSON is a way to serialize data into strings. Have a look at the notebook below:

Notebook about JSON

For the timer system, you can use the following JSON encodings:

{"command": "new_timer", "name": "spaghetti", "duration":50}

{"command": "new_timer", "name": "spaghetti", "duration":50}

{"command": "status_all_timers"}

{"command": "status_all_timers"}

{"command": "status_single_timer", "name": "spaghetti"}

{"command": "status_single_timer", "name": "spaghetti"}

Incoming Messages

Messages arrive in the components function on_message(). In this message, we need to determine what the incoming message is, get any data it may have as payload, and then react on it.

If the payload is given as JSON-encoded string, we can get back a dictionary with the following lines:

try:
    payload = json.loads(msg.payload.decode("utf-8"))
except Exception as err:
    self._logger.error('Message sent to topic {} had no valid JSON. Message ignored. {}'.format(msg.topic, err))
    return
command = payload.get('command')

try:
    payload = json.loads(msg.payload.decode("utf-8"))
except Exception as err:
    self._logger.error('Message sent to topic {} had no valid JSON. Message ignored. {}'.format(msg.topic, err))
    return
command = payload.get('command')

Since the message can be formatted wrong, we should wrap this in an exception handler. Based on the content or type of the message, we may for instance create a state machine session (see below), or send a message into a machine.

self.stm_driver.send('report', timer_name)

self.stm_driver.send('report', timer_name)

Publishing

Publishing works via the MQTT client, for example in the effect of a transition of the state machine. This works exactly like we used MQTT before, we get access to the client via the component's variable.

self.component.mqtt_client.publish('topic.../', message)

self.component.mqtt_client.publish('topic.../', message)

Sessions: Several Instances of State Machines

The behavior of a single timer is quite simple, and we can draw the following state machine that takes care of the necessary operations:

In our example it is important that the assistant can keep track of several timers at the same time. We want to do this by creating a single state machine instance for each timer. Each state machine can then take care of a single timer. These state machine instances are also called sessions. This may seem like an overhead for this simple scenario, since the timer is a relatively easy behavior, but this is a clean solution that also works well for much more complicated interactions.

In our component, we can create and start a new session with the following code:

timer_name = ...
duration = ...
timer_stm = TimerLogic(timer_name, duration, self)
self.stm_driver.add_machine(timer_stm)

timer_name = ...
duration = ...
timer_stm = TimerLogic(timer_name, duration, self)
self.stm_driver.add_machine(timer_stm)

Task: Create the Timer Component

Step 1: Sequence Diagrams

Create two sequence diagrams for the scenario with a 10-minute timer "spaghetti", and a 2-minute timer "green tea".

Create one version in which you show the MQTT messages, including publish and subscribe, including the MQTT broker as lifeline.
Create another, more high-level sequence diagram, in which you don't show the MQTT broker and only show messages between the system components, as if they were sending directly to each other.

Step 2: Start the Command Component

Because we don't have a speech-to-text engine, we send in the commands to the component from buttons of a GUI component. This is how it looks like:

Simple GUI to dispatch commands via MQTT.

Do the following:

Use mqtt20.iik.ntnu.no with port 1883 as broker.
Download the code of the command component and study it.
Adjust the topics.
Run the component and see commands coming in by using MQTTX for debugging.

Step 2: Implement the Components

Use the component template, and add the required code in the places marked with TODO.
Send Timer commands into the component using the GUI.
See if your component works as intended and sends the required messages to the Text To Speech component.
- Just emulate the Text To Speech Component by subscribing with MQTTX to the corresponding topic.
Reflect on your progress in a document.
Create a short and uncut demo video using two named timers, simulated via MQTTX and your timer manager component.