Introduction to Actions on Google

Adarsh Menon
4 min readAug 21, 2018

Actions on Google are basically apps on the Google Assistant. It lets users get things done through a conversational interface like a quick command to turn on some lights, do a google search, add an entry to your calender or a longer conversation, such as playing a trivia game.

This is the written version of this video on YouTube. Watch it if you prefer that!

The video version

Why is voice becoming popular ?

As you might have heard voice is becoming increasingly popular. You might have seen the popular video where the Assistant places a call and it sounds exactly like a human. People have started using the Google Assistant or the Amazon Alexa on a daily basis now. It is estimated that by 2020, 30% of all web browsing sessions will be done without a screen. Voice is far more convenient and faster to get things done and we have the technology now. Slowly companies and brands are deploying voice apps to provide their services. Its up to us now to come up with innovative apps on the voice.

Here is a video by the famous enterepreneur Gary Vaynerchuck on Why Voice will Win. Watch it to understand the true power of voice and its impact in the next few years.

Introduction to Actions on Google

  • Actions on Google is a developer platform that lets you create software to extend the functionality of the Google Assistant.
  • It is available across more than 500 million devices, including smart speakers, phones, cars, TVs, headphones, and more.

What is an action ? An Action is an entry point into an interaction/app that you build for the Assistant. Users can invoke your Action by typing or speaking to the Assistant.

For example : Saying “Hey Google, talk to Random Words” will invoke the action Random Words and you will be in a conversation with it.

An intent is a goal or task that the user wants to do, like ordering coffee or setting a timer. In Actions on Google, this is represented as a unique identifier, with the corresponding user utterances that can trigger the intent.

A service, app, feed, conversation, or other logic that handles an intent and carries out the corresponding Action. For every intent we need to define some form of fulfillment. This will be the reply to the user intent.

What happens during a conversation ?

  • Actions are run entirely in the cloud.
  • Every Action supports a specific intent and has a corresponding fulfillment that processes the intent.
  • The user’s device sends the user’s utterance to the Google Assistant, which routes it to your fulfillment service via HTTP POST requests.
  • Your fulfillment figures out a relevant response and sends that back to the Assistant, which ultimately returns it to the user.


  • Dialogflow is a web based service provided by Google that uses an agent to process user input. The sevice allows you to integrate conversational apps with your Assistant, as well as other conversational platforms.
  • Dialogflow simplifies the task of understanding user input, extracting key words and phrases from the input, and returning responses using Natural Language Understanding (NLU). You can define how all this works within your Dialogflow agent.

Create your first Action

1. Enable necessary permissions
Go to the Activity Controls Page and make sure the following permissions enabled:

  • Web & App activity
  • Device Information
  • Voice & Audio Activity

2. Create an actions project

  • Enter a project name. This name is for your reference, and you can set an external name that people will see later.
  • Click Create Project. Now skip this page for now.
  • Click Build > Add Action and click on Add your first Action in the next page.
  • On the Custom Intent card, click Build

3. Create a Dialogflow agent

  • Choose your account and accept the terms of service.
  • Click Create to create a new agent !

And that’s it ! You have created your first Action !

In the next tutorial I will show you how to create intents and have a conversation with the user.



Adarsh Menon

I make websites and teach machines to predict stuff. I also make YouTube videos —