Recently, I thought it would be a cool idea to add a date-time stamp to the end of a Google Doc checklist item with Google Apps Script. So I knew when I completed a task.
I often share a project Google Doc with clients and then add my tasks to the document list. With Google’s new check box list item, I wanted to add the date and time that I completed the task when I checked the box.
The bad news is that there is no onEdit() trigger (like in Google Sheets) for the DocumentApp class that would listen for an edit of the document and see a change of the checked box from unchecked to checked and then apply the date-time stamp. 😢
All good, I settled for the next best thing! A menu item.
I love it when a Google Sheets blog post generates more curiosity and further questions. Apart from the almost endless source of new material to write about, it is great to see peoples curiosity for a topic deepen.
I wish I could reward you all for this curiosity every time, but sadly I seem to run out of hours of the day.
How would I modify this [the solution in the post] to get the value with the most occurrences?
I hastily wrote a solution but sadly ran out of time to provide any context to the solution. Further, on review of my solution, I discovered two much better approaches. This post answers Sue’s question more fully.
… in a complete platonic Goat-human interactive way, of course. 🐐
In this tutorial, I will provide two solutions. Both are interesting ways to solve the problem and provide some great insight into how to use certain functions in Google Sheets. You can choose which one makes the most sense for you in your project.
You can use the table of contents below to jump straight to the solution or follow along to learn some cool stuff about the QUERY, FILTER, INDEX, COUNTIF, MAX, ARRAYFORMULA and UNIQUE functions in Google Sheets.
If you are playing along, you can get the sheet with just the data here for you to work on.
Just go to File > Make a copy. So you have your own copy of the sheet to edit.
Table of Contents
The problem
Our task is to display the most frequently occurring word or words in a list of words.
Take this list of mystery words, for example.
If we were to do a manual count we would discover that the value “puzzling” was the most frequently occurring as it occurs 7 times.
Incidentally, if it was just a list of numbers we could get the most commonly occurring number simply with the MAX function.
So how do we solve this with a Google Sheets formula?
The Solution
As we mentioned earlier, we have two interesting solutions this problem. One we solve primarily using FILTER around a bunch of other functions and one around using the QUERY function on a lovingly hand-crafted array.
The FILTER solution
1
2
3
4
5
6
=UNIQUE(
FILTER(
A2:A,
COUNTIF(A2:A,A2:A)=MAX(COUNTIF(A2:A,A2:A))
)
)
Here you replace A2:A with the column that you want to find the most frequently occurring cell item in. This can be a single letter or word or phrase.
Simply copy this formula into your Google Sheet and change the range.
To quickly change the range you can select the cell with your new formula in it and hit ctrl + h. The find and replace menu will appear. Let’s say you want to change your range to H4:H200. Your Find and replace menu should look like this:
But I am sure you are just itching to learn how it all works so you can tweak and change it to other projects. So read on!
Formula Breakdown.
Get a frequency count for each value
1
=COUNTIF(A2:A,A2:A)
Our first task is to get a count of all the times each word appears in each cell. Essentially we want the total count of each word to appear next to the work.
To do this we use the COUNTIF function. This function takes a range as the first argument and a criterion to check as the second argument. If the criterion is true, it will count the value in the cell.
1
=COUNTIF(range,criterion)
Now, if you have used COUNTIF before you may not have seen another range being added as a criterion. We can apply a criterion as a range because the FILTER function (more on FILTER later) allows for array conditions or ranges of cells to be calculated.
Essentially inside FILTER, this COUNTIF will loop through each cell and conduct a count if the current value in column A matches any other values in the entire range.
We can now clearly see that “puzzling” is the most frequently occurring word, followed by “cryptic”.
Note! If you are playing along, you might have noticed the inclusion of ARRAYFORMULA in this formula. I’ve used this here to help simulate how the COUNTIF function actually works in the FILTER function when it iterates through the cells in the range.
If you want to try it out yourself, your formula should look like this:
1
=ARRAYFORMULA(COUNTIF(A2:A,A2:A))
Find the max frequency as a value
Next, we need to find the max value of our newly created range of count numbers for each word.
We can do this with the MAX function which conveniently takes a range of values or numbers and return the highest number in that range.
1
=MAX(your range of values)
Here we can apply our COUNTIF function we created above so MAX can find the highest value. On ya! Max!
1
=MAX(COUNTIF(A2:A,A2:A))
Again, if you are playing along, then you will need to apply the ARRAYFORMUAL function to COUNTIF for it to work on its own in order to simulate it in the FILTER function.
1
=MAX(ARRAYFORMULA(COUNTIF(A2:A,A2:A)))
Your result should look like this:
Filter only those most frequent words in the list
Now we can use these formulas in a FILTER function to only show the word or words that appear the most in your Google Sheets range.
The FILTER function takes a minimum of two arguments:
The range to be displayed.
The conditions that are used to filter the data. The condition starts with a range and then a criterion.
Which would result in displaying all the times that the most frequently occurring cell, well… occurs.
What if there is more than one Frequently occurring cell value?
I’m glad you asked.
The great part of this formula is that it will take this into account and include any cell value that is also the most common in the range.
If you have a good memory, you might have noticed that the word, “cryptic” occurs 6 times. Let’s add another “cryptic” to cellA20 and see what happens.
Cool, hey?
Only show each most frequent word once
We don’t want to see repetitions of the words. We get it. They are the most common. Let’s do something about that with the UNIQUE function.
UNIQUE takes a range and displays any value that occurs in the range only once.
1
=UNIQUE(range)
It is pretty cool in that it doesn’t discriminate if it is a number, date or text. It is also case sensitive.
Our final formula would now look like this (Keeping in mind that we have added “cryptic” now as a most frequent value).
1
2
3
4
5
6
=UNIQUE(
FILTER(
A2:A,
COUNTIF(A2:A,A2:A)=MAX(COUNTIF(A2:A,A2:A))
)
)
Bonus! Sorting your results
Having come this far I have now realised that my example is woefully ill-prepared to present the sorting of our results alphabetically because it is, well… already that way.
So for the sake of this short example, let’s go a little wild and sort from Z to A.
We can do this by wrapping our formula in the SORT function.
Sort takes at least 3 functions:
The range to sort – This will be the result of our formula.
The column to sort by – There is only one column for us. So this will be ‘1’.
Whether ascending or descending – Let’s go Z to A here.
1
=SORT(range tosort,column tosort by,order)
Our formula will now look like this:
1
2
3
4
5
6
7
8
9
10
=SORT(
UNIQUE(
FILTER(
A2:A,
COUNTIF(A2:A,A2:A)=MAX(COUNTIF(A2:A,A2:A))
)
),
1,
FALSE
)
Which will look like this spectacularly topsy-turvy hellscape:
Quickly now! Change the sort value to TRUE and we will speak no more of it.
The QUERY solution
1
2
3
4
5
6
7
8
=UNIQUE(
QUERY(
{A2:A,ARRAYFORMULA(COUNTIF(A2:A,A2:A))},
"SELECT Col1 WHERE Col2 = "&
MAX(ARRAYFORMULA(COUNTIF(A2:A,A2:A)))&
" "
)
)
Here you replace the A2:A with the range you want to run the formula. You will need to do this each of the three times this range occurs in the formula.
For a shortcut to do this check out the example in The FILTER solution chapter.
This approach has the benefit of being able to quickly display headers or even the count for each max value with simple changes to the query. More on this in the bonus material at the end.
Now I am sure that you are barely containing yourself to find out how this formula works so let’s get cracking.
Formula Breakdown.
This formula uses the QUERY Google Sheets function to show the most commonly occurring cell item in a range. To do this we build our own range based on the original A1:A range and a range generated from the total count of each cell value in the A1:A range as our second column.
We then dynamically insert the largest number of occurrences from this second column to only show those values that are most frequently used, before using the UNIQUE function to shot those values only once.
Let’s break it down into steps…
Get a frequency count for each value.
Our first task is to get the number of times each cell repeats in our list.
We did the same thing here as we did in the FILTER versions.
Make an array containing both the mystery word list and the total count for each word
Next, we need to create a new array containing our original list of mystery words and their corresponding count. In Google Sheets, we can create an array or range of data with curly braces ({}). We create two columns by putting our data between a comma (,).
1
={acol of data,another col of data}
Here is what our array is going to look like:
1
={A2:A,ARRAYFORMULA(COUNTIF(A2:A,A2:A))}
Which will produce this result in your Google Sheet:
Find the max frequency as a value
Just like in our FILTER version we need to get the highest value from our list frequency count for each cell value in the list.
Being a new language to learn it can take a little while to get the hang of, but one of the beauties of using the query language is that is pretty close to plain English to read.
So, let’s dive in.
Your first step is to ask what you want from the data. For us, we want to display the words in our first column where the total frequency count of the word is the highest.
In query language, our statement would look like this:
1
"SELECT Col1 WHERE Col2 = 7 "
Here we select all cells in column 1 where column 2 (our frequency sum col) is equal to 7.
Why 7? This is just a placeholder for our dynamic max count value we created earlier. We know that our current max count is 7 for our most frequently occurring word. We will update this in the next step. For now, it is just easier to read this way.
If you have tried out QUERY before you might be scratching your head over the ‘Col’ word instead of say, a column letter like A, B, C etc. We use the ‘Col#’ syntax when we are not referencing a ‘physical’ range on the sheet but a generated range like our array.
Let’s put our QUERY together now:
1
2
3
4
5
6
7
=QUERY(
{
A2:A,
ARRAYFORMULA(COUNTIF(A2:A,A2:A))
},
"SELECT Col1 WHERE Col2 = 7 "
)
Which will result in:
Adding the max frequency formula inside the query
One of the cool things about Google Sheets is that we can add dynamic values to strings of text in a cell. To insert a value or formula result in a string of text we use the ampersand character “&” .
1
=" some text "&aformula orvalue&" some more text "
Let’s go ahead and add our dynamic max frequency formula to our query.
1
2
3
4
5
6
7
8
9
=QUERY(
{
A2:A,
ARRAYFORMULA(COUNTIF(A2:A,A2:A))
},
"SELECT Col1 WHERE Col2 = "&
MAX(ARRAYFORMULA(COUNTIF(A2:A,A2:A)))
&" "
)
Complete the formula by only selecting UNIQUE values
Finally, we can use the UNIQUE function to only display the words once.
1
2
3
4
5
6
7
8
9
10
11
=UNIQUE(
QUERY(
{
A2:A,
ARRAYFORMULA(COUNTIF(A2:A,A2:A))
},
"SELECT Col1 WHERE Col2 = "&
MAX(ARRAYFORMULA(COUNTIF(A2:A,A2:A)))
&" "
)
)
BONUS! Expanding QUERY views for kicks
Another benefit of using QUERY is that we can quickly change how the data is displayed by making a few changes to the formula.
First off, for all of these examples, we will need to change our ranges to A1:A instead of A2:A. Go ahead and change this now (See if you can do it with find and replace).
All done? Let’s make some mods to our QUERY.
Add a header
The QUERY function has an optional third parameter that allows you to have a header. We can simply add one to our formula and we now have a header – keeping in mind that we have adjusted all of our ranges to start at A1, not A2.
1
2
3
4
5
6
7
8
9
10
11
12
=UNIQUE(
QUERY(
{
A1:A,
ARRAYFORMULA(COUNTIF(A1:A,A1:A))
},
"SELECT Col1 WHERE Col2 = "
&MAX(ARRAYFORMULA(COUNTIF(A1:A,A1:A)))
&" ",
1
)
)
And the result is:
Add the header and frequency count to the formula
Now that we have our headers in place, let’s add the frequency column so that we know how many times these max values have appeared.
All that we need to do here is to add Col2 to our SELECT stage. Don’t forget to separate the columns by a comma.
1
2
3
4
5
6
7
8
9
10
11
12
=UNIQUE(
QUERY(
{
A1:A,
ARRAYFORMULA(COUNTIF(A1:A,A1:A))
},
"SELECT Col1, Col2 WHERE Col2 = "
&MAX(ARRAYFORMULA(COUNTIF(A1:A,A1:A)))
&" ",
1
)
)
Which will now look like this:
Those headers don’t look great. Especially the frequency column. Let’s do something about that.
Add the header and frequency count to the formula with header LABELS
We can use the query language LABEL clause to create custom labels for any column.
Let’s change column one’s header to read ‘Top Mystery Word/s’and column two to read ‘Frequency’. To do this we add the LABEL clause followed by the column and then in single quotation (' ') we add our new label. We separate each label group with a comma.