I love it when a Google Sheets blog post generates more curiosity and further questions. Apart from the almost endless source of new material to write about, it is great to see peoples curiosity for a topic deepen.

I wish I could reward you all for this curiosity every time, but sadly I seem to run out of hours of the day.

Fortunately, I have found a little time today to put more details into a question that came from Sue in a recent post Get the Occurrence of a Value in a Range as a Percentage in Google Sheets. She asked:

How would I modify this [the solution in the post] to get the value with the most occurrences?

I hastily wrote a solution but sadly ran out of time to provide any context to the solution. Further, on review of my solution, I discovered two much better approaches. This post answers Sue’s question more fully.

Sue, thanks for the inspiration for this article.

… in a complete platonic Goat-human interactive way, of course. π

In this tutorial, I will provide two solutions. Both are interesting ways to solve the problem and provide some great insight into how to use certain functions in Google Sheets. You can choose which one makes the most sense for you in your project.

You can use the table of contents below to jump straight to the solution or follow along to learn some cool stuff about the QUERY, FILTER, INDEX, COUNTIF, MAX, ARRAYFORMULA and UNIQUE functions in Google Sheets.

If you are playing along, you can get the sheet with just the data here for you to work on.

Just go toΒ * File > Make a copy. *So you have your own copy of the sheet to edit.

Table of Contents

# The problem

Our task is to display the most frequently occurring word or words in a list of words.

Take this list of mystery words, for example.

If we were to do a manual count we would discover that the value **“puzzling”** was the most frequently occurring as it occurs 7 times.

Incidentally, if it was just a list of numbers we could get the most commonly occurring number simply with the MAX function.

So how do we solve this with a Google Sheets formula?

# The Solution

As we mentioned earlier, we have two interesting solutions this problem. One we solve primarily using FILTER around a bunch of other functions and one around using the QUERY function on a lovingly hand-crafted array.

## The FILTER solution

1 2 3 4 5 6 |
=UNIQUE( FILTER( A2:A, COUNTIF(A2:A,A2:A) = MAX(COUNTIF(A2:A,A2:A)) ) ) |

Here you replace **A2:A** with the column that you want to find the most frequently occurring cell item in. This can be a single letter or word or phrase.

Simply copy this formula into your Google Sheet and change the range.

To quickly change the range you can select the cell with your new formula in it and hitΒ *ctrl + h*. The find and replace menu will appear. Let’s say you want to change your range to H4:H200. Your Find and replace menu should look like this:

You are all done.

But I am sure you are just itching to learn how it all works so you can tweak and change it to other projects. So read on!

### Formula Breakdown.

#### Get a frequency count for each value

1 |
=COUNTIF(A2:A,A2:A) |

Our first task is to get a count of all the times each word appears in each cell. Essentially we want the total count of each word to appear next to the work.

To do this we use the COUNTIF function. This function takes a range as the first argument and a criterion to check as the second argument. If the criterion is true, it will count the value in the cell.

1 |
=COUNTIF(range, criterion) |

Now, if you have used COUNTIF before you may not have seen another range being added as a criterion. We can apply a criterion as a range because the FILTER function (*more on FILTER later*) allows for array conditions or ranges of cells to be calculated.

Essentially inside FILTER, this COUNTIF will loop through each cell and conduct a count if the current value in column *A* matches any other values in the entire range.

We can now clearly see that* “puzzling”* is the most frequently occurring word, followed by

*.*

**“cryptic”****Note!Β **If you are playing along, you might have noticed the inclusion of ARRAYFORMULA in this formula. I’ve used this here to help simulate how the COUNTIF function actually works in the FILTER function when it iterates through the cells in the range.Β

*If you want to try it out yourself, your formula should look like this:Β *

1 |
=ARRAYFORMULA(COUNTIF(A2:A,A2:A)) |

#### Find the max frequency as a value

Next, we need to find the max value of our newly created range of count numbers for each word.

We can do this with the MAX function which conveniently takes a range of values or numbers and return the highest number in that range.

1 |
=MAX(your range of values) |

Here we can apply our COUNTIF function we created above so MAX can find the highest value. On ya! Max!

1 |
=MAX(COUNTIF(A2:A,A2:A)) |

Again, if you are playing along, then you will need to apply the ARRAYFORMUAL function to COUNTIF for it to work on its own in order to simulate it in the FILTER function.

1 |
=MAX(ARRAYFORMULA(COUNTIF(A2:A,A2:A))) |

Your result should look like this:

#### Filter only those most frequent words in the list

Now we can use these formulas in a FILTER function to only show the word or words that appear the most in your Google Sheets range.

The FILTER function takes a minimum of two arguments:

- The range to be displayed.
- The conditions that are used to filter the data. The condition starts with a range and then a criterion.

1 |
=FILTER(range to be displayed, criterion range & criterion) |

In our FILTER formula we want to display * A2:A*. So that goes in the first argument.

Then in the next argument, we want to compare the count of each cell against the max value.

Our FILTER will then look like this:

1 |
=FILTER(A2:A,COUNTIF(A2:A,A2:A) = MAX(COUNTIF(A2:A,A2:A))) |

Which would result in displaying all the times that the most frequently occurring cell, well… occurs.

##### What if there is more than one Frequently occurring cell value?

I’m glad you asked.

The great part of this formula is that it will take this into account and include any cell value that is also the most common in the range.

If you have a good memory, you might have noticed that the word, **“cryptic”** occurs 6 times. Let’s add another **“cryptic”** to cell*Β A20* and see what happens.

Cool, hey?

#### Only show each most frequent word once

We don’t want to see repetitions of the words. We get it. They are the most common. Let’s do something about that with the UNIQUE function.

UNIQUE takes a range and displays any value that occurs in the range only once.

1 |
=UNIQUE(range) |

It is pretty cool in that it doesn’t discriminate if it is a number, date or text. It is also case sensitive.

Our final formula would now look like this (*Keeping in mind that we have addedΒ “cryptic”Β now as a most frequent value).*

1 2 3 4 5 6 |
=UNIQUE( FILTER( A2:A, COUNTIF(A2:A,A2:A) = MAX(COUNTIF(A2:A,A2:A)) ) ) |

#### Bonus! Sorting your results

Having come this far I have now realised that my example is woefully ill-prepared to present the sorting of our results alphabetically because it is, well… already that way.

So for the sake of this short example, let’s go a little wild and sort from Z to A.

We can do this by wrapping our formula in the SORT function.

Sort takes at least 3 functions:

**The range to sort**– This will be the result of our formula.**The column to sort by**– There is only one column for us. So this will be ‘1’.**Whether ascending or descending**– Let’s go Z to A here.

1 |
=SORT(range to sort, column to sort by, order) |

Our formula will now look like this:

1 2 3 4 5 6 7 8 9 10 |
=SORT( UNIQUE( FILTER( A2:A, COUNTIF(A2:A,A2:A) = MAX(COUNTIF(A2:A,A2:A)) ) ), 1, FALSE ) |

Which will look like this spectacularly topsy-turvy hellscape:

Quickly now! Change the sort value to TRUE and we will speak no more of it.

## The QUERY solution

1 2 3 4 5 6 7 8 |
=UNIQUE( QUERY( {A2:A,ARRAYFORMULA(COUNTIF(A2:A,A2:A))}, "SELECT Col1 WHERE Col2 = "& MAX(ARRAYFORMULA(COUNTIF(A2:A,A2:A)))& " " ) ) |

Here you replace the ** A2:A** with the range you want to run the formula. You will need to do this each of the three times this range occurs in the formula.

For a shortcut to do this check out the example in * The FILTER solution* chapter.

This approach has the benefit of being able to quickly display headers or even the count for each max value with simple changes to the query. More on this in the bonus material at the end.

Now I am sure that you are barely containing yourself to find out how this formula works so let’s get cracking.

### Formula Breakdown.

This formula uses the QUERY Google Sheets function to show the most commonly occurring cell item in a range. To do this we build our own range based on the original * A1:A* range and a range generated from the total count of each cell value in the

*range as our second column.*

**A1:A**We then dynamically insert the largest number of occurrences from this second column to only show those values that are most frequently used, before using the UNIQUE function to shot those values only once.

Let’s break it down into steps…

#### Get a frequency count for each value.

Our first task is to get the number of times each cell repeats in our list.

We did the same thing here as we did in the FILTER versions.

1 |
=ARRAYFORMULA(COUNTIF(A2:A,A2:A)) |

You can click on the link for the previous explanation.

#### Make an array containing both the mystery word list and the total count for each word

Next, we need to create a new array containing our original list of mystery words and their corresponding count. In Google Sheets, we can create an array or range of data with curly braces (`{}`

). We create two columns by putting our data between a comma (,).

1 |
={a col of data, another col of data} |

Here is what our array is going to look like:

1 |
={A2:A,ARRAYFORMULA(COUNTIF(A2:A,A2:A))} |

Which will produce this result in your Google Sheet:

#### Find the max frequency as a value

Just like in our FILTER version we need to get the highest value from our list frequency count for each cell value in the list.

1 |
=MAX(ARRAYFORMULA(COUNTIF(A2:A,A2:A))) |

Check out the explanation in the FILTER portion of this tutorial.

#### QUERY the newly built array where column two is equal to 7

We can finally get into the query. The QUERY function takes two mandatory arguments:

- The range to query. This is the two-column array we created.
- The query. This will be the query we will write below.

QUERY uses the Google Visualisation API Query Language. You can find some good use-case examples in the main docs.

Being a new language to learn it can take a little while to get the hang of, but one of the beauties of using the query language is that is pretty close to plain English to read.

So, let’s dive in.

Your first step is to ask what you want from the data. For us, we want to display the words in our first column where the total frequency count of the word is the highest.

In query language, our statement would look like this:

1 |
"SELECT Col1 WHERE Col2 = 7 " |

Here we select all cells in column 1Β where column 2 (*our frequency sum col*) is equal to 7.

Why 7? This is just a placeholder for our dynamic max count value we created earlier. We know that our current max count is 7 for our most frequently occurring word. We will update this in the next step. For now, it is just easier to read this way.

If you have tried out QUERY before you might be scratching your head over the ‘Col’ word instead of say, a column letter like A, B, C *etc*. We use the ‘Col#’ syntax when we are not referencing a ‘physical’ range on the sheet but a generated range like our array.

Let’s put our QUERY together now:

1 2 3 4 5 6 7 |
=QUERY( { A2:A, ARRAYFORMULA(COUNTIF(A2:A,A2:A)) }, "SELECT Col1 WHERE Col2 = 7 " ) |

Which will result in:

#### Adding the max frequency formula inside the query

One of the cool things about Google Sheets is that we can add dynamic values to strings of text in a cell. To insert a value or formula result in a string of text we use the ampersand character “`&`

” .

1 |
=" some text " & a formula or value & " some more text " |

Let’s go ahead and add our dynamic max frequency formula to our query.

1 2 3 4 5 6 7 8 9 |
=QUERY( { A2:A, ARRAYFORMULA(COUNTIF(A2:A,A2:A)) }, "SELECT Col1 WHERE Col2 = "& MAX(ARRAYFORMULA(COUNTIF(A2:A,A2:A))) &" " ) |

#### Complete the formula by only selecting UNIQUE values

Finally, we can use the UNIQUE function to only display the words once.

1 2 3 4 5 6 7 8 9 10 11 |
=UNIQUE( QUERY( { A2:A, ARRAYFORMULA(COUNTIF(A2:A,A2:A)) }, "SELECT Col1 WHERE Col2 = "& MAX(ARRAYFORMULA(COUNTIF(A2:A,A2:A))) &" " ) ) |

#### BONUS! Expanding QUERY views for kicks

Another benefit of using QUERY is that we can quickly change how the data is displayed by making a few changes to the formula.

First off, for all of these examples, we will need to change our ranges to A1:A instead of A2:A. Go ahead and change this now (*See if you can do it with find and replace*).

All done? Let’s make some mods to our QUERY.

##### Add a header

The QUERY function has an optional third parameter that allows you to have a header. We can simply add one to our formula and we now have a header – keeping in mind that we have adjusted all of our ranges to start at A1, not A2.

1 2 3 4 5 6 7 8 9 10 11 12 |
=UNIQUE( QUERY( { A1:A, ARRAYFORMULA(COUNTIF(A1:A,A1:A)) }, "SELECT Col1 WHERE Col2 = " & MAX(ARRAYFORMULA(COUNTIF(A1:A,A1:A))) &" ", 1 ) ) |

And the result is:

##### Add the header and frequency count to the formula

Now that we have our headers in place, let’s add the frequency column so that we know how many times these max values have appeared.

All that we need to do here is to add Col2 to our SELECT stage. Don’t forget to separate the columns by a comma.

1 2 3 4 5 6 7 8 9 10 11 12 |
=UNIQUE( QUERY( { A1:A, ARRAYFORMULA(COUNTIF(A1:A,A1:A)) }, "SELECT Col1, Col2 WHERE Col2 = " & MAX(ARRAYFORMULA(COUNTIF(A1:A,A1:A))) &" ", 1 ) ) |

Which will now look like this:

Those headers don’t look great. Especially the frequency column. Let’s do something about that.

##### Add the header and frequency count to the formula with header LABELS

We can use the query language LABEL clause to create custom labels for any column.

Let’s change column one’s header to read * ‘Top Mystery Word/s’ *and column two to read

**. To do this we add the LABEL clause followed by the column and then in single quotation (**

*‘Frequency’*`'Β Β '`

) we add our new label. We separate each label group with a comma.
1 |
LABEL Col1 'Top Mystery Word/s', Col2 'Frequency' |

Our formula will look like this now.

1 2 3 4 5 6 7 8 9 10 11 12 |
=UNIQUE( QUERY( { A1:A, ARRAYFORMULA(COUNTIF(A1:A,A1:A)) }, "SELECT Col1, Col2 WHERE Col2 = " & MAX(ARRAYFORMULA(COUNTIF(A1:A,A1:A))) &" LABEL Col1 'Top Mystery Word/s', Col2 'Frequency' ", 1 ) ) |

And our end result is this:

Isn’t it pretty?

# Conclusion

If you would like the complete Google Sheet with all the examples you can get it here:

Get the most common words in a column – complete examples.googleSheet

If you have found the tutorial helpful, why not shout me a coffee β? I'd really appreciate it.

I really love hearing how you use these projects or build on them in the comments below.

Did you enjoy the tutorial? Want to upskill and get a solid step-by-step course to become a pro at Google Sheets? Check out my course,**Google Sheets: Learn the Essentials with Three Detailed Projects.**Sign up today.

**Google Sheets Shortcuts mousepads**that I created from my store.

~Yagi

Hi I was wondering if this could be altered when there is more than one word in a cell such as sentences to find most commonly occuring words with frequency for a column of sentences?

Is it possible to write something similar that will pull the most 5 occurring or most 10 occurring results instead of just the singular most occurring value?