Unlock the full potential of your analytics. Dave Westby explains 8 key reasons to export GA4 data to BigQuery, from overcoming retention limits to fixing data sampling.
Click on the whiteboard image above to open a high-resolution version!
Hello and welcome to this edition of Whiteboard Friday. I'm Dave Westby, and I'm an analytics consultant at Piped Out.
So today, I'm going to be speaking about why you should be exporting your GA4 data to BigQuery if you're not already doing so.
Now the first thing to note here is that your GA4 data in BigQuery does not backdate. So if you set up that link between your GA4 and BigQuery, you'll only start collecting that data in BigQuery from today moving forward.
So that's the first really important point.
Advantage 1: It’s free to set up
So moving on to the first key advantage, it's actually free to set up this link between GA4 and BigQuery. The thing that you do need, though, is a Google Cloud account, and what you pay for is the storage of this data.
But luckily for you, it's super cheap. So, as an example, if you have a website with 30,000 sessions a month, for example, the storage for this would actually be free. So it doesn't cost you dear.
Advantage 2: You don’t need to work within the GA4 interface
So, coming on to the second point, you don't need to work with the GA4 interface when you're working with the raw GA4 BigQuery data.
A lot of the limitations that you have with the interface are actually solved by the fact that you're working with the raw data. So that's a great perk.
Advantage 3: Data retention
So coming on to the next point, which is data retention. So this is one of the biggest advantages of having your raw data in BigQuery. So with the raw data, you have that data stored forever. So that's great.
You just have access to it forever. Whereas with the interface, when you access any of the standard reports, it will use all-time data, which is fine. But if you segment it or you use Explorations, your user and event level data will be limited to 14 months. So that is a limiting factor there.
So that's a huge advantage.
Advantage 4: Sampling
Coming on to the next point, which is sampling. So if you've worked with the GA4 interface at all, you'll be used to seeing sample data. The same applies as well for the GA4 Looker Studio connector as well, and that can be a real bummer.
And this happens basically when there are more than 10 million events within a single request. That's when your data starts getting sampled. Whereas, obviously, if you're working with your raw data, then you're not going to be having that problem.
Advantage 5: Cardinality
Now moving on to the next point, which is cardinality. So this is when things get lumped under “Other.” Boo, no one likes it. This happens on the GA4 interface when you basically hit row limits. It just starts lumping everything under “Other.”
But again, when you're working with the raw data, you don't have that problem at all. So you can get far more granular with your data.
Advantage 6: Customization
Then moving on to the next point, so customization. So working with the raw GA4 BigQuery data, you can customize your data however you like.
And this comes in really handy, for example, if you've got spam traffic that you want to remove, you can easily exclude that. You can fix historical conversions that are broken, and you can make your own conversions as well. So it's very customizable.
Advantage 7: You can create your own sessions and user properties
Then coming on to the next point, which is you can create your own sessions and user properties. Now you can do this in the GA4 interface under Explorations. But there are two main limitations here.
The first one being that you have to use Explorations, and the second is that you can only create dimensions. And when you've created these as well, these only live in your Explorations. You can't have these in other places, if you want to view them in dashboards or whatever.
But obviously, with the raw GA4 BigQuery data, you can create these properties, and then you can use them wherever.
Advantage 8: You can create your own sessions and user properties
Then moving on to the next point, which is attribution.
So when you're working with the raw data, you can build all the attribution metrics however you like. So you've got your first click, your last click, your last non-direct click, linear times, etc. You can create all of those metrics, and then you've got access to them as well.
Advantage 9: You can combine your data with other sources
Then the final point is that you can combine your GA4 data far more easily with other data sources as well. So you can join it with your Search Console, Google Ads, or your CRM data as well. This just means you can get so much more out of your data.
It's hard to scale SEO with scattered data
Moz API unifies your SEO metrics in one place
So, just to wrap it all up, if there are only two things you take away from today, the first thing is that the export to BigQuery is free, and all you're paying for is the storage, which is extremely cheap. So it's definitely worth doing.
The second thing is that the BigQuery data does not backfill. So if you set it up today, you'll only start collecting that data from today onwards. So do it.
That's all from me. Yeah, hope you enjoyed it. Catch you soon.
The author's views are entirely their own (excluding the unlikely event of hypnosis) and may not always reflect the views of Moz.