I have a website with thousands of directories which I am trying to find out which are the most popular through using the following field
REGEXP_EXTRACT(Page,'(/[^/]+)')
This is working great following my previous question here, however there is a random extra string in the URL which is impacting the figures.
e.g
- sitename.com/directoryA returns the same page as
- sitename.com/randomstring/directoryA
This means that when I run my field some of the views that should be attributed to the /directoryA/ are being attributed to /randomstring and my table reports as follows:
- /randomstring = 1,000 views
- directoryA = 800 views
- directoryB = 850 views
- ...
Does anyone know how to clean the data in Google Data studio so that /randomstring is removed from the URLs before I run my REGEXP_EXTRACT code?
directoryA
anddirectoryB
? What makesdirectoryA
not random? ā Diego Queiroz