Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Moz Q&A is closed.

After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.

Regex in Disavow Files?

Intermediate & Advanced SEO

888

Fubra last edited by

Hi,

Will Regex expressions work in a disavow file?

If i include website.com/* will that work or would you recommend just website.com?

Thanks.
1 Reply Last reply
Reply Quote 0
davebuts last edited by

Hi Fubra,

You can disavow at a domain level, so no regex is required (and I don't think it will work).

Just add "domain:" before the domain, eg. domain:spammysite.com

Marie Haynes wrote a good guide to using the disavow tool here if you need any further information: https://azwa.1clkaccess.in/blog/guide-to-googles-disavow-tool

Cheers,

David
1 Reply Last reply
Reply Quote 1

Got a burning SEO question?

Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.

Start my free trial

Browse Questions

View

From

Sorted by

With category

Explore more categories

Related Questions

Disavow 401, 403, 410, 500, 502, 503

Dear people, I am cleaning my backlink profile and I am not sure if I should disavow links that drive you to a: 401, 403, 410, 500, 502, 503. I do understand that since last Penguin update, it won't be necessary, but I would like to be sure about it. Any hints out there? Thanks in advance 🙂
Intermediate & Advanced SEO | | Marta_King_ruiz

0
How important is the file extension in the URL for images?

I know that descriptive image file names are important for SEO. But how important is it to include .png, .jpg, .gif (or whatever file extension) in the url path? i.e. https://example.com/images/golden-retriever vs. https://example.com/images/golden-retriever.jpg Furthermore, since you can set the filename in the Content-Disposition response header, is there any need to include the descriptive filename in the URL path? Since I'm pulling most of our images from a database, it'd be much simpler to not care about simulating a filename, and just reference an image id in my templates. Example: 1. Browser requests GET /images/123456
2. Server responds with image setting both Content-Disposition, and Link (canonical) headers Content-Disposition: inline; filename="golden-retriever"
Link: <https: 123456="" example.com="" images="">; rel="canonical"</https:>
Intermediate & Advanced SEO | | dsbud

1
Hacked website - Dealing with 301 redirects and a large .htaccess file

One of my client's websites was recently hacked and I've been dealing with the after effects of it. The website is now clean of malware and I already appealed to Google about the malware issue. The current issue I have is dealing with the 20, 000+ crawl errors which are garbage links that were created from the hacking. How does one go about dealing with all the 301 redirects I need to create for all the 404 crawl errors? I'm already noticing an increased load time on the website due to having a rather large .htaccess file with a couple thousand 301 redirects done already which I fear will result in my client's website performance and SEO performance taking a hit as well.
Intermediate & Advanced SEO | | FPK

0
Large robots.txt file

We're looking at potentially creating a robots.txt with 1450 lines in it. This will remove 100k+ pages from the crawl that are all old pages (I know, the ideal would be to delete/noindex but not viable unfortunately) Now the issue i'm thinking is that a large robots.txt will either stop the robots.txt from being followed or will slow our crawl rate down. Does anybody have any experience with a robots.txt of that size?
Intermediate & Advanced SEO | | ThomasHarvey

0
Partial Match or RegEx in Search Console's URL Parameters Tool?

So I currently have approximately 1000 of these URLs indexed, when I only want roughly 100 of them. Let's say the URL is www.example.com/page.php?par1=ABC123=&par2=DEF456=&par3=GHI789= All the indexed URLs follow that same kinda format, but I only want to index the URLs that have a par1 of ABC (but that could be ABC123 or ABC456 or whatever). Using URL Parameters tool in Search Console, I can ask Googlebot to only crawl URLs with a specific value. But is there any way to get a partial match, using regex maybe? Am I wasting my time with Search Console, and should I just disallow any page.php without par1=ABC in robots.txt?
Intermediate & Advanced SEO | | Ria_

0
Is there a limit to images file names?

Hi, I have an eCommerce site with hundreds of product images. For management reasons files are named in length to have the product details in them.
Is there a limit for a filename length before it is considered ambiguous or spammy etc.?
(it usually ranges 50-70 chars). Thanks
Intermediate & Advanced SEO | | BeytzNet

0
Disavow Tool - WWW or Not?

Hi All, Just a quick question ... A shady domain linking to my website is indexed in Google for both example.com and www.example.com. If I wan't to disavow the entire domain, do I need to submit both: domain:www.example.com domain:example.com or just: domain:example.com Cheers!
Intermediate & Advanced SEO | | Carlos-R

0
Could you use a robots.txt file to disalow a duplicate content page from being crawled?

A website has duplicate content pages to make it easier for users to find the information from a couple spots in the site navigation. Site owner would like to keep it this way without hurting SEO. I've thought of using the robots.txt file to disallow search engines from crawling one of the pages. Would you think this is a workable/acceptable solution?
Intermediate & Advanced SEO | | gregelwell

0

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Moz Q&A is closed.

Regex in Disavow Files?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Disavow 401, 403, 410, 500, 502, 503

How important is the file extension in the URL for images?

Hacked website - Dealing with 301 redirects and a large .htaccess file

Large robots.txt file

Partial Match or RegEx in Search Console's URL Parameters Tool?

Is there a limit to images file names?

Disavow Tool - WWW or Not?

Could you use a robots.txt file to disalow a duplicate content page from being crawled?

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved