Discuss and resolve questions on Liquid, JavaScript, themes, sales channels, and site speed enhancements.
Hi There,
As of November 15, 2023 our Google Search Console has been recording a large number of indexed pages though blocked by robots.txt.
Diving into the urls they all follow the same structure:
Any insight into the below would be greatly appreciated:
Thanks,
Reece
Solved! Go to the solution
This is an accepted solution.
Hi @Reece3,
The URLs you mentioned, which contain "login_with_shop" in their structure, are associated with logins using the Shopify Shop App. This is quite common and not something to be overly concerned about.
The good news is, since these URLs are blocked by your site's robots.txt file, Google will not index them. The presence of these URLs in your Search Console as being blocked is actually a sign that your robots.txt file is functioning correctly. It's doing its job by telling search engine crawlers which pages not to index, which in this case includes these specific login URLs.
This is an accepted solution.
Hi @Reece3,
The URLs you mentioned, which contain "login_with_shop" in their structure, are associated with logins using the Shopify Shop App. This is quite common and not something to be overly concerned about.
The good news is, since these URLs are blocked by your site's robots.txt file, Google will not index them. The presence of these URLs in your Search Console as being blocked is actually a sign that your robots.txt file is functioning correctly. It's doing its job by telling search engine crawlers which pages not to index, which in this case includes these specific login URLs.
Thanks Lee, for the clarity here.
In three months, I have a site that has added 55,000 non-indexed pages from "login_with_shop," as seen below. While they might not be indexed, it's making a mess of GSC.
The URLs redirect to the below (I added xxxx for the domain and some other references)
It sure would be nice to not have this happen.
Have you found any solution to this? We now have around 500000 pages with login_with_shop in the URL in our GSC. Luckily they are not indexed but as you mentioned it's a mess in GSC and as I understand it consumes a crawling budget. Should we disallow them in Robots? Should we add noindex,nofollow for such pages? any thoughts?
No solution to date.
Thanks for your clarification. However, we now see that those pages with login_with_shop in their URL are being index by Google. Even though they are blocked in robots.
Google GSC shows them exactly in the right section - Indexed, though blocked by robots.txt. I believe we need noindex,nofollow tag in such pages but we are not sure how to add it. Any thoughts?
Has anyone else encountered such issue? What did you do?
Thanks
This is happening to us too! Def getting indexed!
Have you come across any solution to that?
Shopify support blames Google for indexing them, which does not make any sense as those URLs belong to Shopify. We can't figure out where they are coming from and why they are being indexed. The number of indexed URLs with login_with_shop has now jumped to 50,000. Instead of crawling and indexing valuable content, Google is crawling and indexing these useless URLs.
It's an absolute nightmare, and Shopify support seems not to care.
I have the same issue. I wonder if we submit them for removal within GSC that may fix it? Then disallow them.. Has anyone contacted Shop about this?
I have contacted shopify support but I don't know if anything will come of it, they're getting back to me soon and I will update. I tried to find a bug reporting feature to send a more detailed report but the only way to do that is to talk to support so that's all we've got for now.
Update: Shopify support passed it onto their "support team" and then finally got back to me. They claimed this was an "intentional feature" and that nothing was wrong and then sent me a copy / pasted help article of a totally unrelated issue that has nothing to do with this problem. And then told me they would pass it on to the DEVs to look at, as if acknowledging that their response made no sense and that this clearly was not intended functionality. However, at the same time hilariously enough they took our advice and added a <noindex> tag to those pages, the issue seems to be gone entirely on my end. Are any of you still having this issue?
Yes, it's also fixed on my side. No issues with Shopify support, they told me that this problem will be passed to DEV team and a week later I got a reply that this issue is now sorted. I don't see any noindex tags in those pages but somehow they all are now in Noindex section in GSC so looks good to me. Problem solved.
Do you have any idea what they did? Having same issue and banging my head against the wall trying to walk to support
Shopify and our financial partners regularly review and update verification requiremen...
By Jacqui Mar 14, 2025Unlock the potential of marketing on your business growth with Shopify Academy's late...
By Shopify Mar 12, 2025Learn how to increase conversion rates in every stage of the customer journey by enroll...
By Shopify Mar 5, 2025