Google Indexing domain/services/login_with_shop

Topic summary

Multiple Shopify store owners reported a significant issue where Google Search Console showed thousands to hundreds of thousands of pages with /services/login_with_shop/ URLs being indexed, despite being blocked by robots.txt. These URLs are associated with Shopify Shop App login flows and discount capture mechanisms.

Initial Assessment:

  • One expert initially reassured that robots.txt blocking should prevent indexing
  • However, multiple users confirmed these pages were actually appearing in GSC’s “Indexed, though blocked by robots.txt” section
  • The volume ranged from 55,000 to 500,000+ URLs per site over just a few months

Key Concerns:

  • URLs cluttering Google Search Console reports
  • Potential waste of crawl budget on non-valuable pages
  • Shopify support initially blamed Google and provided unhelpful responses

Resolution:

  • After escalation to Shopify’s development team, the issue was resolved
  • Shopify added <noindex> tags to these pages
  • Affected users confirmed the problem was fixed, with URLs moving to the “Noindex” section in GSC
  • Timeline: approximately one week after proper escalation to developers
Summarized with AI on November 1. AI used: claude-sonnet-4-5-20250929.

Hi There,

As of November 15, 2023 our Google Search Console has been recording a large number of indexed pages though blocked by robots.txt.

Diving into the urls they all follow the same structure:

  • https://[domain]/services/login_with_shop/authroize?target_origin=[URL]&api_key=[Key]&flow=discount&flow_version=EMAIL_CAPTURE&locale=en&anlytics_trace_id=[ID]&phone_capture=false&save_discount_at=authorize-success

Any insight into the below would be greatly appreciated:

  1. Are other stores noticing a similar indexing issue?
  2. How can we prevent these urls from being indexed?

Thanks,

Reece

2 Likes

Hi @Reece3 ,

The URLs you mentioned, which contain “login_with_shop” in their structure, are associated with logins using the Shopify Shop App. This is quite common and not something to be overly concerned about.

The good news is, since these URLs are blocked by your site’s robots.txt file, Google will not index them. The presence of these URLs in your Search Console as being blocked is actually a sign that your robots.txt file is functioning correctly. It’s doing its job by telling search engine crawlers which pages not to index, which in this case includes these specific login URLs.

1 Like

Thanks Lee, for the clarity here.

1 Like

In three months, I have a site that has added 55,000 non-indexed pages from “login_with_shop,” as seen below. While they might not be indexed, it’s making a mess of GSC.

The URLs redirect to the below (I added xxxx for the domain and some other references)

https://pay.shopify.com/pay/sdk-authorize?target_origin=https%3A%2F%2Fxxxxxxxxxxx.com&flow=discount&flow_version=PHONE_CAPTURE_SHOWN_AT_NEVER_SAVED_AT_AUTHORIZE_SUCCESS&locale=en&analytics_trace_id=xxxxxx-0a9c-4f68-b754-9ccfcb135de3&phone_capture=true&save_discount_at=authorize-success&response_mode=form_post&scope=openid+pay%3Asession_token+email+phone%3Averified&response_type=id_token&client_id=xxxxx-xxxxx-xxxx-xxxxx-xxxxx&redirect_uri=https%3A%2F%xxxxxxxxxxx.com%2Fservices%2Flogin_with_shop%2Fcallback&state=xxxxxxx%3D–xxxxxx&require_verification=false&uniq_token=xxxxx-xxxx-4f0c-a69a-xxxxx&visit_token=15aa5811-facd-48a9-9b9c-xxxxx&nonce=xxxxxx&session_token=xxxxxx

It sure would be nice to not have this happen.

Hi Jonlivingston

Have you found any solution to this? We now have around 500000 pages with login_with_shop in the URL in our GSC. Luckily they are not indexed but as you mentioned it’s a mess in GSC and as I understand it consumes a crawling budget. Should we disallow them in Robots? Should we add noindex,nofollow for such pages? any thoughts?

*Hi @Lee-Strong-SEO *

Thanks for your clarification. However, we now see that those pages with login_with_shop in their URL are being index by Google. Even though they are blocked in robots.

Google GSC shows them exactly in the right section - Indexed, though blocked by robots.txt. I believe we need noindex,nofollow tag in such pages but we are not sure how to add it. Any thoughts?

Has anyone else encountered such issue? What did you do?

Thanks

1 Like

This is happening to us too! Def getting indexed!

No solution to date.

Have you come across any solution to that?

Shopify support blames Google for indexing them, which does not make any sense as those URLs belong to Shopify. We can’t figure out where they are coming from and why they are being indexed. The number of indexed URLs with login_with_shop has now jumped to 50,000. Instead of crawling and indexing valuable content, Google is crawling and indexing these useless URLs.

It’s an absolute nightmare, and Shopify support seems not to care.

I have the same issue. I wonder if we submit them for removal within GSC that may fix it? Then disallow them.. Has anyone contacted Shop about this?

I have contacted shopify support but I don’t know if anything will come of it, they’re getting back to me soon and I will update. I tried to find a bug reporting feature to send a more detailed report but the only way to do that is to talk to support so that’s all we’ve got for now.

1 Like

Update: Shopify support passed it onto their “support team” and then finally got back to me. They claimed this was an “intentional feature” and that nothing was wrong and then sent me a copy / pasted help article of a totally unrelated issue that has nothing to do with this problem. And then told me they would pass it on to the DEVs to look at, as if acknowledging that their response made no sense and that this clearly was not intended functionality. However, at the same time hilariously enough they took our advice and added a tag to those pages, the issue seems to be gone entirely on my end. Are any of you still having this issue?

Yes, it’s also fixed on my side. No issues with Shopify support, they told me that this problem will be passed to DEV team and a week later I got a reply that this issue is now sorted. I don’t see any noindex tags in those pages but somehow they all are now in Noindex section in GSC so looks good to me. Problem solved.

Do you have any idea what they did? Having same issue and banging my head against the wall trying to walk to support