How to Handle Constantly Changing Product Catalog

Hey everyone,

I’m reaching out to my brand suppliers to request product data sheets, but I’m not expecting them to provide a fully up-to-date or automated solution. If you’ve done this before, what kind of responses do you typically get from brands? Do they provide well-structured CSVs, API access, or just outdated/incomplete spreadsheets?

Since I’m managing around 10,000 products across 8 brands, and a large percentage of the inventory turns over every month, I need a scalable and automated way to:

  • Keep the catalog updated with new product info.
  • Remove discontinued products automatically.
  • Pull missing data (descriptions, specs, images) from brand websites.
  • Format & bulk import everything into Shopify efficiently.

Current Situation & Challenges:

We purchase inventory directly from brand suppliers (not dropshipping).
We receive order forms (PDFs) that only list SKUs (no descriptions or images).
We need to scrape product descriptions, specs, and high-res images from brand websites.
We need a fast way to format & bulk import everything into Shopify while keeping it updated.

What I’ve Considered:- Matrixify (Excelify) – Good for structured bulk imports, but doesn’t help with missing data.

  • Stock Sync – Could work if suppliers provide live feeds, but many don’t.
  • Octoparse / Web Scraper – Might help pull missing descriptions/images, but needs automation.
  • Zapier / Make.com – Could automate some of the workflow but may not be enough.

**What I Need Advice On:**1. What kind of responses do you typically get when asking brands for product data sheets?

  1. Is there a Shopify app or tool that can handle this kind of product data import dynamically?
  2. Has anyone built a workflow for this kind of situation? (Scraping data → Formatting for Shopify → Importing/Updating Automatically)
  3. Any AI-based tools that could help with scraping, extracting, or updating product details?
  4. Would a custom script (e.g., Python with Selenium/BeautifulSoup) be the best approach, or is there a Shopify-friendly alternative?

If anyone has experience managing high-turnover inventory with large-scale imports, I’d love to hear what’s worked (or not worked) for you!

Thanks in advance! :rocket:

Hey! This resonated a lot. I’ve ran into almost the exact same pipeline: vendors send product sheets of SKUs and recommend we scrape their site for descriptions, specs, images. Sometimes we get image folders or manuals, but it’s all pretty inconsistent and non-standardized.

The hard part turned out not to be the Shopify import itself, but dealing with messy vendor data – missing images, inconsistent formats, edge cases, etc. We ended up writing vendor-specific scripts (scraping + some AI + human checks) just to normalize things before publishing.

Curious what solution you ended up with in the end.

Best approach is separating this into two steps: maintain a clean “source of truth” outside Shopify, then push updates in. Most suppliers won’t give consistent data, so standardize everything around SKU and keep a master sheet tracking new, updated, and discontinued items.

For discontinued products, regularly compare your current catalog to the latest supplier file and archive anything missing. Treat scraping/enrichment (descriptions, images) as a separate step so it’s easier to debug.

Once your data is clean, a bulk import/export app like Altera can make updating Shopify much faster, especially if you’re working in Excel.

Have you tried Retail Labels and Validation. It has product data health dashboard. You can export the problematic products → Edit in CSV or using LLMs and import again using Shopify’s product import. I hope this helps.