Using google spreadsheet to crawl our site

  • ステータス: Closed
  • 賞金: $150
  • 受け取ったエントリー: 7
  • 優勝者: erdyse

コンテスト概要

We are looking for the best google spreadsheet solution for crawling our websites and checking for specific tags and values. These values and tags must then be compared to a template. Your mission is to provide the best idea for this. The winner will get the assignment to implement the solution for us.

What we need

Your solution should manage up to 1 000 URL:s that needs to be checked, for each of our 9 websites (a total of 11 000 URL:s). Each of them needs to be checked that they have the correct tag. The check needs to be automatic, and we need to receive an alarm in the spreadsheet and by email if the correct tag is not in the url.

The tags we are checking for are canonical tag, metarobots and hreflang-tag.

There are many ways to set this up, and your mission is to find the best, easiest and most logical way.

Requirement for your solution

> There can be no installation of any software anywhere
> The solution must work directly in the spreadsheet
> If you choose javascript, that too must be directly in the spreadsheet
> We can’t be dependant on any third part
> We want only one column for “Status” and an then an IF-message that lets us know what the error is

Examples

Here are some examples of our URLS for our swedish site

https://www.footway.se/skor/herr/kangor-boots
https://www.footway.se/skor/kangor-boots/chelsea-boots
https://www.footway.se/skor/herr/kangor-boots/chelsea-boots
https://www.footway.se/skor/kangor-boots/chelsea-boots
https://www.footway.se/skor/herr/kangor-boots/chelsea-boots

You can also find the urls and current setup in this test file:

https://drive.google.com/drive/folders/18g2Pg_Zor1LDRgs_zZyO1Vo0Vk2jU7Na

The solution you provide needs to check these urls for the correct canonical tag, correct metarobot tag and correct hreflang tag. And then return a status message in a single column if all tags are met and correct. If not, the message must say which of the tags are not met.

It is up to you how to structure and present the solution for this. Keep in mind that there are 1 000 urls for 9 different websites.

Examples of what we want

A google spreadsheet solution
A solution with no installation of any software
An easy structure that is quick to understand
Google script
A crawler
Webscraping

Examples of what we don’t want

A programme that needs to be installed
An advanced programme that needs lots of manual work
A subscription to any seo service
Your own developed programme that requires installation

Make sure you follow instructions closely. Read them again now. And once more after. Do not start writing before you have a clear understanding for the guidelines. We will reject any entry that does not follow instructions. We also reject any entries telling us you're working on the project. For long term collaboration we highly value the ability to follow instructions.

We have limited capacity to answer questions or give individual feedback during the project, instead we will give collective feedback - this will then hopefully help you get a better understanding of what we are looking for.

The winner gets 150 USD, and an additional 100 USD later on for implementing the solution for us.

推奨スキル

採用者フィードバック

“Victor A presented a clear and precise idea for how to solve our tag checking problem. Great work.”

プロフィール画像 alexanderaberg, Sweden.

このコンテストのトップエントリー

エントリーをもっと表示

公開説明ボード

  • brohimumtaz
    brohimumtaz
    • 4年間前

    kindly see my Portfolio

    • 4年間前
  • alexanderaberg
    コンテスト所有者
    • 4年間前

    I see some recommendations of =IMPORTXML. This is a good idea and we have tried it. But the load of so many urls becomes to big for spreadsheet. It does not work with so many urls.

    If you can provide an idea around this, we'd be very glad!

    • 4年間前
  • erdyse
    erdyse
    • 4年間前

    Thank you. Will submit soon.

    • 4年間前
  • erdyse
    erdyse
    • 4年間前

    Your sites have multiple hreflang tags and yet your template has only one expected url for hreflang tag to compare to.

    • 4年間前
    1. alexanderaberg
      コンテスト所有者
      • 4年間前

      See my answer above.

      • 4年間前
  • erdyse
    erdyse
    • 4年間前

    Secondly, your brief does not explain what we are to check specifically for meta robots tag

    • 4年間前
    1. alexanderaberg
      コンテスト所有者
      • 4年間前

      See my answer above.

      • 4年間前
  • alexanderaberg
    コンテスト所有者
    • 4年間前

    As Victor asked below, we have several hreflang tags and same for meta robots. Remember that this contest is not about doing the crawling, it us about providing the easiest and best idea of HOW to do this. So don't spend time on the actual crawling now.

    • 4年間前

コメントをもっと見る

コンテストの開始方法

  • あなたのコンテストを投稿

    あなたのコンテストを投稿 速くて簡単

  • たくさんのエントリーを集めましょう

    たくさんのエントリーを集めましょう 世界中から

  • ベストエントリーをアワード

    ベストエントリーをアワード ファイルをダウンロード - 簡単!

コンテストを今すぐ投稿 または本日参加!