Scrape sitemap.xml from given url & save meta data to Excel file

Did you ever need to check all of your pages meta data?

This project can crawl & scrape all urls from sitemap.xml and save as .xls file

https://github.com/ivanmijatovic89/sitemap-to-excel

Crawler will go to each page from given sitemap.xml url and collect:

  • title
  • meta description
  • url
  • canonical
  • image
  • og:title
  • og:image
  • og:url
  • og:site_name
  • og:description
  • og:type
  • h1
  • h2 ( all h2 on page)

Script crawl urls asynchronously.


Run Script

php artisan seo:export "https://yourdomain.com/sitemap.xml"


File will be stored at

storage/app/{DOMAIN}-{YEAR}-{MONTH}-{DAY}.xls

#example
storage/app/yourdomain.com-2020-06-25.xls


Exported file: Export File

;