all cron’d up . . .

Well, here’s the long and short of my work tonight

mysql> select * from sitemap_ping;
+———–+—————+——–+———————+
| domain_id | search_engine | status | timestamp |
+———–+—————+——–+———————+
| 3 | Ask.com | 200 | 2008-04-30 00:15:01 |
| 3 | Google | 200 | 2008-04-30 00:15:01 |
| 3 | Live Search | 200 | 2008-04-30 00:15:01 |
| 3 | Moreover.com | 200 | 2008-04-30 00:15:02 |
| 3 | Yahoo! | 200 | 2008-04-30 00:15:02 |
. . .

Views helped quite a bit. I made two.

  1. latest_modification – gets the domain_id and the latest timestamp for each domain
  2. latest_sitemap_ping – get the domain_id and the timestamp of the latest sitemap_ping for each domain

Using those two views, I ping for

  1. every domain that has a latest_modification but that hasn’t ever been pinged
  2. every domain where the last modified is more recent than the last ping

Thinking I could combine the two queries, but for now, I am satisfied. I just wiped my traffic_log, so we’ll have to see if like moreover.com starts hitting me.

Not sure how expensive those queries get with like a million rows, but other than that, I am pretty well done with sitemap stuff. I am still unsure if it will do anything, i.e., will google (WLOG) come and spider as per the sitemap if I haven’t registered the domain with their webmaster tools? Guess we shall soon see.

Enjoy!

Earl

Leave a Reply