Well, here’s the long and short of my work tonight
mysql> select * from sitemap_ping;
+———–+—————+——–+———————+
| domain_id | search_engine | status | timestamp |
+———–+—————+——–+———————+
| 3 | Ask.com | 200 | 2008-04-30 00:15:01 |
| 3 | Google | 200 | 2008-04-30 00:15:01 |
| 3 | Live Search | 200 | 2008-04-30 00:15:01 |
| 3 | Moreover.com | 200 | 2008-04-30 00:15:02 |
| 3 | Yahoo! | 200 | 2008-04-30 00:15:02 |
. . .
Views helped quite a bit. I made two.
- latest_modification – gets the domain_id and the latest timestamp for each domain
- latest_sitemap_ping – get the domain_id and the timestamp of the latest sitemap_ping for each domain
Using those two views, I ping for
- every domain that has a latest_modification but that hasn’t ever been pinged
- every domain where the last modified is more recent than the last ping
Thinking I could combine the two queries, but for now, I am satisfied. I just wiped my traffic_log, so we’ll have to see if like moreover.com starts hitting me.
Not sure how expensive those queries get with like a million rows, but other than that, I am pretty well done with sitemap stuff. I am still unsure if it will do anything, i.e., will google (WLOG) come and spider as per the sitemap if I haven’t registered the domain with their webmaster tools? Guess we shall soon see.
Enjoy!
Earl