I love love love this stored procedure, it allows you to spider HTML and easily parse it using SQL Server.
I’m SUPER excited to get home from work today, because there is a new version of this sproc (with better NOCOUNT) that can decrease execution time from ~60s to ~2s.
FYI, I posted an update to SQLDOM today on Sourceforge (.927). http://sourceforge.net/projects/sqldom/files/SQLDOM_core_927.sql/download
Besides adding a parameter to allow retrieving a subset of the HTML from the DOM, I found that if the caller did not have SET NOCOUNT ON that the SQLDOM procedures could run much more slowly than expected. (For example, instead of processing a page like http://www.ocregister.com in 4 seconds, it could take like 30 seconds.)
This version explicitly sets SET NOCOUNT ON in each procedure to avoid the risk of this condition.