XHTMLizer on steroids
One of the coolest things with XHTML is that it is an XML language, so you can apply any kind of XML tools to it.
One of the terrible thing with XHTML (and XML more generally) is how hard it is sometimes to get it right.
One of the depressing thing with building tools based on XML for Web technologies is that most of the content out there is in HTML (or the tag soup that some people call with that name), or in ill-formed XHTML.