an enthusiastic austrian *nix/Linux user: Parsing HTML code like XML as DOMTree?? -> Use htmlcleaner