SEARCH
NEW RPMS
DIRECTORIES
ABOUT
FAQ
VARIOUS
BLOG
DONATE


YUM REPOSITORY

 
 

MAN page from openSUSE Tumbleweed perl-HTML-Tree-5.07-1.4.noarch.rpm

HTML::Tree

Section: User Contributed Perl Documentation (3)
Updated: 2017-09-01
Index 

NAME

HTML::Tree - build and scan parse-trees of HTML 

VERSION

This document describes version 5.07 ofHTML::Tree, released August 31, 2017as part of HTML-Tree. 

SYNOPSIS

    use HTML::TreeBuilder;    my $tree = HTML::TreeBuilder->new();    $tree->parse_file($filename);        # Then do something with the tree, using HTML::Element        # methods -- for example:    $tree->dump        # Finally:    $tree->delete;
 

DESCRIPTION

HTML-Tree is a suite of Perl modules for making parse trees out ofHTML source. It consists of mainly two modules, whose documentationyou should refer to: HTML::TreeBuilderand HTML::Element.

HTML::TreeBuilder is the module that builds the parse trees. (It usesHTML::Parser to do the work of breaking the HTML up into tokens.)

The tree that TreeBuilder builds for you is made up of objects of theclass HTML::Element.

If you find that you do not properly understand the documentationfor HTML::TreeBuilder and HTML::Element, it may be because you areunfamiliar with tree-shaped data structures, or with object-orientedmodules in general. Sean Burke has written some articles forThe Perl Journal ("www.tpj.com") that seek to provide that background.The full text of those articles is contained in this distribution, as:

HTML::Tree::AboutObjects
``User's View of Object-Oriented Modules'' from TPJ17.
HTML::Tree::AboutTrees
``Trees'' from TPJ18
HTML::Tree::Scanning
``Scanning HTML'' from TPJ19

Readers already familiar with object-oriented modules and tree-shapeddata structures should read just the last article. Readers withoutthat background should read the first, then the second, and then thethird. 

METHODS

All these methods simply redirect to the corresponding method inHTML::TreeBuilder. It's more efficient to use HTML::TreeBuilderdirectly, and skip loading HTML::Tree at all. 

new

Redirects to ``new'' in HTML::TreeBuilder. 

new_from_file

Redirects to ``new_from_file'' in HTML::TreeBuilder. 

new_from_content

Redirects to ``new_from_content'' in HTML::TreeBuilder. 

new_from_url

Redirects to ``new_from_url'' in HTML::TreeBuilder. 

SUPPORT

You can find documentation for this module with the perldoc command.

    perldoc HTML::Tree    You can also look for information at:
*
AnnoCPAN: Annotated CPAN documentation

<http://annocpan.org/dist/HTML-Tree>

*
CPAN Ratings

<http://cpanratings.perl.org/d/HTML-Tree>

*
RT: CPAN's request tracker

<http://rt.cpan.org/NoAuth/Bugs.html?Dist=HTML-Tree>

*
Search CPAN

<http://search.cpan.org/dist/HTML-Tree>

*
Stack Overflow

<http://stackoverflow.com/questions/tagged/html-tree>

If you have a question about how to use HTML-Tree, Stack Overflow isthe place to ask it. Make sure you tag it both "perl" and "html-tree".

 

SEE ALSO

HTML::TreeBuilder, HTML::Element, HTML::Tagset,HTML::Parser, HTML::DOMbo

The book Perl & LWP by Sean M. Burke published byO'Reilly and Associates, 2002. ISBN: 0-596-00178-9

It has several chapters to do with HTML processing in general,and HTML-Tree specifically. There's more info at:

    http://www.oreilly.com/catalog/perllwp/    http://www.amazon.com/exec/obidos/ASIN/0596001789
 

SOURCE REPOSITORY

HTML-Tree is now maintained using Git. The main public repository is<https://github.com/kentfredric/HTML-Tree>.

The best way to send a patch is to make a pull request there. 

ACKNOWLEDGEMENTS

Thanks to Gisle Aas, Sean Burke and Andy Lester for their original work.

Thanks to Chicago Perl Mongers (http://chicago.pm.org) for theirpatches submitted to HTML::Tree as part of the Phalanx project(http://qa.perl.org/phalanx).

Thanks to the following people for additional patches and documentation:Terrence Brannon, Gordon Lack, Chris Madsen and Ricardo Signes. 

AUTHOR

Current maintainers:
*
Christopher J. Madsen "<perl AT cjmweb.net>"
*
Jeff Fearn "<jfearn AT cpan.org>"

Original HTML-Tree author:

*
Gisle Aas

Former maintainers:

*
Sean M. Burke
*
Andy Lester
*
Pete Krawczyk "<petek AT cpan.org>"

You can follow or contribute to HTML-Tree's development at<https://github.com/kentfredric/HTML-Tree>. 

COPYRIGHT AND LICENSE

Copyright 1995-1998 Gisle Aas, 1999-2004 Sean M. Burke,2005 Andy Lester, 2006 Pete Krawczyk, 2010 Jeff Fearn,2012 Christopher J. Madsen.(Except the articles contained in HTML::Tree::AboutObjects,HTML::Tree::AboutTrees, and HTML::Tree::Scanning, which are allcopyright 2000 The Perl Journal.)

Except for those three TPJ articles, the whole HTML-Tree distribution,of which this file is a part, is free software; you can redistributeit and/or modify it under the same terms as Perl itself.

Those three TPJ articles may be distributed under the same terms asPerl itself.

The programs in this library are distributed in the hope that theywill be useful, but without any warranty; without even the impliedwarranty of merchantability or fitness for a particular purpose.


 

Index

NAME
VERSION
SYNOPSIS
DESCRIPTION
METHODS
new
new_from_file
new_from_content
new_from_url
SUPPORT
SEE ALSO
SOURCE REPOSITORY
ACKNOWLEDGEMENTS
AUTHOR
COPYRIGHT AND LICENSE

This document was created byman2html,using the manual pages.