我的函数需要从HTML代码的一部分获取所有第一级HTML标记,以便我可以使用它们。
这是我在这里总结的HTML文档:
The breed was first...
Semencic credits his...
Appearance
The breed's distinctive...
It should be symmetrical...
Requirements
Prospective owners....
These dogs....
A Little Warning!
If you are considering...
According to...
Source: http://...
Although more suitable...
现在,我希望我的输出为:
p
p
h1
p
p
figure
h1
p
p
h2
p
blockquote
p
但是现在,它是:
h1
p
h1
p
h2
p
blockquote
p
有几件事是错的: - '数字'没有显示 - 即使有几个兄弟姐妹也会挑出段落标签 - 找不到第一个p'
$doc = new DOMDocument();
$doc->loadHTML( $this->post_content, LIBXML_HTML_NOIMPLIED | LIBXML_HTML_NODEFDTD );
$xpath = new DOMXpath( $doc );
$nodes = $xpath->query( "/*/*" );
foreach ( $nodes as $node ) {
echo $node->nodeName;
echo '
';
$this->add_part(
md5( $node->textContent ),
$node->nodeName
);
}