如何在 Ruby中按名称删除某些HTML标记? 例如: string = "!DOCTYPE htmlhtmlbodyh1My First Heading/h1pMy first paragraph./p/body/html"string.magic_method("h1") #= "!DOCTYPE htmlhtmlbodypMy first paragraph./p/body/html" 我写了一
例如:
string = "<!DOCTYPE html><html><body><h1>My First Heading</h1><p>My first paragraph.</p></body></html>" string.magic_method("h1") #=> "<!DOCTYPE html><html><body><p>My first paragraph.</p></body></html>"
我写了一些正则表达式,但想知道是否有一个库或本机方法可以做同样的事情.
使用 Nokogiri:require 'nokogiri' doc = Nokogiri::HTML <<-_HTML_ <!DOCTYPE html><html><body><h1>My First Heading</h1><p>My first paragraph.</p></body></html> _HTML_ doc.at('h1') # => #(Element:0x4d2f006 { # name = "h1", # children = [ #(Text "My First Heading")] # }) doc.at('h1').unlink puts doc.to_html # >> <!DOCTYPE html> # >> <html><body><p>My first paragraph.</p></body></html>