Python的HTML解析 mechanize

jopen 13年前
     <p>当您希望与 Web 页面中找到的内容进行某种比较复杂的交互时,您需要使用 <strong>mechanize</strong> 库</p>    <p>示例代码:</p>    <pre class="brush:python; toolbar: true; auto-links: false;">import re from mechanize import Browser  br = Browser() br.open("http://www.example.com/") # follow second link with element text matching regular expression response1 = br.follow_link(text_regex=r"cheese\s*shop", nr=1) assert br.viewing_html() print br.title() print response1.geturl() print response1.info()  # headers print response1.read()  # body response1.close()  # (shown for clarity; in fact Browser does this for you)  br.select_form(name="order") # Browser passes through unknown attributes (including methods) # to the selected HTMLForm (from ClientForm). br["cheeses"] = ["mozzarella", "caerphilly"]  # (the method here is __setitem__) response2 = br.submit()  # submit current form  # print currently selected form (don't call .submit() on this, use br.submit()) print br.form</pre>    <p><strong>项目主页:</strong><a href="http://www.open-open.com/lib/view/home/1324371010780" target="_blank">http://www.open-open.com/lib/view/home/1324371010780</a></p>    <p></p>