[XML], [HTML] [Parsing]을 위한 [Python] 모듈. invaliad한 문서형식도 가능.
http://www.crummy.com/software/BeautifulSoup/
예제들
1 from BeautifulSoup import BeautifulSoup 2 soup = BeautifulSoup(html_string) 3 for tr in soup.findAll('tr') 4 print tr.find('th').contents+ [c.contents[0] for c in tr.findAll('td')]
CategoryProgramLibrary