最新文章专题视频专题问答1问答10问答100问答1000问答2000关键字专题1关键字专题50关键字专题500关键字专题1500TAG最新视频文章推荐1 推荐3 推荐5 推荐7 推荐9 推荐11 推荐13 推荐15 推荐17 推荐19 推荐21 推荐23 推荐25 推荐27 推荐29 推荐31 推荐33 推荐35 推荐37视频文章20视频文章30视频文章40视频文章50视频文章60 视频文章70视频文章80视频文章90视频文章100视频文章120视频文章140 视频2关键字专题关键字专题tag2tag3文章专题文章专题2文章索引1文章索引2文章索引3文章索引4文章索引5123456789101112131415文章专题3
当前位置: 首页 - 科技 - 知识百科 - 正文

python操作xml文件示例

来源:动视网 责编:小采 时间:2020-11-27 14:30:11
文档

python操作xml文件示例

python操作xml文件示例: 代码如下:def get_seed_data(filename):dom = minidom.parse(filename)root = dom.documentElementsystem_nodes = root.getElementsByTagName(system)k = 0seed_list = []for system_node in s
推荐度:
导读python操作xml文件示例: 代码如下:def get_seed_data(filename):dom = minidom.parse(filename)root = dom.documentElementsystem_nodes = root.getElementsByTagName(system)k = 0seed_list = []for system_node in s

代码如下:


def get_seed_data(filename):
dom = minidom.parse(filename)
root = dom.documentElement
system_nodes = root.getElementsByTagName("system")
k = 0
seed_list = []
for system_node in system_nodes:
#print system_node.nodeName+' id='+system_node.getAttribute('id')
system_id = system_node.getAttribute("id")
system_name = system_node.getAttribute("name")
#print 'system_name:%s'%system_name
section_nodes = system_node.getElementsByTagName("section")
for section_node in section_nodes:
section_id = section_node.getAttribute('id')
section_name = section_node.getAttribute('name')
#print ' '+section_node.nodeName+' id='+section_id+' name='+section_name
crawl_cycle_node = section_node.getElementsByTagName("crawl_cycle")
crawl_cycle = crawl_cycle_node[0].childNodes[0].nodeValue
#print ' '+crawl_cycle_node[0].nodeName+'='+crawl_cycle
seed_nodes = section_node.getElementsByTagName('seed')
for seed_node in seed_nodes:
seed = {}
seed['crawl_cycle'] = crawl_cycle
seed['system_id'] = int(system_id)
seed['system_name'] = system_name
seed['section_id'] = int(section_id)
seed['section_name'] = section_name
seed_id = seed_node.getAttribute('id')
seed['seed_id'] = int(seed_id)
#print ' '+seed_node.nodeName+' '+'id='+seed_id
userblog_url_node = seed_node.getElementsByTagName('userblog_url')
userblog_url = userblog_url_node[0].childNodes[0].nodeValue
seed['userblog_url'] = userblog_url
#print ' '+'userblog_url'+' '+userblog_url
print '-------------------------------------------'
print 'system_id:%d' % seed['system_id']
print 'system_name:%s'%seed['system_name']
print ' section_id:%d' % seed['section_id']
print ' section_name:%s' % seed['section_name']
print ' seed_id:%d' %seed['seed_id']
print ' userblog_url:%s' %seed['userblog_url']
print '========================='
seed_list.append(seed)
print seed_list[k]
k += 1
os.system('pause')
return seed_list

代码如下:








http://aaa.com.cn/loveissuuny


http://aaa.com.cn/loveissuuny


http://aaa.com.cn/sanxiazaixian





http://aaa.com.cn/twocold


http://aaa.com.cn/u/1233526741



文档

python操作xml文件示例

python操作xml文件示例: 代码如下:def get_seed_data(filename):dom = minidom.parse(filename)root = dom.documentElementsystem_nodes = root.getElementsByTagName(system)k = 0seed_list = []for system_node in s
推荐度:
标签: 例子 读取 示例
  • 热门焦点

最新推荐

猜你喜欢

热门推荐

专题
Top