python-docx识别表格在docx文档中的所在位置--688IT编程网

python中文文档python-docx识别表格在docx⽂档中的所在位置

由于⼯作需要提取⼀个word⽂档中的表格，及其所在的章节，普通的Document.paragraphs 和Document.tables⽆法满⾜需求。所以综合GitHub作者的代码及我⾃⼰的需求代码如下：

from docx.document import Document

l.table import CT_Tbl

l.text.paragraph import CT_P

from docx.table import _Cell, Table

paragraph import Paragraph

import docx

import openpyxl

import xlsxwriter

def iter_block_items(parent):

"""

Yield each paragraph and table child within *parent*, in document order.

Each returned value is an instance of either Table or Paragraph. *parent*

would most commonly be a reference to a main Document object, but

also works for a _Cell object, which itself can contain paragraphs and tables.

"""

if isinstance(parent, Document):

parent_elm = parent.element.body

elif isinstance(parent, _Cell):

parent_elm = parent._tc

else:

raise ValueError("something's not right")

for child in parent_elm.iterchildren():

if isinstance(child, CT_P):

yield Paragraph(child, parent)

elif isinstance(child, CT_Tbl):

yield Table(child, parent)

# table = Table(child, parent)

# for row ws:

# for cell lls:

# for paragraph in cell.paragraphs:

# yield paragraph

doc = docx.Document('C:\\Users\\Citect2016\\Desktop\\A19-42000.docx')

for block in iter_block_items(doc):

if block.style.name == 'Table Grid':

pass

if block.style.name == 'Heading 1':

pass

值得⼀提的是，本⽂的docx版本是0.8.6，适⽤于python3.x，各位道友请到官⽹⾃⾏下载。

发表评论

688IT编程网

python-docx识别表格在docx文档中的所在位置

发表评论

推荐文章

mongodb中match多个条件

纯数字正则表达式

zipkin tagquery用法

excel匹配正则 -回复

re正则匹配之findall

热门文章

java非负整数正则表达式

js 动态生成整数范围的正则

z正整数校验规则

生成2位随机整数的正则表达式

大于等于0的整数的正则

大于指定整数的数字正则表达式

阿里云密码正则表达式

el-form 密码正则表达

js 密码正则表达式

php密码正则

excel字母正则 -回复

shell 中括号正则

sn明细正则表达式

字母对称的正则表达式

shell akw 正则表达式

hive中的正则表达式

密码数字字母符号混合 java 正则

正则数字字母组合

组织机构代码正则

8位密码的正则表达式

最新文章

mongodb中match多个条件

excel匹配正则 -回复

re正则匹配之findall

数据库正则匹配数字

ue 匹配数字正则

ireport常用正则表达式

标签列表

688IT编程网

python-docx识别表格在docx文档中的所在位置

发表评论

推荐文章

mongodb中match多个条件

纯数字正则表达式

zipkin tagquery用法

excel匹配正则 -回复

re正则匹配之findall

热门文章

java非负整数正则表达式

js 动态生成整数范围的正则

z正整数校验规则

生成2位随机整数的正则表达式

大于等于0的整数的正则

大于指定整数的数字 正则表达式

阿里云密码正则表达式

el-form 密码正则表达

js 密码 正则表达式

php密码正则

excel字母正则 -回复

shell 中括号 正则

sn明细正则表达式

字母对称的正则表达式

shell akw 正则表达式

hive中的正则表达式

密码 数字字母符号混合 java 正则

正则数字字母组合

组织机构代码正则

8位密码的正则表达式

最新文章

mongodb中match多个条件

excel匹配正则 -回复

re正则匹配之findall

数据库正则匹配数字

ue 匹配数字 正则

ireport常用正则表达式

标签列表

大于指定整数的数字正则表达式

js 密码正则表达式

shell 中括号正则

密码数字字母符号混合 java 正则

ue 匹配数字正则