python读取文件格式_python以某种格式读取文件(--688IT编程网

python读取⽂件格式_python以某种格式读取⽂件

（pythonreadfilein。。。

python以某种格式读取⽂件(python read file in certain format)

我有⼀些格式如下的⽂件：

36.1 37.1 A: Hi, how are you?

39.1 40.1 B: I am ok!

我使⽤numpy.loadtxt()以dtype = np.dtype([('start', '|S1'), ('end', 'f8'),('person','|S1'),('content','|S100')])

前3列很好，但字符串部分总是有问题：格式不匹配。我想这是因为每个发⾔者都说可变长度的单词。有谁知道解决这个问题的更好⽅法吗？

⾮常感谢，

I have files with a certain format as follows:

36.1 37.1 A: Hi, how are you?

39.1 40.1 B: I am ok!

I am using numpy.loadtxt() to read this file with dtype = np.dtype([('start', '|S1'), ('end', 'f8'),('person','|S1'),('content','|S100')])

The first 3 column is fine but the string part always has an issue: the format does not match. I guess it is because each speaker says variable length words. Does anyone know a better way to solve this?

many thanks,

原⽂：stackoverflow/questions/41965458

更新时间：2019-12-25 12:38

最满意答案

我会建议阅读⽂本⼿动没有numpy，只是遍历⽂件中的⾏。

with open("", "r") as infile:

chats = []

for i in infile:

python怎么读取json文件

data = i.split(":")

start, end, name, content = data[0].split(" ")[0], data[0].split(" ")[1], data[0].split(" ")[2], data[1].strip("\n")

chats.append([start, end, name, content])

打开⽂件并逐⾏读取，⽽开始，结束，名称和内容作为⼦列表追加到列表聊天中。

I would recommend reading the text manually without numpy and just iterating over the lines in the file.

with open("", "r") as infile:

chats = []

for i in infile:

data = i.split(":")

start, end, name, content = data[0].split(" ")[0], data[0].split(" ")[1], data[0].split(" ")[2], data[1].strip("\n")

chats.append([start, end, name, content])

The file is opened and read line by line, while start, end, name and content is appended as a sublist to the list chats.

2017-01-31

相关问答

您可以简单地解析CSV，进⾏必要的更改，然后重新写⼊。 (我没有测试过这个代码，但它应该是这样的) import csv

reader = ader(open('IN.csv', 'r'))

writer = csv.writer(open('OUT.csv', 'w')

for row in reader:

# do stuff to the row here

# row is just a list of items

writer.writerow(row)

You cou

...

好吧，在玩了⼀下之后，我添加了这条线 import Scapy_Exception⽂件 import

Scapy_Exception中的Scapy_Exception并运⾏我的程序。如果我这样做，它现在捕获错误： try:

...: pkts = rdpcap('./')

...: except Scapy_Exception as msg:

...: print msg, "

...

我会建议阅读⽂本⼿动没有numpy，只是遍历⽂件中的⾏。 with open("", "r") as infile:

chats = []

for i in infile:

data = i.split(":")

start, end, name, content = data[0].split(" ")[0], data[0].split(" ")[1], data[0].split(" ")[2], data[1].strip("

...

当您使⽤csv⽂件时，您应该查看csv模块。我写了⼀个代码应该做的伎俩。此代码假定“良好数据”。如果您的数据集可能包含错误(例如列数⼩于13或数据⾏少于326)，则应进⾏⼀些更改。 (更改为符合Python 2.6.6) import csv

with open('mydata.csv') as in_file:

with open('outfile.csv', 'wb') as out_file:

csv_reader = ader(in_file,

...

您可能希望将⽂本⽂件逐⾏读⼊列表中。然后，您可以将数据导出为 excel可以读取的.csv⽂件，也可以使⽤类似Openpyxl的库直接作为Excel⽂件导出。例如，如果您愿意⽣成.csv⽽不是Excel⽂件，则此代码执⾏您所要求的操作： fname = "" #path to file

csvname = "" #path to output csv

with open(fname) as f: #reads the file

content = f.readlines() #

...

感谢stovfl，我解决了这个问题。这⾥是结合np.fromfile和reshape的代码。 cmorph =

np.fromfile(file,type=np.float32,count=nvar*ntim*nlat*mlon)).reshape((nvar*ntim,nlat,mlon))

Thanks to stovfl, I solved this. Here is the code that combines the np.fromfile and the reshape. cmor

...

你可以试试这个。 import json

with open("json_data.json", mode='r', encoding='utf-8') as json_data:

data = json.load(json_data)

print(data)

json_data.json {

"hello": 11,

"world": 22,

"json": 33

}

产量 {'hello': 11, 'world': 22, 'json': 33}

确保您的

...

这应该⼯作： In [47]: strs="4.500000+1 1.894719-3 4.600000+1 8.196721-3 4.700000+1 2.869539-3"

In [48]: [place("+","e+").replace("-","e-")) for x in strs.split()]

Out[48]: [45.0, 0.001894719, 46.0, 0.008196721, 47.0, 0.002869539]

this should wo

...

在您显⽰之后阅读标题后，您将获得⾼度(1024)⾼度(下⼀个1024)和深度(255)。要获取像素数据，最简单的是逐字节读取它们： def read_pgm(pgmf):

"""Return a raster of integers from a PGM as a list of lists."""

adline() == 'P5\n'

(width, height) = [int(i) for i adline().spl

...

你可以使⽤带分隔符的read_csv ][必须由\转义。然后replace列和值，并使⽤dropna删除所有NaN的dropna ： import pandas as pd

from pandaspat import StringIO

temp=u"""[Header1][Header2][Header3][HeaderN]

[=======][=======][=======][=======]

[Value1][Value2][Value3][ValueN]

[AnotherVal

688IT编程网

python读取文件格式_python以某种格式读取文件(

发表评论

推荐文章

随机森林算法介绍及R语言实现

基于随机森林优化的神经网络算法在冬小麦产量预测中的应用研究_百度文 ...

基于正则化贪心森林算法的情感分析方法研究

随机森林算法和grandientboosting算法

基于随机森林的图像分类算法研究

热门文章

随机森林特征选择原理

自动驾驶系统中的随机森林算法解析

随机森林算法及其在生物信息学中的应用

监督学习中的随机森林算法解析(六)

随机森林算法在数据分析中的应用

机器学习——随机森林,RandomForestClassifier参数含义详解

随机森林的算法

随机森林算法作用

监督学习中的随机森林算法解析(十)

随机森林算法案例

随机森林案例

二分类问题常用的模型

绘制ssd框架训练流程

一种基于信息熵和DTW的多维时间序列相似性度量算法

SVM训练过程范文

如何使用支持向量机进行股票预测与交易分析

二分类交叉熵损失函数binary

tinybert_训练中文文本分类模型_概述说明

基于门控可形变卷积和分层Transformer的图像修复模型及其应用

人工智能开发技术的测试和评估方法

最新文章

基于随机森林的数据分类算法改进

人工智能中的智能识别与分类技术

基于人工智能技术的随机森林算法在医疗数据挖掘中的应用

随机森林回归模型的建模步骤

r语言随机森林预测模型校准曲线

《2024年随机森林算法优化研究》范文

标签列表

688IT编程网

python读取文件格式_python以某种格式读取文件(

发表评论

推荐文章

随机森林算法介绍及R语言实现

基于随机森林优化的神经网络算法在冬小麦产量预测中的应用研究_百度文 ...

基于正则化贪心森林算法的情感分析方法研究

随机森林算法和grandientboosting算法

基于随机森林的图像分类算法研究

热门文章

随机森林特征选择原理

自动驾驶系统中的随机森林算法解析

随机森林算法及其在生物信息学中的应用

监督学习中的随机森林算法解析(六)

随机森林算法在数据分析中的应用

机器学习——随机森林,RandomForestClassifier参数含义详解

随机森林 的算法

随机森林算法作用

监督学习中的随机森林算法解析(十)

随机森林算法案例

随机森林案例

二分类问题常用的模型

绘制ssd框架训练流程

一种基于信息熵和DTW的多维时间序列相似性度量算法

SVM训练过程范文

如何使用支持向量机进行股票预测与交易分析

二分类交叉熵损失函数binary

tinybert_训练中文文本分类模型_概述说明

基于门控可形变卷积和分层Transformer的图像修复模型及其应用

人工智能开发技术的测试和评估方法

最新文章

基于随机森林的数据分类算法改进

人工智能中的智能识别与分类技术

基于人工智能技术的随机森林算法在医疗数据挖掘中的应用

随机森林回归模型的建模步骤

r语言随机森林预测模型校准曲线

《2024年随机森林算法优化研究》范文

标签列表

随机森林的算法