python通信达数据_Python读取通达信数据--688IT编程网

python通信达数据_Python读取通达信数据

Python读取通达信数据

⼀、介绍

python获取股票数据的⽅法很多，其中Tushare 财经数据接⼝包很好⽤，当然，也可以通过通达信本地的数据获取，这样更为⽅便。⽇线数据存在这路径下

D:\通达信\vipdoc\sh\lday(我的通达信安装⽬录是D盘)

接着我们需要的就是解析这些数据，在分别存为

csv

格式的数据就⾏了，这样我们可以⽅便的⽤

pandas

或其他⽅法读取和分析。

通达信的⽇线数据格式如下：

每32个字节为⼀天数据每4个字节为⼀个字段，每个字段内低字节在前00

~ 03

字节：年⽉⽇,

整型04 ~ 07

字节：开盘价*100，

整型08 ~ 11

字节：最⾼价*100,

整型12 ~ 15

字节：最低价*100,

整型16 ~ 19

字节：收盘价*100,

整型20 ~ 23

字节：成交额(元)，float型24

~ 27

字节：成交量(股)，整型28

~ 31

字节：(保留)

打开⼀个

.day

的⽂件，发现是乱码，以⼆进制格式存储，那么我们只需按照上⾯存的字节数解析下就可以了。

先读取⼀天的数据

>>> f =

open('D:/通达信/vipdoc/sh/lday/sh000001.day',

'rb') >>> f.read(32)

b'\xa2\xde2\x01\x14\x9b\x03\x00\x0f\x9d\x03\x00\x8d\x91\x03\x00\xef\x93\x03\x00\xcb\xbc\x14Q\x9a\xfb|\x02-\x01Z\x02'

这应该就是⼀天的数据了，我们对这个数据进⾏解析，这⾥需要⽤到

struct

模块中的

unpack ⽅法

>>>

import struct >>> f =

open('D:/通达信/vipdoc/sh/lday/sh000001.day',

'rb') >>> li =

struct.unpack('lllllfll', li)

>>> data (20111010,

236308, 236815, 233869, 234479, 39926411264.0, 41745306, 39452973)

分别为⽇期，开盘，最⾼，最低，收盘，成交额，成交量，保留值

unpack⽤法：

前⼀个参数是格式，'lllllfii'

就是⼀个浮点数格式(f，这⾥对应⽇线数据中的成交额是float格式)和其他整形格式(i，这⾥对应⽇线数据中的其他数据是

int

格式)

那么剩下的问题不⼤了

⼆、完整代码

在 sh

⽬录下新建了个

pythondata

⽂件夹，注意⽂件路径分隔符是

import struct import datetime def stock_csv(filepath,

name): data = [] with open(filepath, 'rb') as f: file_object_path =

'D:/通达信/vipdoc/sh/pythondata/'

+ name +'.csv' file_object = open(file_object_path, 'w+') while

True: stock_date = f.read(4) stock_open = f.read(4) stock_high =

= f.read(4) stock_vol = f.read(4) stock_reservation = f.read(4) #

date,open,high,low,close,amount,vol,reservation if not stock_date:

break stock_date = struct.unpack("l", stock_date) #

4字节

如20091229 stock_open = struct.unpack("l",

stock_open)

#开盘价*100

stock_high = struct.unpack("l", stock_high)

#最⾼价*100

stock_low= struct.unpack("l", stock_low)

#最低价*100

stock_close = struct.unpack("l", stock_close)

#收盘价*100

stock_amount = struct.unpack("f", stock_amount)

#成交额

stock_vol = struct.unpack("l", stock_vol)

#成交量

stock_reservation = struct.unpack("l", stock_reservation)

#保留值

date_format =

datetime.datetime.strptime(str(stock_date[0]),'%Y%M%d')

#格式化⽇期

list=

date_format.strftime('%Y-%M-

%d')+","+str(stock_open[0]/100)+","+str(stock_high[0]/100.0)+","+str(stock_low[0]/100.0)+","+str(stock_close[0]/100.0)+","+str(s

file_object.writelines(list) file_object.close()

stock_csv('D:/通达信/vipdoc/sh/lday/sh000001.day',

'1')

运⾏下，打开

1.CSV ⽂件

三、批量解析

import os import struct import datetime def

stock_csv(filepath, name): data = [] with open(filepath, 'rb') as

f: file_object_path =

'D:/通达信/vipdoc/sh/pythondata/'

+ name +'.csv' file_object = open(file_object_path, 'w+') while

True: stock_date = f.read(4) stock_open = f.read(4) stock_high =

= f.read(4) stock_vol = f.read(4) stock_reservation = f.read(4) #

date,open,high,low,close,amount,vol,reservation if not stock_date:

break stock_date = struct.unpack("l", stock_date) #

4字节

如20091229 stock_open = struct.unpack("l",

stock_open)

#开盘价*100

stock_high = struct.unpack("l", stock_high)

#最⾼价*100

stock_low= struct.unpack("l", stock_low)

#最低价*100

stock_close = struct.unpack("l", stock_close)

#收盘价*100

stock_amount = struct.unpack("f", stock_amount)

#成交额

stock_vol = struct.unpack("l", stock_vol)

#成交量

stock_reservation = struct.unpack("l", stock_reservation)

#保留值

date_format =

datetime.datetime.strptime(str(stock_date[0]),'%Y%M%d')

#格式化⽇期

list=

date_format.strftime('%Y-%M-

%d')+","+str(stock_open[0]/100)+","+str(stock_high[0]/100.0)+","+str(stock_low[0]/100.0)+","+str(stock_close[0]/100.0)+","+str(s

file_object.writelines(list) file_object.close() path =

'D:/通达信/vipdoc/sh/lday/'

listfile =

writelines在python中的用法os.listdir('D:/通达信/vipdoc/sh/lday/') for i in listfile: stock_csv(path+i, i[:-4])运⾏下

688IT编程网

python通信达数据_Python读取通达信数据

发表评论

推荐文章

java正则表达式选择题

一种基于正则表达式的DBC文件解析及报文分析方法[发明专利]

工龄小数点提取

非零金额正则表达式

提取文本中数字的函数

热门文章

利用正则表达式实现文本数据提取与处理

正则表达式零宽断言详解

文本匹配规则

excel中使用正则

1-31正则表达式

anki之高级筛选

BUAA_OO_2021_第一单元总结

insert语句递增写法

sublime text 3在行前插入递增数字序号的方法

字符串只允许数字和英文的正则

powerbuilder 正则表达式

Shell脚本编写的高级技巧利用正则表达式进行字符串匹配

JAVA正则表达式的三种模式:贪婪,勉强和占有的讨论

go regexp匹配规则

oracle regexp_substr 实现原理

基本的元字符回溯引用和前后查匹配模式

elasticsearch query dsl正则

oracle sql正则表达式

GA-设置目标

仅匹配全角片假名的正则表达式

最新文章

java正则表达式选择题

工龄小数点提取

非零金额正则表达式

提取文本中数字的函数

vue数字相加小数点变长-概述说明以及解释

vue validate 正则验证小数长度

标签列表

688IT编程网

python通信达数据_Python读取通达信数据

发表评论

推荐文章

java正则表达式 选择题

一种基于正则表达式的DBC文件解析及报文分析方法[发明专利]

工龄小数点提取

非零金额 正则表达式

提取文本中数字的函数

热门文章

利用正则表达式实现文本数据提取与处理

正则表达式零宽断言详解

文本匹配规则

excel中使用正则

1-31正则表达式

anki之高级筛选

BUAA_OO_2021_第一单元总结

insert语句递增写法

sublime text 3在行前插入递增数字序号的方法

字符串只允许数字和英文的正则

powerbuilder 正则表达式

Shell脚本编写的高级技巧利用正则表达式进行字符串匹配

JAVA正则表达式的三种模式:贪婪,勉强和占有的讨论

go regexp匹配规则

oracle regexp_substr 实现原理

基本的元字符 回溯引用和前后查 匹配模式

elasticsearch query dsl正则

oracle sql正则表达式

GA-设置目标

仅匹配全角片假名的正则表达式

最新文章

java正则表达式 选择题

工龄小数点提取

非零金额 正则表达式

提取文本中数字的函数

vue数字相加小数点变长-概述说明以及解释

vue validate 正则验证小数长度

标签列表

java正则表达式选择题

非零金额正则表达式

基本的元字符回溯引用和前后查匹配模式

java正则表达式选择题

非零金额正则表达式