python解析日志,获取想要的数据--688IT编程网

python解析⽇志，获取想要的数据

最常用的排序算法由于⽼⼤需要对⽇志进⾏解析，获取到相应桩的信息，所以我写了个专门的解析脚本，就是执⾏的时间有点长，如果⽤java的话应该可以快2/3.练⼀下python.

在该脚本中遇到的问题就是des解密的时候有⼀个固定8位的key.当时使⽤32位的长key，每次都报错，后来发现可以先使⽤8位空key设置，再setKey(KEY)为32位的.解析截取到的数据时，看似json格式，其实并不是，所以单写了个解析⽅法.

# -*- coding: utf-8 -*-

import os

import linecache # 对⽂件进⾏⾏缓存，可以直接取到想要的⾏值

import base64 # base64解码

import pandas as pd # 进⾏格式化数据

excel常用函数公式表乘法from pyDes import *

from Crypto import Random

import sys

from urllib import parse # 进⾏urlencode

import time # 记录时间

import json # json转化

FILEDIR = r'E:\aaa'

KEY = '' # 秘钥

PILENUMBERS = ['1011895210701234176-1',

'1011895210701234176-2',

'1011895284617453568-1',

'1011895284617453568-2',

'1011895333984407552-1',

'1011895333984407552-2',

'1011895376523038720-1',

'1011895376523038720-2',

'1011895424895950848-1',

'1011895424895950848-2']

def read_file(filelist):

time = []

cipherText = []

plainText = []

for filename in filelist:

filename = FILEDIR + '\\' + filename

if ists(filename):

cache_data = lines(filename)

python解析json文件

for line in range(len(cache_data)):

if r'DES》》Base64后' in cache_data[line]:

str = cache_data[line][len('>DES》》Base64后>'):]

plain = des_decrypt(str)

s2 = analysis(plain)['pileNumber']

if s2 in PILENUMBERS:

time.append(cache_data[line + 3][0:len('2019-03-12 20:13:04')])

cipherText.append(str)

plainText.append(plain)

dataframe = pd.DataFrame({'时间': time, '密⽂': cipherText, '明⽂': plainText})

<_csv(FILEDIR + '\\' + 'wx.csv', index=False, sep=',', encoding='gbk')

# 获取⽂件夹下所有的⽂件

def file_name(filedir):

filelist = os.listdir(filedir)

return filelist

# des解密

def des_decrypt(str):

cipherX = des(key=' ', w().read(8), pad=None, padmode=PAD_PKCS5)

cipherX.setKey(KEY)

b = cipherX.decrypt(base64.b64decode(str))

return parse.unquote(bytes.decode(b)).replace('+', '')

# 解析{uid='61916',terminal='web',id='61916',pileNumber='1011895424895950848-2'}

yuan = r"{uid='61916',terminal='web',id='61916',pileNumber='1011895424895950848-2'}"

def analysis(str):

new_str = '{'

d = place('=', ':')

for i in range(len(d.split(','))):

if '{' in d.split(',')[i].split(':')[0]:

key = "'" + d.split(',')[i].split(':')[0].replace('{', '') + "'"

else:

key = "'" + d.split(',')[i].split(':')[0] + "'"

excel教程视频网资源if '}' in d.split(',')[i].split(':')[1]:

value = d.split(',')[i].split(':')[1].replace('}', '')

else:

value = d.split(',')[i].split(':')[1]

f = key + ':' + valuecss子元素hover父元素变颜

if 0 < i < len(d.split(',')):

new_str = new_str + ',' + f

j2seelif i == 0:

new_str = new_str + f

return json.loads((new_str + '}').replace("'", '"'))

# print(analysis(yuan)['pileNumber'])

if __name__ == '__main__':

print('开始时间:', time.strftime('%Y.%m.%d %H:%M:%S', time.localtime(time.time()))) read_file(file_name(FILEDIR))

print('结束时间:', time.strftime('%Y.%m.%d %H:%M:%S', time.localtime(time.time())))

688IT编程网

python解析日志,获取想要的数据

发表评论

推荐文章

随机森林算法介绍及R语言实现

基于随机森林优化的神经网络算法在冬小麦产量预测中的应用研究_百度文 ...

基于正则化贪心森林算法的情感分析方法研究

随机森林算法和grandientboosting算法

基于随机森林的图像分类算法研究

热门文章

随机森林特征选择原理

自动驾驶系统中的随机森林算法解析

随机森林算法及其在生物信息学中的应用

监督学习中的随机森林算法解析(六)

随机森林算法在数据分析中的应用

机器学习——随机森林,RandomForestClassifier参数含义详解

随机森林的算法

随机森林算法作用

监督学习中的随机森林算法解析(十)

随机森林算法案例

随机森林案例

二分类问题常用的模型

绘制ssd框架训练流程

一种基于信息熵和DTW的多维时间序列相似性度量算法

SVM训练过程范文

如何使用支持向量机进行股票预测与交易分析

二分类交叉熵损失函数binary

tinybert_训练中文文本分类模型_概述说明

基于门控可形变卷积和分层Transformer的图像修复模型及其应用

人工智能开发技术的测试和评估方法

最新文章

基于随机森林的数据分类算法改进

人工智能中的智能识别与分类技术

基于人工智能技术的随机森林算法在医疗数据挖掘中的应用

随机森林回归模型的建模步骤

r语言随机森林预测模型校准曲线

《2024年随机森林算法优化研究》范文

标签列表