Temporal pattern identification of time series data using pattern wavelets and genetic algo
基于Django印刷体维吾尔文识别系统的设计与实现

㊀第53卷第3期郑州大学学报(理学版)Vol.53No.3㊀2021年9月J.Zhengzhou Univ.(Nat.Sci.Ed.)Sep.2021收稿日期:2020-08-25基金项目:国家自然科学基金项目(61433012);国家 973 重点基础研究计划基金项目(2014CB340506)㊂作者简介:熊黎剑(1996 ),男,硕士研究生,主要从事OCR 研究,E-mail:xiong_lijian@;通信作者:吾守尔㊃斯拉木(1941 ),男,教授,中国工程院院士,主要从事多语种信息处理研究,E-mail:wushour@㊂基于Django 印刷体维吾尔文识别系统的设计与实现熊黎剑1,2,3,㊀吾守尔㊃斯拉木1,2,3,㊀许苗苗1,2,3(1.新疆大学信息科学与工程学院㊀新疆乌鲁木齐830046;2.新疆多语种信息技术实验室㊀新疆乌鲁木齐830046;3.新疆多语种信息技术研究中心㊀新疆乌鲁木齐830046)摘要:光学字符识别(optical character recognition,OCR)技术在图书数字化㊁文献管理等诸多领域得到了广泛应用,而相比于已十分成熟的中文㊁英文印刷体识别系统,小文种(维吾尔文)印刷体识别还有研究空间和实际应用需求㊂针对传统识别方法特征表示不足等问题,结合日益兴起的深度学习技术,采用Python 语言编程,选用经改进的卷积循环神经网络作为识别算法核心,并利用Django 设计系统框架㊂实验表明,印刷体维文识别系统的精度达到95.7%,平均速度达到12.5fps㊂该系统实现了端到端的维文整词识别㊂关键词:卷积循环神经网络;门控循环单元;连接时序分类器;印刷体维吾尔文中图分类号:TP391㊀㊀㊀㊀㊀文献标志码:A㊀㊀㊀㊀㊀文章编号:1671-6841(2021)03-0009-06DOI :10.13705/j.issn.1671-6841.20202770㊀引言随着信息化社会的不断推进,光学字符识别(optical character recognition,OCR)技术在各领域开花结果㊂印刷体文字识别在数字化办公㊁文献管理等方面均有良好的应用前景㊂相比于已成熟化的印刷体中㊁英文识别,印刷体维吾尔文识别还有研究空间[1]㊂维吾尔文多使用于我国新疆地区,包含32个字母,其中8个元音字母,24个辅音字母,词序是主语-谓语-宾语[2]㊂有一些维文字母的主体部分相同,仅依靠上下点的标记来区别不同字符[3]㊂同时,当字母出现在词前㊁词中㊁词末以及独立出现时,对应的写法也不同,切分不当会导致单词的改变,所以本文从整词识别入手㊂已有研究方法多以传统方法为主,如基于统计和结构的方法[4]㊁模板匹配法[5]等㊂这些方法往往需要较多的人工干预,包括手工设计特征和人工建立标准的匹配库等,因此效率不高㊂近些年以来,国内相关的维文OCR 系统是西安电子科技大学卢朝阳教授团队开发的维吾尔文识别软件㊂它的设计核心是:单词切分成字符再结合人工选取的特征(如方向线素特征和梯度特征),再用欧氏距离分类器[6-8],最终得到候选结果㊂2019年,该实验室又利用 翻字典 原理设计了从字符到单词的两级级联分类器[9],完成了维文单词的识别㊂以上方法均是手工选取特征结合分类器进行识别,在特征选择方面具有一定的局限性㊂本文选用经改进的卷积循环神经网络(convolutional recurrent neural network,CRNN)和连接时序分类器(connectionist temporal classification,CTC)作为文字识别的核心算法,Django 作为构建前后端的框架,搭建了完整的维文整词识别系统㊂1㊀算法原理1.1㊀卷积循环网络用于文字识别领域的卷积循环神经网络(CRNN)是由Shi 等提出的[10]㊂它由深层卷积网络(deep conv-郑州大学学报(理学版)第53卷olutional neural network,DCNN)加循环网络(recurrent neural network,RNN)构成㊂随着计算机视觉领域研究的持续火热,DCNN 经常被用于图像特征提取,同时,它也在目标检测[11-12]㊁情感分析[13]㊁图像处理[14]等方面表现优异,但是文字的检测与识别不同于一般的目标检测任务,基于特征(人工设计或CNN 得到)及分类的方法往往忽视了文本隐含上下文关联的特点㊂RNN 能处理序列信息,在原有的CRNN 中,RNN 部分是双向长短期记忆网络(bi-directioanl long short-term memory,BiLSTM),但其结构复杂,训练收敛速度慢㊂本文将BiLSTM 替换为更为简洁的双向门控循环神经单元网络(bi-directioanl gated recurrent unit,BiGRU)[15]㊂实验证明,改进后的CRNN 网络比原有网络收敛的速度更快,同时,在测试精度方面也有略微提升㊂1.2㊀门控循环神经单元网络(GRU )GRU 是在RNN 和LSTM 的基础上一步步演变而来的,LSTM 网络解决了RNN 在训练时容易出现梯度爆炸和梯度消失的问题,而相比LSTM 更为复杂的3门结构 输入门㊁忘记门和输出门,GRU 将其简化为2门结构 更新门和重置门,这样简洁的结构减少了网络训练收敛时间,具有更高的计算效率,提高了模型精度㊂GRU 内部结构如图1所示㊂图1㊀GRU 结构图Figure 1㊀Diagram of GRU structureGRU 的前向传播计算公式[15]为z t =σ(W z ㊃[h t -1,x t ]),r t=σ(W r ㊃[h t -1,x t ]),h t =tanh(W h ㊃[r t ∗h t -1,x t ]),h t =(1-z t )∗h t -1+z t ∗h t ,ìîíïïïïïï式中:z t 代表更新门;σ代表Sigmoid 激活函数;W z 代表更新门权重矩阵;h t -1代表t -1时刻隐藏层的输出;x t 代表t 时刻输入;[]表示两个向量相连;r t 代表重置门;W r 代表重置门权重矩阵;h t 代表t 时刻的候选隐藏层;tanh 代表双曲正切激活函数;W h 代表候选隐藏层权重矩阵;∗代表矩阵的哈达玛积;h t 代表t 时刻隐藏层的输出㊂图2㊀系统框架图Figure 2㊀Diagram of system frame重置门用来控制需要保留多少之前的信息,被忘记的历史信息越多,其值越小;更新门主要决定被添加到当前状态信息中的历史信息量,经过Sigmoid 函数激活,取值为0~1;这两个门共同决定了隐藏状态的输出㊂本文采用正向GRU 和反向GRU 结合成双向GRU(BiGRU),并用双层堆叠形式进行序列建模,其中隐藏层单元数为256㊂如图2所示㊂1.3㊀连接时序分类器连接时序分类器是一种用于解决不等长序列的输出问题(序列对齐问题)的算法,最早由Graves 提出,之后他又将CTC 成功应用于语音识别方面[16]㊂训练时无须切分语料,也不需要中间语音的表示,在测试集上错误率低至17.7%㊂该解码算法能有效解01㊀第3期熊黎剑,等:基于Django 印刷体维吾尔文识别系统的设计与实现决输入㊁输出序列不等长的问题㊂数学模型上,CTC 层也叫转录层,是根据上一层(RNN 层)输出长度为T 的预测序列{x 1,x 2, ,x T },去寻找具有最高概率的标签序列㊂p (l x )=ðπɪF -1(l )p (πx ),(1)p (πx )=ᵑT t =1y t πt ,(2)式(1)表示输出标签序列的概率是多条路径概率之和㊂其中:l 是标签序列;x 是输入序列;π是输出路径;F -1是标签到路径的映射㊂式(2)表示基于输入x 输出π路径的概率㊂其中:T 是输入序列长度;y t πt 是t 时刻输出π路径的概率㊂维吾尔文识别的一个CTC 实例㊂1)RNN 层输出标签序列,时序中没有标签的地方用 -代替;2)CTC 去除空白标签 - ,删除重复标签,只保留;3)整个过程可表示为㊀㊀(汉语意思为 建立 )㊂由上述实例可以看出,CTC 对齐前的输入序列长度为26,CTC 对齐后的输出序列长度为5,可见CTC 有效地解决了序列对齐问题㊂2㊀识别系统的设计与实现2.1㊀系统框架本文采用开源的Django 设计系统,遵循M (model)T(templete)V(view)模式㊂用户在浏览器端发送请求,通过urls.py 发给view 处理,view 再调用对应的templete 和model 进行处理㊂其中view 负责业务逻辑,templete(主要是HTML 文件)负责页面展示,model 负责数据库对象和业务对象㊂这种松耦合和相互独立的特性,易于开发和维护㊂系统流程如图3所示㊂2.2㊀系统功能与展示后台输入命令启动服务,然后在浏览器页面输入网址,开启Web 服务㊂1)上传功能㊂选择一张本地图像,点击提交,图像会自动上传到后台splite 数据库㊂2)识别功能㊂后台读取数据库中的图像,调用识别模块和计时模块,将识别结果(汉语意思为 创造力 )㊁时间(0.07s)㊁原图片及图片名返回前端页面并显示,如图4所示㊂图3㊀系统框架图Figure 3㊀Diagram of system frame 图4㊀系统展示图Figure 4㊀Photo of system3㊀实验与结果3.1㊀实验数据1)训练数据(合成数据)利用JAVA 语言编写脚本,合成了含32个维文字母(8个元音,24个辅音)在内的约10万张图片数据11郑州大学学报(理学版)第53卷(JPG 格式),以及对应的标签数据(TXT 格式)㊂同时,为了使训练样本更具代表性,本文对32个维文字母作了数据均衡处理㊂部分训练图片如图5所示㊂图5㊀部分训练数据Figure 5㊀The sample of training data2)测试数据(真实数据)从天山网(维文版)(http:ʊ /)中的不同栏目进行收集并制作成测试图片和标签㊂总数约1500张,部分测试数据图片如图6所示㊂图6㊀部分测试数据Figure 6㊀The sample of testing data3.2㊀实验设置为了验证系统的有效性,本文设置了对比实验㊂采用约10万张图片作为训练数据,分别在CRNN 和改进的CRNN(BiGRU)上训练,并将训练得到的模型文件分别在测试集上进行测试㊂实验中的PC 机主要配置为:Nvidia 独立显卡(1060Ti 6G 内存)等㊂所依赖的软件及环境为:Pycharm(社区版)编译工具㊁Ubuntu18.04操作系统㊁Python3.6.2编程语言㊁Pytorch1.2.0等㊂1)实验中精度的定义为A =(n t /n s )㊃100%,其中:n t 代表正确识别样本数;n s 代表样本总数;A 代表识别精度㊂2)实验中识别速度的定义为v =1/(t o -t i ),其中:t o 代表获得字符串时刻;t i 代表输入图片时刻;v 代表识别速度㊂3)实验中网络训练的损失函数定义为O =-ðI i ,l i ɪχlog p (l i y i ),其中:χ代表训练集;I i 代表输入图片;l i 代表标签序列;y i 代表循环层产生的概率预测序列㊂从损失函数可知,它直接从输入的维文印刷体图片I i 和对应的单词标签序列l i 中计算损失值,网络实现了字母免分割的训练㊂3.3㊀实验结果本实验对两种方法均进行了充分训练,当损失趋于收敛后,保留最终模型文件,其中CRNN(BiGRU)收敛速度更快㊂在测试集上,CRNN 的精度为94.1%,CRNN(BiGRU)的精度为95.7%,平均速度为12.5fps,表现出较好性能㊂究其原因,循环层由BiGRU 替换,简化了模型结构,加快了模型训练收敛速度,提高了计算效率㊂此外,训练数据均衡也使得识别率较为稳定㊂2131㊀第3期熊黎剑,等:基于Django印刷体维吾尔文识别系统的设计与实现4 结语针对传统维文识别方法特征表示不足和基于切分的识别方法易出错等问题,本文从整词识别入手,采用卷积神经网络自动提取文字的深层次抽象特征,并对循环层进行改进,用BiGRU替换原有的BiLSTM,改善了识别性能㊂引入连接时序分类器,很好地解决了维文字符难切分以及不等长输入输出问题㊂测试识别精度达到95.7%,平均速度达到12.5fps㊂最后,利用Django框架,设计了一个端到端的维文整词识别系统㊂因此,该系统具有一定的实际应用价值㊂然而,现有系统只能识别纯维文(不含数字㊁字符),从实际应用的角度来看,后续工作可将常用的符号和数字纳入识别系统,进一步完善该系统㊂参考文献:[1]㊀UBUL K,TURSUN G,AYSA A,et al.Script identification of multi-script documents:a survey[J].IEEE access,2017,5:6546-6559.[2]㊀彭勇,哈力旦㊃阿布都热依木,丁维超.基于改进单深层神经网络的自然场景中维吾尔文检测[J].计算机应用研究,2019,36(9):2876-2880.PENG Y,HALIDAN A,DING W C.Uyghur text detection in natural scene based on improved single deep neural network[J].Application research of computers,2019,36(9):2876-2880.[3]㊀艾力㊃居麦,哈力旦㊃A,黄浩.视频图像中维吾尔文字的识别研究[J].计算机工程与应用,2011,47(36):190-192.ELI J M,HALIDAN A,HUANG H.Recognition of extracting Uyghur texts from videos images[J].Computer engineering and applications,2011,47(36):190-192.[4]㊀买买提依明㊃哈斯木,吾守尔㊃斯拉木,维尼拉㊃木沙江,等.基于统计专用字符的维㊁哈㊁柯文文种识别研究[J].中文信息学报,2015,29(2):111-117.MAIMAITIYIMING H,WUSHOUER S,WEINILA M,et al.Unique character based statistical language identification for Uyghur,Kazak and Kyrgyz[J].Journal of Chinese information processing,2015,29(2):111-117.[5]㊀于丽,亚森㊃艾则孜.基于HOG特征和MLP分类器的印刷体维吾尔文识别方法[J].微型电脑应用,2017,33(6):30-33.YU L,YASEN A.A printed Uyghur recognition method based on HOG feature and MLP classifier[J].Microcomputer applica-tions,2017,33(6):30-33.[6]㊀许亚美.手写维吾尔文字识别若干关键技术研究[D].西安:西安电子科技大学,2014.XU Y M.A study of key techniques for Uighur handwriting recognition[D].Xiᶄan:Xidian University,2014.[7]㊀白云辉.印刷体维吾尔文单词识别[D].西安:西安电子科技大学,2014.BAI Y H.Printed Uyghur word recognition[D].Xiᶄan:Xidian University,2014.[8]㊀郎潇.基于切分的印刷体维吾尔文单词识别[D].西安:西安电子科技大学,2015.LANG X.Recognition of printed Uyghur words based on segmentation[D].Xiᶄan:Xidian University,2015.[9]㊀李旦旦.印刷体维吾尔文单词识别的分类器设计[D].西安:西安电子科技大学,2019.LI D D.Classifier design for printed Uyghur word recognition[D].Xiᶄan:Xidian University,2019.[10]SHI B G,BAI X,YAO C.An end-to-end trainable neural network for image-based sequence recognition and its application toscene text recognition[J].IEEE transaction on pattern analysis&machine intelligence,2017,39:2298-2304. [11]张建明,刘煊赫,吴宏林,等.面向小目标检测结合特征金字塔网络的SSD改进模型[J].郑州大学学报(理学版),2019,51(3):61-66,72.ZHANG J M,LIU X H,WU H L,et al.Improved SSD model with feature pyramid network for small object detection[J].Jour-nal of Zhengzhou university(natural science edition),2019,51(3):61-66,72.[12]佘颢,吴伶,单鲁泉.基于SSD网络模型改进的水稻害虫识别方法[J].郑州大学学报(理学版),2020,52(3):49-54.SHE H,WU L,SHAN L Q.Improved rice pest recognition based on SSD network model[J].Journal of Zhengzhou university (natural science edition),2020,52(3):49-54.[13]陈珂,梁斌,左敬龙,等.一种用于中文微博情感分析的多粒度门控卷积神经网络[J].郑州大学学报(理学版),2020,52(3):21-26,33.41郑州大学学报(理学版)第53卷CHEN K,LIANG B,ZUO J L,et al.Multiple grains-gated convolutional neural networks for Chinese microblog sentiment anal-ysis[J].Journal of Zhengzhou university(natural science edition),2020,52(3):21-26,33.[14]王知人,谷昊晟,任福全,等.基于深度卷积残差学习的图像超分辨[J].郑州大学学报(理学版),2020,52(3):42-48.WANG Z R,GU H S,REN F Q,et al.Residual learning of deep CNN for image super-resolution[J].Journal of Zhengzhou university(natural science edition),2020,52(3):42-48.[15]WANG Y S,LIAO W L,CHANG Y Q.Gated recurrent unit network-based short-term photovoltaic forecasting[J].Energies,2018,11(8):2163.[16]GRAVES A,MOHAMED A R,HINTON G.Speech recognition with deep recurrent neural networks[C]ʊIEEE InternationalConference on Acoustics,Speech and Signal Processing.New York:IEEE Press,2013:6645-6649.Design and Implementation of Printed Uyghur Recognition SystemBased on DjangoXIONG Lijian1,2,3,WUSHOR Silamu1,2,3,XU Miaomiao1,2,3(1.School of Information Science and Engineering,Xinjiang University,Urumqi830046,China;2.Xinjiang Multilingual Information Technology Laboratory,Urumqi830046,China;3.Xinjiang Multilingual Information Technology Research Center,Urumqi830046,China) Abstract:Optical character recognition(OCR)has been widely used in many fields such as book digiti-zation and document pared with the more mature Chinese and English printed recogni-tion system,there is still room for research and practical application of Uyghur printed recognition.Ai-ming at the problem of insufficient feature representation of traditional recognition methods,the rising deep learning technology was combined,the Python language programming was used,the improved conv-olutional recurrent neural network as the core of recognition algorithm was selected,and Django was used to design the system framework.The experimental results showed that the accuracy of the system was 95.7%and the average speed was12.5fps,which realized the end-to-end Uyghur whole word recogni-tion.Key words:convolutional recurrent neural network;gated recurrent unit;connectionist temporal classifi-cation;printed Uyghr(责任编辑:王浩毅㊀方惠敏)。
近40年全球海洋热浪的时空特征和机制

近40年全球海洋热浪的时空特征和机制The main focus of my question revolves around the spatial and temporal characteristics as well as the mechanisms behind global marine heatwaves over the past four decades.海洋热浪是指突然而且持续时间较长的海洋温度异常增加的现象。
它们可以导致生态系统和海洋生物多样性受到严重影响,对渔业、旅游业和经济产生负面影响。
近年来,关于全球海洋热浪的研究逐渐增多,人们开始关注其时空特征和形成机制。
Marine heatwaves are characterized by sudden and prolonged increases in sea temperature. They can have severe impacts on ecosystems and marine biodiversity, as well as negative effects on fisheries, tourism, and the economy. In recent years, there has been an increasing amount of research on global marine heatwaves, with a growing interest in understanding their spatial and temporal characteristics,as well as the mechanisms that drive their formation.为了研究海洋热浪的时空特征,科学家们使用了多种方法进行分析。
其中一种方法是利用卫星观测数据,通过观测海表温度变化来检测和分析海洋热浪事件。
母系控制 性别决定

There are currently four described species in the Nasonia genus N. vitripennis, N. longicornis, N. giraulti, and N. oneida. N. vitripennis is found worldwide; N. giraulti is found in eastern North America and N. longicornis is found in western North America
• Sex-specific differential splicing of Nvtra and the functional relationship of Nvtra and Nvdsx.(A)Relative levels of Nvtra mRNA after RNAi in control (light gray bar) and Nvtra dsRNA-injected (black bar) females in The late pupalstage. • Error bars represent SE. *P <0.001.
Nasonia
• Nasonia are a genus of small pteromalid parasitoid wasps that sting and lay eggs in the pupae of various flies. The fly species that Nasonia usually parasitize are primarily blowflies and fleshflies, making Nasonia a useful tool for biocontrol of these pest flies. The wasps are small, pinhead sized, and also referred to as jewel wasps. • The wasp genus has acquired genes from the Pox virus and from Wolbachia in less than 100,000 years.
大数据英文演讲 Big Data presention

GPS Data from Floating Bikes or Cars (Taxis)
GPS Data from Floating Cars is traced from vehicles equipped with GPS, and stored in text type, covering latitude & longitude, driving period, speed, and direction, etc.
Main application field
Urban function division; Boundaries of cities; Boundaries of commercial district
Volunteered Geographic Information (VGI)
添加标题
VGI generates from emergence of online service platform providing geographical location.
The research of big data in urban studies and planning practices
2017.12.22
Group 6
1 What is big data? 2 Behavior data acquisition and analysis 3 Spatial analysis 4 New methodologies with big data 5 Open research issues
Background
添加标题
Individual behavior and its spatio-temporal variation are main subjects and foundation in urban studies and planning practices. The following will particularly introduce some perspectives about them, as well as the main application fields of different types of big data.
大连市暴雨灾害特征及敏感区识别

第43卷第4期2020年12月Vol.43No.4Dec.22气象与减灾研究MeteorologyandDisasterReductionResearch徐建文,杨建莹,翁雪玲,等,2020.大连市暴雨灾害特征及敏感区识别气象与减灾研究,43(4)&14-320.Xu Jianwen,Yang Jianying,W eng Xueling,et al,2020.Characteristics of rainstorm disasters and identification of sensitive areas in Dalian"#Meteorology and Disaster Reduction Research,43(4):313-320.大连市暴雨灾害特征及敏感区识别徐建文x杨建莹2,翁雪玲x王丽娜】,王pp11.大连市气象服务中心,辽宁大连1160012.中国气象科学研究院,北京100081摘要:利用国家气象站和区域自动气象站的降水观测资料以及暴雨洪涝灾害情况资料,分析了2007-2019年大连地区暴雨气候特征时空分布,并进行了灾害评估和敏感区域识别。
结果表明:1)大连地区年平均暴雨和大暴雨日数在空间分布上东北部多于西南部,大暴雨出现的概率为0-40%。
年平均暴雨量和大暴雨量的极大值区域主要集中于庄河地区,年均暴雨强度也呈现从西到东逐渐递增趋势。
暴雨变异系数的高值中心出现在瓦房店西部地区,低值区出现在庄河地区。
2)大连市的暴雨灾害近80%为较严重灾害,最严重和严重暴雨灾害各占9%-10%。
3)普兰店西南部、庄河西部与大连市区为大连地区暴雨洪涝灾害相对敏感区域。
关键词:暴雨灾害,气候特征,等级,敏感区,大连中图分类号:P467文献标识码:B文章编号:10079033(2020)04-0314-07doi:10.12013/qxyjzyj2020-044Characteristics of Rainstorm Disasters andIdentification of Sensitive Areas in DalianXu Jianwen1,Yang Jianying2,Weng Xueling1,Wang Lina1,Wang Jianjian11.Dalian Meteorological Service Center,Dalian116001,China2.Chinese Academy of Meteorological Sciences,Beijing100081,ChinaAbstract:The temporal and spatial distribution of climatic characteristics,disaster assessment and sensitive area identification ofrainstormdisasterinDalianwerediscussedbyusingtheprecipitationdataandrainstormdisasterdataobtainedfromregional and national automatic weather stations from2007to2019.The results showed that:1)There was a gradual decrease of the average annual rainstorm and rainstorm days from northeast to southwest in Dalian!and the probability of heavy rainstorm was 0—40%.The regions of maximum annual rainstorm and heavy rainstorm were mainly concentrated in Zhuanghe,and the annual rainstorm intensity presented a gradual increasing trend from west to east.The high value center of the rainstorm variation coefficient appeared in the west of Wafangdian,while the low value area appeared in Zhuanghe.2)Nearly80%of the rain-stormdisastersinDalianweremoreseveredisasters and the most severe and severe rainstorm disasters accounted for9%to 10%,respectively.3)The southwest of Pulandian,the west of Zhuanghe and the urban area of Dalian were the most sensitive areastotherainstormdisasterinDalian.Keywords:rainstorm disasters;climatic characteristics;grade;sensitive area;Dalian收稿日期:2020-10-12;修订日期:2020-1126.基金项目:大连市气象局业务应用开发项目(编号:dlqk201910).作者简介:徐建文,硕士,工程师,主要从事气候资源与气候变化研究,E-mail:xujianwen0101@.徐建文等:大连市暴雨灾害特征及敏感区识别3150引言洪的气象灾害(葛全胜等,2008),也国气象灾害中影响最严重的灾害!的灾害给社会安成了严重的损失。
pajek时间网络使用方法

pajek时间网络使用教程时间网络(temporal network)分为两种类型一种是Time Events,在某个时间点发生改变;另一种是time intervals,在某个时间段网络状态维持不变。
文件后缀.tim。
文件显示时间网络以lin_文件为例该文件是以时间延续1.1读文件File/Network/Read->lin_1.2生成时间网络Network/Temporal Network/Generate in Time各参数含义:∗All – Generate all networks in specified times. 生成指定时间所有网络∗Only Different – Generate network in specified time only if the new network will differ in at least one vertex or line from the last network which was generated. 与上次网络相比发生变化时生成网络∗Interval – Generate network with vertices and lines present in selected interval 在选定的时间间隔生成网络此处我们选择第二项,各设置参数如下可以看到Network文件框中生成了169个网络每个网络就代表了该时刻网络的结构状态。
1.3绘制网络(1)选择从第二个网络开始(其实这是时间网络的初始网络),选择Draw/Network(2)点击Previous和Next可以动态绘制各时间节点的网络2.tim文件显示时间网络2.1读取时间文件File/Network/Read Time Events->carlyle.tim2.2生成时间网络与1.2相同2.3绘制网络与1.3相同参考文献:1 官网start with pajek教程http://mrvar.fdv.uni-lj.si/sola/info4/andrej/pajek_short.pdf p82-832 官网使用手册pajekman3 THA TCamp Pittsburgh 2013 Networking and the Digital Humanities Workshop./~lombardi/thatcamp/ carlyle.tim(该文件可用)paston.tim(该文件读取时出现内存错误)附录(1)time events网络Pajekman p9-10Read Time Events – Read temporal network described using time events. See Table 1.List of properties s can be empty as well. If several edges (arcs) can connect two vertices, additional tag like :k (k-th line) must be given to determine to which line the command applies. E.g. command HE:3 14 37 results in hiding the third edge connecting vertices 14 and 37.Example of time network described using time events:*Vertices 3*EventsTI 1A V 2 "b"TE 3HV 2TI 4A V 3 "e"TI 5A V 1 "a"TI 6AE 1 3 1TI 7SV 2AE 1 2 1TE 7DE 1 2DV 2TE 8DE 1 3TE 10HV 1TI 12SV 1TE 14DV 1See also other possibility: description of time network using time intervals.(2)time intervals 网络Pajekman p36-37Temporal Network – Operations that are specific to temporal networks.– Generate in Time – Generate network in specified time(s) or interval.Input first time, last time and step (integers).Additional parameters when vertices and lines are active should be given in network to perform this operation. They must be given between signs [ and ]:- is used to divide lower and upper limit of interval,, is used to separate intervals,* means infinity. Example:*Vertices 31 "a" [5-10,12-14]2 "b" [1-3,7]3 "e" [4-*]*Edges1 2 1 [7]1 3 1 [6-8]Vertex ’a’ is active from times 5 to 10, and 12 to 14, vertex ’b’ in times1 to 3 and in time 7, vertex ’e’ from time 4 on. Line from 1 to2 is active only in time 7, line from 1 to3 in times 6 to 8.The lines and vertices in a temporal network should satisfy the consistency condition: if a line is active in time t then also its end-verticesare active in time t. When generating time slices of a given temporalnetwork only ’consistent’ li nes are generated.Note that time records should always be written as last in the row where vertices / lines are defined.See also other possibility of describing time network: description of time network using time events.∗All – Generate all networks in specified times.∗Only Different – Generate network in specified time only if thenew network will differ in at least one vertex or line from the lastnetwork which was generated.∗Interval – Generate network with vertices and lines present inselected interval。
乳腺时间信号曲线分型标准
乳腺时间信号曲线分型标准## Breast Time Signal Curve Fractal Dimension Standard.### English Answer:Fractal Dimension Standard for Breast Time Signal Curves.Introduction:Fractal dimension is a mathematical measure of the complexity and irregularity of a shape or object. In the context of breast cancer detection, fractal dimension has been used to analyze time signal curves derived from mammograms and ultrasound images. These curves represent the temporal variation in the intensity of the signals received by the imaging device, and their fractal dimension can provide insights into the characteristics of the underlying breast tissue.Standard for Fractal Dimension of Breast Time Signal Curves:The standard for fractal dimension of breast time signal curves has been established based on extensive research and clinical studies. The following criteria are used to classify curves into different categories:Normal Breast Tissue: Fractal dimension values typically range from 1.2 to 1.5. Curves exhibit smooth and regular patterns.Benign Breast Lesions: Fractal dimension values can vary widely, but typically fall within a range of 1.5 to 1.7. Curves may exhibit some irregularities but generally maintain a relatively smooth shape.Malignant Breast Lesions: Fractal dimension values tend to be higher, often exceeding 1.7. Curves are more complex and irregular, with sharp transitions and abrupt changes in intensity.Applications in Breast Cancer Detection:The fractal dimension of breast time signal curves has several applications in breast cancer detection:Discrimination between Normal and Abnormal Tissue: Fractal dimension can help distinguish between normal breast tissue and abnormal lesions with a high degree of accuracy.Characterizing Breast Lesions: The fractal dimension of time signal curves can provide information about the nature of breast lesions, helping to differentiate between benign and malignant tumors.Monitoring Disease Progression: Serial measurements of fractal dimension over time can help monitor disease progression and evaluate treatment response.Computer-Aided Diagnosis (CAD): Fractal dimension is incorporated into CAD systems to assist radiologists in interpreting mammograms and ultrasound images, improvingthe sensitivity and specificity of breast cancer detection.Conclusion:The fractal dimension standard for breast time signal curves is a valuable tool for analyzing breast images and aiding in the detection and characterization of breast cancer. By assessing the complexity and irregularity of these curves, clinicians can gain insights into the underlying tissue structure and make more informed decisions about patient care.### 中文回答:乳腺时间信号曲线分型标准。
JVB2016-2017年(97-101卷)Journal of Vocational Behavior文献目录及摘要摘取
表格 1题⽬目作者Highlights AbstractVolume-1011Cross-culturalwork andfamilyresearch: Areview of theliterature跨⽂文化⼯工作与家庭研究:⽂文献综述KristenM.Shockley a, , , JillDouek b,ChristineR.Smith b,Peter P.Yu b,SonerDumani c,1,KimberlyA.French d, 1Highlights•Anglo regions have receivedthe most attention, followed byEastern Europe.•Many studies focus on meandifferences across cultures inwork-family constructs.•The review of theory suggeststhat collectivism is the mostcommonly used theoreticalexplanation.•There is a great deal ofinconsistency in the way theoryhas been applied.AbstractResearch aimed at understanding the intersection of employees'work and family lives has blossomed over the past few years, and,in more recent times, has begun to have an increasingly globalfocus. Conducting research in diverse cultural settings is importantgiven that work and family dynamics are entrenched in largersocietal contexts, such as gender role norms, national policies, andcultural values. However, the literature has not developed in aprogrammatic way, making it difficult to build upon the currentknowledge base. The goal of this study is to review extant cross-cultural work and family published research in an effort tosynthesize and assess the current state of the literature, with afocus on theoretical logic and methodology. We do this by reportingdescriptive statistics regarding which global regions have receivedthe most attention, the way culture is considered analytically andtheoretically, and information about measurement. Through thisreview we aim to provide scholars with a more completeunderstanding of the state of cross-cultural work-family knowledgeand offer recommendations for future research that will facilitatetheoretical advancement.KeywordsCross-cultural; Work-family; Global; Collectivism; Genderegalitarianism本⽂文旨在了了解员⼯工⼯工作与家庭⽣生活的交叉点的研究在过去⼏几年年⾥里里已经蓬勃发展,⽽而且在最近的⼀一段时间⾥里里,全球范围内的⼯工作重⼼心开始逐渐扩⼤大。
Eaton公司产品说明:听力障碍人士的火警报设备说明书
LISTING No.:7125-0785:0152CATEGORY:7125 - FIRE ALARM DEVICES FOR THE HEARING IMPAIRED LISTEE:Cooper Wheelock Inc. 7246 16th St. E., Ste. 105, Sarasota, FL, 34243Contact: Conover,ThomasDESIGN:Models *ET70-241575W, ET70-24MCW, ET70-24MCC, ET70-24MCWH, ET70-24MCCH, *ET80-241575W, ET80-24MCW, ET80-24MCWH, *ET90-241575W, ET90-24MCW, ET90-24MCC, ET90-24MCWH, ET90-24MCCH, E70-24MCW, E70-24MCC, E70-24MCWH, E70-24MCCH, E90-24MCW, E90-24MCC, E90-24MCWH, and E90-24MCCH multi-candela speaker/strobes. Models E60-R, E60-W, E60-24MCC-FW, E60-24MCCH-FR, E60-24MCCH-FW, E60EXT-R and E60-EXT-W multi-candelaspeaker/strobes. May be followed by suffixes to designate lens lettering and plate color. *Lens color maybe Clear, Red (R), Blue (B), Green (G), or Amber (A). EXT is an extender ring that mounts behind thespeaker. Intended for indoor use only. Refer to listee's data sheet for additional detailed productdescription and operational considerations.RATING:Electrical: 16-33 VDC, 25/70 VRMSFlash rate: 60 flashes/minCandela: MCW =15cd,30cd,75cd or 110cdMCC =15cd,30cd,75cd or 95cdMCWH = 135cd,185cdMCCH = 115cd,177cd*1575 = 15 cd/75cd on axisINSTALLATION:In accordance with listee's printed installation instructions, applicable codes and ordinances, and in amanner acceptable to the authority having jurisdiction. Models followed by W or WH are wall mounts; Cor CH are ceiling mounts. All models are for indoor use.MARKING:Listee's name, model number, electrical and candela rating, and UL label.APPROVAL:Listed as strobe speakers suitable for the hearing impaired when used in conjunction with separatelylisted electrically compatible fire alarm control units. For synchronization strobes, Models SM-12/24 orDSM-12/24 synchronized control modules (CSFM Listing No. 7300-0785:132) must be used. Refer tomanufacturer ’s Installation Manual for details.NOTES:These units do not generate a temporal pattern signal. If the distinctive three-pulse Temporal PatternFire Alarm Evacuation signal (for total evacuation) in accordance with NFPA 72, 2002 Edition is required,the appliance must be used with a fire alarm control unit that can generate the temporal pattern signal.LISTING SERVICEThis listing is based upon technical data submitted by the applicant. OSFM Fire Engineering staff has reviewed the test results and/or other data but does not make an independent verification of any claims. This listing is not anendorsement or recommendation of the item listed. This listing should not be used to verify correct operational requirements or installation criteria. Refer to listee's data sheet, installation instructions and/or other suitable information sources.Date Issued: 06/21/2022Listing Expires: 06/30/2023Authorized By: Victor Wong , Program CoordinatorFire Engineering & Investigations Division *Rev. 07-30-10 fmLISTING SERVICE。
基于拉格朗日场的多级运动特征暴力行为识别
文章编号:1003-0530(2022)07-1497-10第38卷第7期2022年7月信号处理Journal of Signal ProcessingVol.38No.7Jul.2022基于拉格朗日场的多级运动特征暴力行为识别娄久左德承张展刘宏伟(哈尔滨工业大学计算机科学与技术学院,黑龙江哈尔滨150001)摘要:针对暴力行为识别过程中缺乏描述不同时间尺度下暴力行为运动变化的问题,本文提出了一种基于拉格朗日场的多级运动特征暴力行为识别算法。
该算法将描述非线性粒子运动的拉格朗日场引入暴力行为分析过程中,首先通过构建基于光流的拉格朗日场来挖掘不同时间尺度下暴力行为运动特征,设计了基于拉格朗日场的多级运动模块,该模块可以根据输入光流序列长度,计算多级运动特征;然后构建了基于流量门控制机制的双流网络,将多级运动特征和RGB 图像特征融合;最后,利用LSTM 和全连接模型计算识别结果。
实验证明,该方法在公共暴力识别数据集上取得了很好的效果,特别是在真实监控场景的RWF -2000数据集上,暴力行为识别正确识别率可以达到88.4%,优于其他算法。
关键词:暴力行为识别;多级运动特征;双流网;拉格朗日场;光流中图分类号:TP3-05文献标识码:ADOI :10.16798/j.issn.1003-0530.2022.07.016引用格式:娄久,左德承,张展,等.基于拉格朗日场的多级运动特征暴力行为识别[J ].信号处理,2022,38(7):1497-1506.DOI :10.16798/j.issn.1003-0530.2022.07.016.Reference format :LOU Jiu ,ZUO Decheng ,ZHANG Zhan ,et al.Violence recognition based on multilevel -motion fea⁃tures of Lagrange field [J ].Journal of Signal Processing ,2022,38(7):1497-1506.DOI:10.16798/j.issn.1003-0530.2022.07.016.Violence Recognition Based on Multilevel -motion Features ofLagrange FieldLOU JiuZUO DechengZHANG ZhanLIU Hongwei(School of Computer Science and Technology ,Harbin Institute of Technology ,Harbin ,Heilongjiang 150001,China )Abstract:In different time scales in the process of violence recognition ,a multilevel -motion feature violence recognition algorithm based on Lagrange field is proposed in this paper.In this algorithm ,the Lagrange field describing the nonlinear particle motion is introduced into the process of violence analysis.The opt Lagrange field based on optical flow is constructed to mine the motion characteristics of violence at different time scales ,and a multilevel -motion module based onLagrange field is designed.The module can calculate the multilevel -motion characteristics according to the length of the input optical flow sequence ;Then ,a dual flow network based on flow gate control mechanism is constructed to fuse multilevel -motion features and RGB image features ;Finally ,the recognition results are calculated by using LSTM and full connection model.Experiments show that this method has achieved good results in public violence identification data set ,especially in RWF -2000data set of real monitoring scene ,the correct recognition rate of violence identification can reach 88.4%,which is better than other algorithms.Key words:violence recognition ;multilevel -motion features ;two -streams network ;Lagrange field ;optical flow收稿日期:2021-08-03;修回日期:2022-04-05基金项目:国家自然科学基金项目(62171155)信号处理第38卷1引言暴力行为是以人身、财产为侵害目标,采取暴力手段,对被害人的身心健康和生命财产安全造成极大的损害,直接危及人的生命、健康与自由的一种行为[1]。
- 1、下载文档前请自行甄别文档内容的完整性,平台不提供额外的编辑、内容补充、找答案等附加服务。
- 2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
- 3、如文档侵犯您的权益,请联系客服反馈,我们会尽快为您处理(人工客服工作时间:9:00-18:30)。
TEMPORAL PATTERN IDENTIFICATION OF TIME SERIES DATA USING PATTERN WAVELETS ANDGENETIC ALGORITHMSRICHARD J. POVINELLI AND XIN FENGDepartment of Electrical and Computer EngineeringMarquette University, P.O. Box 1881, Milwaukee, WI 53201-1881, USAE-mail: povinellir@Ph: 414.288.6820Fx: 414.288.5579ABSTRACT:A new method for temporal pattern matching of a time series isdeveloped using pattern wavelets and genetic algorithms. Thepattern wavelet is applied to the matching of an embedded timeseries. A problem-specific fitness factor is introduced in the newalgorithm, which is useful to construct a fitness function of thefeature space. A two-step process discovers the pattern wavelet thatyields high fitness value. The best temporal pattern matches arefound through a thresholding process. These matches are kept andthe future time series data point is used in the genetic algorithm'sfitness function. The algorithm has been successfully applied to theidentification of statistically significant temporal patterns infinancial time series data.Keywords: Temporal Pattern Identification, Genetic Algorithms, Pattern Recognition, Time Series Analysis, WaveletsINTRODUCTIONData mining is the exploration of data with the goal of discovering hidden structure. In many real-world applications, it is important to study the change of temporal features of a non-stationary time series, and identify the ones that are representing the significance of time instances. For example, it is critical in stock market applications that the patterns relating to sudden stock price changes be identified. Generally such time series are considered non-stationary. Traditional time series analysis employs statistical methods to model and explain the data and predict future values of the time series. It is not easy, however, to identify the critical temporal patterns of the time series using these traditional methods.Using a set of observations, in this paper, we present a new method for time series data mining. By introducing a pattern wavelet along with the use of a genetic algorithm (GA), temporal patterns can be effectively revealed in non-stationary time series.The paper is organized as follows. After presenting the problem statement, traditional ARMA modeling is reviewed. The ideas of temporal pattern matchingand the pattern wavelet are then discussed. Next, a detailed discussion of the new algorithm is provided. Finally, a presentation of the results and conclusions is given. PROBLEM STATEMENTLet Z = {z t , t = 1,…, N } be the non-stationary target time series, whose temporal features evolve over time. The task is to find an approach to characterize these changing temporal features.Applying traditional time series modeling to this problem involves finding solutions to the Box-Jenkins difference equation (Bowerman and O'Connell 1993) .()()φδθp t q t B z B a =+,where φp (B ) is the nonseasonal autoregressive operator of order p , θq (B ) is the nonseasonal moving average operator of order q , z t is the time series, a t is a sequence of random variables, δ is a constant term, and B is the backshift operator. The Box-Jenkins method is limited by the requirement of stationarity of the time series and normality and independence of the residuals. However, in most applications, these conditions are not met. One of the most severe drawbacks of this approach is the loss of the non-stationary characteristics we desire to identify.Our method takes a new approach. Let()z t T t t Q z z t N Q ==−++−,,,,,K K 111be the set of sub-time series of length Q embedded in Z, where Q ≤ N . Clearly, z t ⊆ Z , which may represent the changing temporal features or patterns of Z . We propose that by studying the embedding z t , the temporal features of Z may be identified. The method for eliciting the temporal features from the embedding z t arises from a study of wavelets and the wavelet transform. The wavelet transform is a natural extension of Fourier's work done in the early 19th century. Where Fourier's transform can find frequency information with no time reference or time information with no frequency, the wavelet transform provides both time and frequency information.Generally speaking, the wavelet transform matches a compactly supported function, called a wavelet, across both scale (frequency) and translation (time) (Polikar 1996). The Fourier transform matches an infinitely supported function across frequency (scale). Both use convolution of the basis function and the original time series. For the wavelet transform, it is provided for all scales.Next we introduce the so called “pattern wavelet” and “pattern wavelet transform”. This transform is an extension of a discrete form of the wavelet transform applied specifically to identifying temporal features.PATTERN WAVELETSBy relaxing the restrictions of the wavelet transform, the pattern wavelet transform is derived. Where the wavelet transform uses the convolution of the wavelet and thetime series, the pattern wavelet transform uses a subset of the convolution of the pattern wavelet and the time series. Also, where the wavelet is required to have a zero mean, the pattern wavelet is not. These relaxations yield a transform that identifies the temporal features discussed in the problem statement. A detailed explanation of the algorithm follows.Let f (p ,δ,Z,g ) be the pattern wavelet transform, where p ∈ P ⊆ ℜQ is the pattern wavelet, δ ∈ ℜ is a threshold parameter, and g = g (z t ) is a measure of fitness of the temporal feature. We want to find the optimal solution to the following problem(){}max ,,,,,p p p δδδf Z g P Q ∈⊆ℜ∈ℜ. (1)The pattern wavelet transform f (p ,δ,Z,g ) is the fitness of pattern p with threshold δ applied to time series Z with fitness measure g . The following definitions are needed for f .(){}r t N Q N Q r N Q r M t r t t r t t N Q r t r t N Q t r r ==−+=−+=−+−=≥+=−+=−+p z ,,,, : 111111112211K µσµµδσThe vector z t ⊆ Z is the embedded series of length Q, where Q ≤ N . The pattern factors r t , t = 1,…, N -Q +1, are elements of the vector r ∈ ℜN-Q+1 which consists of N -Q +1 inner products of the pattern wavelet p and the embedded time series z t . Also µr denotes the mean of r t , σr is the standard deviation of r t , and M is the pattern match set, which is defined as the set of all time instances t where the pattern factor r t is greater than or equal to the threshold µr + δσr . Finally, the pattern wavelet transform f is defined as the mean of g (z t ) for t ∈ M .()()()f Z g c M g z M t t Mp ,,,δµ≡=∈1 (2)where c (M ) is the cardinality of M . Also σM is the standard deviation of g (z t ) at times t ∈ M .()()()σµM t M t Mc M g z =−∈12It should be noted that the selection of fitness operator g in (2) is problem specific and is independent of the algorithm. It should be chosen a priori based on the types of hidden temporal features to be discovered.Because the maximization problem in (1) is complex and nonlinear, it is difficultto solve using traditional numerical optimization methods. To overcome theselimitations, a roulette wheel based GA with elitism (Goldberg 1989) searches for the optimal p and δ. Ideally p∈ℜQ and δ∈ℜ, for efficiency purposes p∈ [-ε, ε]Q andδ∈ [δ1, δ2]. These ranges are discrete due to the nature of the GA with a possible 2b unique values, where b is the number of bits used to represent p i and δ. Theparameters for the GA are Q, Z, g, b, and the population size. The parameter b isusually in the range of 4 to 16 and the population size is set to 30. The most eliteindividual is maintained from generation to generation without change. No mutationis used. The GA is shown below.Pattern Finding Genetic Algorithm1. Create an elite populationa) Randomly generate large population (10 times normal population size)b) Calculate fitnessc) Select the top 10th of the population to continue2. While all fitness have not convergeda) Perform roulette selection, save elite individualb) Crossover populationC)Calculate fitnessAPPLICATION RESULTSThe goal of this application is to find hidden temporal patterns in a certain stocktime series. Our experimental time series is the daily open stock price of the Quantum (QNTM, traded on the NASDAQ) time series Z = {z t, t = 1,…, N} with N=3,761. See Figure 1 for illustration. Obviously, this time series is non-stationary. Our special interest is to identify the temporal pattern that is related to a significant price change.ARMA ModelTwo ARMA models of the time series reveal essentially the same random walkcharacteristics. The models areFigure 1- Quantum Corp stock time seriesz z t t t =+−φε 1 (3)z z z t t t t =+−+−112φφε (4)z z t t t =+−1 ε (5) where φ= 0.99933 in (3) and φ = 0.045948 in (4). The φ in both models is statistically significant, but the autocorrelations of (3) show strong evidence of non-stationarity and the Ljung-Box test of the residuals indicates a lack of independence. The model (4) Ljung-Box test of the residuals indicates independence. By seeing that the φ≅ 1 in (3) and φ ≅ 0 in (4), both models become equivalent (5). The ARMA models provide little insight into hidden structure in the time series; the series is a random walk. On the other hand the method presented by the authors finds statistically significant structure as presented below. Pattern Wavelet ModelIn building the pattern wavelet model, the fitness operator g in (2) is chosen as()()g z B B Bz t Q Q Q t =−−−+−1.In our case we want to find features that indicate a fit ∆% after the end of the pattern match.We found c (M ) to be between 138 and 314, depending on the support of the pattern wavelet. The statistics for eight patterns are given in Table 1. The change in the stock price after a pattern match was between +0.7% and +1.5%, whereas the average change was +0.12%. This shows that there is a correlation between the patterns and the price changes. The standard deviation, though, is between 3% and 4% for the patterns and 3% for the average day. The µM of the matched patterns is between 5 to 12 higher than µg (Z ) of the whole time series. Two statistical tests are used to show significance of the results. The first test is the runs test. The test hypothesis is H 0: There is no difference between the matched time series and the remaining time series. H A : There is significant difference between the matched time series and the remaining time series. Our test uses a 1% probability of Type I error (α = 0.01). Table 1 shows that the null hypothesis can easily be rejected in all cases. The second statistical test is the difference of two independent means. The two populations are the transformed series and the whole time series. Although the two populations are probably dependent, this can be ignored because it makes the statistics more conservative, i.e., it will tend to overestimate the Type I error. The test hypothesis is H 0: µM - µg (Z ) = 0, H A : µM - µg (Z ) > 0. This test uses a 1% probability of Type I error (α = 0.01). Again, Table 1 shows that the null hypothesis can be very confidently rejected for all the patterns. The mean fitness of the time series µg (Z ) = 0.001179, and the σg (Z ) = 0.032931.TABLE 1 – STA TISTICAL SIGNIFICANCE OF RESULTSQ c(M ) µM σM Runstest α means test α 1 238 0.00736 0.0385 < 1.00x10-17 8.81x10-32 167 0.00834 0.0375 < 1.00x10-17 7.58x10-33 357 0.00746 0.0336 < 1.00x10-17 3.64x10-44 185 0.00913 0.0417 4.78x10-10 5.30x10-319 201 0.01057 0.0416 < 1.00x10-17 8.28x10-421 144 0.01397 0.0362 < 1.00x10-17 1.51x10-527 190 0.01276 0.0406 4.44x10-16 5.55x10-539 210 0.01113 0.0348 < 1.00x10-17 2.56x10-5CONCLUSIONSIn this paper, a new method for temporal data mining is proposed. Using a pattern wavelet transform as a data mining tool has yielded meaningful results. Instead of forcing the wavelet to match everywhere, it matches only when there is a high similarity between the pattern wavelet and the underlying time series. To find such pattern wavelets, a genetic algorithm is used. Even with a complex, non-stationary time series like stock price, the algorithm detected interesting patterns. Across all tested Q the patterns found were statistically significant.The algorithm is flexible in that by using an alternative g , fitness function, different structures can be found. The g used in this research was for positive changes, but just as easily()()g z B B B z t Q Q Qt =−−−−+−1which would find negative changes. Also, a more complicated g could be used that could take into account the standard deviations of the matches.Future research directions will include exploring combinations of patterns, looking for patterns in shorter segments of the time series, and adding additional factor dimensions such as volume.REFERENCESBowerman, B. L., and O'Connell, R. T. (1993). Forecasting and Time Series: An Applied Approach ,Duxbury Press, Belmont, California.Ghoshray, S. (1996). “Hybrid prediction technique by fuzzy inferencing on the chaotic nature of timeseries data.” Artificial Neural Networks in Engineering, Proceedings , 725-730.Goldberg, D. E. (1989). Genetic algorithms in search, optimization, and machine learning , Addison-Wesley Pub. Co., Reading, Mass.Lin, C. T., and Lee, C. S. G. (1996). Neural Fuzzy Systems - A Neuro-Fuzzy Synergism to IntelligentSystems , Prentice-Hall, Upper Saddle River, NJ.Polikar, R. (1996). “The Engineer's Ultimate Guide To Wavelet Analysis - The Wavelet Tutorial.” . Weigend, A. S., and Gershenfeld, N. A. (1994). “Time Series Prediction: Forecasting the Future andUnderstanding the Past.” , Addison-Wesley Pub. Co., Reading, MA.。