基于片段化XML文档结构的内容重组模型的研究
发布时间:2018-10-08 09:31
【摘要】:传统的信息,特别是书籍、报纸等出版印刷领域的信息存储形式一般是把版式信息和信息本身给混合在一起,,这样就导致很难重复利用信息本身。因此需要有一种有效的方式来存储信息,这种信息存储方式能够使存储格式跨平台,内容和版式要分离,存储形式要满足可重用的要求。 基于片段化XML文档结构的内容重组模型的研究目标是寻找准确、高效、能重复利用文本内容的方法。XML是理想的文档编写格式,对于信息开发有以下优势:它强调的是内容的结构,而不是形式;可以更好地保持内容的一致性,并更好地保证内容的表现形式对各种不同输出设备和格式的一致性。通过对国内外内容重组技术的研究,充分地分析了图书、期刊、报纸、标准等各种出版物的结构,设计出了基于片段化XML文档结构的内容重组模型。并对模型的思想,模型的详细描述,模型的实现进行了详细地介绍。 基于片段化XML文档结构的内容重组模型通过内容对象的基础模型到复合文档结构的映射表,将基于片段化XML文档结构的内容对象通过映射重组为具备层级结构的复合文档。在映射重组过程中,根据最终交付文档的语义表现形式,生成面向不同主题的交付文档。基于片段化XML文档结构的内容重组模型将划分成适当颗粒度的内容模块,也就是主题存放在主题库中,通过映射将与创作有关的主题组织和连接在一起。按照所需交付出版物设定相应的样式模板,选择相应的输出类型,通过XSLT技术转换得到最终交付出版物。 基于XML文档结构的内容重组模型能够很好地支撑不同XML文档结构间转换和组合关系。但是其XML结构文档需要合理化,片段化的内容要能很好地独立描述完整的意思,这样重组映射出来的文档才能不利用上下文的关系而很好地重组为最终交付物。
[Abstract]:Traditional information, especially books, newspapers and other fields of information storage in the field of publishing and printing is usually the layout of information and the information itself to mix together, so it is difficult to reuse the information itself. Therefore, there needs to be an effective way to store information, which can make the storage format cross-platform, content and layout to be separated, storage form to meet the requirements of reusable. The research goal of the content recombination model based on segmented XML document structure is to find an accurate and efficient way to reuse the text content. Information development has the following advantages: it emphasizes the structure of the content, not the form; it can better maintain the consistency of the content, and better ensure the consistency of the different output devices and formats. Through the research on the technology of content recombination at home and abroad, the structure of various publications, such as books, periodicals, newspapers, standards and so on, is fully analyzed, and a content reorganization model based on the structure of segmented XML documents is designed. The idea of the model, the detailed description of the model, and the implementation of the model are introduced in detail. The content recombination model based on segmented XML document structure is transformed into a hierarchical composite document by mapping the content object based on the fragment XML document structure into the mapping table of the composite document structure through the basic model of the content object. In the process of mapping reorganization, according to the semantic representation of the final delivery document, the delivery document oriented to different topics is generated. The content reorganization model based on fragment XML document structure will be divided into content modules with appropriate granularity, that is, the topic is stored in the topic library, and the theme related to authoring is organized and connected by mapping. According to the required delivery of publications set the corresponding style template, select the corresponding output type, through XSLT technology conversion to get the final delivery of publications. The content recombination model based on XML document structure can support the transformation and composition relationship between different XML document structures. However, its XML structure document needs to be rationalized, and the fragment content should be able to describe the complete meaning independently, so that the recombined mapped document can be reorganized into the final delivery without using the context relationship.
【学位授予单位】:华中科技大学
【学位级别】:硕士
【学位授予年份】:2013
【分类号】:TP333;TP391.1
本文编号:2256282
[Abstract]:Traditional information, especially books, newspapers and other fields of information storage in the field of publishing and printing is usually the layout of information and the information itself to mix together, so it is difficult to reuse the information itself. Therefore, there needs to be an effective way to store information, which can make the storage format cross-platform, content and layout to be separated, storage form to meet the requirements of reusable. The research goal of the content recombination model based on segmented XML document structure is to find an accurate and efficient way to reuse the text content. Information development has the following advantages: it emphasizes the structure of the content, not the form; it can better maintain the consistency of the content, and better ensure the consistency of the different output devices and formats. Through the research on the technology of content recombination at home and abroad, the structure of various publications, such as books, periodicals, newspapers, standards and so on, is fully analyzed, and a content reorganization model based on the structure of segmented XML documents is designed. The idea of the model, the detailed description of the model, and the implementation of the model are introduced in detail. The content recombination model based on segmented XML document structure is transformed into a hierarchical composite document by mapping the content object based on the fragment XML document structure into the mapping table of the composite document structure through the basic model of the content object. In the process of mapping reorganization, according to the semantic representation of the final delivery document, the delivery document oriented to different topics is generated. The content reorganization model based on fragment XML document structure will be divided into content modules with appropriate granularity, that is, the topic is stored in the topic library, and the theme related to authoring is organized and connected by mapping. According to the required delivery of publications set the corresponding style template, select the corresponding output type, through XSLT technology conversion to get the final delivery of publications. The content recombination model based on XML document structure can support the transformation and composition relationship between different XML document structures. However, its XML structure document needs to be rationalized, and the fragment content should be able to describe the complete meaning independently, so that the recombined mapped document can be reorganized into the final delivery without using the context relationship.
【学位授予单位】:华中科技大学
【学位级别】:硕士
【学位授予年份】:2013
【分类号】:TP333;TP391.1
【参考文献】
相关期刊论文 前10条
1 赵相国;王国仁;韩东红;丁大斌;;XML的函数依赖[J];东北大学学报(自然科学版);2008年01期
2 薛万国;XML与电子病历[J];国外医学(医院管理分册);2002年01期
3 高阳,谭力民;基于XML文档的关系数据库与面向对象数据库之间的信息交互[J];计算机工程与应用;2003年03期
4 吴沉寒,朱先忠,孟令奎,邓世军;基于XMLRPC通信的一卡通系统[J];计算机工程与应用;2004年27期
5 王春宇,李建中,何震瀛;基于DTD节点自动机的XML模式验证方法[J];计算机工程与应用;2004年32期
6 聂培尧;安世虎;;XML及语义Web技术[J];计算机科学;2001年05期
7 黄芳;孙建伶;;XML文档顺序的维护[J];计算机科学;2004年08期
8 朱新华;黄立和;罗辉;张显全;;基于XML模式的作业描述语言的设计与处理[J];计算机工程;2007年16期
9 余诗权;谢冬青;;基于关系数据库的XML数据转换架构[J];计算技术与自动化;2006年02期
10 李磊;李一凡;赵怀慈;;一种基于XML的文档处理模型[J];计算机应用与软件;2006年07期
本文编号:2256282
本文链接:https://www.wllwen.com/kejilunwen/jisuanjikexuelunwen/2256282.html