潘新,滕飞.新信息新技术在传统科技期刊编辑工作中的应用.编辑学报,2018,30(3):302-303 |
新信息新技术在传统科技期刊编辑工作中的应用 |
Practice on automatic extraction of metadata from Founder typesetting file and pages from PDF file for webzine:Website of Nursing of Integrated Traditional Chinese and Western Medicine as an example |
|
DOI:10.16811/j.cnki.1001-4314.2018.03.027 |
中文关键词: 网刊 元数据 自动提取 PDF文件 自动分割-合并 |
英文关键词: webzine metadata automatic extraction PDF file automatic split merge |
基金项目: |
|
摘要点击次数: 1366 |
全文下载次数: 1099 |
中文摘要: |
以《中西医结合护理》排版所用的方正书版文件为例,介绍用于圈定元数据字段的“准标签对”的选择技巧,以及fbd文件与html文件之间的字符兼容性和格式对等性的处理方法。以此为基础,可以方便地实现高质量网刊元数据的高效率自动提取,以及PDF文件的精准自动分割与转页合并。实践证明,对于特定期刊而言,上述工作是很容易自主完成的。 |
英文摘要: |
Taking the Founder typesetting file (fbd file) of Nursing of Integrated Traditional Chinese and Western Medicine as an example, this paper introduces the tips for selecting prospective tag-pairs to locate different metadata fields in fbd files, and the way to solve the problems of character compatibility and format equivalence between fbd file and HTML file. Thus, high-quality metadata can be automatically extracted from Founder typesetting files with high efficiency, and split-merge of pages from PDF file can be accurately realized. Practices have proven that all the above work can be easily finished for a particular journal. |
查看全文
查看/发表评论 下载PDF阅读器 |
关闭 |