To resolve the standards for the word class and the word segmentation which is a premise for this issue,this paper firstly classifies the Tibetan words accoring to requirements of Tibetan information processing,and then provides a systemic and applicable word segmentation scheme.
为了便于计算机对自动分词、词性标注的辨认,该文首先要确定满足藏文信息处理中词类的需求,并根据藏文自身的词汇特点与构词规律,提出了较为系统、适用的分词规范。
For Chinese words segmentation,a module which is based on word library and uses the positive direction maximum matching algorithm was presented.
在该系统模型中,针对中文分词实现了基于词库的采用正向最大匹配算法的中文分词模块;针对多种格式文档的处理采用接口实现的方式和动态实例化的方法,实现了可以有效地处理txt、xml、html、pdf、doc和rtf等常见格式文档。
Chinese WEB documents classification involves to the documents automatic capture, the information processing and the extraction, the automatic sorting and so on, this article realizes a open style Chinese WEB documents automatic sorting system, and has applied several improvement algorithms in the system module, in the main solution present information retrieval involves when Chinese participle .
中文WEB文档的分类涉及到文档的自动抓取、信息加工和提取、自动分类等,本文实现一个开放式的中文WEB文档自动分类系统,并在系统模块中应用了几个改进算法,主要解决目前信息检索中涉及中文分词搜索时所遇到的一些问题。
The computer may very easily understand English word, but Chinese sentence which is composed by the word, which can be understood through Chinese participle technology.
计算机可以很容易地理解英文单词,而对由词组成的中文句子,必须通过中文分词技术才得以理解。
It returned to the first results of the Chinese participle and fully tap its semantic information use CC4 neural networks to judge the he rele-vant web page to re-sort the results,and a good solution integrated search engine does not prevail in the search results accurate information stagnant.
它使用神经网络对检索结果进行优化排序,它先对返回结果进行中文分词,在充分挖掘其语义信息的基础上,利用CC4神经网络对网页的相关性进行判断,对返回结果重新排序,很好地解决了综合性搜索引擎中普遍存在搜索结果不准确、信息滞后等问题。
To improve the preprocessing performance for anti-spam and to search for phrases more efficiently, this paper creatively constructed an indexing dictionary based on hash algorithm, and designed a method of Chinese phrase segmentation based on this indexing dictionary aiming at anti-Chinese-spam.
通过实验,表明该方法提高了传统机械分词法的效率和准确率,改善了邮件预处理阶段的性能,并且可以广泛地应用于中文分词领域。
CRFs-based Chinese word segmentation method with character position probability feature;
基于字位置概率特征的条件随机场中文分词方法
Research of Chinese word segmentation system applies in professional search engine;
一种适用于专业搜索引擎的中文分词系统研究
Chinese Word Segmentation Method Based on Dictionary and Frequency of the Words;
基于词典和词频的中文分词方法
The Research and Implementation of Computability on the Chinese Segmentation Specification;
中文分词规范可计算化的研究与实现
An analysis of phenomena of nonstandard using words in university literature books;
浅析高校文科教材中的用词不规范现象
The usage and normalization of lettered words and phrases in chinese;
汉语文本中字母词语的使用与规范探讨
An Analysis of the Misuse in the Television Medium Language;
电视媒体语言中违背词汇规范现象分析
On the Standardization and Control of Indexing in Using the Keyword Glossary of Chinese Book Catalogue;
使用《中国图书分类主题词表》标引的规范与控制
Standardization of word spacing forms in bibliographic references of sci-tech papers in China
应重视科技论文参考文献表中词间距的规范格式
A Standardize Analysis on Public Interest From the Visual Angle of Constitution Version;
宪法文本中“公共利益”的规范分析
Serene Consideration on Chinese Language Regulation--The usage of letter words in Chinese;
语文规范中的冷静思考——关于汉语中的字母词使用问题
Analysis about the Style Effect and Relative Regulation about Letter Loanwords in Chinese Advertisements;
浅析广告中字母外来词的文体效应及其规范化问题
The Standard Expression of Eglish Scientific & Technical Terminology in Chinese Thesis;
英语科技外来词在汉语科技论文中表达的规范性
On the Loanwords in Logo and the Advertisement Title--from the cultural translation to the language standard
商标、广告标题用语中的外来词研究——由文化翻译到语言规范
On Expression of Such terms as "Code" and "Standard"
汉语“规范”及“标准”等词语的英文表达法
The Pragmatic Principles in Contemporary Chinese Normalization: A Study on Color Lexeme Normalization;
从色彩词的规范谈规范化工作中的语用原则
Analysis of Lexicology Characteristics of English Translation Rules in Traditional Chinese Medical Literature;
中医药学文献英译规律中词汇学特点分析
A Further Consideration on the Conventional Principle for the Standardization of Homophones: On the Three Aspects in the Standardization of Homophones;
再论异形词规范的俗成性原则——谈异形词规范中的三个问题
On Assurance of Correct Errors in Putonghua Proficiency Test(ppt ) --And Analysis on Pronouncing Standardization of Some Words;
普通话水平测试中的正误把握——兼及部分字词读音规范化的探析
Contemporary Chinese language word segmentation specification for information processing
GB/T13715-1992信息处理用现代汉语分词规范
The Motivational Analyse and Norm Guiding of the Applying of Four Classifiers;
规范四个量词用法的理据分析及建议
CopyRight © 2020-2024 优校网[www.youxiaow.com]版权所有 All Rights Reserved. ICP备案号:浙ICP备2024058711号