| COMPUTER-IMPLEMENTED METHOD OF TRAINING NEURAL NETWORK TO DETERMINE GENRE AND SUBGENRE OF TEXT | |
| 2023-11-12 | |
| 专利权人 | AUTHORS PLATFORM LLC (AUTH-Non-standard) |
| 申请日期 | 2023-11-12 |
| 专利号 | RU2831511-C1 |
| 成果简介 | NOVELTY - Present invention relates to determining the genre of text, in particular to training a neural network for determining the genre and subgenre of text, including a large volume and complex semantic structure. According to the proposed method of training a neural network to determine the genre and subgenre of the text at the first stage: providing the availability of texts from the first group relating to one genre and containing a through named entity, and a dictionary containing said named entity and words falling into a predetermined step before and after the end-to-end named entity, training the neural network using the text from the first group, during training, the neural network selects the named entity and words and/or context structures falling into the given step before and after the named entity, they are placed in a list and the list is compared with said dictionary, based on which the neural network outputs the matching result to determine the genre of the text. At the second stage: providing the availability of texts from the second group, relating to the same genre and containing different named entities, training the neural network, trained at the first stage, using the text from the second group, repeating said operations of the first stage, starting with the named entity selection. At the third stage: after training the neural network at least two genres, providing the presence of texts from a third group relating to said trained genres and containing different named entities, and a combined dictionary obtained from augmented dictionaries for said trained genres, and training the neural network using the text from the third group, repeating said operations of the first step, starting with the named entity selection, wherein the merged dictionary is used for the comparison operation, and at the output, the neural network outputs a comparison result to determine the genre and subgenre of the text. USE - Information technology. ADVANTAGE - Proposed method reduces the total amount of training data and time for training a neural network for the task of determining genre and subgenre belonging of large text corpuses with provision of high accuracy of results. 5 cl |
| IPC 分类号 | G06F-017/00 |
| 国家 | 俄罗斯 |
| 专业领域 | 信息技术 |
| 语种 | 英语 |
| 成果类型 | 专利 |
| 文献类型 | 科技成果 |
| 条目标识符 | http://119.78.100.226:8889/handle/3KE4DYBR/19288 |
| 专题 | 中国科学院新疆生态与地理研究所 |
| 作者单位 | AUTHORS PLATFORM LLC (AUTH-Non-standard) |
| 推荐引用方式 GB/T 7714 | ALIEV R I,GRIGORYEV S S,KIPARISOV A S,et al. COMPUTER-IMPLEMENTED METHOD OF TRAINING NEURAL NETWORK TO DETERMINE GENRE AND SUBGENRE OF TEXT. RU2831511-C1[P]. 2023. |
| 条目包含的文件 | 条目无相关文件。 | |||||
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论