
JACIII Vol.23 No.2 pp. 268-273
doi: 10.20965/jaciii.2019.p0268


Research on Policy Text Clustering Algorithm Based on LDA-Gibbs Model

Haiqun Ma*,** and Tao Zhang***,†

*Center for Russian Language Literature and Culture, Heilongjiang University
Harbin, Heilongjiang 150080, China

**Research Center of Information Resource Management, Heilongjiang University
Harbin, Heilongjiang 150080, China

***Information and Network Center, Heilongjiang University
Harbin, Heilongjiang 150080, China

Corresponding author

May 31, 2018
July 24, 2018
March 20, 2019
LDA-Gibbs, topic model, text clustering, weighted algorithm

Policy text contains large amount of diversified data and strictly conforms to standards and specifications, but the traditional text clustering method cannot solve the problems of high dimensionality, sparse features, and similar meanings, so this paper proposes a weighted algorithm based on the LDA-Gibbs model to improve the accuracy of policy text clustering. Firstly, it provides realistic basis for the assumptions of the LDA-Gibbs topic model and the weighted algorithm; secondly, it pre-processes the existing policy text simulated data, establishes the LDA-Gibbs model, forms a weighted algorithm, and generates training data to determine the number of optimal topics in the LDA-Gibbs model and completes the final clustering of the policy text; finally, by summarizing, classifying and deducing the conclusions of the experimental data, this paper proves the objective validity and effects of this method. Hopefully the overall design of this method can be applied in the prospective study on the formulation of new policies in the future, the retrospective evaluation and testing of the existing policies and the formation of a two-way interactive mechanism.

Cite this article as:
H. Ma and T. Zhang, “Research on Policy Text Clustering Algorithm Based on LDA-Gibbs Model,” J. Adv. Comput. Intell. Intell. Inform., Vol.23 No.2, pp. 268-273, 2019.
