KS_SELECTION - implements the heuristic method from [2] for 
  the selection of the number of factors from each cluster 
  used in the Clustered Latent Semantic Indexing method [1].
    N_ST=KS_SELECTION(A, N_COLS, ALPHA_VAL, L) returns in N_ST 
    a vector of integers denoting the number of factors (sum 
    equals L) selected from each cluster of the tdm A. N_COLS 
    is a vector containing the last column index for each column 
    block, while ALPHA_VAL is a value in [0, 1]. 
 
    REFERENCES: 
    [1] D. Zeimpekis and E. Gallopoulos. CLSI: A Flexible 
    Approximation Scheme from Clustered Term-Document Matrices. 
    In Proc. 5th SIAM International Conference on Data Mining, 
    pages 631–635, Newport Beach, California, 2005.
    [2] D. Zeimpekis and E. Gallopoulos, "Linear and Non-Linear 
    Dimensional Reduction via Class Representatives for Text 
    Classification", In Proc. of the 2006 IEEE International 
    Conference on Data Mining, pp. 1172-1177, December 2006, 
    Hong Kong. 
 
  Copyright 2011 Dimitrios Zeimpekis, Eugenia Maria Kontopoulou, 
                 Efstratios Gallopoulos
					
				

Return to main page