From: Martin Sevior (msevior_at_seviorpc.ph.unimelb.edu.au)
Date: Tue Nov 25 2003 - 23:44:27 EST
On Wed, 2003-11-26 at 00:48, Nadav Rotem wrote:
> The Inner product of two text is defined as the number of topics they
> share. One of my professors is doing research in this field and needed a
> matrix of the inner product of cunks of text. Here is a quick example,
> in a Bash script, of how to use OTS to generate this list of topics.
Wow, Linear Algebra meets linguistics!
I never thought there would be an overlap of those two fields...
> Usage of the script:
> [nadav_at_gringo articles]$ ./inner.sh sacbee1.txt sacbee2.txt
> <sacbee1.txt,sacbee2.txt>= 0
> [nadav_at_gringo articles]$ ./inner.sh test1.txt test2.txt
> <test1.txt,test2.txt>= 3
> >From your c Code you can get the list of topics through this call:
> word = ots_word_in_list(Doc->ImpWords,i);
This archive was generated by hypermail 2.1.4 : Tue Nov 25 2003 - 22:45:34 EST