August 21, 2006

Bartell results

First tried it with the target ground truth being the PPK kernel on uspop37. Did
pretty awfully on the R-precision task, about .16 (a random kernel with the correct diagonal gets about .125).

Using the LFM-collection ground truth and uspop1000, the results were much better. Below is a plot of R-precision vs. Bartlell's 'k' parameter, which is the dimensionality of the semantic space.

Also, the LFM ground truth kernel by itself gets very high R-precision: .567, max hubness 325.

Ah ha... But if I normalize the collections before computing the kernel, then the Bartell results are much better:


Posted by madadam at 01:53 PM

Bartell experiment HOWTO

# train the full-cov single gaussian models:
./run-stats.pl -list libraries/uspop_computed_rand1000.list -models uspop1000-fcvg.models -pfile_dir /homes/madadam/projblush/musicsim/uspop/pfiles

# get the collections from LFM by running the scripts in nyab:work/data/lfm/ :
# do-fetch; run-scraper-local; run-filter-lo;
# copy here to 'collections-uspop1000.txt'

In matlab, do_bartell.m

Posted by madadam at 09:14 AM