News
August 2010:
- The Research-ESA Web service is now available.
February 2010:
- The Multipla project will support the CriES Workshop at the CLEF Conference.
- As part of CriES we will organize a Pilot Challenge for multi-lingual Expert Search.
Back links: CriES Workshop - CriES Pilot Challenge
The preprocessing tool preforms the following steps on the Yahoo! Answers dataset:
The output of the tool will be:
The preprocessing tool is written in Java and can be downloaded as executable jar file: cries_preprocessing.jar. If you are interested on the source code, please contact Philipp Sorg.
The preprocessing tool is implemented in Java. You will need a Java 1.6 runtime environment to run the program.
Command to run the tool:
java -jar cries_preprocessing.jar -Dxml_file=<Yahoo! Answers XML file> -Doutput_dir=<output directory>
Comments: - The preprocessing tool can handle gzipped XML input files (in this case the file FullOct2007.xml.gz)
The following output files will be generated:
Please refer to our Evaluation Guidelines for instructions of how to submit your expert search results.
cries_automatic_eval.trec_rel.txt is a TREC style relevance file, that assigns each topic exactly one relevant user, namely the user who wrote the best answer to topic question. This file can be used for testing/debugging, but it will most probably heavily underestimate the values of evaluation measures.