马上开始. 它是免费的哦
注册 使用您的电邮地址
IR 2008 syllabus 作者: Mind Map: IR 2008 syllabus

1. Projects

1.1. Boolean Model and Term Vocabulary

1.1.1. phase 1

1.1.2. phase 2

1.2. Vector Space Model and Query Expansion

1.2.1. phase 1

1.2.2. phase 2

1.3. Spam Filtering (Document Classification)

1.4. Hyperlink Analysis

2. Lectures

2.1. Introduction

2.1.1. Overveiw

2.1.2. boolean retrieval

2.1.2.1. incedence matrix

2.1.2.2. inverted file index

2.1.2.3. query optimization

2.2. Dictionaries?

2.2.1. Vocabulary

2.2.2. Dictionaries

2.2.3. Phrase and wildcard queries

2.3. Index construction & compression

2.3.1. BSBI

2.3.2. SPIMI

2.3.3. Distributed indexing

2.3.4. Dictionary Compression

2.3.5. Posting compression

2.4. Vector space model and Term Frequancy

2.4.1. TF

2.4.2. tf-idf

2.4.3. vector space

2.5. Naive Bayes classifier (document classification?)

2.5.1. Naive Bayes

2.5.2. Evaluation

2.5.3. Assumptions

2.6. Linear classification + relevance feedback

2.6.1. Feature selection

2.6.2. Linear classifier

2.6.3. Relevance feedback

2.6.4. Query expansion

2.7. Document clustering

2.7.1. Clustering

2.7.2. K-means

2.7.3. Evaluation

2.7.4. k selection

2.8. Hyperlink Analysis

2.8.1. Anchors

2.8.2. PageRank

2.8.3. HITS

2.9. Web search

2.9.1. Web IR

2.9.2. Ads & Spam

2.9.3. Size of Web

2.10. Web search & recommender Systems

2.10.1. Size of Web

2.10.2. Duplicate detection

2.10.3. Recommender Systems

3. Tutorials

3.1. Statistics and Machine Learning Basiscs

3.1.1. Probability

3.1.2. Naive Bayes

3.1.3. Linear classifier

3.1.4. Overfitting and VC dimensions

3.2. Evaluation of classifiers

3.2.1. Holdout

3.2.2. Cross validation

3.2.3. ROC

3.3. Clustering

3.3.1. EM

3.3.2. Mean shift

3.3.3. Clustering stability