Some Future Challenges for LSI
Agent-based software for indexing remote/distributed collections
Effective updating with global weighting
Incorporate phrases and proximity
Expand cosine matching to incorporate other similarity-based data (e.g., images)
Optimal number of dimensions