Analyzing and indexing huge reference sequence collections