ID=9283c44ee61dda4b47d4219e7ff31e243935d933

Description

Clustering of 454 Calanus finnmarchicus cDNA data using Newbler, and combining the result with genbank and EST data, using TGICL. Annotation by blasting against UniProt (no guarantees on quality).

Provenance

Same data as CalFin1RNAasm, but renamed sequences to non-generic names. All sequences (exept ones from GenBank) are prefixed with CalFin_rna1_, and suffixed with c#c### - CL#contig# (cluster number and contig number from TGICL) c##### - contig#### (from Newbler assembly) i##### - isotig#### (from Newbler assembly) The following translation was used: sed -e 's/^CL\([0-9]*\)Contig\([0-9]*\)/CalFin_rna1_cl\1c\2/g' -e 's/^contig\([0-9]*\)/CalFin_rna1_c\1/g' -e 's/^isotig\([0-9]*\)/CalFin_rna1_i\1/g'

Files

PathDescriptionTypechecksum

annotations.csv Annotations for index.fasta text/csv 0176f84d84ae7b3fa1fbbb76c5d1dab85d76ef4a
index.fasta Assembled contigs text/x-fasta-rna e2108bde3e05dd046ff8cc6c593ccf10823f4d17