; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021620 (gene) of Snake gourd v1 genome

Gene IDTan0021620
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionThylakoid soluble phosphoprotein TSP
Genome locationLG05:3152550..3153200
RNA-Seq ExpressionTan0021620
SyntenyTan0021620
Gene Ontology termsNA
InterPro domainsIPR021584 - Thylakoid soluble phosphoprotein TSP9
IPR037244 - Thylakoid soluble phosphoprotein TSP9 superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010512.1 hypothetical protein SDJN02_27306, partial [Cucurbita argyrosperma subsp. argyrosperma]6.7e-3687.63Show/hide
Query:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKNGGFGGLFAKKD
        MASLPIAFSPAAGRVFAATAAK AGESKKEKGLLDWI+GSLE+DQLLETDPVLQKVEGKD     SN GTV GGRKNSVQ+PPKKNGGFGGLFAK+D
Subjt:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKNGGFGGLFAKKD

XP_004143212.1 uncharacterized protein LOC101205268 [Cucumis sativus]3.7e-3486.73Show/hide
Query:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKN-GGFGGLFAKKD
        MASLPIAFSPAAGRVFAATAAK AGESKKEKGLLDWI+GSL KDQLLETDPVLQKVEGKD   G +  G+VRGGRKNSVQ+PPKKN GGFGGLFAKKD
Subjt:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKN-GGFGGLFAKKD

XP_008464052.1 PREDICTED: uncharacterized protein LOC103502033 [Cucumis melo]4.1e-3387.88Show/hide
Query:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNG-GTVRGGRKNSVQLPPKKNGG-FGGLFAKKD
        MASLPIAFSPAAGRVFAATAAK AGESKKEKGLLDWI+GSL KDQLLETDPVLQKVEG  KDG + NG GTVR GRKNSVQ+PPKKNGG FGGLFAKKD
Subjt:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNG-GTVRGGRKNSVQLPPKKNGG-FGGLFAKKD

XP_022947653.1 uncharacterized protein LOC111451452 [Cucurbita moschata]3.3e-3585.57Show/hide
Query:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKNGGFGGLFAKKD
        MASL  AFSPAAGRVFAATAAK AGE KKEKG+LDWILG +EKDQLLETDPVLQKVEG  K+GGA+NGGTVRGG+KNSVQ+PPKKNG FGGLFAKKD
Subjt:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKNGGFGGLFAKKD

XP_023532336.1 uncharacterized protein LOC111794525 [Cucurbita pepo subsp. pepo]1.1e-3586.6Show/hide
Query:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKNGGFGGLFAKKD
        MASL  AFSPAAGRVFAATAAK AGESKKEKG+LDWILG +EKDQLLETDPVLQKVEG  K+GGA+NGGTVRGG+KNSVQ+PPKKNG FGGLFAKKD
Subjt:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKNGGFGGLFAKKD

TrEMBL top hitse value%identityAlignment
A0A0A0KC59 Uncharacterized protein1.8e-3486.73Show/hide
Query:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKN-GGFGGLFAKKD
        MASLPIAFSPAAGRVFAATAAK AGESKKEKGLLDWI+GSL KDQLLETDPVLQKVEGKD   G +  G+VRGGRKNSVQ+PPKKN GGFGGLFAKKD
Subjt:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKN-GGFGGLFAKKD

A0A1S3CKK5 uncharacterized protein LOC1035020332.0e-3387.88Show/hide
Query:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNG-GTVRGGRKNSVQLPPKKNGG-FGGLFAKKD
        MASLPIAFSPAAGRVFAATAAK AGESKKEKGLLDWI+GSL KDQLLETDPVLQKVEG  KDG + NG GTVR GRKNSVQ+PPKKNGG FGGLFAKKD
Subjt:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNG-GTVRGGRKNSVQLPPKKNGG-FGGLFAKKD

A0A5A7V5S9 Uncharacterized protein2.0e-3387.88Show/hide
Query:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNG-GTVRGGRKNSVQLPPKKNGG-FGGLFAKKD
        MASLPIAFSPAAGRVFAATAAK AGESKKEKGLLDWI+GSL KDQLLETDPVLQKVEG  KDG + NG GTVR GRKNSVQ+PPKKNGG FGGLFAKKD
Subjt:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNG-GTVRGGRKNSVQLPPKKNGG-FGGLFAKKD

A0A6J1G777 uncharacterized protein LOC1114514521.6e-3585.57Show/hide
Query:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKNGGFGGLFAKKD
        MASL  AFSPAAGRVFAATAAK AGE KKEKG+LDWILG +EKDQLLETDPVLQKVEG  K+GGA+NGGTVRGG+KNSVQ+PPKKNG FGGLFAKKD
Subjt:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKNGGFGGLFAKKD

A0A6J1L2X7 uncharacterized protein LOC1114999271.6e-3585.57Show/hide
Query:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKNGGFGGLFAKKD
        MASL  AFSPAAGRVFAATAAK AGE KKEKG+LDWILG +EKDQLLETDPVLQKVEG  K+GGA+NGGTVRGG+KNSVQ+PPKKNG FGGLFAKKD
Subjt:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKNGGFGGLFAKKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G47070.1 LOCATED IN: thylakoid, chloroplast thylakoid membrane, chloroplast, chloroplast envelope; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Thylakoid soluble phosphoprotein TSP9 (InterPro:IPR021584); Has 37 Blast hits to 37 proteins in 10 species: Archae - 0; Bacteria - 0; Metazoa - 0; Fungi - 0; Plants - 37; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.7e-1143.16Show/hide
Query:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEK-DQLLETDPVLQKVEGKDKDGGASNG-GTVRGGRKNS-VQLPPKKNGGFGGL
        ++SL ++F+PA  RV+A +    +G  K+EK  +D++LG + K DQ  ET+P+L+KV+  +K+G  + G GTVRGG+ ++   +P K  GGFGGL
Subjt:  MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEK-DQLLETDPVLQKVEGKDKDGGASNG-GTVRGGRKNS-VQLPPKKNGGFGGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTCTCTTCCGATCGCATTCTCTCCGGCCGCCGGACGGGTTTTCGCAGCCACGGCGGCGAAGAGCGCCGGCGAAAGCAAGAAGGAAAAGGGCCTTCTTGATTGGAT
CCTCGGAAGCTTGGAGAAGGATCAGCTTCTCGAAACCGACCCTGTTCTTCAAAAGGTCGAGGGCAAGGACAAGGACGGCGGCGCCAGCAACGGCGGAACCGTCCGCGGCG
GCCGGAAGAATTCCGTCCAACTTCCCCCGAAGAAGAACGGCGGCTTCGGCGGTCTCTTTGCCAAGAAAGACTGA
mRNA sequenceShow/hide mRNA sequence
AAAAAAATTAAAAGGCTCGCTTCGCTTCAGGATAAGGCGCCTGTGGTGAGCTCAAGCATTTCAACATCTCATCGTCTCTTATTTTACCCAAATTTCCCTTTCCTTTCATT
TCCATTTCATTTTAATTTTTTCTCCTCTCATAAAAAAAAAAAAAAAACAATGGCCTCTCTTCCGATCGCATTCTCTCCGGCCGCCGGACGGGTTTTCGCAGCCACGGCGG
CGAAGAGCGCCGGCGAAAGCAAGAAGGAAAAGGGCCTTCTTGATTGGATCCTCGGAAGCTTGGAGAAGGATCAGCTTCTCGAAACCGACCCTGTTCTTCAAAAGGTCGAG
GGCAAGGACAAGGACGGCGGCGCCAGCAACGGCGGAACCGTCCGCGGCGGCCGGAAGAATTCCGTCCAACTTCCCCCGAAGAAGAACGGCGGCTTCGGCGGTCTCTTTGC
CAAGAAAGACTGACGGCAACGGCGACGGCGGTTATGCCGTTTCACTCATATATAATTTCATTTGAATCGAAAACTTTTGCAATTTCAGGCCATATCTTGTTGCCTTTATG
TTGCTCTGATTGGTTATTCTCCAATACTATCGTCACATTATTAATTATTGTTGGTAATTGTTCATTTTTCTTTAATATATATATATTTTTTAATTTATCAA
Protein sequenceShow/hide protein sequence
MASLPIAFSPAAGRVFAATAAKSAGESKKEKGLLDWILGSLEKDQLLETDPVLQKVEGKDKDGGASNGGTVRGGRKNSVQLPPKKNGGFGGLFAKKD