; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021350 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021350
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionprotein PHLOEM PROTEIN 2-LIKE A10-like
Genome locationscaffold6:45422387..45454551
RNA-Seq ExpressionSpg021350
SyntenySpg021350
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6604112.1 Protein PHLOEM PROTEIN 2-LIKE A10, partial [Cucurbita argyrosperma subsp. sororia]8.6e-8272.44Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        MAALGFTGYTAYR YH PSI R+RAKIS+FFAALSSAA  FSD   CVAT+ +D K+F+HSDSDE+P SLKQISKL RSDEIS S TRLSQALT+GVLRG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY
        YD+YSR++   GG E   +FTDK+M KLCSESGSGFVSVVVGSFARNLVM  FSID++SK  S LEDRL RW  VACDE CRELIGEL+R+FVS+ VS+Y
Subjt:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY

Query:  LEKTMEINTFDQIFENNFKTETVRE
        LEKTMEIN+FD+IF      +  RE
Subjt:  LEKTMEINTFDQIFENNFKTETVRE

KAG7034275.1 Protein PHLOEM PROTEIN 2-LIKE A10, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-8172.44Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        MAALGFTGYTAYR YH PSI R+RAKIS+FFAALSSAA  FSD   CVAT+ +D K+F++SDSDE+P SLKQISKL RSDEIS S TRLSQALT+GVLRG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY
        YD+YSR++   GG E   +FTDK+MTKLCSESGSGFVSVVVGSFARNLVM  FSID++SK  S LEDRL RW  VACDE CRELIGEL+R+FVS+ VS+Y
Subjt:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY

Query:  LEKTMEINTFDQIFENNFKTETVRE
        LEKTMEIN+FD+IF      +  RE
Subjt:  LEKTMEINTFDQIFENNFKTETVRE

XP_022950345.1 protein PHLOEM PROTEIN 2-LIKE A10-like [Cucurbita moschata]9.5e-8171.56Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        MAALGFTGYTAYR YH PSI R+RAKIS+FFAALSSAA  F+D   CVAT+ +D K+F++SDSDE+P SLKQISKL RSDEIS S TRLSQALT+GVLRG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY
        YD+YSR++   GG E   +FTDK+MTKLCSESGSGFVSVVVGSFARNLVM  FSID++SK  + LEDRL RW  VACDE CRELIGEL+R+FVS+ VS+Y
Subjt:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY

Query:  LEKTMEINTFDQIFENNFKTETVRE
        LEKTMEIN+FD+IF      +  RE
Subjt:  LEKTMEINTFDQIFENNFKTETVRE

XP_022977274.1 protein PHLOEM PROTEIN 2-LIKE A10-like [Cucurbita maxima]5.0e-8272.44Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        MAALGFTGYTAYR YH PSI R+RAKIS+FFAALSSAA  FSD   CVAT+ +D K+F+HSD+DE+P SL QISKL RSDEIS S TRLSQALT+GVLRG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY
        YD+YSR++   GG E   +FTDK+MTKLCSESGSGFVSVVVGSFARNLVM  FSID++SK RS LEDRL RW  VACDE CRELIGEL+R+FVS+ VS+Y
Subjt:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY

Query:  LEKTMEINTFDQIFENNFKTETVRE
        LEKTMEIN+FD+IF      +  RE
Subjt:  LEKTMEINTFDQIFENNFKTETVRE

XP_023543994.1 protein PHLOEM PROTEIN 2-LIKE A10-like [Cucurbita pepo subsp. pepo]2.0e-8373.33Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        MAALGFTGYTAYR YH PSI R+RAKIS+FFAALSSAA  FSD   CVAT+ +D K+F+HSDSDE+P SLKQISKL RSDEIS S TRLSQALT+GVLRG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY
        YD+YSR++   GG E   +FTDK+MTKLCSESGSGFVSVVVGSFARNLVM  FSID++SK  SCLEDRL RW  VACDE CRELIGEL+R+FVS+ VS+Y
Subjt:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY

Query:  LEKTMEINTFDQIFENNFKTETVRE
        LEKTMEIN+FD+IF      +  RE
Subjt:  LEKTMEINTFDQIFENNFKTETVRE

TrEMBL top hitse value%identityAlignment
A0A1S3B1U4 protein PHLOEM PROTEIN 2-LIKE A10-like8.1e-7070.05Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        MAALGFTGY+AY  YHLPSI R+RAKISKFFAALSSAA  FSD  DCVATV +D K+FLHSDSDEIPQSLKQISKL RSDEISDS TRLS+A+TVGVLRG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSR--RQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARW-TSVACDENCRELIGELVRLFVSTTV
        YD+YSR   +      E + +FTD+++ KLCSE G GFVSVVVGSFARNLVMA     + SKS S L   + RW   V  DE  RELIGEL+R+FVS+ +
Subjt:  YDRYSR--RQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARW-TSVACDENCRELIGELVRLFVSTTV

Query:  SIYLEKTMEINTFDQIF
        S+YLEKTMEINTFDQIF
Subjt:  SIYLEKTMEINTFDQIF

A0A5A7SZ84 Protein PHLOEM PROTEIN 2-LIKE A10-like8.1e-7070.05Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        MAALGFTGY+AY  YHLPSI R+RAKISKFFAALSSAA  FSD  DCVATV +D K+FLHSDSDEIPQSLKQISKL RSDEISDS TRLS+A+TVGVLRG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSR--RQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARW-TSVACDENCRELIGELVRLFVSTTV
        YD+YSR   +      E + +FTD+++ KLCSE G GFVSVVVGSFARNLVMA     + SKS S L   + RW   V  DE  RELIGEL+R+FVS+ +
Subjt:  YDRYSR--RQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARW-TSVACDENCRELIGELVRLFVSTTV

Query:  SIYLEKTMEINTFDQIF
        S+YLEKTMEINTFDQIF
Subjt:  SIYLEKTMEINTFDQIF

A0A6J1BTU5 protein PHLOEM PROTEIN 2-LIKE A10-like3.6e-7068.22Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        MAALGF+GY  YR YH  S+ R+RAKIS+F AALSSAA  FSD  DC ATV RD K+FLHSDSD+IP+S  +I+KL RSDEISDS TR+SQA+T+GVLRG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY
        YD+  R +P  GG + + NF D +M K+CSESG GFVS VVGSFARNLVMA+FS+ + SKSRS LED  A+W  VACDE  RELIGEL++LFVS  VS+Y
Subjt:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY

Query:  LEKTMEINTFDQIF
        LEKT EINTFDQIF
Subjt:  LEKTMEINTFDQIF

A0A6J1GFG9 protein PHLOEM PROTEIN 2-LIKE A10-like4.6e-8171.56Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        MAALGFTGYTAYR YH PSI R+RAKIS+FFAALSSAA  F+D   CVAT+ +D K+F++SDSDE+P SLKQISKL RSDEIS S TRLSQALT+GVLRG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY
        YD+YSR++   GG E   +FTDK+MTKLCSESGSGFVSVVVGSFARNLVM  FSID++SK  + LEDRL RW  VACDE CRELIGEL+R+FVS+ VS+Y
Subjt:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY

Query:  LEKTMEINTFDQIFENNFKTETVRE
        LEKTMEIN+FD+IF      +  RE
Subjt:  LEKTMEINTFDQIFENNFKTETVRE

A0A6J1IQY5 protein PHLOEM PROTEIN 2-LIKE A10-like2.4e-8272.44Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        MAALGFTGYTAYR YH PSI R+RAKIS+FFAALSSAA  FSD   CVAT+ +D K+F+HSD+DE+P SL QISKL RSDEIS S TRLSQALT+GVLRG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY
        YD+YSR++   GG E   +FTDK+MTKLCSESGSGFVSVVVGSFARNLVM  FSID++SK RS LEDRL RW  VACDE CRELIGEL+R+FVS+ VS+Y
Subjt:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY

Query:  LEKTMEINTFDQIFENNFKTETVRE
        LEKTMEIN+FD+IF      +  RE
Subjt:  LEKTMEINTFDQIFENNFKTETVRE

SwissProt top hitse value%identityAlignment
Q9SY57 Protein PHLOEM PROTEIN 2-LIKE A104.0e-5047.47Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        MA  G +GY AY+ YHLPS+ R+R ++ K F A+ S A   SD  + ++ V RD KDFL+SDSDEIP SLKQI+K+  S+E +DS +R+SQA+T+G  RG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFS--IDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVS
        Y   S    S     S+ +  D+V+ K+ SE+G+GFVSVVVGSFA+NLV+ F+S  ++   K          RW ++  D+ CREL+ + +  F ST + 
Subjt:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFS--IDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVS

Query:  IYLEKTMEINTFDQIFE
        +YL+KTM+INT+DQIFE
Subjt:  IYLEKTMEINTFDQIFE

Arabidopsis top hitse value%identityAlignment
AT1G10150.1 Carbohydrate-binding protein2.9e-5147.47Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        MA  G +GY AY+ YHLPS+ R+R ++ K F A+ S A   SD  + ++ V RD KDFL+SDSDEIP SLKQI+K+  S+E +DS +R+SQA+T+G  RG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFS--IDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVS
        Y   S    S     S+ +  D+V+ K+ SE+G+GFVSVVVGSFA+NLV+ F+S  ++   K          RW ++  D+ CREL+ + +  F ST + 
Subjt:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFS--IDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVS

Query:  IYLEKTMEINTFDQIFE
        +YL+KTM+INT+DQIFE
Subjt:  IYLEKTMEINTFDQIFE

AT1G59510.1 Carbohydrate-binding protein1.9e-3941.12Show/hide
Query:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG
        +A  G +GY  YR Y+   I ++  ++ K F+ + S A    D  + ++ V RD K+FL S+S EIP SLKQ+SK+ +S E +DS  R+S+A+ +GV RG
Subjt:  MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRG

Query:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY
        Y+       S   VE   N +  V+ ++ SE G+GFVSVVVGSFA+NLV+ F+S +    S   L+    RW ++  D+ CREL+ + +  F S+ VS+Y
Subjt:  YDRYSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIY

Query:  LEKTMEINTFDQIF
        ++KT+ +NT+DQIF
Subjt:  LEKTMEINTFDQIF

AT3G49790.1 Carbohydrate-binding protein4.6e-4145.02Show/hide
Query:  LGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRGYDR
        L  +GY A+R YH PSI ++R +ISK F  L +     SD  + V+ + +D  +FL SDSD+IP SLKQISK+ +SDE++ S  R +QA+TVG++RG D 
Subjt:  LGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRGYDR

Query:  YSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIYLEK
                    S   FTD+VM KL ++SGSGF S +VGSFARNLV+A +S      +   L+       +V  D+  R LIG+ V+ FVST VS+YL+K
Subjt:  YSRRQPSSGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIYLEK

Query:  TMEINTFDQIF
        T ++N FD +F
Subjt:  TMEINTFDQIF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTCTTGGATTTACCGGCTACACTGCATACAGAGCCTATCACTTACCTTCTATTGTCCGAAGGAGGGCTAAGATTTCAAAATTCTTCGCCGCTCTGTCTTCTGC
GGCCATAACATTCTCCGATTATGTCGATTGTGTTGCCACTGTTTTTAGAGACTCGAAGGATTTTCTTCATTCCGACTCGGATGAAATCCCTCAAAGTTTGAAACAAATCT
CTAAACTCGTCCGGTCAGACGAGATCTCCGATTCTTTCACTCGTCTCTCTCAGGCGTTAACCGTTGGAGTTTTGCGAGGTTACGATCGATACTCTCGACGACAGCCCAGC
AGTGGCGGTGTTGAGTCGAATGATAATTTCACTGATAAAGTTATGACGAAACTGTGTAGTGAATCTGGATCGGGGTTTGTTTCTGTGGTGGTTGGGAGTTTTGCTAGAAA
CTTAGTGATGGCGTTTTTCTCGATTGATCAGTCGAGTAAATCGAGGAGTTGTTTAGAGGATCGTTTGGCGAGATGGACGAGCGTGGCCTGCGATGAAAATTGCAGGGAAT
TAATCGGGGAGTTAGTTCGATTGTTCGTGAGCACGACAGTCTCTATTTATCTGGAGAAGACGATGGAGATCAACACGTTTGATCAAATATTCGAGAACAATTTCAAGACT
GAAACCGTGAGGGAGCCTCCTCCCGTCGCCGGCGCCGACTCCTCCCTTTCCAAGGCACCGAAGAGGAAGCGATGCGATTTAGCGGTGGGGATGGGGAAGAATAAGGCTTC
TCTGAAAAGGAAAGTCCTCCTATTAGTTCCACAGGAAGCGGTGAGGGCACAATTTTTGTCGAAGAGTGCCGTTCTTGTTGAGTCGGGGTCGTCTGGTGTAAAAAATGAAG
TTGTGATGAGAGAGAGATCGGCTGTTACCCATGGGAGCAAAAATCGTGGAATTAAACCTGCTAATACCCACAAGCTGAAGAGTTGGGCATCGCGGCGGTGTTCATTTGCT
TGCTGGGTTTCTTCGTTCGTGCCTCGTGATAAGAGTGAACCTTGGATGGAGCTATTTGATATTGCTAAGTCTAAGGATCACGCCATGGATTACCTCGAGGCTTGTGGATT
TCCTCTTCAAGACAAAAACCTCAAGGTGCCTTCAACCATTACAGACTTCCGTTACATTGCACATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCTCTTGGATTTACCGGCTACACTGCATACAGAGCCTATCACTTACCTTCTATTGTCCGAAGGAGGGCTAAGATTTCAAAATTCTTCGCCGCTCTGTCTTCTGC
GGCCATAACATTCTCCGATTATGTCGATTGTGTTGCCACTGTTTTTAGAGACTCGAAGGATTTTCTTCATTCCGACTCGGATGAAATCCCTCAAAGTTTGAAACAAATCT
CTAAACTCGTCCGGTCAGACGAGATCTCCGATTCTTTCACTCGTCTCTCTCAGGCGTTAACCGTTGGAGTTTTGCGAGGTTACGATCGATACTCTCGACGACAGCCCAGC
AGTGGCGGTGTTGAGTCGAATGATAATTTCACTGATAAAGTTATGACGAAACTGTGTAGTGAATCTGGATCGGGGTTTGTTTCTGTGGTGGTTGGGAGTTTTGCTAGAAA
CTTAGTGATGGCGTTTTTCTCGATTGATCAGTCGAGTAAATCGAGGAGTTGTTTAGAGGATCGTTTGGCGAGATGGACGAGCGTGGCCTGCGATGAAAATTGCAGGGAAT
TAATCGGGGAGTTAGTTCGATTGTTCGTGAGCACGACAGTCTCTATTTATCTGGAGAAGACGATGGAGATCAACACGTTTGATCAAATATTCGAGAACAATTTCAAGACT
GAAACCGTGAGGGAGCCTCCTCCCGTCGCCGGCGCCGACTCCTCCCTTTCCAAGGCACCGAAGAGGAAGCGATGCGATTTAGCGGTGGGGATGGGGAAGAATAAGGCTTC
TCTGAAAAGGAAAGTCCTCCTATTAGTTCCACAGGAAGCGGTGAGGGCACAATTTTTGTCGAAGAGTGCCGTTCTTGTTGAGTCGGGGTCGTCTGGTGTAAAAAATGAAG
TTGTGATGAGAGAGAGATCGGCTGTTACCCATGGGAGCAAAAATCGTGGAATTAAACCTGCTAATACCCACAAGCTGAAGAGTTGGGCATCGCGGCGGTGTTCATTTGCT
TGCTGGGTTTCTTCGTTCGTGCCTCGTGATAAGAGTGAACCTTGGATGGAGCTATTTGATATTGCTAAGTCTAAGGATCACGCCATGGATTACCTCGAGGCTTGTGGATT
TCCTCTTCAAGACAAAAACCTCAAGGTGCCTTCAACCATTACAGACTTCCGTTACATTGCACATTAG
Protein sequenceShow/hide protein sequence
MAALGFTGYTAYRAYHLPSIVRRRAKISKFFAALSSAAITFSDYVDCVATVFRDSKDFLHSDSDEIPQSLKQISKLVRSDEISDSFTRLSQALTVGVLRGYDRYSRRQPS
SGGVESNDNFTDKVMTKLCSESGSGFVSVVVGSFARNLVMAFFSIDQSSKSRSCLEDRLARWTSVACDENCRELIGELVRLFVSTTVSIYLEKTMEINTFDQIFENNFKT
ETVREPPPVAGADSSLSKAPKRKRCDLAVGMGKNKASLKRKVLLLVPQEAVRAQFLSKSAVLVESGSSGVKNEVVMRERSAVTHGSKNRGIKPANTHKLKSWASRRCSFA
CWVSSFVPRDKSEPWMELFDIAKSKDHAMDYLEACGFPLQDKNLKVPSTITDFRYIAH