; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011884 (gene) of Snake gourd v1 genome

Gene IDTan0011884
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransport protein SEC31
Genome locationLG01:1823005..1823661
RNA-Seq ExpressionTan0011884
SyntenyTan0011884
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140761.1 uncharacterized protein LOC101212865 [Cucumis sativus]2.5e-10291.28Show/hide
Query:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
        MASFVAS+GNKGQ P+VKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
Subjt:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK

Query:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI
        ND LNPWECP QFDHVM+VTRPG+PVQEFGRCS+EGTGWTSNI+RF SMWHGF NDYWSVSAGPY E +HH LCFEQVDGAVSEISAGIIAAQAVSSLQI
Subjt:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI

Query:  TVDGEEEIDVLNRMRRGM
        TVDG+EE++VLNRMRRG+
Subjt:  TVDGEEEIDVLNRMRRGM

XP_022922623.1 uncharacterized protein LOC111430580 [Cucurbita moschata]3.4e-10492.2Show/hide
Query:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
        MASFVAS+GNKGQ PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
Subjt:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK

Query:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI
        NDGLNPWECP QFDHVM+VTRPGIPVQEFGRCSLEG+ WTSN+NRFGS WHGF NDYWSVSAGPYAE  HHALCFEQVDGAVSEISAGIIAAQ+VSSLQI
Subjt:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI

Query:  TVDGEEEIDVLNRMRRGM
         VDG+EEID+L+RM+RGM
Subjt:  TVDGEEEIDVLNRMRRGM

XP_022984279.1 uncharacterized protein LOC111482632 [Cucurbita maxima]2.9e-10391.74Show/hide
Query:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
        MASFVAS+GNKGQ PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
Subjt:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK

Query:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI
        NDGLNPWECP QFDHVM+VTRPGIPVQEFGRCSLEG+ WTSN+NRFGS WHGF NDYWSVSAGPYAE  HHALCFEQVDGAVSEISAGIIAAQ+VSSLQI
Subjt:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI

Query:  TVDGEEEIDVLNRMRRGM
         VD +EEID+L+RM+RGM
Subjt:  TVDGEEEIDVLNRMRRGM

XP_023551819.1 uncharacterized protein LOC111809672 [Cucurbita pepo subsp. pepo]1.4e-10290.83Show/hide
Query:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
        MASFV S+GNKGQ PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKI LK
Subjt:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK

Query:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI
        NDGLNPWECP Q DHVM+VTRPGIPVQEFGRCSLEG+ WTSN+NRFGS WHGF NDYWSVSAGPYAE  HHALCFEQVDGAVSEISAGIIAAQ+VSSLQI
Subjt:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI

Query:  TVDGEEEIDVLNRMRRGM
         VDG+EEID+L+RM+RGM
Subjt:  TVDGEEEIDVLNRMRRGM

XP_038877444.1 uncharacterized protein LOC120069727 [Benincasa hispida]7.1e-10292.66Show/hide
Query:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
        MASFVAS+GNKGQ P+VKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILL 
Subjt:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK

Query:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI
        N+GLNPWECP QFDHVM+VTRP IPVQEFGRCSLEGTGWTSNI RFGSMWHGF NDYWSVS GPYAE DHH L +EQVDGAVSEISAGIIAAQAVSSLQI
Subjt:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI

Query:  TVDGEEEIDVLNRMRRGM
        TVDG+EEIDVLNRMRRGM
Subjt:  TVDGEEEIDVLNRMRRGM

TrEMBL top hitse value%identityAlignment
A0A0A0L6N8 Uncharacterized protein1.2e-10291.28Show/hide
Query:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
        MASFVAS+GNKGQ P+VKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
Subjt:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK

Query:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI
        ND LNPWECP QFDHVM+VTRPG+PVQEFGRCS+EGTGWTSNI+RF SMWHGF NDYWSVSAGPY E +HH LCFEQVDGAVSEISAGIIAAQAVSSLQI
Subjt:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI

Query:  TVDGEEEIDVLNRMRRGM
        TVDG+EE++VLNRMRRG+
Subjt:  TVDGEEEIDVLNRMRRGM

A0A5A7SSX2 Transport protein SEC313.5e-10291.74Show/hide
Query:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
        MASFVAS+GNKGQ P+VKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
Subjt:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK

Query:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI
        NDGLNPW+CP QFDHVM+VTRPGIPVQEFGRCS+EGTGWTSNI+RF SMWHGF NDYWSVSAGPYAE DH  LCFEQVD AVSEISAGIIAAQAVSSLQI
Subjt:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI

Query:  TVDGEEEIDVLNRMRRGM
        TVDG+EEI+VLNRMRR +
Subjt:  TVDGEEEIDVLNRMRRGM

A0A5D3DHX4 Transport protein SEC313.8e-10191.67Show/hide
Query:  SFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKND
        SFVAS+GNKGQ P+VKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKND
Subjt:  SFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKND

Query:  GLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQITV
        GLNPW+CP QFDHVM+VTRPGIPVQEFGRCS+EGTGWTSNI+RF SMWHGF NDYWSVSAGPYAE DH  LCFEQVD AVSEISAGIIAAQAVSSLQITV
Subjt:  GLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQITV

Query:  DGEEEIDVLNRMRRGM
        DG+EEI+VLNRMRR +
Subjt:  DGEEEIDVLNRMRRGM

A0A6J1E7A4 uncharacterized protein LOC1114305801.7e-10492.2Show/hide
Query:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
        MASFVAS+GNKGQ PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
Subjt:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK

Query:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI
        NDGLNPWECP QFDHVM+VTRPGIPVQEFGRCSLEG+ WTSN+NRFGS WHGF NDYWSVSAGPYAE  HHALCFEQVDGAVSEISAGIIAAQ+VSSLQI
Subjt:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI

Query:  TVDGEEEIDVLNRMRRGM
         VDG+EEID+L+RM+RGM
Subjt:  TVDGEEEIDVLNRMRRGM

A0A6J1JA12 uncharacterized protein LOC1114826321.4e-10391.74Show/hide
Query:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
        MASFVAS+GNKGQ PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK
Subjt:  MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLK

Query:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI
        NDGLNPWECP QFDHVM+VTRPGIPVQEFGRCSLEG+ WTSN+NRFGS WHGF NDYWSVSAGPYAE  HHALCFEQVDGAVSEISAGIIAAQ+VSSLQI
Subjt:  NDGLNPWECPAQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQI

Query:  TVDGEEEIDVLNRMRRGM
         VD +EEID+L+RM+RGM
Subjt:  TVDGEEEIDVLNRMRRGM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G25670.1 unknown protein4.4e-3349.75Show/hide
Query:  PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKNDGLNPWECPAQFD
        PVV+KAKKK  KDE DR KQAEKKKRRLEKALATSAAI +ELEKKKQ + EEQQRLDEEGAAIAEAVALHVL+GEDSDDS ++          E     D
Subjt:  PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKNDGLNPWECPAQFD

Query:  HVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQITVDGEEEIDVLNRM
                 +P Q     +++G G+ SN        +G G+  WS      A D++              ISA +IAAQAVS+LQI+ + +    V N M
Subjt:  HVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQITVDGEEEIDVLNRM

Query:  RRG
         RG
Subjt:  RRG

AT4G25670.2 unknown protein4.4e-3349.75Show/hide
Query:  PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKNDGLNPWECPAQFD
        PVV+KAKKK  KDE DR KQAEKKKRRLEKALATSAAI +ELEKKKQ + EEQQRLDEEGAAIAEAVALHVL+GEDSDDS ++          E     D
Subjt:  PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKNDGLNPWECPAQFD

Query:  HVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQITVDGEEEIDVLNRM
                 +P Q     +++G G+ SN        +G G+  WS      A D++              ISA +IAAQAVS+LQI+ + +    V N M
Subjt:  HVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQITVDGEEEIDVLNRM

Query:  RRG
         RG
Subjt:  RRG

AT4G25690.1 unknown protein1.3e-3249.27Show/hide
Query:  PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKNDGLNPWECPAQFD
        PV +K  KK  KDE DR KQAEKKKRRLEKALATSAAI +ELEKKKQ + EEQQRLDEEGAAIAEAVALHVL+GEDSDDS ++          E     D
Subjt:  PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKNDGLNPWECPAQFD

Query:  HVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAED--DHHALCFEQVDGAVSEISAGIIAAQAVSSLQITVDGEEEIDVLN
                 +P Q     +++G G+ SN        +G G+  WSVS  P+ +D  D++ +           ISA +IAAQAVSSLQI+ D +    V  
Subjt:  HVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAED--DHHALCFEQVDGAVSEISAGIIAAQAVSSLQITVDGEEEIDVLN

Query:  RMRRG
         M  G
Subjt:  RMRRG

AT4G25690.2 unknown protein1.3e-3249.27Show/hide
Query:  PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKNDGLNPWECPAQFD
        PV +K  KK  KDE DR KQAEKKKRRLEKALATSAAI +ELEKKKQ + EEQQRLDEEGAAIAEAVALHVL+GEDSDDS ++          E     D
Subjt:  PVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKNDGLNPWECPAQFD

Query:  HVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAED--DHHALCFEQVDGAVSEISAGIIAAQAVSSLQITVDGEEEIDVLN
                 +P Q     +++G G+ SN        +G G+  WSVS  P+ +D  D++ +           ISA +IAAQAVSSLQI+ D +    V  
Subjt:  HVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAED--DHHALCFEQVDGAVSEISAGIIAAQAVSSLQITVDGEEEIDVLN

Query:  RMRRG
         M  G
Subjt:  RMRRG

AT5G52550.1 unknown protein6.2e-2743.94Show/hide
Query:  AKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKND-GLNPWECPAQFDHVMS
        AKKKQ ++EL+R KQAE+KKRR+EK++ATSAAI +ELEKKK  K EEQ+RLDEEGAAIAEAVALHVL+GED DDSY+  L  + G  PW+   + +    
Subjt:  AKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKND-GLNPWECPAQFDHVMS

Query:  VTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQITVDGEEEIDVLNRMRR
              P Q     ++     T + N            + SVS  P+A    +       +     ISA +   QAVSSLQI+ + + +  V N M R
Subjt:  VTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQITVDGEEEIDVLNRMRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGTTTTGTGGCTAGTGTGGGTAATAAAGGACAACCTCCTGTCGTGAAGAAAGCAAAAAAGAAGCAGGTGAAGGATGAGTTGGATCGTCAGAAACAAGCTGAGAA
GAAAAAGAGGCGGTTAGAGAAAGCCCTTGCTACTTCAGCAGCTATTATATCTGAACTTGAAAAGAAAAAACAAATGAAGAAGGAAGAACAGCAAAGGCTAGATGAAGAAG
GTGCTGCTATAGCTGAGGCTGTTGCTCTGCATGTCCTGATTGGTGAAGACTCAGATGACTCATACAAGATTCTTCTCAAAAACGACGGTCTCAACCCTTGGGAGTGCCCT
GCCCAGTTTGACCACGTAATGAGTGTTACAAGACCAGGCATTCCAGTGCAGGAGTTTGGGAGGTGTTCCCTTGAAGGGACTGGTTGGACCTCTAATATCAATAGATTCGG
ATCGATGTGGCACGGTTTTGGAAATGATTACTGGTCAGTCTCTGCTGGACCTTATGCTGAGGATGATCACCATGCATTATGTTTCGAGCAAGTTGATGGTGCTGTAAGTG
AGATTTCTGCTGGTATTATAGCGGCGCAGGCTGTCTCATCACTTCAAATTACGGTAGATGGGGAGGAAGAAATCGACGTCCTCAACAGAATGCGACGGGGTATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGTTTTGTGGCTAGTGTGGGTAATAAAGGACAACCTCCTGTCGTGAAGAAAGCAAAAAAGAAGCAGGTGAAGGATGAGTTGGATCGTCAGAAACAAGCTGAGAA
GAAAAAGAGGCGGTTAGAGAAAGCCCTTGCTACTTCAGCAGCTATTATATCTGAACTTGAAAAGAAAAAACAAATGAAGAAGGAAGAACAGCAAAGGCTAGATGAAGAAG
GTGCTGCTATAGCTGAGGCTGTTGCTCTGCATGTCCTGATTGGTGAAGACTCAGATGACTCATACAAGATTCTTCTCAAAAACGACGGTCTCAACCCTTGGGAGTGCCCT
GCCCAGTTTGACCACGTAATGAGTGTTACAAGACCAGGCATTCCAGTGCAGGAGTTTGGGAGGTGTTCCCTTGAAGGGACTGGTTGGACCTCTAATATCAATAGATTCGG
ATCGATGTGGCACGGTTTTGGAAATGATTACTGGTCAGTCTCTGCTGGACCTTATGCTGAGGATGATCACCATGCATTATGTTTCGAGCAAGTTGATGGTGCTGTAAGTG
AGATTTCTGCTGGTATTATAGCGGCGCAGGCTGTCTCATCACTTCAAATTACGGTAGATGGGGAGGAAGAAATCGACGTCCTCAACAGAATGCGACGGGGTATGTAA
Protein sequenceShow/hide protein sequence
MASFVASVGNKGQPPVVKKAKKKQVKDELDRQKQAEKKKRRLEKALATSAAIISELEKKKQMKKEEQQRLDEEGAAIAEAVALHVLIGEDSDDSYKILLKNDGLNPWECP
AQFDHVMSVTRPGIPVQEFGRCSLEGTGWTSNINRFGSMWHGFGNDYWSVSAGPYAEDDHHALCFEQVDGAVSEISAGIIAAQAVSSLQITVDGEEEIDVLNRMRRGM