; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1010 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1010
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC01:15646488..15652171
RNA-Seq ExpressionMC01g1010
SyntenyMC01g1010
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143444.1 uncharacterized protein LOC101203646 isoform X1 [Cucumis sativus]2.55e-8076.65Show/hide
Query:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI
        MAVS SR S WF  GKE+E VAN  +P NS SE G GL+EPESLKFKRVDLPSSSKKV KQKW S+KETRI WEYDFVMVPS GD +QM DS DEADWSI
Subjt:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI

Query:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKN-----SPESNNYMEQWLSSLQNS
        GW EPHGP FQS+DSFAVLVPSYS+RCKE+VE SNVELLAAIK      SPES  YME WLSSLQNS
Subjt:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKN-----SPESNNYMEQWLSSLQNS

XP_008440536.1 PREDICTED: uncharacterized protein LOC103484930 isoform X1 [Cucumis melo]2.76e-8277.84Show/hide
Query:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI
        MAVS SR S WF  GKEKE VAN  +P NS SE G GL+EPESLKFKRVDLPSSSKKV KQKWQS+KETRI+WEYDFVMVPS GD +QM DS DEADWSI
Subjt:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI

Query:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKN-----SPESNNYMEQWLSSLQNS
        GW EPHGP FQS+DSFAVLVPSYS+RCKE+VE SNVELLAAIK      SPES  YME WLSSLQNS
Subjt:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKN-----SPESNNYMEQWLSSLQNS

XP_022132824.1 uncharacterized protein LOC111005581 isoform X1 [Momordica charantia]4.04e-115100Show/hide
Query:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI
        MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI
Subjt:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI

Query:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSPESNNYMEQWLSSLQNSES
        GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSPESNNYMEQWLSSLQNSES
Subjt:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSPESNNYMEQWLSSLQNSES

XP_022132826.1 uncharacterized protein LOC111005581 isoform X2 [Momordica charantia]8.26e-10399.32Show/hide
Query:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI
        MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI
Subjt:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI

Query:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSPE
        GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSP+
Subjt:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSPE

XP_038882086.1 uncharacterized protein LOC120073358 [Benincasa hispida]4.50e-7976.79Show/hide
Query:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDG-IQMPDSADEADWS
        MAVS SR S WF  GKEKE VAN  SP NS SE G GL+EPESLKFKRVDLPSSSKKV KQKWQS+KET I+WEYDFVMVPS GD  +QM DS DEADWS
Subjt:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDG-IQMPDSADEADWS

Query:  IGWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKN-----SPESNNYMEQWLSSLQNS
        IGW EPHGP FQS+DSFAVLVPSYS+RCKE+VE SNVELLAAIK      SPES  YME WLS LQNS
Subjt:  IGWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKN-----SPESNNYMEQWLSSLQNS

TrEMBL top hitse value%identityAlignment
A0A1S3B0X6 uncharacterized protein LOC103484930 isoform X11.34e-8277.84Show/hide
Query:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI
        MAVS SR S WF  GKEKE VAN  +P NS SE G GL+EPESLKFKRVDLPSSSKKV KQKWQS+KETRI+WEYDFVMVPS GD +QM DS DEADWSI
Subjt:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI

Query:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKN-----SPESNNYMEQWLSSLQNS
        GW EPHGP FQS+DSFAVLVPSYS+RCKE+VE SNVELLAAIK      SPES  YME WLSSLQNS
Subjt:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKN-----SPESNNYMEQWLSSLQNS

A0A6J1BTD2 uncharacterized protein LOC111005581 isoform X24.00e-10399.32Show/hide
Query:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI
        MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI
Subjt:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI

Query:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSPE
        GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSP+
Subjt:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSPE

A0A6J1BTJ7 uncharacterized protein LOC111005581 isoform X11.96e-115100Show/hide
Query:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI
        MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI
Subjt:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSI

Query:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSPESNNYMEQWLSSLQNSES
        GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSPESNNYMEQWLSSLQNSES
Subjt:  GWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSPESNNYMEQWLSSLQNSES

A0A6J1GF21 uncharacterized protein LOC1114536047.52e-7876.22Show/hide
Query:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKET-RIDWEYDFVMVPSVGDGIQMPDSADEADWS
        MAVSFSR S WF  GKEKE VAN   P NS SE+G GL+EPESLK KRVDL SSSKKVK QKW+S+KET RIDWEYDF++VPS GD +QMPDSADEADWS
Subjt:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKET-RIDWEYDFVMVPSVGDGIQMPDSADEADWS

Query:  IGWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKN-----SPESNNYMEQWLSS
        IGW EPHGP FQS+DSFAVLVPSYS+RC+E+VE SNVELLAAIKN     SPES  YME WLSS
Subjt:  IGWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKN-----SPESNNYMEQWLSS

A0A6J1IP42 uncharacterized protein LOC111478669 isoform X12.15e-7775.61Show/hide
Query:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKET-RIDWEYDFVMVPSVGDGIQMPDSADEADWS
        MAVSFSR S WF  GKEKE VAN  +P NS +E+G GL+EPESLK KRVDL S SKKVK QKW+S+KET RIDWEYDF++VPS GD +QMPDSADEADWS
Subjt:  MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKET-RIDWEYDFVMVPSVGDGIQMPDSADEADWS

Query:  IGWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKN-----SPESNNYMEQWLSS
        IGW EPHGP FQS+DSFAVLVPSYS+RCKE+VE SNVELLAAIKN     SPES  YME WLSS
Subjt:  IGWFEPHGPNFQSEDSFAVLVPSYSHRCKELVESSNVELLAAIKN-----SPESNNYMEQWLSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G38060.1 unknown protein2.2e-0837.07Show/hide
Query:  KEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSIGWFEPHGPNFQSED-----SFAVLVPSYSHRCKELVES
        KE E+LKF     P+   +V K+K           E +     S G       + D  +WS+GW EPHGP+FQS+D      F VLVP Y    + +VE 
Subjt:  KEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSIGWFEPHGPNFQSED-----SFAVLVPSYSHRCKELVES

Query:  S---NVELLAAIKNSP
        S   N +LL+A+KN P
Subjt:  S---NVELLAAIKNSP

AT4G38060.2 unknown protein2.2e-1340Show/hide
Query:  KEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSIGWFEPHGPNFQSED-----SFAVLVPSYSHRCKELVES
        KE E+LKF     P+   +V K+K           E +     S G       + D  +WS+GW EPHGP+FQS+D      F VLVP Y    + +VE 
Subjt:  KEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSIGWFEPHGPNFQSED-----SFAVLVPSYSHRCKELVES

Query:  S---NVELLAAIKN-----SPESNNYMEQWLSSLQ
        S   N +LL+A+KN      P+  NYMEQWLSSLQ
Subjt:  S---NVELLAAIKN-----SPESNNYMEQWLSSLQ

AT5G65480.1 unknown protein4.2e-1234.94Show/hide
Query:  MAVSFSRISWWFRGGKE-KEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWS
        MAVSF R+  W  GG   K+    +  PK S S +   LK+  + K       SSS K  K+ W  R  +    E D +        +  P+  D+ +WS
Subjt:  MAVSFSRISWWFRGGKE-KEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWS

Query:  IGWFEPHGPNFQSED----SFAVLVPSYSHRCKELVESSNVELLAAIKN--SPESNNYMEQWLSSL
        IGW EPHGP F++ED     F VLVP Y    K++++ S  ++     +  +P+  N MEQWLSS+
Subjt:  IGWFEPHGPNFQSED----SFAVLVPSYSHRCKELVESSNVELLAAIKN--SPESNNYMEQWLSSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTGTCTTTCAGTCGTATTTCATGGTGGTTTCGGGGTGGGAAGGAAAAAGAGGTTGTTGCAAACCAGTTCTCTCCAAAAAATTCTTGCTCTGAGATTGGTGCTGG
TTTGAAAGAACCGGAGAGTCTCAAGTTCAAGAGGGTCGATTTGCCCTCATCATCAAAGAAGGTGAAGAAGCAGAAATGGCAAAGTAGGAAAGAAACAAGGATTGACTGGG
AATACGATTTTGTGATGGTACCATCCGTTGGTGATGGCATACAAATGCCCGACTCTGCTGATGAGGCCGATTGGTCCATTGGTTGGTTTGAGCCTCACGGTCCTAACTTT
CAAAGTGAAGATAGTTTTGCCGTTCTGGTCCCTTCCTATAGCCATCGTTGCAAGGAGCTGGTGGAGAGTTCAAACGTGGAGCTTTTGGCTGCCATCAAAAATTCACCTGA
GAGCAATAACTATATGGAACAGTGGCTTTCTTCCCTGCAAAACTCAGAATCCTGA
mRNA sequenceShow/hide mRNA sequence
GAGAGAGCTGGTTCTCCCTACACGTGGCAGTGAATTAAACACCGCATAAGCAAACGGGTATGTTTTTTTAACGCCACGTCACAAGATTTGGACGGTTCCATAATAATTTA
TCTCCATGATTTGATTTGATTTGATTTTCGTCTGAGTTCCATGATTATGATTTAATTAATGCCAAAATCACAATTTGCTTTTTTAAGTTAAAATATTGTTAATCAAATTA
TCGAACGATTCACAATTGATTGGATTGGATTGGAGGAGCGCTCAGCGGCAGGGGAGCGATATAATTGACCGCCGAATCATCATCGCCACTGCAACTTTGCATCTCGACGG
CTGAGCCTTTTTTTGAATAGAATATTGGAAGCTCCCTATAAATTTAGACGAAAGGAGCTTTGATAATTGCTTCTTCGGCCTTACGGAAGAGGAATGGGGAATAGGAGGGA
GAGAGCGAGATAATTTAGAGGGCGAAAGCCAAATCTGTTCCGATTTCTCTGTTTGGTTCCTAGAGAGATTTCAGGAATCTTTTGATTAGCAGATTTCTCCTGATTACGGT
GAAATAATCGGTGCTTGGCAGACTCCTCGGTTCGTTTTCTCTTGCGTCCATTCTGTGAGAAACGGAGTGGCCTTTTGATCAGTCGACGTCTCGATTCCCCATTTTGTTCT
GAATCGCACTCTAGCAAATAACTGGACGATCTTCGCTTTGTTATCCGATTGAAAGAGGAACGAGTATTCAATCTCGATTATTGGACTCAAATCATTTTTTTTGTTCTTTT
TTGTTGTGGTTCGTGTTTTTTTGGAGCAAACAACTGATCTATTCTGGAGTAAGATTCTGACTGTCTGGGTCTGAAAGTGACGAATATTTACACTTTCCGAGCTTTGGCTA
ATAGTTTAGGCAAACTTTGTTTGGATGCTTTCTGGGTTCAGTATGTTATGAGAAACTTCGTTTTTGTTTTCCTCTAGAAATCGACCGGTTTCCATTCTTAGCGATTTCTT
CGGGACGAGACAATCGGGGAGCACATCCTTGGTTGACTGAAATTGGGTTTGGTAATTGGAGATTTTTGGGGAACTTTCTTCTCTCCGTGACCAATCAGCTTCGGAGTTCT
TAAAGTTCGGGCTTTTTGTTGATCTTTTAAGAATTTGATTAGATTTTTCCTAGTTTTTTAACATGACATTTTTCATTTTCCATATGGAGGTTACAGCATCAAGCAGGTTC
CATCTGCCACTCCTTTCCATTTTGGGATTCTACCTCTAAATTCTTTCGGAAGAACTCTTAGCTATCTCCCTAATTTCCCCCAGAAGTCTTTCACTTGTCCGGAGTAGAAC
CCATGGCCGTGTCTTTCAGTCGTATTTCATGGTGGTTTCGGGGTGGGAAGGAAAAAGAGGTTGTTGCAAACCAGTTCTCTCCAAAAAATTCTTGCTCTGAGATTGGTGCT
GGTTTGAAAGAACCGGAGAGTCTCAAGTTCAAGAGGGTCGATTTGCCCTCATCATCAAAGAAGGTGAAGAAGCAGAAATGGCAAAGTAGGAAAGAAACAAGGATTGACTG
GGAATACGATTTTGTGATGGTACCATCCGTTGGTGATGGCATACAAATGCCCGACTCTGCTGATGAGGCCGATTGGTCCATTGGTTGGTTTGAGCCTCACGGTCCTAACT
TTCAAAGTGAAGATAGTTTTGCCGTTCTGGTCCCTTCCTATAGCCATCGTTGCAAGGAGCTGGTGGAGAGTTCAAACGTGGAGCTTTTGGCTGCCATCAAAAATTCACCT
GAGAGCAATAACTATATGGAACAGTGGCTTTCTTCCCTGCAAAACTCAGAATCCTGAAGCGCGCTGCTGCTTCTTTCTGGTTTCGAGCGATGAAAGGTGTGAAAAAATAC
AGTCCAGGTGCTTCATTTCTGCAAAGATTATCATCATCTTGACCAGCAGTTGATAAGATAGAATAACAAACCCAGTTTTGTGCATATCATGTTAAAGATCTTTCTTTACA
TACTTCTATTTTGCAGATAATTTGATTGAAACTCTAGGAACCATGGGAAAAGAGATTTTCTTTCTTTCTTTTTTTTCGTTTTTTTTTCTGTGTATATGTGGTAAGATACA
GCATCCATAGTCTAACTACTACTGTTTCAATTCTCCAATTTCCTTCAAATATATCTTAATGCAACATTGCCCTTGTGTAAATCTGTAATTCTCTCATATACTTGTTCATT
CTCCTTGTGCCTTAAACTTCAATTGACCCTCTCATAATTAATTTGCAAAGATCAATGGTTGCATGTCCTGCTTCAATGGCTCTCATTGAATTATGGTGGCCATTTATTAG
AAGCTATCAAATGCCTGCAATATATAATACGATATTCTCTCTGAAGCAGAGGTTGATTGATTATCTACAAAGCCAGTTGCAATGTCGAGCAAAACATTTTACATTGGGAA
AGACAGTTTTGAAGGCAGATAGCAAATTGGGTAGTTGATTAAGTGGTTTGGCTCCTCAGTTTGGTTTAATTGAAACCTCCGATACCACCTTGGCTCCTCTTCCCCTGGCC
ATGTACAGGCCTATGTGGTGCATCAGCTCCTGCCACAACCAAAATCAAAATAACCTTTTTCAGTATTGTGCTTATTGGGAATTGGGATCCTTATATATCTGAAAAAAGTA
GAGGAAAATATTTTTACAATGGAGTTCACAGACTTGAAAACTGAAGTAACATTCTTCACCATTTGGAAGAGTTCTTTTTTCCAACTCAGAAATCATCCATTGGAGATGCA
AAATTCAAGAATAATTACTACTTTGAGGCATATATGCTTAACAGTAAGAACGGTGAAATGCATTCTTATTTTTTCATTTCCATTTAACATTTGATGAGTTTGACAATGTC
TGAAAAATAGAAACATGAACTGTCTTGCGAATTATCCCTCTAATTGAATAAGGTTAGCTCTTTTTTTTTTCCTTCTTATTCAGTAGATATTATCTCTTATCCATTCCTAA
AAATCTCTTTGTCCAGTATTTATATTTTTATCCTCTTTTTTATTTATGGCATCAGCACAAGGATGAAGACAAGCATAGCCATGTTTGCAGTGAGGTCCGATCAGCTCCAT
CAACAATAAAAATAAGGTTTGAGCAAATGAATGGACAAAAACGCAATGATAGGCTCTTCAAAGAAATAAAGAATAAGCGAAAAACAAAACCAAACAGTCCTTGTCATGAT
GTTAAGAGACTTATTTTATATGAAAGTAATGGATTAATAATTTAGATACCTGTGGATGAGCAATTGCATCGCCTAAAGTCTTGGCTACAGGAACTGGAAGATCAAAAGTA
GCGCAGCCACCTGAATATACAATCACATCCTCTAATGGCGAAAGGAAACCGCGATTTCTTGCAGATAATGTCGAACATACAAAATCCAGCACGCATATGTCGGTGCAAAT
TCCCAGCACTAAGATCTTTTTTTTTTTTTTTTCAGAGAAGGAAACAGATAAAAAATTACTTCTCTTTAATTGTAAAAAATGAAAAAACGTAGTCAAATCAGAAGAGAAAT
TTGTATCTCCGATCGAAAATTGGGGAATACTCACGACTTTGATCTGATTGTTTTTCACCCAGTCAACGAAAACGTTAGAGGCATCTTTCTCCAAACAACCTAGAAATCCA
TCAATGCAATCTTTGCGTCTCAGCGTTACATTCGCTTCATTTTCTAGCCATTGCAAGGCTGCAAACAAGGAACAGAGATTTTCAAAATTGGCTGTGAATTGATCGCCAAA
TACTAAAGTGAAGATTCGTTTACGGAGAATTTCAAAATTGTACCTGGAACTAGTTTGGACTCGTCGGTTCCAGCAATGCAATGAGGAGGATAGGGGGGCTCGGGAATATC
CGGATGATGGGAATCAATAAAGGCGAATATAGGCCACTTCTTCTCACAGAAAACTCCGGCGAATCTCACCGACTCCTTCACCATTTCGGAAATTTGCTTGTCGTGCTGTT
TCGGAGCCTATAAACAAGTAGAGCCACGATCCAGTCCAACCAACAGAGACATTCAATCGGAAAAAAGAGAAAATCAATAGCGAATCAAAAGTTGAATCAAAAGCCTATTA
ATATCAAGGGCTAATTTGCAGAAGAAGGTGTTTGGATTAGATCGGAAGACAGAGGAGAAATCAGGGAATGGAATTACCAAATTGCCAGCGCCGACGGTGCAGAAGCCATT
AACGACATCGACGAGGACGAGGCCGGTGTTGACGTCCCCGGACAAAACGAAGGACTCTTGGTCCACCGGAAGGTGCTCCTTCAACAATTCTACGACCTCCGAGACCATCT
TCTTCTTCTTCTTCTTCTTCTTCTTCCTGTAAAGGGTAATAGACCAAATGCGAGAGGCGCCATCACGGACCTTGAAATACAGGAAGAAAGAAGGTTGGTCAAAAAAACGA
G
Protein sequenceShow/hide protein sequence
MAVSFSRISWWFRGGKEKEVVANQFSPKNSCSEIGAGLKEPESLKFKRVDLPSSSKKVKKQKWQSRKETRIDWEYDFVMVPSVGDGIQMPDSADEADWSIGWFEPHGPNF
QSEDSFAVLVPSYSHRCKELVESSNVELLAAIKNSPESNNYMEQWLSSLQNSES