; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0121 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0121
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC05:851281..856733
RNA-Seq ExpressionMC05g0121
SyntenyMC05g0121
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134039.1 uncharacterized protein LOC101203442 [Cucumis sativus]3.62e-9271.95Show/hide
Query:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGK--EHRVSPSSSAQIPKACIPNVI
        MV+N  PICRISVSST +AVP+KMKDQS  YPKVKVREE++ DD  P VYEQKRSYLLSLKD ESLFL+DSSN+PGK  EHRVS  S A+IPKAC  N I
Subjt:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGK--EHRVSPSSSAQIPKACIPNVI

Query:  EPSPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKR
        +PS SE QE   ++  +V+EDN  N RA+SIPMPRAVVSSPEND MIGKKNRKTTEKPSVLKN NSVQSRH+QCKI+A HS NEN I++R+SKD  D+K 
Subjt:  EPSPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKR

Query:  RQVGKSGTVNRGSSFMSKKTP
        R VGK+GT  RG SFMSK TP
Subjt:  RQVGKSGTVNRGSSFMSKKTP

XP_008438459.1 PREDICTED: uncharacterized protein LOC103483545 [Cucumis melo]1.39e-8969.96Show/hide
Query:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGK--EHRVSPSSSAQIPKACIPNVI
        MV+N  PICRISVSST +AVP+KMKD+S  YPKVKV+EE++ DD  P VYEQKRSYL SLKD ESLFL+DSSN+PGK  EH VSP   A+IPKAC  N I
Subjt:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGK--EHRVSPSSSAQIPKACIPNVI

Query:  EPSPSESQE--RRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADN
        +PS SE QE  RRCK VD   E+N  N RA+SIPMPRAV+SSPEND MIGKKNRKTT++PSVLKN NSVQSRH+ CKI+ASH GNENPI++R+SKD  D+
Subjt:  EPSPSESQE--RRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADN

Query:  KRRQVGKSGTVNRGSSFMSKKTP
        K R VGK+GT   G SFMSK TP
Subjt:  KRRQVGKSGTVNRGSSFMSKKTP

XP_022157963.1 uncharacterized protein LOC111024560 [Momordica charantia]7.63e-147100Show/hide
Query:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP
        MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP
Subjt:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP

Query:  SPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKRRQ
        SPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKRRQ
Subjt:  SPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKRRQ

Query:  VGKSGTVNRGSSFMSKKTP
        VGKSGTVNRGSSFMSKKTP
Subjt:  VGKSGTVNRGSSFMSKKTP

XP_038895103.1 uncharacterized protein LOC120083415 isoform X1 [Benincasa hispida]7.32e-10376.71Show/hide
Query:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP
        MV+N TPICRISVSST +AVP+KMKDQ+  YP+VKVREEK  DD HPAVYEQKRSYLLSLKD ESL L+DSSNSPGKEH VSPS SA+IPKA IPN I+P
Subjt:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP

Query:  SPSESQ-ERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKRR
        S SESQ ERRC+ VD   EDN  N RA+SIPMPRA++SSPEND+MIGKKNRKTTEKPSVLKNHNSVQSRH+QCKI+ASHS NENPI++R+SK+ AD+K R
Subjt:  SPSESQ-ERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKRR

Query:  QVGKSGTVNRGSSFMSKKT
         +GK+GT  RGSSFMSK T
Subjt:  QVGKSGTVNRGSSFMSKKT

XP_038895110.1 uncharacterized protein LOC120083415 isoform X2 [Benincasa hispida]1.06e-10477.06Show/hide
Query:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP
        MV+N TPICRISVSST +AVP+KMKDQ+  YP+VKVREEK  DD HPAVYEQKRSYLLSLKD ESL L+DSSNSPGKEH VSPS SA+IPKA IPN I+P
Subjt:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP

Query:  SPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKRRQ
        S SESQERRC+ VD   EDN  N RA+SIPMPRA++SSPEND+MIGKKNRKTTEKPSVLKNHNSVQSRH+QCKI+ASHS NENPI++R+SK+ AD+K R 
Subjt:  SPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKRRQ

Query:  VGKSGTVNRGSSFMSKKT
        +GK+GT  RGSSFMSK T
Subjt:  VGKSGTVNRGSSFMSKKT

TrEMBL top hitse value%identityAlignment
A0A0A0L861 Uncharacterized protein1.75e-9271.95Show/hide
Query:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGK--EHRVSPSSSAQIPKACIPNVI
        MV+N  PICRISVSST +AVP+KMKDQS  YPKVKVREE++ DD  P VYEQKRSYLLSLKD ESLFL+DSSN+PGK  EHRVS  S A+IPKAC  N I
Subjt:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGK--EHRVSPSSSAQIPKACIPNVI

Query:  EPSPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKR
        +PS SE QE   ++  +V+EDN  N RA+SIPMPRAVVSSPEND MIGKKNRKTTEKPSVLKN NSVQSRH+QCKI+A HS NEN I++R+SKD  D+K 
Subjt:  EPSPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKR

Query:  RQVGKSGTVNRGSSFMSKKTP
        R VGK+GT  RG SFMSK TP
Subjt:  RQVGKSGTVNRGSSFMSKKTP

A0A1S3AW33 uncharacterized protein LOC1034835456.75e-9069.96Show/hide
Query:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGK--EHRVSPSSSAQIPKACIPNVI
        MV+N  PICRISVSST +AVP+KMKD+S  YPKVKV+EE++ DD  P VYEQKRSYL SLKD ESLFL+DSSN+PGK  EH VSP   A+IPKAC  N I
Subjt:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGK--EHRVSPSSSAQIPKACIPNVI

Query:  EPSPSESQE--RRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADN
        +PS SE QE  RRCK VD   E+N  N RA+SIPMPRAV+SSPEND MIGKKNRKTT++PSVLKN NSVQSRH+ CKI+ASH GNENPI++R+SKD  D+
Subjt:  EPSPSESQE--RRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADN

Query:  KRRQVGKSGTVNRGSSFMSKKTP
        K R VGK+GT   G SFMSK TP
Subjt:  KRRQVGKSGTVNRGSSFMSKKTP

A0A6J1DUS8 uncharacterized protein LOC1110245603.69e-147100Show/hide
Query:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP
        MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP
Subjt:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP

Query:  SPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKRRQ
        SPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKRRQ
Subjt:  SPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKRRQ

Query:  VGKSGTVNRGSSFMSKKTP
        VGKSGTVNRGSSFMSKKTP
Subjt:  VGKSGTVNRGSSFMSKKTP

A0A6J1FAB2 uncharacterized protein LOC111443687 isoform X77.70e-8470.1Show/hide
Query:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP
        MVQNPTPICRISV       P+ MKDQ  +YPKVKVR+E + DD HPAV EQKRSYLLSLKD ESLFLEDSS+S GKEHRVSPSS+A++PK   PNV++P
Subjt:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP

Query:  SPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKR-R
        S SESQ++RC         N +NTRA+SIPMPRAV+SSPEND+MIGKKNRKTTEKPSVLKNHNSVQSRH+QCK +A HSGNENPI +RKSK+   N + R
Subjt:  SPSESQERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKR-R

Query:  QVGK
          GK
Subjt:  QVGK

A0A6J1FGL2 uncharacterized protein LOC111443687 isoform X65.30e-8269.76Show/hide
Query:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP
        MVQNPTPICRISV       P+ MKDQ  +YPKVKVR+E + DD HPAV EQKRSYLLSLKD ESLFLEDSS+S GKEHRVSPSS+A++PK   PNV++P
Subjt:  MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEP

Query:  SPSESQ-ERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKR-
        S SESQ ++RC         N +NTRA+SIPMPRAV+SSPEND+MIGKKNRKTTEKPSVLKNHNSVQSRH+QCK +A HSGNENPI +RKSK+   N + 
Subjt:  SPSESQ-ERRCKTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKR-

Query:  RQVGK
        R  GK
Subjt:  RQVGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G21865.1 unknown protein6.6e-0534.48Show/hide
Query:  VYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEPSPSESQERRCKTVDVVEEDNNINTRANSI
        +YPKVK+       D+    Y+QK S L       S +L+   +   +E  +     A+IPK  IP+V+  S SES+E   K +   + +    T+A+  
Subjt:  VYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEPSPSESQERRCKTVDVVEEDNNINTRANSI

Query:  PMPRAVVSSPENDIMIGKKNRKTTEKPSV-LKNHNSVQSRHAQCK
          PRAVVSSP+ND MIG  N     K    LK+ + ++SR +Q K
Subjt:  PMPRAVVSSPENDIMIGKKNRKTTEKPSV-LKNHNSVQSRHAQCK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAAAACCCAACGCCCATATGCCGCATCTCTGTTTCTTCGACGGCCCAAGCTGTTCCCAAGAAGATGAAAGACCAATCCACTGTCTATCCCAAGGTGAAGGTGAG
GGAGGAGAAAGACCCCGATGATGATCACCCTGCTGTATACGAGCAGAAGAGAAGTTATCTGTTGTCTCTGAAAGATTTTGAATCACTCTTCCTTGAAGACTCCTCCAATT
CTCCAGGAAAAGAGCATCGTGTTTCTCCATCATCCAGTGCTCAAATTCCAAAAGCTTGTATTCCCAACGTAATCGAACCATCCCCTTCTGAATCCCAAGAAAGGAGGTGT
AAAACGGTTGATGTTGTTGAGGAGGACAACAACATAAATACTAGAGCTAATTCAATCCCAATGCCACGTGCCGTCGTATCCAGCCCTGAAAATGATATAATGATAGGGAA
GAAAAACAGAAAAACAACAGAAAAACCATCAGTTTTGAAGAACCACAATTCAGTTCAGTCCAGACACGCACAGTGTAAGATCATGGCTAGCCACAGTGGCAATGAAAATC
CAATTACCACGAGGAAGTCCAAGGATGCTGCGGATAATAAACGTCGTCAAGTTGGAAAGAGTGGTACGGTGAACAGGGGCAGCAGTTTCATGTCAAAGAAAACTCCCTAG
mRNA sequenceShow/hide mRNA sequence
TCCCAACTCCAATCACTTTTGACTATCTCAATACTATTAAAAAGCAAACGAAATCAAATACGATTTGCAGATTCAAACATTCCTACAAAAAGGGAAATATCAGTGTCACT
TTTGGGAGAAATCTCGCGAAGATCAAACCCAAATGAACTACCAAACAAGAACGAGAAAAAGAAAAAATAAACCCAATTGGAAATATAGATAAAAAAAAGGTTCAAATAAT
ATTTTGGTTCAGAAAATAAAAGGGGAATGGAGGAGATGGAACGGCATCGTTGACAATACAAGAACTTCTCTTCTGAACTCAGCAGCAGCAAAAACTAAAAGAAAAGCATG
GCTAAATGCTAATGGGGTCCGCCACTGCGAAGGTTTCAGGAAAATGCGCCCCTTGGCGTCGGTTGACCCAACGATCGCTTACACGTTCGATTTGAATCATTTCATTTTCT
GAAATCCCAAAAGCCATTTCTCCCAAAACTTCGCTCACTGATCATTCTCTTCTCTTTGAAGACACGCATTGAGCACGTCATTTCTGGGTATCTTCTTCTACTGATTCGTT
CTGATGGTTCAAAACCCAACGCCCATATGCCGCATCTCTGTTTCTTCGACGGCCCAAGCTGTTCCCAAGAAGATGAAAGACCAATCCACTGTCTATCCCAAGGTGAAGGT
GAGGGAGGAGAAAGACCCCGATGATGATCACCCTGCTGTATACGAGCAGAAGAGAAGTTATCTGTTGTCTCTGAAAGATTTTGAATCACTCTTCCTTGAAGACTCCTCCA
ATTCTCCAGGAAAAGAGCATCGTGTTTCTCCATCATCCAGTGCTCAAATTCCAAAAGCTTGTATTCCCAACGTAATCGAACCATCCCCTTCTGAATCCCAAGAAAGGAGG
TGTAAAACGGTTGATGTTGTTGAGGAGGACAACAACATAAATACTAGAGCTAATTCAATCCCAATGCCACGTGCCGTCGTATCCAGCCCTGAAAATGATATAATGATAGG
GAAGAAAAACAGAAAAACAACAGAAAAACCATCAGTTTTGAAGAACCACAATTCAGTTCAGTCCAGACACGCACAGTGTAAGATCATGGCTAGCCACAGTGGCAATGAAA
ATCCAATTACCACGAGGAAGTCCAAGGATGCTGCGGATAATAAACGTCGTCAAGTTGGAAAGAGTGGTACGGTGAACAGGGGCAGCAGTTTCATGTCAAAGAAAACTCCC
TAGAAACAAGAGACCAATCTCTCGGGTGATATCTGGCATGTCATAAACCTTCCAGTAGGCCAAATATAAGCCTTGAGTCGATGCTTGCATTGGATAGAAGTATCGTTGTG
GTGCTGTGTCAAATTGCTCATCAATGCAAATGGGTTACTTGTTCATACTATAGCTTGGAATCAACTTCCTCGCTAACATGAATTACTTTATATCCATGTAGCCTGAGATG
TTTTGAACTTGAAAACTTGAGACCAATGAAGTTCTAGTAACAGCCAGCAACACCTTTTTCACTTATGTTGCTGTGAAACCAATCTAGACATGCACTATTAAACATTATGC
TGTTTGGAACTCGGCTTCTCTCATTTTAGAATTTCATTTTCCATCTCCAGATTCAATAACTTTACAACACAACAGATTCTATCAAAATGTGGATTTGAACGATCTCCAGA
AAGAAAGACATGCCCACTATTACTTATCTCAATCCAGCTACACCCTGGTTGTTTCTTAACTCCATTAATCTTCATTCTTGCCAATATCCTCTCAGCATCTTCTCTTTTCC
CACCTCTCAAGTACATTTCTGCTAAAATCAAATACACTCCAGAGTTATGAGGCTCCTTCTCCAGGACCTTCTCACCTGCAATCACACCTACATCATAATTCTTGTGGATT
CTGCAATCTCCAAGCAAAGCCCCCCAAACGCTTGAAGGAATTTCAATTCCTTCTGCTTTCATTTCGACTAGAAAACTCAATGCCTCATTGATAAGCCCAAATCTCCCAAA
CAAGTCAACTAAGCACGTGTAATGCTCACTTAATGGCTGAATAGAACACATGTTTTTCATGAAATTGAAGTAGTGCCTACCCTGGTCCACCAGACCGTTATGGCTACAGG
CAGACAGAACACCAATAAAGGTGATATGATTGGGTTCTACGTTAGCCAATCTCATTTTTTCAAACATCTCCAGAGCTTCGTTACCATTTCCATGTTGAGCAAACCCACAG
ATTATAGAATTCCAAGAAATCACATCCCTGGTTGACATGGAAGAGAACTCTATCAAAGCACAGTCCATGTTTCCACATCTTGCGTACATAGTAACCATAGCATTCGAGAC
TGCAACAAAGCCATTGAATCCCGCTTTTAGAACAAGTGCATGTGTTTGTTTACCAAGCTGCAAAGTTTCCAGGCCGGAACAGATTGTTAGAACACTCGTAAATGTAGCTT
TATCAGGATGTGGACCTAACCTTATCATTCTAGTGAAAAGCTTCAAACCTTCCTCACCTTGATCGTTTTCTCCTAATCCAAATATTGTAGCATTCCACACAGTTGCATCC
TTGTACAGAATCGATTCAAAAATTTTCACAGCCATCCCAACTTCACCAATCCCAAAATATCCCACGATCAGATTAGTCCAGGAAACTATATTACCATATGGGGTTTTCTC
AAGAAATGCATGAGTTTCTTTTACCAGTCCCTTTCTTATGTACGCCAGCACAATACTGTTCCATGTTTTTTGGCATTTCTCTGGCATATCCATAAACAGCCTCCTAGCAT
CATCAACCCTTTGGCTTCCAACCAACCCATTCACTAAATCGTTCCAAGAATCAAAGTTCCTTTCAGGCATTTTCTCGAACAACTCCTCTGCCTCTTTAATCTGGTCATTT
TCTATATACCCGGCCATCATCGCATTCCAAGCTCGAGCATCTTTCACGGGCATGTTCTCAAATAGCTCTCTCGCTTCATCTATACGTCCTGCACGCGCAAGCCCCGAGAT
CATGATGGTCCACGACACAAGATCCCTCCTGCTCATTCTATTGAAATACTCTTCAGCCAAATCAAGTCGGCCGCAATTTACAAGCCCGCCAATTATCAAGTTCCAAGAAA
TTACATCTCGCAAAGGCATTTTATCGAACAGCTGAACCGCTGCCTCGAGTAACTCATTCCGAATATATCCCGCAATCATTGAATTCCAACTCACAACATCTCCGAACGGC
ATTACAGCAAAAACATCTTTTGCACCATCCACATCCCCGCATTGCATCAGCCCAGCAATCACGGTGTTATAAGAGAAAACATCACGCTCGGGCATTCGACGAAATAAGCT
AATCGCATCACCACGCCGCCCGTTCAAGAAGTACCCTCGAATCATAGCATTCCAAGTCACAACATTTCTTTGAGCCATTTCGTCGAACAGTTTCTGGGCTTCTTCTACTA
AACCATTTCTCATACAGTTTGAGATCTCGGAATTGAGAGGCTTTAGATTTAGAAACGACGCCGGTGTATGACGGGATTGTAAGCAGGAGGTCGAAGTTGCGAGAGAGATT
GATGTCCTGAGATGTAGAGAGAAACTCCATGTGGGTCTGAGAACAAGTGAGGGCCGAGCTACAAAGATCATTCTCCATTAGTTTAGAAACAATAGCACATGGGAAGCAGA
AGAACAGAGTCGGTCCCGTAATATATAGTTTTCGCCTGAAGTTTCACCGGAGCCACTTCCAAGCAAGTCAAGCTATGAACAAAATACAGAAGAACAGGTTCGGATGTTGT
TCATAGCTCAAATACAGTTGCCACTTCCAAGCAAGTCAATCTATGAACAAAATTGCCCATTGCCTTGTGGGAAACCTTTCGGCCGGGAAGACGAAAATATTTGCATCCAC
AAAAATGTCCAGAAAATCATTAAGAAAAAATCAACTCGTCTTTAATTTTTGGGTCGATTTTTTTTTAATTTCTTTTTTGGCCAAACTGAGCTAAGTCACTGCGACTTGGA
Protein sequenceShow/hide protein sequence
MVQNPTPICRISVSSTAQAVPKKMKDQSTVYPKVKVREEKDPDDDHPAVYEQKRSYLLSLKDFESLFLEDSSNSPGKEHRVSPSSSAQIPKACIPNVIEPSPSESQERRC
KTVDVVEEDNNINTRANSIPMPRAVVSSPENDIMIGKKNRKTTEKPSVLKNHNSVQSRHAQCKIMASHSGNENPITTRKSKDAADNKRRQVGKSGTVNRGSSFMSKKTP