; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g30770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g30770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr8:22064896..22074866
RNA-Seq ExpressionMoc08g30770
SyntenyMoc08g30770
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]5.7e-10058.89Show/hide
Query:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA
        M+TSI+QLL S+KLNGDNY  WKSNLNTILV+DDLRFVLTEECP APA NANRTVR+AYDRWVKAN+KA VYILAS++DVL+KKH+ +A A+ IMDSL+ 
Subjt:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA

Query:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------
        +FGQPS S+ H+AIK++Y  +MKEG+SVREHVL+MM+HFNI EVNG  ++E +QV                                             
Subjt:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------

Query:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTATKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIETCI
         E E NVA T K+KF +GSS  +K GPS   K   KKK   GKGKAP  +K K+   KGKCFHCN++GHWKRNCPKYLAEK+AEK  QGK+DLLV+ETC+
Subjt:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTATKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIETCI

Query:  LENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG
        +E D ST+ILDSGATNH+C SFQETS W++L+E E TL +  G
Subjt:  LENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]4.4e-9254.34Show/hide
Query:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA
        M+++ + +L +DKLNG+NY  WK+ +NT+L+IDDLRFVL EECP  PA NA RTVR+ Y+RW KANEKA  YILAS+S+VL+KKHE +  AREIMDSLQ 
Subjt:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA

Query:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------
        +FGQ S  I HDA+KY+YN +M EG+SVREHVLNMMVHFN+ E+NG +++E SQV                                             
Subjt:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------

Query:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTA---TKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIE
         + E NVA TS +KFH+GS+ G+KS PS       KKKK     KA  A   T  K K AKG CFHCN+ GHWKRNCPKYLAEK+  K KQGK+DLLV+E
Subjt:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTA---TKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIE

Query:  TCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG
        TC++ENDDS +I+DSGATNHVCSSFQ  S WRQLE  E T+ +  G
Subjt:  TCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG

KAA0048103.1 gag/pol protein [Cucumis melo var. makuwa]3.2e-9555.59Show/hide
Query:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA
        M++ I+QLL S+KLN DNY  WKSNLNTILV+DDLRFVLTEECP  PA NANRT R+AYDRW+KANEKA VYILAS+SDVL+KKHE LA A+EIMDSL+ 
Subjt:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA

Query:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------
        +FGQP  S+ H AIKY+Y  +MKEG+S+REHVL MM+HFNI EVNG  ++E +QV                                             
Subjt:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------

Query:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTATKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIETCI
         E E N+ATT K KF +GSS  SK GPS   + I+KK    GKGK P   KGK+   KGKC+HC ENGH   NCPKYL +K+AEKE Q K+DLLV+ETC+
Subjt:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTATKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIETCI

Query:  LENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQGSSSQQK
        +EN++ST+ILDSGATNH+C SFQE S W+ L E + TL +  G     K
Subjt:  LENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQGSSSQQK

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]4.4e-9254.34Show/hide
Query:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA
        M+++ + +L +DKLNG+NY  WK+ +NT+L+IDDLRFVL EECP  PA NA RTVR+ Y+RW KANEKA  YILAS+S+VL+KKHE +  AREIMDSLQ 
Subjt:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA

Query:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------
        +FGQ S  I HDA+KY+YN +M EG+SVREHVLNMMVHFN+ E+NG +++E SQV                                             
Subjt:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------

Query:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTA---TKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIE
         + E NVA TS +KFH+GS+ G+KS PS       KKKK     KA  A   T  K K AKG CFHCN+ GHWKRNCPKYLAEK+  K KQGK+DLLV+E
Subjt:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTA---TKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIE

Query:  TCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG
        TC++ENDDS +I+DSGATNHVCSSFQ  S WRQLE  E T+ +  G
Subjt:  TCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]4.4e-9254.34Show/hide
Query:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA
        M+++ + +L +DKLNG+NY  WK+ +NT+L+IDDLRFVL EECP  PA NA RTVR+ Y+RW KANEKA  YILAS+S+VL+KKHE +  AREIMDSLQ 
Subjt:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA

Query:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------
        +FGQ S  I HDA+KY+YN +M EG+SVREHVLNMMVHFN+ E+NG +++E SQV                                             
Subjt:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------

Query:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTA---TKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIE
         + E NVA TS +KFH+GS+ G+KS PS       KKKK     KA  A   T  K K AKG CFHCN+ GHWKRNCPKYLAEK+  K KQGK+DLLV+E
Subjt:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTA---TKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIE

Query:  TCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG
        TC++ENDDS +I+DSGATNHVCSSFQ  S WRQLE  E T+ +  G
Subjt:  TCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.1e-9254.34Show/hide
Query:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA
        M+++ + +L +DKLNG+NY  WK+ +NT+L+IDDLRFVL EECP  PA NA RTVR+ Y+RW KANEKA  YILAS+S+VL+KKHE +  AREIMDSLQ 
Subjt:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA

Query:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------
        +FGQ S  I HDA+KY+YN +M EG+SVREHVLNMMVHFN+ E+NG +++E SQV                                             
Subjt:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------

Query:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTA---TKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIE
         + E NVA TS +KFH+GS+ G+KS PS       KKKK     KA  A   T  K K AKG CFHCN+ GHWKRNCPKYLAEK+  K KQGK+DLLV+E
Subjt:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTA---TKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIE

Query:  TCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG
        TC++ENDDS +I+DSGATNHVCSSFQ  S WRQLE  E T+ +  G
Subjt:  TCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG

A0A5A7TWB9 Gag/pol protein2.1e-9254.34Show/hide
Query:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA
        M+++ + +L +DKLNG+NY  WK+ +NT+L+IDDLRFVL EECP  PA NA RTVR+ Y+RW KANEKA  YILAS+S+VL+KKHE +  AREIMDSLQ 
Subjt:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA

Query:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------
        +FGQ S  I HDA+KY+YN +M EG+SVREHVLNMMVHFN+ E+NG +++E SQV                                             
Subjt:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------

Query:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTA---TKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIE
         + E NVA TS +KFH+GS+ G+KS PS       KKKK     KA  A   T  K K AKG CFHCN+ GHWKRNCPKYLAEK+  K KQGK+DLLV+E
Subjt:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTA---TKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIE

Query:  TCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG
        TC++ENDDS +I+DSGATNHVCSSFQ  S WRQLE  E T+ +  G
Subjt:  TCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG

A0A5A7TWX1 Gag/pol protein1.6e-9555.59Show/hide
Query:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA
        M++ I+QLL S+KLN DNY  WKSNLNTILV+DDLRFVLTEECP  PA NANRT R+AYDRW+KANEKA VYILAS+SDVL+KKHE LA A+EIMDSL+ 
Subjt:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA

Query:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------
        +FGQP  S+ H AIKY+Y  +MKEG+S+REHVL MM+HFNI EVNG  ++E +QV                                             
Subjt:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------

Query:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTATKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIETCI
         E E N+ATT K KF +GSS  SK GPS   + I+KK    GKGK P   KGK+   KGKC+HC ENGH   NCPKYL +K+AEKE Q K+DLLV+ETC+
Subjt:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTATKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIETCI

Query:  LENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQGSSSQQK
        +EN++ST+ILDSGATNH+C SFQE S W+ L E + TL +  G     K
Subjt:  LENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQGSSSQQK

A0A5D3CPJ6 Gag/pol protein2.1e-9254.34Show/hide
Query:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA
        M+++ + +L +DKLNG+NY  WK+ +NT+L+IDDLRFVL EECP  PA NA RTVR+ Y+RW KANEKA  YILAS+S+VL+KKHE +  AREIMDSLQ 
Subjt:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA

Query:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------
        +FGQ S  I HDA+KY+YN +M EG+SVREHVLNMMVHFN+ E+NG +++E SQV                                             
Subjt:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------

Query:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTA---TKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIE
         + E NVA TS +KFH+GS+ G+KS PS       KKKK     KA  A   T  K K AKG CFHCN+ GHWKRNCPKYLAEK+  K KQGK+DLLV+E
Subjt:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTA---TKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIE

Query:  TCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG
        TC++ENDDS +I+DSGATNHVCSSFQ  S WRQLE  E T+ +  G
Subjt:  TCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG

E2GK51 Gag/pol protein (Fragment)2.8e-10058.89Show/hide
Query:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA
        M+TSI+QLL S+KLNGDNY  WKSNLNTILV+DDLRFVLTEECP APA NANRTVR+AYDRWVKAN+KA VYILAS++DVL+KKH+ +A A+ IMDSL+ 
Subjt:  MSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSKKHERLAIAREIMDSLQA

Query:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------
        +FGQPS S+ H+AIK++Y  +MKEG+SVREHVL+MM+HFNI EVNG  ++E +QV                                             
Subjt:  LFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQV---------------------------------------------

Query:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTATKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIETCI
         E E NVA T K+KF +GSS  +K GPS   K   KKK   GKGKAP  +K K+   KGKCFHCN++GHWKRNCPKYLAEK+AEK  QGK+DLLV+ETC+
Subjt:  LEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKGIQKKKKDNGKGKAPTATKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIETCI

Query:  LENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG
        +E D ST+ILDSGATNH+C SFQETS W++L+E E TL +  G
Subjt:  LENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTLWLDQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATTCTTCAATCAATTCAAGGTATGGTGAAAATGATGAGGGAAGATAGGCACGAAAGAAGGGCGCAACAACAAAGAGAAGAACGAGCCTTACAGGAAGAT
GAAGGTCATCAGGCCGGATCCAGTCGAGCCAAGTCTGTCAAGAACAACAATACCTCCAATACCCATCGTATCTCCATAATGTCCCATGTACGCGCCATACGAGTG
TCACGTACGCCTTTGTCTTCTAGCTCCAACGATATCTACACACAGAAATCTAGCACGTGGTGCTCTACTACTGATAGCACTCTAATGGAAAAGTTAGCACTTAAG
GTTGGAACATGCAAGTTTGAAGTAGAGAGAGAGAAAGCGATTCGGTGGTACGTGGAGACCACCCATGTTCTCATTCCCTCTGTTCCCAATAGAGATGCCCCATCT
TTGGAGCGTGTCAGGATTCAGACACAGGCTACAAACGGTTCTTCCACTGGCTGCAACCTTGGGTCTGTTGTGCTCAGTTTTAACCCTAGATTGAGTAAGCTCATC
AGCGCTGCTCAATATGCCTCCCATTTCAGGGATAAGACTGGATATATAACTGGGAACATAGAATTAGCAAGACGAAATTCACTCCTACCCGATTTAGGGATAGTA
GAGAGGAGTAGTCGACTGCCCTTCGGTGGGGGCTATTCTGACTCACTGAAGTATCGTTGCAAAACAATGTCTACTTCAATTATACAACTGCTAGTTTCCGATAAA
CTAAACGGAGACAATTATGAAATATGGAAATCAAATTTAAACACAATACTAGTTATTGACGATCTAAGGTTCGTTTTAACGGAGGAGTGTCCTCCAGCCCCTGCC
CCTAATGCAAACCGAACAGTTCGGGATGCATATGATAGATGGGTTAAAGCTAATGAAAAAGCTTGTGTCTACATTTTAGCCAGTATATCAGATGTATTGTCTAAA
AAGCACGAAAGATTAGCTATTGCAAGAGAGATCATGGACTCTCTACAAGCCCTGTTTGGACAACCATCAACATCTATCATGCATGATGCGATTAAGTATGTTTAC
AACTGCAAAATGAAGGAAGGATCTTCTGTAAGGGAGCATGTTTTAAACATGATGGTTCACTTCAATATTGTAGAAGTGAACGGCCCAATCATGAACGAAATAAGT
CAGGTTCTTGAGGCTGAGGTAAATGTTGCTACTACCTCAAAGAAGAAATTTCACAAGGGATCTTCCTTTGGGAGTAAATCTGGACCTTCTTATCAGAAGAAAGGA
ATTCAAAAGAAGAAGAAGGACAATGGGAAGGGGAAGGCTCCGACTGCGACAAAAGGCAAGGAAAAGATTGCAAAAGGAAAATGTTTCCATTGCAATGAAAATGGG
CACTGGAAAAGAAATTGCCCCAAATACCTCGCCGAGAAAAGAGCTGAGAAGGAAAAGCAAGGTAAATTCGATTTACTAGTCATTGAAACATGTATACTGGAGAAT
GATGATTCTACCTATATACTAGATTCAGGAGCCACTAATCATGTTTGTTCTTCTTTTCAGGAAACTAGTTTCTGGAGACAGCTTGAAGAAGATGAGTTCACTCTC
TGGTTGGATCAGGGGAGCTCATCTCAGCAAAAGCAATGGGAGGAATTAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCATTCTTCAATCAATTCAAGGTATGGTGAAAATGATGAGGGAAGATAGGCACGAAAGAAGGGCGCAACAACAAAGAGAAGAACGAGCCTTACAGGAAGAT
GAAGGTCATCAGGCCGGATCCAGTCGAGCCAAGTCTGTCAAGAACAACAATACCTCCAATACCCATCGTATCTCCATAATGTCCCATGTACGCGCCATACGAGTG
TCACGTACGCCTTTGTCTTCTAGCTCCAACGATATCTACACACAGAAATCTAGCACGTGGTGCTCTACTACTGATAGCACTCTAATGGAAAAGTTAGCACTTAAG
GTTGGAACATGCAAGTTTGAAGTAGAGAGAGAGAAAGCGATTCGGTGGTACGTGGAGACCACCCATGTTCTCATTCCCTCTGTTCCCAATAGAGATGCCCCATCT
TTGGAGCGTGTCAGGATTCAGACACAGGCTACAAACGGTTCTTCCACTGGCTGCAACCTTGGGTCTGTTGTGCTCAGTTTTAACCCTAGATTGAGTAAGCTCATC
AGCGCTGCTCAATATGCCTCCCATTTCAGGGATAAGACTGGATATATAACTGGGAACATAGAATTAGCAAGACGAAATTCACTCCTACCCGATTTAGGGATAGTA
GAGAGGAGTAGTCGACTGCCCTTCGGTGGGGGCTATTCTGACTCACTGAAGTATCGTTGCAAAACAATGTCTACTTCAATTATACAACTGCTAGTTTCCGATAAA
CTAAACGGAGACAATTATGAAATATGGAAATCAAATTTAAACACAATACTAGTTATTGACGATCTAAGGTTCGTTTTAACGGAGGAGTGTCCTCCAGCCCCTGCC
CCTAATGCAAACCGAACAGTTCGGGATGCATATGATAGATGGGTTAAAGCTAATGAAAAAGCTTGTGTCTACATTTTAGCCAGTATATCAGATGTATTGTCTAAA
AAGCACGAAAGATTAGCTATTGCAAGAGAGATCATGGACTCTCTACAAGCCCTGTTTGGACAACCATCAACATCTATCATGCATGATGCGATTAAGTATGTTTAC
AACTGCAAAATGAAGGAAGGATCTTCTGTAAGGGAGCATGTTTTAAACATGATGGTTCACTTCAATATTGTAGAAGTGAACGGCCCAATCATGAACGAAATAAGT
CAGGTTCTTGAGGCTGAGGTAAATGTTGCTACTACCTCAAAGAAGAAATTTCACAAGGGATCTTCCTTTGGGAGTAAATCTGGACCTTCTTATCAGAAGAAAGGA
ATTCAAAAGAAGAAGAAGGACAATGGGAAGGGGAAGGCTCCGACTGCGACAAAAGGCAAGGAAAAGATTGCAAAAGGAAAATGTTTCCATTGCAATGAAAATGGG
CACTGGAAAAGAAATTGCCCCAAATACCTCGCCGAGAAAAGAGCTGAGAAGGAAAAGCAAGGTAAATTCGATTTACTAGTCATTGAAACATGTATACTGGAGAAT
GATGATTCTACCTATATACTAGATTCAGGAGCCACTAATCATGTTTGTTCTTCTTTTCAGGAAACTAGTTTCTGGAGACAGCTTGAAGAAGATGAGTTCACTCTC
TGGTTGGATCAGGGGAGCTCATCTCAGCAAAAGCAATGGGAGGAATTAAGTTGA
Protein sequenceShow/hide protein sequence
MAILQSIQGMVKMMREDRHERRAQQQREERALQEDEGHQAGSSRAKSVKNNNTSNTHRISIMSHVRAIRVSRTPLSSSSNDIYTQKSSTWCSTTDSTLMEKLALK
VGTCKFEVEREKAIRWYVETTHVLIPSVPNRDAPSLERVRIQTQATNGSSTGCNLGSVVLSFNPRLSKLISAAQYASHFRDKTGYITGNIELARRNSLLPDLGIV
ERSSRLPFGGGYSDSLKYRCKTMSTSIIQLLVSDKLNGDNYEIWKSNLNTILVIDDLRFVLTEECPPAPAPNANRTVRDAYDRWVKANEKACVYILASISDVLSK
KHERLAIAREIMDSLQALFGQPSTSIMHDAIKYVYNCKMKEGSSVREHVLNMMVHFNIVEVNGPIMNEISQVLEAEVNVATTSKKKFHKGSSFGSKSGPSYQKKG
IQKKKKDNGKGKAPTATKGKEKIAKGKCFHCNENGHWKRNCPKYLAEKRAEKEKQGKFDLLVIETCILENDDSTYILDSGATNHVCSSFQETSFWRQLEEDEFTL
WLDQGSSSQQKQWEELS