; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g24290 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g24290
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr4:17506919..17510134
RNA-Seq ExpressionMoc04g24290
SyntenyMoc04g24290
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-9872.26Show/hide
Query:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL
        RFVL E+CPQ PA NAT  VR  Y+RW     KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+
Subjt:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL

Query:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK
        M+HFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQ ++SLMK KGQ+GEANVATS ++F+RGS+SGT+S PSSSG+K +K
Subjt:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK

Query:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA
        KKK  G+G+K N A A    K K A KG CFHCN +GHWKRNC KYLAEKKKA +GKYDLLVLETCLVENDDSA
Subjt:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-9872.26Show/hide
Query:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL
        RFVL E+CPQ PA NAT  VR  Y+RW     KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+
Subjt:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL

Query:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK
        M+HFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQ ++SLMK KGQ+GEANVATS ++F+RGS+SGT+S PSSSG+K +K
Subjt:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK

Query:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA
        KKK  G+G+K N A A    K K A KG CFHCN +GHWKRNC KYLAEKKKA +GKYDLLVLETCLVENDDSA
Subjt:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-9872.26Show/hide
Query:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL
        RFVL E+CPQ PA NAT  VR  Y+RW     KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+
Subjt:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL

Query:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK
        M+HFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQ ++SLMK KGQ+GEANVATS ++F+RGS+SGT+S PSSSG+K +K
Subjt:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK

Query:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA
        KKK  G+G+K N A A    K K A KG CFHCN +GHWKRNC KYLAEKKKA +GKYDLLVLETCLVENDDSA
Subjt:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-9872.26Show/hide
Query:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL
        RFVL E+CPQ PA NAT  VR  Y+RW     KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+
Subjt:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL

Query:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK
        M+HFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQ ++SLMK KGQ+GEANVATS ++F+RGS+SGT+S PSSSG+K +K
Subjt:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK

Query:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA
        KKK  G+G+K N A A    K K A KG CFHCN +GHWKRNC KYLAEKKKA +GKYDLLVLETCLVENDDSA
Subjt:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-9872.26Show/hide
Query:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL
        RFVL E+CPQ PA NAT  VR  Y+RW     KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+
Subjt:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL

Query:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK
        M+HFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQ ++SLMK KGQ+GEANVATS ++F+RGS+SGT+S PSSSG+K +K
Subjt:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK

Query:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA
        KKK  G+G+K N A A    K K A KG CFHCN +GHWKRNC KYLAEKKKA +GKYDLLVLETCLVENDDSA
Subjt:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein6.1e-9972.26Show/hide
Query:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL
        RFVL E+CPQ PA NAT  VR  Y+RW     KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+
Subjt:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL

Query:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK
        M+HFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQ ++SLMK KGQ+GEANVATS ++F+RGS+SGT+S PSSSG+K +K
Subjt:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK

Query:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA
        KKK  G+G+K N A A    K K A KG CFHCN +GHWKRNC KYLAEKKKA +GKYDLLVLETCLVENDDSA
Subjt:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA

A0A5A7TU93 Gag/pol protein6.1e-9972.26Show/hide
Query:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL
        RFVL E+CPQ PA NAT  VR  Y+RW     KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+
Subjt:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL

Query:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK
        M+HFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQ ++SLMK KGQ+GEANVATS ++F+RGS+SGT+S PSSSG+K +K
Subjt:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK

Query:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA
        KKK  G+G+K N A A    K K A KG CFHCN +GHWKRNC KYLAEKKKA +GKYDLLVLETCLVENDDSA
Subjt:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA

A0A5A7TWB9 Gag/pol protein6.1e-9972.26Show/hide
Query:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL
        RFVL E+CPQ PA NAT  VR  Y+RW     KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+
Subjt:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL

Query:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK
        M+HFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQ ++SLMK KGQ+GEANVATS ++F+RGS+SGT+S PSSSG+K +K
Subjt:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK

Query:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA
        KKK  G+G+K N A A    K K A KG CFHCN +GHWKRNC KYLAEKKKA +GKYDLLVLETCLVENDDSA
Subjt:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA

A0A5D3CPJ6 Gag/pol protein6.1e-9972.26Show/hide
Query:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL
        RFVL E+CPQ PA NAT  VR  Y+RW     KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+
Subjt:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL

Query:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK
        M+HFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQ ++SLMK KGQ+GEANVATS ++F+RGS+SGT+S PSSSG+K +K
Subjt:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK

Query:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA
        KKK  G+G+K N A A    K K A KG CFHCN +GHWKRNC KYLAEKKKA +GKYDLLVLETCLVENDDSA
Subjt:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA

A0A5D3CSZ6 Gag/pol protein6.1e-9972.26Show/hide
Query:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL
        RFVL E+CPQ PA NAT  VR  Y+RW     KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+
Subjt:  RFVLQEDCPQAPAPNATVAVRNAYDRWV----KAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNL

Query:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK
        M+HFNVAE NGAVIDE SQVSFILESLP+SFL F SNAVMNK+ YTLTTLLNELQ ++SLMK KGQ+GEANVATS ++F+RGS+SGT+S PSSSG+K +K
Subjt:  MIHFNVAESNGAVIDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATS-KRFNRGSSSGTRSAPSSSGSKTFK

Query:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA
        KKK  G+G+K N A A    K K A KG CFHCN +GHWKRNC KYLAEKKKA +GKYDLLVLETCLVENDDSA
Subjt:  KKKAAGKGSKPNSAVAAQKGKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAAAGAAGGTGTGCTACCACAAGCAGAATCCCGAGACCCAAGAGGATAGCAAGGAAGACGTAATGGTGGTGTTCGTGGGAAACCGTAGAAGAGAAGTTCTT
CAAAGTCGTTCGTGGCATCGGATTGGGCAAAAGTTGCAGAAAACAGAGAAGAAGACGAAGCAGACTGCGCAAACAGCCCCATGGCTCTGCGGGACAGCACATAGC
GCCACGGCACTGCACTGTAGCGTCGCGGCGCTGCCCTTAGGCACCGAGGTGCTTTCCCGGGTGCTTTCAACGCGGTTCCGAGGCTCCGGTTCGCGGTTCGTCTTA
CAAGAGGATTGTCCACAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGCCTATGACAGGTGGGTCAAGGCCAAGGTCTACATCTTGGCGAGCATATCT
GATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCTTCACAGGCTCGACATGAAGCC
CTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTCCTCAGTGCGAGAACACGTTCTCAACCTGATGATCCACTTCAACGTGGCTGAGTCAAACGGGGCCGTC
ATAGACGAGCAGAGTCAGGTCAGCTTTATTCTGGAATCTCTTCCGAAGAGTTTCCTTCCATTCTGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACC
ACGCTCCTAAACGAGCTGCAGATCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCAACAGAGGATCATCC
TCTGGAACCAGGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAAGCTGCTGGTAAGGGGTCTAAACCTAACTCAGCTGTTGCTGCCCAGAAA
GGCAAGGTCAAGGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATATGGACGGGCATTGGAAGCGCAACTGCACAAAGTACTTGGCCGAAAAGAAGAAAGCCAAC
GAAGGTAAATATGATTTACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAAAGAAGGTGTGCTACCACAAGCAGAATCCCGAGACCCAAGAGGATAGCAAGGAAGACGTAATGGTGGTGTTCGTGGGAAACCGTAGAAGAGAAGTTCTT
CAAAGTCGTTCGTGGCATCGGATTGGGCAAAAGTTGCAGAAAACAGAGAAGAAGACGAAGCAGACTGCGCAAACAGCCCCATGGCTCTGCGGGACAGCACATAGC
GCCACGGCACTGCACTGTAGCGTCGCGGCGCTGCCCTTAGGCACCGAGGTGCTTTCCCGGGTGCTTTCAACGCGGTTCCGAGGCTCCGGTTCGCGGTTCGTCTTA
CAAGAGGATTGTCCACAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGCCTATGACAGGTGGGTCAAGGCCAAGGTCTACATCTTGGCGAGCATATCT
GATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCTTCACAGGCTCGACATGAAGCC
CTTAAGTTCATTTACAACTCCCGCATGAAGGAGGGCTCCTCAGTGCGAGAACACGTTCTCAACCTGATGATCCACTTCAACGTGGCTGAGTCAAACGGGGCCGTC
ATAGACGAGCAGAGTCAGGTCAGCTTTATTCTGGAATCTCTTCCGAAGAGTTTCCTTCCATTCTGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACC
ACGCTCCTAAACGAGCTGCAGATCTACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCAACAGAGGATCATCC
TCTGGAACCAGGTCTGCGCCCTCTTCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAAGCTGCTGGTAAGGGGTCTAAACCTAACTCAGCTGTTGCTGCCCAGAAA
GGCAAGGTCAAGGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATATGGACGGGCATTGGAAGCGCAACTGCACAAAGTACTTGGCCGAAAAGAAGAAAGCCAAC
GAAGGTAAATATGATTTACTTGTATTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGA
Protein sequenceShow/hide protein sequence
MKKKVCYHKQNPETQEDSKEDVMVVFVGNRRREVLQSRSWHRIGQKLQKTEKKTKQTAQTAPWLCGTAHSATALHCSVAALPLGTEVLSRVLSTRFRGSGSRFVL
QEDCPQAPAPNATVAVRNAYDRWVKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMIHFNVAESNGAV
IDEQSQVSFILESLPKSFLPFCSNAVMNKLEYTLTTLLNELQIYQSLMKCKGQEGEANVATSKRFNRGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPNSAVAAQK
GKVKVAEKGKCFHCNMDGHWKRNCTKYLAEKKKANEGKYDLLVLETCLVENDDSA