; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g10540 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g10540
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr1:6516494..6520326
RNA-Seq ExpressionMoc01g10540
SyntenyMoc01g10540
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]7.7e-0482.76Show/hide
Query:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV
        HG+ LSKEQCPKTPQ+VE+MR IPYASAV
Subjt:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.6e-8973.97Show/hide
Query:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG
        M +G+SVREHVLN+MVHFNVAE NGAVID+ SQVSFI ESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS SG
Subjt:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG

Query:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC
        T+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSGATNHVC
Subjt:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC

Query:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN
        SSFQGISSW+QL+ GEMT++VGT  VV+A+AVG L+L+  K+
Subjt:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-8974.38Show/hide
Query:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG
        M +G+SVREHVLN+MVHFNVAE NGAVID+ SQVSFI ESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS SG
Subjt:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG

Query:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC
        T+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSGATNHVC
Subjt:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC

Query:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN
        SSFQGISSWRQL+ GEMT++VGT  VV+A+AVG L+L   K+
Subjt:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]7.7e-0482.76Show/hide
Query:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV
        HG+ LSKEQCPKTPQ+VE+MR IPYASAV
Subjt:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-8974.38Show/hide
Query:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG
        M +G+SVREHVLN+MVHFNVAE NGAVID+ SQVSFI ESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS SG
Subjt:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG

Query:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC
        T+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSGATNHVC
Subjt:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC

Query:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN
        SSFQGISSWRQL+ GEMT++VGT  VV+A+AVG L+L   K+
Subjt:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]7.7e-0482.76Show/hide
Query:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV
        HG+ LSKEQCPKTPQ+VE+MR IPYASAV
Subjt:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-8974.38Show/hide
Query:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG
        M +G+SVREHVLN+MVHFNVAE NGAVID+ SQVSFI ESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS SG
Subjt:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG

Query:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC
        T+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSGATNHVC
Subjt:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC

Query:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN
        SSFQGISSWRQL+ GEMT++VGT  VV+A+AVG L+L   K+
Subjt:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]7.7e-0482.76Show/hide
Query:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV
        HG+ LSKEQCPKTPQ+VE+MR IPYASAV
Subjt:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-8974.38Show/hide
Query:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG
        M +G+SVREHVLN+MVHFNVAE NGAVID+ SQVSFI ESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS SG
Subjt:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG

Query:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC
        T+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSGATNHVC
Subjt:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC

Query:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN
        SSFQGISSWRQL+ GEMT++VGT  VV+A+AVG L+L   K+
Subjt:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein9.9e-9074.38Show/hide
Query:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG
        M +G+SVREHVLN+MVHFNVAE NGAVID+ SQVSFI ESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS SG
Subjt:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG

Query:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC
        T+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSGATNHVC
Subjt:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC

Query:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN
        SSFQGISSWRQL+ GEMT++VGT  VV+A+AVG L+L   K+
Subjt:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN

A0A5A7SMH8 Gag/pol protein3.7e-0482.76Show/hide
Query:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV
        HG+ LSKEQCPKTPQ+VE+MR IPYASAV
Subjt:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV

A0A5A7SMH8 Gag/pol protein9.9e-9074.38Show/hide
Query:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG
        M +G+SVREHVLN+MVHFNVAE NGAVID+ SQVSFI ESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS SG
Subjt:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG

Query:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC
        T+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSGATNHVC
Subjt:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC

Query:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN
        SSFQGISSWRQL+ GEMT++VGT  VV+A+AVG L+L   K+
Subjt:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN

A0A5A7TU93 Gag/pol protein7.6e-9073.97Show/hide
Query:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG
        M +G+SVREHVLN+MVHFNVAE NGAVID+ SQVSFI ESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS SG
Subjt:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG

Query:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC
        T+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSGATNHVC
Subjt:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC

Query:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN
        SSFQGISSW+QL+ GEMT++VGT  VV+A+AVG L+L+  K+
Subjt:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN

A0A5A7TWB9 Gag/pol protein3.7e-0482.76Show/hide
Query:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV
        HG+ LSKEQCPKTPQ+VE+MR IPYASAV
Subjt:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV

A0A5A7V4M1 Gag/pol protein3.7e-0482.76Show/hide
Query:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV
        HG+ LSKEQCPKTPQ+VE+MR IPYASAV
Subjt:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV

A0A5A7V4M1 Gag/pol protein9.9e-9074.38Show/hide
Query:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG
        M +G+SVREHVLN+MVHFNVAE NGAVID+ SQVSFI ESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS SG
Subjt:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG

Query:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC
        T+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSGATNHVC
Subjt:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC

Query:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN
        SSFQGISSWRQL+ GEMT++VGT  VV+A+AVG L+L   K+
Subjt:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN

A0A5D3CPJ6 Gag/pol protein3.7e-0482.76Show/hide
Query:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV
        HG+ LSKEQCPKTPQ+VE+MR IPYASAV
Subjt:  HGIHLSKEQCPKTPQEVEDMRRIPYASAV

A0A5D3CPJ6 Gag/pol protein9.9e-9074.38Show/hide
Query:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG
        M +G+SVREHVLN+MVHFNVAE NGAVID+ SQVSFI ESLP+SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KGQ+GEANVATS ++F+RGS SG
Subjt:  MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATS-KRFNRGSFSG

Query:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC
        T+S PSSSG+K +KKKK  G+G+K + AAA    K K A KG CFH N + HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAWI+DSGATNHVC
Subjt:  TRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVC

Query:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN
        SSFQGISSWRQL+ GEMT++VGT  VV+A+AVG L+L   K+
Subjt:  SSFQGISSWRQLDAGEMTLKVGTEEVVTAVAVGELKLFTNKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAGGGTTCATCAGTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAACGTGGCTGAGTCGAACGGGGCCGTCATAGACAAGCAGAGTCAGGTTAGC
TTCATTACGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACCACGCTTCTTAACGAGCTGCAGACC
TACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCAACAGAGGATCGTTCTCTGGAACCAGGTCTGCGCCCTCT
TCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCAAGGTTGCAGAGAAA
GGAAAGTGTTTCCACTGGAACATGGATCGGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAGAAGAAGAAAGCCAACGAAGGTAAATATGATTTACTTGTA
TTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGAGGCAG
CTTGACGCAGGAGAGATGACTCTCAAGGTCGGAACGGAAGAGGTCGTCACAGCTGTGGCGGTAGGGGAGCTCAAGTTGTTTACAAACAAGAATATGCATGGAATT
CATCTGTCTAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATACCTTATGCTTCAGCTGTAACGACCCAAAATTTCCCACGCACTTCT
AGCTATCTCCTTCGAGTAGAGTCGGCCCCTGACCATTTATACCCAGAGCCCCGACTCTGCTGCATCGGCCCTCGACCATTTATACTCGAGGCCCCGATGACTTCA
CCATTTATTGGCCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAGGGTTCATCAGTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAACGTGGCTGAGTCGAACGGGGCCGTCATAGACAAGCAGAGTCAGGTTAGC
TTCATTACGGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACCACGCTTCTTAACGAGCTGCAGACC
TACCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCAACAGAGGATCGTTCTCTGGAACCAGGTCTGCGCCCTCT
TCTTCTGGAAGTAAGACTTTTAAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCAAGGTTGCAGAGAAA
GGAAAGTGTTTCCACTGGAACATGGATCGGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAGAAGAAGAAAGCCAACGAAGGTAAATATGATTTACTTGTA
TTGGAAACATGTTTAGTGGAGAATGATGACTCCGCCTGGATACTGGATTCAGGAGCCACTAATCACGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGAGGCAG
CTTGACGCAGGAGAGATGACTCTCAAGGTCGGAACGGAAGAGGTCGTCACAGCTGTGGCGGTAGGGGAGCTCAAGTTGTTTACAAACAAGAATATGCATGGAATT
CATCTGTCTAAGGAACAATGTCCTAAGACACCTCAAGAAGTTGAGGATATGAGACGTATACCTTATGCTTCAGCTGTAACGACCCAAAATTTCCCACGCACTTCT
AGCTATCTCCTTCGAGTAGAGTCGGCCCCTGACCATTTATACCCAGAGCCCCGACTCTGCTGCATCGGCCCTCGACCATTTATACTCGAGGCCCCGATGACTTCA
CCATTTATTGGCCTTTGA
Protein sequenceShow/hide protein sequence
MKKGSSVREHVLNLMVHFNVAESNGAVIDKQSQVSFITESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSKRFNRGSFSGTRSAPS
SSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKVAEKGKCFHWNMDRHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAWILDSGATNHVCSSFQGISSWRQ
LDAGEMTLKVGTEEVVTAVAVGELKLFTNKNMHGIHLSKEQCPKTPQEVEDMRRIPYASAVTTQNFPRTSSYLLRVESAPDHLYPEPRLCCIGPRPFILEAPMTS
PFIGL