; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g12060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g12060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr4:9079762..9095492
RNA-Seq ExpressionMoc04g12060
SyntenyMoc04g12060
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-9156.53Show/hide
Query:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS
        M+++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESL +SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------
        Q+GEANVATS ++F++GS+SGT+S PSSSG+K +KKKK  G+G+K + AAA                              A+K K K            
Subjt:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------

Query:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV
                                GISSWRQL+ GEMT++V TG VVSA+AV
Subjt:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-9156.53Show/hide
Query:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS
        M+++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESL +SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------
        Q+GEANVATS ++F++GS+SGT+S PSSSG+K +KKKK  G+G+K + AAA                              A+K K K            
Subjt:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------

Query:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV
                                GISSWRQL+ GEMT++V TG VVSA+AV
Subjt:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-9156.53Show/hide
Query:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS
        M+++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESL +SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------
        Q+GEANVATS ++F++GS+SGT+S PSSSG+K +KKKK  G+G+K + AAA                              A+K K K            
Subjt:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------

Query:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV
                                GISSWRQL+ GEMT++V TG VVSA+AV
Subjt:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-9156.53Show/hide
Query:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS
        M+++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESL +SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------
        Q+GEANVATS ++F++GS+SGT+S PSSSG+K +KKKK  G+G+K + AAA                              A+K K K            
Subjt:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------

Query:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV
                                GISSWRQL+ GEMT++V TG VVSA+AV
Subjt:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-9156.53Show/hide
Query:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS
        M+++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESL +SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------
        Q+GEANVATS ++F++GS+SGT+S PSSSG+K +KKKK  G+G+K + AAA                              A+K K K            
Subjt:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------

Query:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV
                                GISSWRQL+ GEMT++V TG VVSA+AV
Subjt:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.1e-9156.53Show/hide
Query:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS
        M+++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESL +SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------
        Q+GEANVATS ++F++GS+SGT+S PSSSG+K +KKKK  G+G+K + AAA                              A+K K K            
Subjt:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------

Query:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV
                                GISSWRQL+ GEMT++V TG VVSA+AV
Subjt:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV

A0A5A7TWB9 Gag/pol protein1.1e-9156.53Show/hide
Query:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS
        M+++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESL +SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------
        Q+GEANVATS ++F++GS+SGT+S PSSSG+K +KKKK  G+G+K + AAA                              A+K K K            
Subjt:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------

Query:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV
                                GISSWRQL+ GEMT++V TG VVSA+AV
Subjt:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV

A0A5A7UGV2 Gag/pol protein1.1e-9156.53Show/hide
Query:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS
        M+++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESL +SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------
        Q+GEANVATS ++F++GS+SGT+S PSSSG+K +KKKK  G+G+K + AAA                              A+K K K            
Subjt:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------

Query:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV
                                GISSWRQL+ GEMT++V TG VVSA+AV
Subjt:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV

A0A5D3CPJ6 Gag/pol protein1.1e-9156.53Show/hide
Query:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS
        M+++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESL +SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------
        Q+GEANVATS ++F++GS+SGT+S PSSSG+K +KKKK  G+G+K + AAA                              A+K K K            
Subjt:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------

Query:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV
                                GISSWRQL+ GEMT++V TG VVSA+AV
Subjt:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV

A0A5D3CSZ6 Gag/pol protein1.1e-9156.53Show/hide
Query:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS
        M+++ + +LAA +LNG NY  WK+ +NT+L+IDDLRFVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +T +EIMDSLQ 
Subjt:  MSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQS

Query:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG
        MFGQ S Q +H+ALK+IYN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILESL +SFL FRSNAVMNK+ YTLTTLLNELQT++SLMK KG
Subjt:  MFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKG

Query:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------
        Q+GEANVATS ++F++GS+SGT+S PSSSG+K +KKKK  G+G+K + AAA                              A+K K K            
Subjt:  QEGEANVATS-KRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAA------------------------------AQKGKVK------------

Query:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV
                                GISSWRQL+ GEMT++V TG VVSA+AV
Subjt:  ------------------------GISSWRQLDAGEMTLKVRTGEVVSAVAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATCTCACACAATTTCACAAGCCTCCTACTCGGGCTTTACCCTTAGGATTAGATCGGAGACTGAGTAGTGTGACCTTCGTGGAATTTCGATTCCCGTGCAGGTGCAT
GAAATTCTCCACTCAAGTCGACCATACTGCGTACATAAAACAACATAACATGGAAGAATTTAATGTTGAGTTAAAGAAAATTAGAATTGAAATCCATGAACGGGTCATGG
AAGAATTGAATACTGAGCTAAAGAAAATAAGAGAAGAGATTCAAGAACAGGTCATGGAGGTGCAGAATGCAGAGCTAAAGAAGACGAGACAAGACATCGATAAGATTCAG
GGGCAGGTCACGAAAATTTTCAATGTCTTAAAGAATATTGTTACAGTTAAGGACATTTCTCAACCTAACAATTTAGTCGAACCTCATAATAATTCATCACCTAGGATGAA
CCCACCTCTTTGTCAATCAATAGATGGCTTCCCCCAACACAGTTCGTCACATGACGCTCAGCTTTCTAAGAAAAGCCAACAAGCTCCTACAATTTGTCAGGTTAAATTCG
CATTCCAAGATGGACAAACATCAAAGACCAATTTAAAAGGTAAAGAAAAAGAAAATGTTACCAATGAAAATGTTGAAGGAATTGCAGGGAAAGAATATGCTCAAAGATGG
GGAGAAGCAACAGAACAACTGCCAGCGCCTTTAATTAATAAAGAGTCACATCTTGAAACGAATAGAAGCTCTTTTAAAAGAAAAAATAAAGTTCAGGCAGTAATCTTGAA
GCGCAAGTGGAGGCACCAACAAGGATCTACGTCGGAGACAACTAAAGCGAGAGAGCTTTCATCCAATCCCAATATCCTATACCGAGTTTTAGCCCAATTATTTCAAGGTA
ATCCATTAGCTCCCGTACCCGTAGAGCCATTACAACCACCTTATCCAAAGTGGTATGACCCGCATGTTCGTTGTGATTACCATGCAGGAGCTGTAGTCCGGGTCACTCGA
AAGTCGTTCGTGGCGTCGGATTGGGCGAAAATTGCAGAAAACAGCGAAGAAGACGAAGCAGACTGCGCAGACAGCGCCATGGCGCTGCGAGACAGCACACAGCGCCGTAA
CGCTGTCCTTAGGCGCCGAGGCGCTGTCCCGGGTGTTTTCGACGCGGTTTCGAGGCTCCAGTTCGCGGTTCGAGGGCAGATGCACATGTCTACTTCTATTATTGCACTCT
TAGCCGCTCAAAGACTTAATGGCGAAAATTACAAACAATGGAAGTCAAACCTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTTACAAGAGGATTGTCCTCAA
GCTCCTGCGCCTAACGCCACTATGGCGGTGCGCAACGCCTATGACAGGTGGGTTAAGGCCAATGACAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGC
CAAGAAGCACGAGGACACGGTCACCGTTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCATTTACA
ACTCCCGCATGAAAGAGGGCTCTTCAGTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAATGTGGCTGAGTCGAACGGGGCCGTCATAGACGAGCAGAGTCAGGTC
AGCTTTATTCTGGAATCTCTTCTGAAAAGTTTTCTTCCATTCCGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACCACGCTCCTAAACGAGCTTCAGACCTA
CCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCAACAAAGGATCGTCCTCTGGAACCAGGTCTGCGCCCTCTTCTTCTG
GAAGTAAGACTTTTAAGAAGAAGAAGGCTGCTGGTAAAGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCAAGGGAATTAGTTCCTGGAGGCAGCTT
GACGCCGGAGAGATGACTCTCAAGGTCAGAACGGGAGAGGTCGTCTCAGCTGTGGCGGTCTCCTGCTCAACACGTTTCAGCTCACCTCAAGGTCCTCCTTCCGTCCGAGC
CAAGCTCGCAAGCTTGGTCATTTTTGTGATTAAACATCTTATTCGGTTAAATCGAACTGGATCCAAGTCGAGGTCGAAGCCTGTTGTTAGAGTCAGGTCCATTGCATTTT
TATCATTCAGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATCTCACACAATTTCACAAGCCTCCTACTCGGGCTTTACCCTTAGGATTAGATCGGAGACTGAGTAGTGTGACCTTCGTGGAATTTCGATTCCCGTGCAGGTGCAT
GAAATTCTCCACTCAAGTCGACCATACTGCGTACATAAAACAACATAACATGGAAGAATTTAATGTTGAGTTAAAGAAAATTAGAATTGAAATCCATGAACGGGTCATGG
AAGAATTGAATACTGAGCTAAAGAAAATAAGAGAAGAGATTCAAGAACAGGTCATGGAGGTGCAGAATGCAGAGCTAAAGAAGACGAGACAAGACATCGATAAGATTCAG
GGGCAGGTCACGAAAATTTTCAATGTCTTAAAGAATATTGTTACAGTTAAGGACATTTCTCAACCTAACAATTTAGTCGAACCTCATAATAATTCATCACCTAGGATGAA
CCCACCTCTTTGTCAATCAATAGATGGCTTCCCCCAACACAGTTCGTCACATGACGCTCAGCTTTCTAAGAAAAGCCAACAAGCTCCTACAATTTGTCAGGTTAAATTCG
CATTCCAAGATGGACAAACATCAAAGACCAATTTAAAAGGTAAAGAAAAAGAAAATGTTACCAATGAAAATGTTGAAGGAATTGCAGGGAAAGAATATGCTCAAAGATGG
GGAGAAGCAACAGAACAACTGCCAGCGCCTTTAATTAATAAAGAGTCACATCTTGAAACGAATAGAAGCTCTTTTAAAAGAAAAAATAAAGTTCAGGCAGTAATCTTGAA
GCGCAAGTGGAGGCACCAACAAGGATCTACGTCGGAGACAACTAAAGCGAGAGAGCTTTCATCCAATCCCAATATCCTATACCGAGTTTTAGCCCAATTATTTCAAGGTA
ATCCATTAGCTCCCGTACCCGTAGAGCCATTACAACCACCTTATCCAAAGTGGTATGACCCGCATGTTCGTTGTGATTACCATGCAGGAGCTGTAGTCCGGGTCACTCGA
AAGTCGTTCGTGGCGTCGGATTGGGCGAAAATTGCAGAAAACAGCGAAGAAGACGAAGCAGACTGCGCAGACAGCGCCATGGCGCTGCGAGACAGCACACAGCGCCGTAA
CGCTGTCCTTAGGCGCCGAGGCGCTGTCCCGGGTGTTTTCGACGCGGTTTCGAGGCTCCAGTTCGCGGTTCGAGGGCAGATGCACATGTCTACTTCTATTATTGCACTCT
TAGCCGCTCAAAGACTTAATGGCGAAAATTACAAACAATGGAAGTCAAACCTAAACACTATTCTCGTGATAGATGATCTTAGGTTCGTCTTACAAGAGGATTGTCCTCAA
GCTCCTGCGCCTAACGCCACTATGGCGGTGCGCAACGCCTATGACAGGTGGGTTAAGGCCAATGACAAGGCCAAGGTCTACATCTTGGCGAGCATATCTGATGTGCTTGC
CAAGAAGCACGAGGACACGGTCACCGTTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCATTTACA
ACTCCCGCATGAAAGAGGGCTCTTCAGTGCGAGAACACGTTCTCAACCTGATGGTCCACTTCAATGTGGCTGAGTCGAACGGGGCCGTCATAGACGAGCAGAGTCAGGTC
AGCTTTATTCTGGAATCTCTTCTGAAAAGTTTTCTTCCATTCCGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACCACGCTCCTAAACGAGCTTCAGACCTA
CCAGTCTCTTATGAAATGTAAGGGACAAGAAGGGGAGGCAAATGTTGCCACCTCAAAGAGGTTCAACAAAGGATCGTCCTCTGGAACCAGGTCTGCGCCCTCTTCTTCTG
GAAGTAAGACTTTTAAGAAGAAGAAGGCTGCTGGTAAAGGGTCTAAACCTGACTCAGCTGCTGCTGCCCAGAAAGGCAAGGTCAAGGGAATTAGTTCCTGGAGGCAGCTT
GACGCCGGAGAGATGACTCTCAAGGTCAGAACGGGAGAGGTCGTCTCAGCTGTGGCGGTCTCCTGCTCAACACGTTTCAGCTCACCTCAAGGTCCTCCTTCCGTCCGAGC
CAAGCTCGCAAGCTTGGTCATTTTTGTGATTAAACATCTTATTCGGTTAAATCGAACTGGATCCAAGTCGAGGTCGAAGCCTGTTGTTAGAGTCAGGTCCATTGCATTTT
TATCATTCAGTTAA
Protein sequenceShow/hide protein sequence
MHLTQFHKPPTRALPLGLDRRLSSVTFVEFRFPCRCMKFSTQVDHTAYIKQHNMEEFNVELKKIRIEIHERVMEELNTELKKIREEIQEQVMEVQNAELKKTRQDIDKIQ
GQVTKIFNVLKNIVTVKDISQPNNLVEPHNNSSPRMNPPLCQSIDGFPQHSSSHDAQLSKKSQQAPTICQVKFAFQDGQTSKTNLKGKEKENVTNENVEGIAGKEYAQRW
GEATEQLPAPLINKESHLETNRSSFKRKNKVQAVILKRKWRHQQGSTSETTKARELSSNPNILYRVLAQLFQGNPLAPVPVEPLQPPYPKWYDPHVRCDYHAGAVVRVTR
KSFVASDWAKIAENSEEDEADCADSAMALRDSTQRRNAVLRRRGAVPGVFDAVSRLQFAVRGQMHMSTSIIALLAAQRLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQ
APAPNATMAVRNAYDRWVKANDKAKVYILASISDVLAKKHEDTVTVKEIMDSLQSMFGQPSSQARHEALKFIYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQV
SFILESLLKSFLPFRSNAVMNKLEYTLTTLLNELQTYQSLMKCKGQEGEANVATSKRFNKGSSSGTRSAPSSSGSKTFKKKKAAGKGSKPDSAAAAQKGKVKGISSWRQL
DAGEMTLKVRTGEVVSAVAVSCSTRFSSPQGPPSVRAKLASLVIFVIKHLIRLNRTGSKSRSKPVVRVRSIAFLSFS