; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g21960 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g21960
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr4:15976911..15978492
RNA-Seq ExpressionMoc04g21960
SyntenyMoc04g21960
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-10363.95Show/hide
Query:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR
        D  NG+ NY  WK+ +NT+++IDDLR VL E+C Q P  N T TV   Y+RW     KA+ YILA +S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +
Subjt:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR

Query:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---
        H+ALK++YN+RM EG+SV+EHVLN+MVHFNVAE NG VIDE SQVSFILESLP+SFL FRSNA+MNK+ YT TTLLNELQT++SLMK KGQ+GEAN    
Subjt:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---

Query:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW
                               +KKK  G+G+K + AAA   K KAK   KG CFHCN   HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAW
Subjt:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW

Query:  ILDSGATNHICSSFQGISS
        I+DSGATNH+CSSFQGISS
Subjt:  ILDSGATNHICSSFQGISS

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-10363.95Show/hide
Query:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR
        D  NG+ NY  WK+ +NT+++IDDLR VL E+C Q P  N T TV   Y+RW     KA+ YILA +S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +
Subjt:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR

Query:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---
        H+ALK++YN+RM EG+SV+EHVLN+MVHFNVAE NG VIDE SQVSFILESLP+SFL FRSNA+MNK+ YT TTLLNELQT++SLMK KGQ+GEAN    
Subjt:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---

Query:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW
                               +KKK  G+G+K + AAA   K KAK   KG CFHCN   HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAW
Subjt:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW

Query:  ILDSGATNHICSSFQGISS
        I+DSGATNH+CSSFQGISS
Subjt:  ILDSGATNHICSSFQGISS

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-10363.95Show/hide
Query:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR
        D  NG+ NY  WK+ +NT+++IDDLR VL E+C Q P  N T TV   Y+RW     KA+ YILA +S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +
Subjt:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR

Query:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---
        H+ALK++YN+RM EG+SV+EHVLN+MVHFNVAE NG VIDE SQVSFILESLP+SFL FRSNA+MNK+ YT TTLLNELQT++SLMK KGQ+GEAN    
Subjt:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---

Query:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW
                               +KKK  G+G+K + AAA   K KAK   KG CFHCN   HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAW
Subjt:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW

Query:  ILDSGATNHICSSFQGISS
        I+DSGATNH+CSSFQGISS
Subjt:  ILDSGATNHICSSFQGISS

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]4.7e-10363.95Show/hide
Query:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR
        D  NG+ NY  WK+ +NT+++IDDLR VL E+C Q P  N T TV   Y+RW     KA+ YILA +S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +
Subjt:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR

Query:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---
        H+ALK++YN+RM EG+SV+EHVLN+MVHFNVAE NG VIDE SQVSFILESLP+SFL FRSNA+MNK+ YT TTLLNELQT++SLMK KGQ+GEAN    
Subjt:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---

Query:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW
                               +KKK  G+G+K + AAA   K KAK   KG CFHCN   HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAW
Subjt:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW

Query:  ILDSGATNHICSSFQGISS
        I+DSGATNH+CSSFQGISS
Subjt:  ILDSGATNHICSSFQGISS

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]8.0e-10363.32Show/hide
Query:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR
        D  NG+ NY  WK+ +NT+++IDDLR VL E+C Q P  N T TV   Y+RW     KA+ YILA +S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +
Subjt:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR

Query:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---
        H+ALK++YN+RM EG+SV+EHVLN+MVHFNVAE NG VIDE SQVSFILESLP+SFL FRSNA+MNK+ YT TTLLNELQT++SLMK KGQ+GEAN    
Subjt:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---

Query:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW
                               +KKK  G+G+K +   AAAK  K     KG CFHCN   HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAW
Subjt:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW

Query:  ILDSGATNHICSSFQGISS
        I+DSGATNH+CSSFQGISS
Subjt:  ILDSGATNHICSSFQGISS

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.3e-10363.95Show/hide
Query:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR
        D  NG+ NY  WK+ +NT+++IDDLR VL E+C Q P  N T TV   Y+RW     KA+ YILA +S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +
Subjt:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR

Query:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---
        H+ALK++YN+RM EG+SV+EHVLN+MVHFNVAE NG VIDE SQVSFILESLP+SFL FRSNA+MNK+ YT TTLLNELQT++SLMK KGQ+GEAN    
Subjt:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---

Query:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW
                               +KKK  G+G+K + AAA   K KAK   KG CFHCN   HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAW
Subjt:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW

Query:  ILDSGATNHICSSFQGISS
        I+DSGATNH+CSSFQGISS
Subjt:  ILDSGATNHICSSFQGISS

A0A5A7TU93 Gag/pol protein2.3e-10363.95Show/hide
Query:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR
        D  NG+ NY  WK+ +NT+++IDDLR VL E+C Q P  N T TV   Y+RW     KA+ YILA +S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +
Subjt:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR

Query:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---
        H+ALK++YN+RM EG+SV+EHVLN+MVHFNVAE NG VIDE SQVSFILESLP+SFL FRSNA+MNK+ YT TTLLNELQT++SLMK KGQ+GEAN    
Subjt:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---

Query:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW
                               +KKK  G+G+K + AAA   K KAK   KG CFHCN   HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAW
Subjt:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW

Query:  ILDSGATNHICSSFQGISS
        I+DSGATNH+CSSFQGISS
Subjt:  ILDSGATNHICSSFQGISS

A0A5A7TWB9 Gag/pol protein2.3e-10363.95Show/hide
Query:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR
        D  NG+ NY  WK+ +NT+++IDDLR VL E+C Q P  N T TV   Y+RW     KA+ YILA +S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +
Subjt:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR

Query:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---
        H+ALK++YN+RM EG+SV+EHVLN+MVHFNVAE NG VIDE SQVSFILESLP+SFL FRSNA+MNK+ YT TTLLNELQT++SLMK KGQ+GEAN    
Subjt:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---

Query:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW
                               +KKK  G+G+K + AAA   K KAK   KG CFHCN   HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAW
Subjt:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW

Query:  ILDSGATNHICSSFQGISS
        I+DSGATNH+CSSFQGISS
Subjt:  ILDSGATNHICSSFQGISS

A0A5A7UGV2 Gag/pol protein2.3e-10363.95Show/hide
Query:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR
        D  NG+ NY  WK+ +NT+++IDDLR VL E+C Q P  N T TV   Y+RW     KA+ YILA +S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +
Subjt:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR

Query:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---
        H+ALK++YN+RM EG+SV+EHVLN+MVHFNVAE NG VIDE SQVSFILESLP+SFL FRSNA+MNK+ YT TTLLNELQT++SLMK KGQ+GEAN    
Subjt:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---

Query:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW
                               +KKK  G+G+K + AAA   K KAK   KG CFHCN   HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAW
Subjt:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW

Query:  ILDSGATNHICSSFQGISS
        I+DSGATNH+CSSFQGISS
Subjt:  ILDSGATNHICSSFQGISS

A0A5D3CPJ6 Gag/pol protein3.9e-10363.32Show/hide
Query:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR
        D  NG+ NY  WK+ +NT+++IDDLR VL E+C Q P  N T TV   Y+RW     KA+ YILA +S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +
Subjt:  DPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNTTVTVHNVYDRWI----KAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQAR

Query:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---
        H+ALK++YN+RM EG+SV+EHVLN+MVHFNVAE NG VIDE SQVSFILESLP+SFL FRSNA+MNK+ YT TTLLNELQT++SLMK KGQ+GEAN    
Subjt:  HEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILESLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANT---

Query:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW
                               +KKK  G+G+K +   AAAK  K     KG CFHCN   HWKRNCPKYLAEKKKA +GKYDLLVLETCLVENDDSAW
Subjt:  -----------------------FKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEGKYDLLVLETCLVENDDSAW

Query:  ILDSGATNHICSSFQGISS
        I+DSGATNH+CSSFQGISS
Subjt:  ILDSGATNHICSSFQGISS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGTGTAAACAAGTCTGGTATTGTTAAACTATCTAAAGATAGAGATGTGATAAGCGTTCTAACGGTTCTGTCGAATGAGTTAAGGAGTGATTGTTGCATC
GTTGTCGATATTGCTTCTATAGATACCAAAATTGACTGGAGGAATGGGACTGCGACACCAGTGCATACATTTGTAGATACCGATATTGACCCTAGAAATGGAAGC
GAGAATTACAAACAATGGAAATCAAATCTAAACACTATTATCGTGATAGATGATCTTAGGGTCGTCTTGCAAGAGGATTGTCTTCAAGCTCCTGAGCCTAACACC
ACTGTGACGGTGCACAACGTCTATGACAGGTGGATCAAGGCCAAGGTCTACATCTTGGCGGGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACC
GCTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCGTTTACAACTCCCGCATGAAGGAGGGT
TCCTCAGTGCAAGAACACGTTCTCAACCTGATGGTCCACTTCAACGTGGCTGAGTCGAACGGGACCGTCATAGACGAGCAGAGTCAGGTCAGCTTCATTCTTGAA
TCTCTTCCGAAGAGTTTCCTTCCCTTCCGTAGCAATGCGATTATGAATAAGCTGGAGTACACTTTTACCACGCTCTTAAATGAGCTGCAGACCTACCAGTCTCTT
ATGAAGAGTAAGGGACAAGAAGGGGAGGCAAATACTTTTAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCCGCTGCTGCCGCTGCCAAGAAAGGCAAG
GCCAAGGTTACAGAGAAAGGAAAGTGTTTCCACTGCAATATGAACGAGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAGAAGAAAGCCAACGAAGGT
AAATATGATTTACTTGTTTTGGAAACATGTTTAGTGGAGAACGATGACTCTGCCTGGATACTGGATTCAGGAGCCACTAATCACATTTGTTCTTCATTTCAGGGA
ATTAGTTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGTGTAAACAAGTCTGGTATTGTTAAACTATCTAAAGATAGAGATGTGATAAGCGTTCTAACGGTTCTGTCGAATGAGTTAAGGAGTGATTGTTGCATC
GTTGTCGATATTGCTTCTATAGATACCAAAATTGACTGGAGGAATGGGACTGCGACACCAGTGCATACATTTGTAGATACCGATATTGACCCTAGAAATGGAAGC
GAGAATTACAAACAATGGAAATCAAATCTAAACACTATTATCGTGATAGATGATCTTAGGGTCGTCTTGCAAGAGGATTGTCTTCAAGCTCCTGAGCCTAACACC
ACTGTGACGGTGCACAACGTCTATGACAGGTGGATCAAGGCCAAGGTCTACATCTTGGCGGGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACC
GCTAAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCGTTTACAACTCCCGCATGAAGGAGGGT
TCCTCAGTGCAAGAACACGTTCTCAACCTGATGGTCCACTTCAACGTGGCTGAGTCGAACGGGACCGTCATAGACGAGCAGAGTCAGGTCAGCTTCATTCTTGAA
TCTCTTCCGAAGAGTTTCCTTCCCTTCCGTAGCAATGCGATTATGAATAAGCTGGAGTACACTTTTACCACGCTCTTAAATGAGCTGCAGACCTACCAGTCTCTT
ATGAAGAGTAAGGGACAAGAAGGGGAGGCAAATACTTTTAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGACTCCGCTGCTGCCGCTGCCAAGAAAGGCAAG
GCCAAGGTTACAGAGAAAGGAAAGTGTTTCCACTGCAATATGAACGAGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAGAAGAAAGCCAACGAAGGT
AAATATGATTTACTTGTTTTGGAAACATGTTTAGTGGAGAACGATGACTCTGCCTGGATACTGGATTCAGGAGCCACTAATCACATTTGTTCTTCATTTCAGGGA
ATTAGTTCCTGA
Protein sequenceShow/hide protein sequence
MNSVNKSGIVKLSKDRDVISVLTVLSNELRSDCCIVVDIASIDTKIDWRNGTATPVHTFVDTDIDPRNGSENYKQWKSNLNTIIVIDDLRVVLQEDCLQAPEPNT
TVTVHNVYDRWIKAKVYILAGISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHEALKFVYNSRMKEGSSVQEHVLNLMVHFNVAESNGTVIDEQSQVSFILE
SLPKSFLPFRSNAIMNKLEYTFTTLLNELQTYQSLMKSKGQEGEANTFKKKAAGKGSKPDSAAAAAKKGKAKVTEKGKCFHCNMNEHWKRNCPKYLAEKKKANEG
KYDLLVLETCLVENDDSAWILDSGATNHICSSFQGISS