; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g26250 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g26250
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr8:18812045..18815243
RNA-Seq ExpressionMoc08g26250
SyntenyMoc08g26250
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-9259.94Show/hide
Query:  KLNGKNYKQWKSNLNTILVIDDLKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHE
        KLNG NY  WK+ +NT+L+IDDL+FVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+
Subjt:  KLNGKNYKQWKSNLNTILVIDDLKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHE

Query:  ALKFVYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESFPNSFLPF-CNAVMNKLEYTLTTLLNKLQTYQSLMKSKGQEGEANTFK---
        ALK++YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILES P SFL F  NAVMNK+ YTLTTLLN+LQT++SLMK KGQ+GEAN      
Subjt:  ALKFVYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESFPNSFLPF-CNAVMNKLEYTLTTLLNKLQTYQSLMKSKGQEGEANTFK---

Query:  ----------------------KKKAAGKGSKPESTIAAAKNDKAKVAEKGKCFHYNMDGHWKRNCPKYLAEKKKANE----------------------
                              KKK  G+G+K  + +AAAK  K   A KG CFH N +GHWKRNCPKYLAEKKKA +                      
Subjt:  ----------------------KKKAAGKGSKPESTIAAAKNDKAKVAEKGKCFHYNMDGHWKRNCPKYLAEKKKANE----------------------

Query:  --GATNHVCSSFQGISS
          GATNHVCSSFQGISS
Subjt:  --GATNHVCSSFQGISS

KAA0046182.1 zf-CCHC domain-containing protein/UBN2 domain-containing protein [Cucumis melo var. makuwa]1.4e-9961.3Show/hide
Query:  KKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYKNFIKRKKQFKTH
        +K+L  LPK W++KVT+IQEAKDL  + ++E+IGSLMTHEI +++++EDE K K+KSIAL  I+LE+  + E+ LDEDD+ Y SRKYKNFIKRKK FK +
Subjt:  KKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYKNFIKRKKQFKTH

Query:  FSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFEN
         S QK S  E SKKDEVICYECKK  HIRTDCP LKSSKKSK+KAMKATWDDS E  SESE EE AN   M  SDKEDE DDEV L+P S +ELFE FEN
Subjt:  FSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFEN

Query:  MQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKK---KKYDVVNVSCDKHVLDCDEKNALLDKIRFLEHDGC------------------------
        +QNDLEKL SKY +LKKKYN L SENKSLLD IAC K+   K+ + +NVS DKH+ DC+EK+ALLDK+RFLEHD C                        
Subjt:  MQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKK---KKYDVVNVSCDKHVLDCDEKNALLDKIRFLEHDGC------------------------

Query:  ---------AQRLDKIIEVGKPYSDKRGSGYLDECSTPSSSETIFVKESPNMPK
                 AQRLDKIIEVGK Y DKR  GY+DE ST  SS+T FVK SP +PK
Subjt:  ---------AQRLDKIIEVGKPYSDKRGSGYLDECSTPSSSETIFVKESPNMPK

XP_022156978.1 uncharacterized protein LOC111023806 [Momordica charantia]2.9e-11366.85Show/hide
Query:  GLGKEYSNLVKVKKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYK
        GL KEYSNL KVKKLLWSLPK+WE KVT IQEAKDLKT+SMDE+IGSLMTHEIKIKKNMEDEKK K+KSIALKAITLEV  +GEN LDEDDVAYLSRK  
Subjt:  GLGKEYSNLVKVKKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYK

Query:  NFIKRKKQFKTHFSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDP
                                                 TDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDP
Subjt:  NFIKRKKQFKTHFSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDP

Query:  LSYDELFEAFENMQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKKKKYDVVNVSCDKHVLDCDEKNAL--LDKIR-FLEHDGC-AQRLDKIIEVG
        LSYDELFEAFENMQN+LEKLGSKY MLK K N   SENKSL DDIACLKK ++DV N+     +L  +E +AL  LDK + F++     AQRLDKIIE G
Subjt:  LSYDELFEAFENMQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKKKKYDVVNVSCDKHVLDCDEKNAL--LDKIR-FLEHDGC-AQRLDKIIEVG

Query:  KPYSDKRGSGYLDECSTPSSSETIFVKESPNMPKLVAPKPVKKLNGKNYKQWKSNLNTILVIDDLKFV
        KPY DKRG GY++EC+TPSSS+TIFVK SPNMPKLVAPK   K + K+     S  +  +  D  KFV
Subjt:  KPYSDKRGSGYLDECSTPSSSETIFVKESPNMPKLVAPKPVKKLNGKNYKQWKSNLNTILVIDDLKFV

XP_022158792.1 uncharacterized protein LOC111025259 [Momordica charantia]7.0e-9959.73Show/hide
Query:  GLGKEYSNLVKVKKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYK
        GLGKEYSNL KVKKLLWSLPKQWE KVT+IQEAKDLKT+SMDE+I                                     GENALDEDDVAYLSRKYK
Subjt:  GLGKEYSNLVKVKKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYK

Query:  NFIKRKKQFKTHFSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDP
        NFIKRKKQFK +FSN K+  +E SKKDEVICYECKKPGHIRTDCP LKSSKKSKKKAMKATWDDSDESG+ESENEEVANFCFMAHSDKEDE+DDE+ LDP
Subjt:  NFIKRKKQFKTHFSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDP

Query:  LSYDELFEAFENMQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKKKKYDVVNVSCDKHVLDCDEKNALLDKIRFLEHDGCAQRLDKIIEVGKPYS
        LSYDELFEAFENMQNDLEKL                                            ++ D+    + K+        AQRLDKIIE+GKPY 
Subjt:  LSYDELFEAFENMQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKKKKYDVVNVSCDKHVLDCDEKNALLDKIRFLEHDGCAQRLDKIIEVGKPYS

Query:  DKRGSGYLDECSTPSSSETIFVKESPNMPKLVAPKPV-KKLNGKNYKQWKSNLNTILVIDDLKFV
        DKRG GY+DECSTPSSS+ IFVK SPNMPKLVAPK V  K + K      S  +  +  D  KFV
Subjt:  DKRGSGYLDECSTPSSSETIFVKESPNMPKLVAPKPV-KKLNGKNYKQWKSNLNTILVIDDLKFV

XP_031741720.1 uncharacterized protein LOC116403915 [Cucumis sativus]1.1e-11264.31Show/hide
Query:  GLGKEYSNLVKVKKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYK
        GLGK Y+    V+K+L SLPK WE+KVT+IQEAKDL  + ++E+IGSLMTHEI +K+++EDE K K+KSIALK I+LEV  + E+ LDEDD+AY SRKYK
Subjt:  GLGKEYSNLVKVKKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYK

Query:  NFIKRKKQFKTHFSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDP
        NFIKRKK FK H S QK+S  E SKKDEVICYECK+ GHIRTDCPLLKSSKKSKKKAMKATWDDS E  SESE EE+AN   MAHSDK+DE DD+V L+P
Subjt:  NFIKRKKQFKTHFSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDP

Query:  LSYDELFEAFENMQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKK----KKYDVVNVSCDKHVLDCDEKNALLDKIRFLEHDGC-----------
        LS DELFE FE+MQNDLEKL SKY +LKKKYN L+SENKSLLD IAC K+    ++ + +NVS DKHV  C EK+ALLDK+RFLEHD C           
Subjt:  LSYDELFEAFENMQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKK----KKYDVVNVSCDKHVLDCDEKNALLDKIRFLEHDGC-----------

Query:  ----------------------AQRLDKIIEVGKPYSDKRGSGYLDECSTPSSSETIFVKESPNMPK
                              AQRLDKIIEVGK Y DKRG GY+DE STPSSS+T FVK SP +PK
Subjt:  ----------------------AQRLDKIIEVGKPYSDKRGSGYLDECSTPSSSETIFVKESPNMPK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.8e-9259.94Show/hide
Query:  KLNGKNYKQWKSNLNTILVIDDLKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHE
        KLNG NY  WK+ +NT+L+IDDL+FVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+
Subjt:  KLNGKNYKQWKSNLNTILVIDDLKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHE

Query:  ALKFVYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESFPNSFLPF-CNAVMNKLEYTLTTLLNKLQTYQSLMKSKGQEGEANTFK---
        ALK++YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILES P SFL F  NAVMNK+ YTLTTLLN+LQT++SLMK KGQ+GEAN      
Subjt:  ALKFVYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESFPNSFLPF-CNAVMNKLEYTLTTLLNKLQTYQSLMKSKGQEGEANTFK---

Query:  ----------------------KKKAAGKGSKPESTIAAAKNDKAKVAEKGKCFHYNMDGHWKRNCPKYLAEKKKANE----------------------
                              KKK  G+G+K  + +AAAK  K   A KG CFH N +GHWKRNCPKYLAEKKKA +                      
Subjt:  ----------------------KKKAAGKGSKPESTIAAAKNDKAKVAEKGKCFHYNMDGHWKRNCPKYLAEKKKANE----------------------

Query:  --GATNHVCSSFQGISS
          GATNHVCSSFQGISS
Subjt:  --GATNHVCSSFQGISS

A0A5A7TRZ7 Zf-CCHC domain-containing protein/UBN2 domain-containing protein6.9e-10061.3Show/hide
Query:  KKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYKNFIKRKKQFKTH
        +K+L  LPK W++KVT+IQEAKDL  + ++E+IGSLMTHEI +++++EDE K K+KSIAL  I+LE+  + E+ LDEDD+ Y SRKYKNFIKRKK FK +
Subjt:  KKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYKNFIKRKKQFKTH

Query:  FSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFEN
         S QK S  E SKKDEVICYECKK  HIRTDCP LKSSKKSK+KAMKATWDDS E  SESE EE AN   M  SDKEDE DDEV L+P S +ELFE FEN
Subjt:  FSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAFEN

Query:  MQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKK---KKYDVVNVSCDKHVLDCDEKNALLDKIRFLEHDGC------------------------
        +QNDLEKL SKY +LKKKYN L SENKSLLD IAC K+   K+ + +NVS DKH+ DC+EK+ALLDK+RFLEHD C                        
Subjt:  MQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKK---KKYDVVNVSCDKHVLDCDEKNALLDKIRFLEHDGC------------------------

Query:  ---------AQRLDKIIEVGKPYSDKRGSGYLDECSTPSSSETIFVKESPNMPK
                 AQRLDKIIEVGK Y DKR  GY+DE ST  SS+T FVK SP +PK
Subjt:  ---------AQRLDKIIEVGKPYSDKRGSGYLDECSTPSSSETIFVKESPNMPK

A0A5D3CPJ6 Gag/pol protein1.8e-9259.94Show/hide
Query:  KLNGKNYKQWKSNLNTILVIDDLKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHE
        KLNG NY  WK+ +NT+L+IDDL+FVL E+CPQ PA NAT  VR  Y+RW KAN+KA+ YILAS+S+VLAKKHE  +TA+EIMDSLQ MFGQ S Q +H+
Subjt:  KLNGKNYKQWKSNLNTILVIDDLKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEIMDSLQSMFGQPSSQARHE

Query:  ALKFVYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESFPNSFLPF-CNAVMNKLEYTLTTLLNKLQTYQSLMKSKGQEGEANTFK---
        ALK++YN+RM EG+SVREHVLN+MVHFNVAE NGAVIDE SQVSFILES P SFL F  NAVMNK+ YTLTTLLN+LQT++SLMK KGQ+GEAN      
Subjt:  ALKFVYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESFPNSFLPF-CNAVMNKLEYTLTTLLNKLQTYQSLMKSKGQEGEANTFK---

Query:  ----------------------KKKAAGKGSKPESTIAAAKNDKAKVAEKGKCFHYNMDGHWKRNCPKYLAEKKKANE----------------------
                              KKK  G+G+K  + +AAAK  K   A KG CFH N +GHWKRNCPKYLAEKKKA +                      
Subjt:  ----------------------KKKAAGKGSKPESTIAAAKNDKAKVAEKGKCFHYNMDGHWKRNCPKYLAEKKKANE----------------------

Query:  --GATNHVCSSFQGISS
          GATNHVCSSFQGISS
Subjt:  --GATNHVCSSFQGISS

A0A6J1DS74 uncharacterized protein LOC1110238061.4e-11366.85Show/hide
Query:  GLGKEYSNLVKVKKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYK
        GL KEYSNL KVKKLLWSLPK+WE KVT IQEAKDLKT+SMDE+IGSLMTHEIKIKKNMEDEKK K+KSIALKAITLEV  +GEN LDEDDVAYLSRK  
Subjt:  GLGKEYSNLVKVKKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYK

Query:  NFIKRKKQFKTHFSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDP
                                                 TDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDP
Subjt:  NFIKRKKQFKTHFSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDP

Query:  LSYDELFEAFENMQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKKKKYDVVNVSCDKHVLDCDEKNAL--LDKIR-FLEHDGC-AQRLDKIIEVG
        LSYDELFEAFENMQN+LEKLGSKY MLK K N   SENKSL DDIACLKK ++DV N+     +L  +E +AL  LDK + F++     AQRLDKIIE G
Subjt:  LSYDELFEAFENMQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKKKKYDVVNVSCDKHVLDCDEKNAL--LDKIR-FLEHDGC-AQRLDKIIEVG

Query:  KPYSDKRGSGYLDECSTPSSSETIFVKESPNMPKLVAPKPVKKLNGKNYKQWKSNLNTILVIDDLKFV
        KPY DKRG GY++EC+TPSSS+TIFVK SPNMPKLVAPK   K + K+     S  +  +  D  KFV
Subjt:  KPYSDKRGSGYLDECSTPSSSETIFVKESPNMPKLVAPKPVKKLNGKNYKQWKSNLNTILVIDDLKFV

A0A6J1DY46 uncharacterized protein LOC1110252593.4e-9959.73Show/hide
Query:  GLGKEYSNLVKVKKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYK
        GLGKEYSNL KVKKLLWSLPKQWE KVT+IQEAKDLKT+SMDE+I                                     GENALDEDDVAYLSRKYK
Subjt:  GLGKEYSNLVKVKKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYK

Query:  NFIKRKKQFKTHFSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDP
        NFIKRKKQFK +FSN K+  +E SKKDEVICYECKKPGHIRTDCP LKSSKKSKKKAMKATWDDSDESG+ESENEEVANFCFMAHSDKEDE+DDE+ LDP
Subjt:  NFIKRKKQFKTHFSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDP

Query:  LSYDELFEAFENMQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKKKKYDVVNVSCDKHVLDCDEKNALLDKIRFLEHDGCAQRLDKIIEVGKPYS
        LSYDELFEAFENMQNDLEKL                                            ++ D+    + K+        AQRLDKIIE+GKPY 
Subjt:  LSYDELFEAFENMQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKKKKYDVVNVSCDKHVLDCDEKNALLDKIRFLEHDGCAQRLDKIIEVGKPYS

Query:  DKRGSGYLDECSTPSSSETIFVKESPNMPKLVAPKPV-KKLNGKNYKQWKSNLNTILVIDDLKFV
        DKRG GY+DECSTPSSS+ IFVK SPNMPKLVAPK V  K + K      S  +  +  D  KFV
Subjt:  DKRGSGYLDECSTPSSSETIFVKESPNMPKLVAPKPV-KKLNGKNYKQWKSNLNTILVIDDLKFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGACTTGGAAAAGAATACTCAAATCTTGTGAAGGTAAAGAAGCTCTTATGGTCCTTGCCTAAACAATGGGAGTCTAAAGTCACCTCCATTCAAGAGGCAAAGGAT
CTCAAGACTATCTCCATGGATGAAATTATTGGTTCGTTGATGACGCATGAGATAAAGATCAAGAAAAACATGGAGGATGAGAAGAAAAATAAAGAGAAGAGCATA
GCATTAAAGGCCATCACCTTGGAAGTTGTCTCCAAAGGTGAGAATGCCCTTGATGAAGATGATGTGGCATATCTCTCACGTAAGTATAAAAATTTCATCAAGAGG
AAGAAACAATTCAAGACGCATTTCTCCAACCAAAAAAAGTCAAACAATGAAATGAGCAAAAAGGATGAGGTAATTTGTTATGAATGCAAAAAACCGGGTCATATT
AGAACTGATTGTCCTCTTCTTAAATCATCTAAGAAATCCAAGAAGAAAGCAATGAAGGCTACTTGGGATGATAGTGATGAAAGTGGAAGTGAAAGTGAGAATGAA
GAAGTGGCCAACTTTTGCTTCATGGCTCATAGTGACAAAGAGGATGAACAAGATGATGAGGTAAATCTTGACCCCCTTTCTTACGATGAGTTGTTTGAAGCTTTT
GAGAATATGCAAAATGATTTAGAAAAACTTGGTTCTAAATATGCTATGCTTAAAAAGAAATACAATGCCTTAGTTAGTGAAAATAAGTCTTTACTTGATGATATT
GCTTGCTTAAAGAAGAAAAAGTATGATGTTGTGAATGTCTCTTGTGATAAGCATGTTCTTGATTGTGATGAGAAAAATGCCTTACTTGATAAAATTAGATTTCTT
GAGCATGATGGTTGTGCTCAAAGGTTGGACAAGATAATTGAAGTAGGTAAGCCTTATAGTGATAAAAGAGGTTCAGGCTATCTTGATGAATGCTCTACTCCCTCA
AGTTCTGAAACTATTTTTGTTAAAGAATCTCCTAATATGCCTAAGCTTGTTGCTCCTAAGCCGGTCAAAAAACTTAACGGCAAGAATTACAAACAATGGAAATCG
AATCTAAACACTATTCTCGTGATAGATGATCTTAAGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGCCTAT
GACAGGTGGATCAAGGCTAATGACAAGGCCAAGGTCTACATCCTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCTAAGGAGATC
ATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCGTTTACAACTCCCGCATGAAGGAGGGCTCCTCAGTGCGA
GAACACGTTCTCAACCTGATGGTCCACTTCAATGTGGCTGAGTCGAACGGGGCTGTCATAGATGAGCAGAGTCAGGTAAGCTTTATTCTGGAATCTTTTCCGAAC
AGTTTCCTTCCATTTTGCAATGCGGTTATGAATAAGTTGGAGTACACTCTTACCACGCTCTTAAACAAGTTGCAGACCTACCAGTCTCTTATGAAGAGTAAGGGA
CAAGAAGGGGAGGCAAATACTTTTAAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGAATCCACTATTGCGGCTGCCAAGAATGACAAGGCCAAGGTTGCA
GAGAAAGGGAAGTGTTTCCACTACAATATGGACGGGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAGAAGAAAGCCAACGAAGGAGCCACTAATCAC
GTTTGTTCTTCATTTCAGGGAATTAGTTCCTAG
mRNA sequenceShow/hide mRNA sequence
GGACTTGGAAAAGAATACTCAAATCTTGTGAAGGTAAAGAAGCTCTTATGGTCCTTGCCTAAACAATGGGAGTCTAAAGTCACCTCCATTCAAGAGGCAAAGGAT
CTCAAGACTATCTCCATGGATGAAATTATTGGTTCGTTGATGACGCATGAGATAAAGATCAAGAAAAACATGGAGGATGAGAAGAAAAATAAAGAGAAGAGCATA
GCATTAAAGGCCATCACCTTGGAAGTTGTCTCCAAAGGTGAGAATGCCCTTGATGAAGATGATGTGGCATATCTCTCACGTAAGTATAAAAATTTCATCAAGAGG
AAGAAACAATTCAAGACGCATTTCTCCAACCAAAAAAAGTCAAACAATGAAATGAGCAAAAAGGATGAGGTAATTTGTTATGAATGCAAAAAACCGGGTCATATT
AGAACTGATTGTCCTCTTCTTAAATCATCTAAGAAATCCAAGAAGAAAGCAATGAAGGCTACTTGGGATGATAGTGATGAAAGTGGAAGTGAAAGTGAGAATGAA
GAAGTGGCCAACTTTTGCTTCATGGCTCATAGTGACAAAGAGGATGAACAAGATGATGAGGTAAATCTTGACCCCCTTTCTTACGATGAGTTGTTTGAAGCTTTT
GAGAATATGCAAAATGATTTAGAAAAACTTGGTTCTAAATATGCTATGCTTAAAAAGAAATACAATGCCTTAGTTAGTGAAAATAAGTCTTTACTTGATGATATT
GCTTGCTTAAAGAAGAAAAAGTATGATGTTGTGAATGTCTCTTGTGATAAGCATGTTCTTGATTGTGATGAGAAAAATGCCTTACTTGATAAAATTAGATTTCTT
GAGCATGATGGTTGTGCTCAAAGGTTGGACAAGATAATTGAAGTAGGTAAGCCTTATAGTGATAAAAGAGGTTCAGGCTATCTTGATGAATGCTCTACTCCCTCA
AGTTCTGAAACTATTTTTGTTAAAGAATCTCCTAATATGCCTAAGCTTGTTGCTCCTAAGCCGGTCAAAAAACTTAACGGCAAGAATTACAAACAATGGAAATCG
AATCTAAACACTATTCTCGTGATAGATGATCTTAAGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTGCGCCTAACGCCACTGTGGCGGTGCGCAACGCCTAT
GACAGGTGGATCAAGGCTAATGACAAGGCCAAGGTCTACATCCTGGCGAGCATATCTGATGTGCTTGCCAAGAAGCACGAGGACACGGTCACCGCTAAGGAGATC
ATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAGTTCGTTTACAACTCCCGCATGAAGGAGGGCTCCTCAGTGCGA
GAACACGTTCTCAACCTGATGGTCCACTTCAATGTGGCTGAGTCGAACGGGGCTGTCATAGATGAGCAGAGTCAGGTAAGCTTTATTCTGGAATCTTTTCCGAAC
AGTTTCCTTCCATTTTGCAATGCGGTTATGAATAAGTTGGAGTACACTCTTACCACGCTCTTAAACAAGTTGCAGACCTACCAGTCTCTTATGAAGAGTAAGGGA
CAAGAAGGGGAGGCAAATACTTTTAAGAAGAAGAAGGCTGCTGGTAAGGGGTCTAAACCTGAATCCACTATTGCGGCTGCCAAGAATGACAAGGCCAAGGTTGCA
GAGAAAGGGAAGTGTTTCCACTACAATATGGACGGGCATTGGAAGCGCAACTGCCCAAAGTACTTGGCCGAAAAGAAGAAAGCCAACGAAGGAGCCACTAATCAC
GTTTGTTCTTCATTTCAGGGAATTAGTTCCTAG
Protein sequenceShow/hide protein sequence
GLGKEYSNLVKVKKLLWSLPKQWESKVTSIQEAKDLKTISMDEIIGSLMTHEIKIKKNMEDEKKNKEKSIALKAITLEVVSKGENALDEDDVAYLSRKYKNFIKR
KKQFKTHFSNQKKSNNEMSKKDEVICYECKKPGHIRTDCPLLKSSKKSKKKAMKATWDDSDESGSESENEEVANFCFMAHSDKEDEQDDEVNLDPLSYDELFEAF
ENMQNDLEKLGSKYAMLKKKYNALVSENKSLLDDIACLKKKKYDVVNVSCDKHVLDCDEKNALLDKIRFLEHDGCAQRLDKIIEVGKPYSDKRGSGYLDECSTPS
SSETIFVKESPNMPKLVAPKPVKKLNGKNYKQWKSNLNTILVIDDLKFVLQEDCPQAPAPNATVAVRNAYDRWIKANDKAKVYILASISDVLAKKHEDTVTAKEI
MDSLQSMFGQPSSQARHEALKFVYNSRMKEGSSVREHVLNLMVHFNVAESNGAVIDEQSQVSFILESFPNSFLPFCNAVMNKLEYTLTTLLNKLQTYQSLMKSKG
QEGEANTFKKKKAAGKGSKPESTIAAAKNDKAKVAEKGKCFHYNMDGHWKRNCPKYLAEKKKANEGATNHVCSSFQGISS