; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g02290 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g02290
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionGag/pol protein
Genome locationchr1:1501124..1522316
RNA-Seq ExpressionMoc01g02290
SyntenyMoc01g02290
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-5943.8Show/hide
Query:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ
        +A  KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  N    VR  Y++W KAN     YILAS+ +VLAKKHE  +T +EIMDSLQ MFGQ S Q
Subjt:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ

Query:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------
         +H+ALK                            +   VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT+                
Subjt:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------

Query:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------
           + F +   +G +S P S+                 +AAAK  K     KG  FHCN +GHWK  C KY AEK +A +                    
Subjt:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------

Query:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR
            GA N+VCSS QGISSWRQLE  EM  +     +  A+ V  LR
Subjt:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-5943.8Show/hide
Query:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ
        +A  KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  N    VR  Y++W KAN     YILAS+ +VLAKKHE  +T +EIMDSLQ MFGQ S Q
Subjt:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ

Query:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------
         +H+ALK                            +   VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT+                
Subjt:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------

Query:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------
           + F +   +G +S P S+                 +AAAK  K     KG  FHCN +GHWK  C KY AEK +A +                    
Subjt:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------

Query:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR
            GA N+VCSS QGISSWRQLE  EM  +     +  A+ V  LR
Subjt:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-5943.8Show/hide
Query:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ
        +A  KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  N    VR  Y++W KAN     YILAS+ +VLAKKHE  +T +EIMDSLQ MFGQ S Q
Subjt:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ

Query:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------
         +H+ALK                            +   VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT+                
Subjt:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------

Query:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------
           + F +   +G +S P S+                 +AAAK  K     KG  FHCN +GHWK  C KY AEK +A +                    
Subjt:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------

Query:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR
            GA N+VCSS QGISSWRQLE  EM  +     +  A+ V  LR
Subjt:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-5942.86Show/hide
Query:  LVLCSFIGIRIAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSL
        L++ S     +A  KLNG NY  WK+ +NT+L+IDDLRFVL ++CPQ P  N    VR  Y++W KAN     YILAS+ +VLAKKHE  +T +EIMDSL
Subjt:  LVLCSFIGIRIAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSL

Query:  QSMFGQPSSQARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH-----
        Q MFGQ S Q +H+ALK                            +   VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT+      
Subjt:  QSMFGQPSSQARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH-----

Query:  -------------KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE----------
                     + F +   +G +S P S+                 +AAAK  K     KG  FHCN +GHWK  C KY AEK +A +          
Subjt:  -------------KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE----------

Query:  --------------GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR
                      GA N+VCSS QGISSWRQLE  EM  +     +  A+ V  LR
Subjt:  --------------GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-5943.8Show/hide
Query:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ
        +A  KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  N    VR  Y++W KAN     YILAS+ +VLAKKHE  +T +EIMDSLQ MFGQ S Q
Subjt:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ

Query:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------
         +H+ALK                            +   VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT+                
Subjt:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------

Query:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------
           + F +   +G +S P S+                 +AAAK  K     KG  FHCN +GHWK  C KY AEK +A +                    
Subjt:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------

Query:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR
            GA N+VCSS QGISSWRQLE  EM  +     +  A+ V  LR
Subjt:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.3e-5943.8Show/hide
Query:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ
        +A  KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  N    VR  Y++W KAN     YILAS+ +VLAKKHE  +T +EIMDSLQ MFGQ S Q
Subjt:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ

Query:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------
         +H+ALK                            +   VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT+                
Subjt:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------

Query:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------
           + F +   +G +S P S+                 +AAAK  K     KG  FHCN +GHWK  C KY AEK +A +                    
Subjt:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------

Query:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR
            GA N+VCSS QGISSWRQLE  EM  +     +  A+ V  LR
Subjt:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR

A0A5A7TWB9 Gag/pol protein1.3e-5943.8Show/hide
Query:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ
        +A  KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  N    VR  Y++W KAN     YILAS+ +VLAKKHE  +T +EIMDSLQ MFGQ S Q
Subjt:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ

Query:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------
         +H+ALK                            +   VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT+                
Subjt:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------

Query:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------
           + F +   +G +S P S+                 +AAAK  K     KG  FHCN +GHWK  C KY AEK +A +                    
Subjt:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------

Query:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR
            GA N+VCSS QGISSWRQLE  EM  +     +  A+ V  LR
Subjt:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR

A0A5A7V4M1 Gag/pol protein1.3e-5942.86Show/hide
Query:  LVLCSFIGIRIAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSL
        L++ S     +A  KLNG NY  WK+ +NT+L+IDDLRFVL ++CPQ P  N    VR  Y++W KAN     YILAS+ +VLAKKHE  +T +EIMDSL
Subjt:  LVLCSFIGIRIAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSL

Query:  QSMFGQPSSQARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH-----
        Q MFGQ S Q +H+ALK                            +   VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT+      
Subjt:  QSMFGQPSSQARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH-----

Query:  -------------KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE----------
                     + F +   +G +S P S+                 +AAAK  K     KG  FHCN +GHWK  C KY AEK +A +          
Subjt:  -------------KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE----------

Query:  --------------GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR
                      GA N+VCSS QGISSWRQLE  EM  +     +  A+ V  LR
Subjt:  --------------GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR

A0A5D3CPJ6 Gag/pol protein1.3e-5943.8Show/hide
Query:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ
        +A  KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  N    VR  Y++W KAN     YILAS+ +VLAKKHE  +T +EIMDSLQ MFGQ S Q
Subjt:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ

Query:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------
         +H+ALK                            +   VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT+                
Subjt:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------

Query:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------
           + F +   +G +S P S+                 +AAAK  K     KG  FHCN +GHWK  C KY AEK +A +                    
Subjt:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------

Query:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR
            GA N+VCSS QGISSWRQLE  EM  +     +  A+ V  LR
Subjt:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR

A0A5D3CSZ6 Gag/pol protein1.3e-5943.8Show/hide
Query:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ
        +A  KLNG NY  WK+ +NT+L+IDDLRFVL E+CPQ P  N    VR  Y++W KAN     YILAS+ +VLAKKHE  +T +EIMDSLQ MFGQ S Q
Subjt:  IAVKKLNGENYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKAN----VYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQ

Query:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------
         +H+ALK                            +   VIDE SQVSFILESLP+SFL FRSNAVMNK+ YTLTTLLNELQT+                
Subjt:  ARHEALK----------------------------VERVVIDEQSQVSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYH---------------

Query:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------
           + F +   +G +S P S+                 +AAAK  K     KG  FHCN +GHWK  C KY AEK +A +                    
Subjt:  ---KTFKKKKAAGKRSKPDST-----------------VAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANE--------------------

Query:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR
            GA N+VCSS QGISSWRQLE  EM  +     +  A+ V  LR
Subjt:  ----GAINYVCSSVQGISSWRQLEAFEMKWK---RRICLALEVDALR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTTGCAATTTCGTACTTCAAGTATCTGGCTTCAAAACTTCCTTCCTTGGCTTCTTCGTCTGGGAGGGAGTGGGAGCTCTCTCTCTCAATTCCCTTGGCT
TTCAAGATCGTTGCTCCCACGAGCACATTCTCGTACCCAGAGAATAGCAAGGAAGATAATTTGGTGGTGTCAATTAGAATAAAGATCGTCGTTTCCGCTGCGGGA
TTTACGTCCATCAATTGGTATCAGAGCAATGTTTACGTAGAGGGCGGCGCCGCGACTTCGAAACTCCCTTCGGCAGTCGTCGACGGCAGCAGCGGCACGTTTCCA
GCCAATTCCAGCGTCTCCCATGGTGTCTTGAACCCAAACCCACGGCGGTTTAGGCGTTTTTGCAGTGACAGTGGCGCTTCTGCGACATCGGCGTCCCGGCGACGA
CCCTTGGCCGTGTGTACTCCGACAACCATAAACAACTCCGATTTGTGGCCTTTGTTCCCCTCTGACGACGTTAGACACTTACCCACACCTAACCAGAGTTCGATT
CGTGGTACCCACCCCTATTTGGACTCAAACCAGCATTACCCATTGCTTTTTGACATCGGACCAGCAAGCCTAAAGCCATTCAAACTCGGTTTTGGGAGCCCGAAC
CTCCTCGGTGTTGACTCACTCCTTTTCCAAAAGGTTTTAGTGGAATCCTTCGGCGAGCTTAGTAGCGTGTTAGGACTTTGTTGGGCCAATGCACAGTTCGAAGCC
TTGGGGATATATGTCAAGGTCGAACGCCAAGCTTTTGTAAAGGAGTGTCAAGCCTTGGAGATAAATGGCAAAGGCCGATACGATTCGAAGAGTGAGGATTCAGGT
TACATTCATGGTGATGGTGTGGAGGAGACTCGAGAAGAAAGAACATGGCTAGTTTTGTGTAGTTTCATTGGGATTAGAATAGCTGTAAAAAAGCTTAACGGCGAA
AACTACAAACAATGGAAATCAAACCTAAACACTATATTAGTGATAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTACGCTTAATACCATT
GTTGCTGTACGCAACGTCTATGACCAATGGATTAAGGCCAATGTCTACATCTTGGCGAGCATATTTGATGTGCTTGCTAAGAAGCACGAGGACACGGTCACCACT
AAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAAGTTGAACGGGTCGTCATAGACGAGCAGAGTCAG
GTAAGCTTCATTTTAGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACTACGCTCCTAAACGAGCTG
CAGACCTACCATAAGACTTTCAAGAAGAAGAAGGCTGCTGGTAAGAGGTCTAAACCCGACTCCACTGTTGCCGCTGCCAAGAAAGGCAAGGCCAAGGCTACAGAC
AAAGGAAAATATTTCCACTGCAACGTGGACGGGCATTGGAAGTGCACTTGCTCGAAATACCCGGCCGAGAAGAATAGAGCCAATGAAGGAGCCATTAACTATGTT
TGTTCTTCAGTTCAGGGAATTAGTTCCTGGAGGCAGCTTGAAGCATTTGAAATGAAATGGAAAAGACGGATATGCTTAGCACTTGAAGTGGATGCTCTTCGTAAA
GAAATATTGAACTCGAAAAAACGAAAGCAAGACATAATGTCATCGGACAGCGAAAGTGAGGATACCATTGCTGGAATCACACGAGCTCGGGGGAAACTCCTGGCG
AATTTTTGTTACGAGTCCGCCATAAGGATTATTGTGAGCAAGTATGGGTCACACCCGTTCGATTGGGTCTCGACAGGTGGGTCTAAGGTGAACAATAACAACCCT
TGGAAGGGTATTGCTTTGTGTTTCCCCATTTTTTCTCAATCTCTTTGTTCTATTGTGGAGGATGGTCGTAACACTTATTTCTGGGAGGATTCTTGGGAGGGGGTG
GGTCCTTCCATTTCCTTGGGTCTTTCTCATTCTTTGACGAATCGAGAAGCTTTAGAGGTTTCTGCTTTACTTGCTTTGTTGTCTGAGGTGCCCTCCTATTCTAGG
AGGGAAGACGTGAGGATGTGGACCACAACCCTTCAAAAGTTCGAGAAGAGGACGGGTGGAAAGGGGAATAAAGTTGTCCCTGTTGCTGCCCAAAAGAGCAAGGAA
GCCAAGGTTTCAGACAAAGGAAAGTGTTTTCACTACAACGTTGACGAGCATTGGAAAACAAACTGCTCGAAGTACTTGGCTGAGAAGAAGAAAGCCAAAGAAGGC
ATGGAGTTGCTTTGTCTAAGGGAACAATGTCTTAAGGTACCTCAAGAAGTCGAGGATACGAGATGTATTCCCTATGTGTCAGCTATGTCCAGCCTTATGAAGGAA
AAAAGCAGCAATGCACCTGCTGCAGATGACACAGAGGCTATCTTAAAGCTAGCAGCGATGCGTCTGGCGCATCGCCAACACTCCAGAAACTTCAGCAATGCGTCT
GGCGCATTCTCCACCTACAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTTGCAATTTCGTACTTCAAGTATCTGGCTTCAAAACTTCCTTCCTTGGCTTCTTCGTCTGGGAGGGAGTGGGAGCTCTCTCTCTCAATTCCCTTGGCT
TTCAAGATCGTTGCTCCCACGAGCACATTCTCGTACCCAGAGAATAGCAAGGAAGATAATTTGGTGGTGTCAATTAGAATAAAGATCGTCGTTTCCGCTGCGGGA
TTTACGTCCATCAATTGGTATCAGAGCAATGTTTACGTAGAGGGCGGCGCCGCGACTTCGAAACTCCCTTCGGCAGTCGTCGACGGCAGCAGCGGCACGTTTCCA
GCCAATTCCAGCGTCTCCCATGGTGTCTTGAACCCAAACCCACGGCGGTTTAGGCGTTTTTGCAGTGACAGTGGCGCTTCTGCGACATCGGCGTCCCGGCGACGA
CCCTTGGCCGTGTGTACTCCGACAACCATAAACAACTCCGATTTGTGGCCTTTGTTCCCCTCTGACGACGTTAGACACTTACCCACACCTAACCAGAGTTCGATT
CGTGGTACCCACCCCTATTTGGACTCAAACCAGCATTACCCATTGCTTTTTGACATCGGACCAGCAAGCCTAAAGCCATTCAAACTCGGTTTTGGGAGCCCGAAC
CTCCTCGGTGTTGACTCACTCCTTTTCCAAAAGGTTTTAGTGGAATCCTTCGGCGAGCTTAGTAGCGTGTTAGGACTTTGTTGGGCCAATGCACAGTTCGAAGCC
TTGGGGATATATGTCAAGGTCGAACGCCAAGCTTTTGTAAAGGAGTGTCAAGCCTTGGAGATAAATGGCAAAGGCCGATACGATTCGAAGAGTGAGGATTCAGGT
TACATTCATGGTGATGGTGTGGAGGAGACTCGAGAAGAAAGAACATGGCTAGTTTTGTGTAGTTTCATTGGGATTAGAATAGCTGTAAAAAAGCTTAACGGCGAA
AACTACAAACAATGGAAATCAAACCTAAACACTATATTAGTGATAGATGATCTTAGGTTCGTCTTGCAAGAGGATTGTCCTCAAGCTCCTACGCTTAATACCATT
GTTGCTGTACGCAACGTCTATGACCAATGGATTAAGGCCAATGTCTACATCTTGGCGAGCATATTTGATGTGCTTGCTAAGAAGCACGAGGACACGGTCACCACT
AAGGAGATCATGGACTCGCTGCAGAGCATGTTTGGACAACCGTCCTCACAGGCTCGACATGAAGCCCTTAAAGTTGAACGGGTCGTCATAGACGAGCAGAGTCAG
GTAAGCTTCATTTTAGAATCTCTTCCGAAGAGTTTCCTGCCATTCCGCAGCAATGCGGTTATGAATAAGCTGGAGTACACTCTTACTACGCTCCTAAACGAGCTG
CAGACCTACCATAAGACTTTCAAGAAGAAGAAGGCTGCTGGTAAGAGGTCTAAACCCGACTCCACTGTTGCCGCTGCCAAGAAAGGCAAGGCCAAGGCTACAGAC
AAAGGAAAATATTTCCACTGCAACGTGGACGGGCATTGGAAGTGCACTTGCTCGAAATACCCGGCCGAGAAGAATAGAGCCAATGAAGGAGCCATTAACTATGTT
TGTTCTTCAGTTCAGGGAATTAGTTCCTGGAGGCAGCTTGAAGCATTTGAAATGAAATGGAAAAGACGGATATGCTTAGCACTTGAAGTGGATGCTCTTCGTAAA
GAAATATTGAACTCGAAAAAACGAAAGCAAGACATAATGTCATCGGACAGCGAAAGTGAGGATACCATTGCTGGAATCACACGAGCTCGGGGGAAACTCCTGGCG
AATTTTTGTTACGAGTCCGCCATAAGGATTATTGTGAGCAAGTATGGGTCACACCCGTTCGATTGGGTCTCGACAGGTGGGTCTAAGGTGAACAATAACAACCCT
TGGAAGGGTATTGCTTTGTGTTTCCCCATTTTTTCTCAATCTCTTTGTTCTATTGTGGAGGATGGTCGTAACACTTATTTCTGGGAGGATTCTTGGGAGGGGGTG
GGTCCTTCCATTTCCTTGGGTCTTTCTCATTCTTTGACGAATCGAGAAGCTTTAGAGGTTTCTGCTTTACTTGCTTTGTTGTCTGAGGTGCCCTCCTATTCTAGG
AGGGAAGACGTGAGGATGTGGACCACAACCCTTCAAAAGTTCGAGAAGAGGACGGGTGGAAAGGGGAATAAAGTTGTCCCTGTTGCTGCCCAAAAGAGCAAGGAA
GCCAAGGTTTCAGACAAAGGAAAGTGTTTTCACTACAACGTTGACGAGCATTGGAAAACAAACTGCTCGAAGTACTTGGCTGAGAAGAAGAAAGCCAAAGAAGGC
ATGGAGTTGCTTTGTCTAAGGGAACAATGTCTTAAGGTACCTCAAGAAGTCGAGGATACGAGATGTATTCCCTATGTGTCAGCTATGTCCAGCCTTATGAAGGAA
AAAAGCAGCAATGCACCTGCTGCAGATGACACAGAGGCTATCTTAAAGCTAGCAGCGATGCGTCTGGCGCATCGCCAACACTCCAGAAACTTCAGCAATGCGTCT
GGCGCATTCTCCACCTACAAATAA
Protein sequenceShow/hide protein sequence
MFFAISYFKYLASKLPSLASSSGREWELSLSIPLAFKIVAPTSTFSYPENSKEDNLVVSIRIKIVVSAAGFTSINWYQSNVYVEGGAATSKLPSAVVDGSSGTFP
ANSSVSHGVLNPNPRRFRRFCSDSGASATSASRRRPLAVCTPTTINNSDLWPLFPSDDVRHLPTPNQSSIRGTHPYLDSNQHYPLLFDIGPASLKPFKLGFGSPN
LLGVDSLLFQKVLVESFGELSSVLGLCWANAQFEALGIYVKVERQAFVKECQALEINGKGRYDSKSEDSGYIHGDGVEETREERTWLVLCSFIGIRIAVKKLNGE
NYKQWKSNLNTILVIDDLRFVLQEDCPQAPTLNTIVAVRNVYDQWIKANVYILASIFDVLAKKHEDTVTTKEIMDSLQSMFGQPSSQARHEALKVERVVIDEQSQ
VSFILESLPKSFLPFRSNAVMNKLEYTLTTLLNELQTYHKTFKKKKAAGKRSKPDSTVAAAKKGKAKATDKGKYFHCNVDGHWKCTCSKYPAEKNRANEGAINYV
CSSVQGISSWRQLEAFEMKWKRRICLALEVDALRKEILNSKKRKQDIMSSDSESEDTIAGITRARGKLLANFCYESAIRIIVSKYGSHPFDWVSTGGSKVNNNNP
WKGIALCFPIFSQSLCSIVEDGRNTYFWEDSWEGVGPSISLGLSHSLTNREALEVSALLALLSEVPSYSRREDVRMWTTTLQKFEKRTGGKGNKVVPVAAQKSKE
AKVSDKGKCFHYNVDEHWKTNCSKYLAEKKKAKEGMELLCLREQCLKVPQEVEDTRCIPYVSAMSSLMKEKSSNAPAADDTEAILKLAAMRLAHRQHSRNFSNAS
GAFSTYK