; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G011900 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G011900
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTy3/gypsy retrotransposon protein
Genome locationCG_Chr05:14816473..14821075
RNA-Seq ExpressionClCG05G011900
SyntenyClCG05G011900
Gene Ontology termsGO:0009987 - cellular process (biological process)
InterPro domainsIPR000953 - Chromo/chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR023780 - Chromo domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049630.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.8e-3447.95Show/hide
Query:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT
        A  RMKKF D   R++E   GD+VFLKL PYRQ  + K++NE                     KLELPNNA+IH VFHVSQLK+A+GNI  I+   P + 
Subjt:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT

Query:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI
            W+T PEE+  Y   + T +WE LV+WKGL  HEATWE   +L  QFP F LEDK DL  ++  +PPI
Subjt:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI

KAA0057731.1 dual specificity protein phosphatase PHS1 isoform X1 [Cucumis melo var. makuwa]1.6e-3720.91Show/hide
Query:  MVQRRPEERIEESECTMSKVREEVAKFPEIDRAMRELTQKCGENCTPNERP------------------------------DEIKDRRIKRCSMRSPPWP
        MVQ R EER+E  +  ++ +++E++K P I+ ++ E+ +        +E+                               +++K +     S ++    
Subjt:  MVQRRPEERIEESECTMSKVREEVAKFPEIDRAMRELTQKCGENCTPNERP------------------------------DEIKDRRIKRCSMRSPPWP

Query:  RNQG-GSLNKRTKAKAGYADQSNFKK-------------------PYFDIHQLTDLKKVTVTKISFSGAALDWGMERLETAD------------VGSFPA
        RN G    ++R     G ++++ F K                    YF IH+LTD +K+ V+ +SF G AL+ G    E  D            +  F +
Subjt:  RNQG-GSLNKRTKAKAGYADQSNFKK-------------------PYFDIHQLTDLKKVTVTKISFSGAALDWGMERLETAD------------VGSFPA

Query:  SARGVHLSNFMQSSQTSTLPDFIERFEAMPVPLPHLTDEV---------------EVICFEPMGLAQIMKAAQRIEDKDMAIQNIIGPPHSKAQKSFHPS
        +  G     F++  Q  T+ ++I  F+ M  P+  L + V               EV       LA++M+ AQ +E++++A        +S  + + H S
Subjt:  SARGVHLSNFMQSSQTSTLPDFIERFEAMPVPLPHLTDEV---------------EVICFEPMGLAQIMKAAQRIEDKDMAIQNIIGPPHSKAQKSFHPS

Query:  L-------NLKPFNKPSEPNPVRTITLTSNQPT-NKKEVPYRHLSDTKLKARKEQGLYFRCDENYFAS----ARQQR----------HDESGLTIENGEG
        +        +   NK +   P+RTITL S+ P  N++E  Y+ L D + +ARKE+GL FRC+E Y A      R+QR           DE  +  E  E 
Subjt:  L-------NLKPFNKPSEPNPVRTITLTSNQPT-NKKEVPYRHLSDTKLKARKEQGLYFRCDENYFAS----ARQQR----------HDESGLTIENGEG

Query:  LN------EEPIEEMAELSLNTVVRISNLHTIKIKGKFENEEVMVLVDS---------------KLPVFDITNHRMIVGTGDTV-------------CSL
                +E I  + ELS+N+VV +++  T+K++GK   EEV++L+D                 LP+ + +++ +I+G+G  V              S 
Subjt:  LN------EEPIEEMAELSLNTVVRISNLHTIKIKGKFENEEVMVLVDS---------------KLPVFDITNHRMIVGTGDTV-------------CSL

Query:  TVTMYFLLLELGGI---------YTWELQSIHW----------------------------------NKEIKDSLWSYE---------------------
         +   FL LELGG+         Y+  + ++ W                                  N E KD+ +  E                     
Subjt:  TVTMYFLLLELGGI---------YTWELQSIHW----------------------------------NKEIKDSLWSYE---------------------

Query:  -----------------PLLLRWMKLMLPLQLWFHLYH--------------------------------------------------KRKDGSGRFCVD
                         P +  W + + P +   H  H                                                  K+KDGS RFCVD
Subjt:  -----------------PLLLRWMKLMLPLQLWFHLYH--------------------------------------------------KRKDGSGRFCVD

Query:  C----EATIPYKFPIPIIEELLGKLHGA------------------------------------------------------------------------
              ATIP KF I ++EEL  +L GA                                                                        
Subjt:  C----EATIPYKFPIPIIEELLGKLHGA------------------------------------------------------------------------

Query:  --------------KGTLKFLGLTSYYRSFVLNYDAFTFCL-RSSEKGHGH------------------TTYVSFAQFLHSFHC---ASETGLGV-----
                      K    FLGL  YYR FV NY      L +  +KG  H                     ++   F   F     AS  G+GV     
Subjt:  --------------KGTLKFLGLTSYYRSFVLNYDAFTFCL-RSSEKGHGH------------------TTYVSFAQFLHSFHC---ASETGLGV-----

Query:  -----------------------------------------------------------ESLPPQYQNWEEKLLGYDFEIQY------------------
                                                                     + PQYQ W  KLLGY FE+ Y                  
Subjt:  -----------------------------------------------------------ESLPPQYQNWEEKLLGYDFEIQY------------------

Query:  -----CSAPTILEVDIV-KEVSQDEFFPKLYFELKKDPTISSGHPSM--------------------------------FVIIEGHPS------MFVIID
              S P  + +D++ KEV QD  + K+  ++++   ++  + SM                                   +EG P       + V++D
Subjt:  -----CSAPTILEVDIV-KEVSQDEFFPKLYFELKKDPTISSGHPSM--------------------------------FVIIEGHPS------MFVIID

Query:  QLSK---------------LWPSFNPEDQECHSRPTLI----------------------------------KGGNDSTITIPPYS--------------
        +LSK               +   F  E    H  P  I                                   G  + +I + P+               
Subjt:  QLSK---------------LWPSFNPEDQECHSRPTLI----------------------------------KGGNDSTITIPPYS--------------

Query:  -----------------------------ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNEK---------------------LELPNNA
                                     A  +MK + D   R +E S G++VFL++ PYRQ  V  R+NEK                     L+LP ++
Subjt:  -----------------------------ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNEK---------------------LELPNNA

Query:  SIHSVFHVSQLKQALGNIMTIEPTLPCLTNKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI
         IH VFHVSQL++ +G     +PT+  +   ++W + PEE + Y  K     WEVLV WKGL +HEA+WE  +E+  ++P F LEDK +L   + VKP I
Subjt:  SIHSVFHVSQLKQALGNIMTIEPTLPCLTNKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI

Query:  R
        +
Subjt:  R

KAA0063300.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.8e-3447.37Show/hide
Query:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT
        A  RMKKF D   R++E   GD+VFLKL PYRQ  + K++NE                     KLELP+NA+IH VFHVSQLK+A+GNI  ++P  P + 
Subjt:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT

Query:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI
            W+T PEE+  Y   + T +WE LV WKGL  HEATWE   +L  QFP F LEDK DL  ++  +PPI
Subjt:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI

TYJ98416.1 dual specificity protein phosphatase PHS1 isoform X2 [Cucumis melo var. makuwa]1.6e-3720.91Show/hide
Query:  MVQRRPEERIEESECTMSKVREEVAKFPEIDRAMRELTQKCGENCTPNERP------------------------------DEIKDRRIKRCSMRSPPWP
        MVQ R EER+E  +  ++ +++E++K P I+ ++ E+ +        +E+                               +++K +     S ++    
Subjt:  MVQRRPEERIEESECTMSKVREEVAKFPEIDRAMRELTQKCGENCTPNERP------------------------------DEIKDRRIKRCSMRSPPWP

Query:  RNQG-GSLNKRTKAKAGYADQSNFKK-------------------PYFDIHQLTDLKKVTVTKISFSGAALDWGMERLETAD------------VGSFPA
        RN G    ++R     G ++++ F K                    YF IH+LTD +K+ V+ +SF G AL+ G    E  D            +  F +
Subjt:  RNQG-GSLNKRTKAKAGYADQSNFKK-------------------PYFDIHQLTDLKKVTVTKISFSGAALDWGMERLETAD------------VGSFPA

Query:  SARGVHLSNFMQSSQTSTLPDFIERFEAMPVPLPHLTDEV---------------EVICFEPMGLAQIMKAAQRIEDKDMAIQNIIGPPHSKAQKSFHPS
        +  G     F++  Q  T+ ++I  F+ M  P+  L + V               EV       LA++M+ AQ +E++++A        +S  + + H S
Subjt:  SARGVHLSNFMQSSQTSTLPDFIERFEAMPVPLPHLTDEV---------------EVICFEPMGLAQIMKAAQRIEDKDMAIQNIIGPPHSKAQKSFHPS

Query:  L-------NLKPFNKPSEPNPVRTITLTSNQPT-NKKEVPYRHLSDTKLKARKEQGLYFRCDENYFAS----ARQQR----------HDESGLTIENGEG
        +        +   NK +   P+RTITL S+ P  N++E  Y+ L D + +ARKE+GL FRC+E Y A      R+QR           DE  +  E  E 
Subjt:  L-------NLKPFNKPSEPNPVRTITLTSNQPT-NKKEVPYRHLSDTKLKARKEQGLYFRCDENYFAS----ARQQR----------HDESGLTIENGEG

Query:  LN------EEPIEEMAELSLNTVVRISNLHTIKIKGKFENEEVMVLVDS---------------KLPVFDITNHRMIVGTGDTV-------------CSL
                +E I  + ELS+N+VV +++  T+K++GK   EEV++L+D                 LP+ + +++ +I+G+G  V              S 
Subjt:  LN------EEPIEEMAELSLNTVVRISNLHTIKIKGKFENEEVMVLVDS---------------KLPVFDITNHRMIVGTGDTV-------------CSL

Query:  TVTMYFLLLELGGI---------YTWELQSIHW----------------------------------NKEIKDSLWSYE---------------------
         +   FL LELGG+         Y+  + ++ W                                  N E KD+ +  E                     
Subjt:  TVTMYFLLLELGGI---------YTWELQSIHW----------------------------------NKEIKDSLWSYE---------------------

Query:  -----------------PLLLRWMKLMLPLQLWFHLYH--------------------------------------------------KRKDGSGRFCVD
                         P +  W + + P +   H  H                                                  K+KDGS RFCVD
Subjt:  -----------------PLLLRWMKLMLPLQLWFHLYH--------------------------------------------------KRKDGSGRFCVD

Query:  C----EATIPYKFPIPIIEELLGKLHGA------------------------------------------------------------------------
              ATIP KF I ++EEL  +L GA                                                                        
Subjt:  C----EATIPYKFPIPIIEELLGKLHGA------------------------------------------------------------------------

Query:  --------------KGTLKFLGLTSYYRSFVLNYDAFTFCL-RSSEKGHGH------------------TTYVSFAQFLHSFHC---ASETGLGV-----
                      K    FLGL  YYR FV NY      L +  +KG  H                     ++   F   F     AS  G+GV     
Subjt:  --------------KGTLKFLGLTSYYRSFVLNYDAFTFCL-RSSEKGHGH------------------TTYVSFAQFLHSFHC---ASETGLGV-----

Query:  -----------------------------------------------------------ESLPPQYQNWEEKLLGYDFEIQY------------------
                                                                     + PQYQ W  KLLGY FE+ Y                  
Subjt:  -----------------------------------------------------------ESLPPQYQNWEEKLLGYDFEIQY------------------

Query:  -----CSAPTILEVDIV-KEVSQDEFFPKLYFELKKDPTISSGHPSM--------------------------------FVIIEGHPS------MFVIID
              S P  + +D++ KEV QD  + K+  ++++   ++  + SM                                   +EG P       + V++D
Subjt:  -----CSAPTILEVDIV-KEVSQDEFFPKLYFELKKDPTISSGHPSM--------------------------------FVIIEGHPS------MFVIID

Query:  QLSK---------------LWPSFNPEDQECHSRPTLI----------------------------------KGGNDSTITIPPYS--------------
        +LSK               +   F  E    H  P  I                                   G  + +I + P+               
Subjt:  QLSK---------------LWPSFNPEDQECHSRPTLI----------------------------------KGGNDSTITIPPYS--------------

Query:  -----------------------------ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNEK---------------------LELPNNA
                                     A  +MK + D   R +E S G++VFL++ PYRQ  V  R+NEK                     L+LP ++
Subjt:  -----------------------------ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNEK---------------------LELPNNA

Query:  SIHSVFHVSQLKQALGNIMTIEPTLPCLTNKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI
         IH VFHVSQL++ +G     +PT+  +   ++W + PEE + Y  K     WEVLV WKGL +HEA+WE  +E+  ++P F LEDK +L   + VKP I
Subjt:  SIHSVFHVSQLKQALGNIMTIEPTLPCLTNKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI

Query:  R
        +
Subjt:  R

TYK15990.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.8e-3447.95Show/hide
Query:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT
        A  RMKKF D   R++E   GD+VFLKL PYRQ  + K++NE                     KLELPNNA+IH VFHVSQLK+A+GNI  I+   P + 
Subjt:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT

Query:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI
            W+T PEE+  Y   + T +WE LV+WKGL  HEATWE   +L  QFP F LEDK DL  ++  +PPI
Subjt:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI

TrEMBL top hitse value%identityAlignment
A0A5A7TDM4 Ty3/gypsy retrotransposon protein5.1e-3446.78Show/hide
Query:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT
        A  RMKKF D   R++E   GD+VFLKL PYRQ  + K++NE                     KLELP+NA+IH VFHVSQLK+A+G+I  ++P  P + 
Subjt:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT

Query:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI
            W+T PEE+  Y   + T +WE LV WKGL  HEATWE   +L  QFP F LEDK DL  ++  +PPI
Subjt:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI

A0A5A7U2S1 Ty3/gypsy retrotransposon protein1.4e-3447.95Show/hide
Query:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT
        A  RMKKF D   R++E   GD+VFLKL PYRQ  + K++NE                     KLELPNNA+IH VFHVSQLK+A+GNI  I+   P + 
Subjt:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT

Query:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI
            W+T PEE+  Y   + T +WE LV+WKGL  HEATWE   +L  QFP F LEDK DL  ++  +PPI
Subjt:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI

A0A5A7V7S9 Ty3/gypsy retrotransposon protein1.4e-3447.37Show/hide
Query:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT
        A  RMKKF D   R++E   GD+VFLKL PYRQ  + K++NE                     KLELP+NA+IH VFHVSQLK+A+GNI  ++P  P + 
Subjt:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT

Query:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI
            W+T PEE+  Y   + T +WE LV WKGL  HEATWE   +L  QFP F LEDK DL  ++  +PPI
Subjt:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI

A0A5D3CXB1 Ty3/gypsy retrotransposon protein1.4e-3447.95Show/hide
Query:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT
        A  RMKKF D   R++E   GD+VFLKL PYRQ  + K++NE                     KLELPNNA+IH VFHVSQLK+A+GNI  I+   P + 
Subjt:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT

Query:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI
            W+T PEE+  Y   + T +WE LV+WKGL  HEATWE   +L  QFP F LEDK DL  ++  +PPI
Subjt:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI

A0A5D3DI73 Ty3/gypsy retrotransposon protein5.1e-3447.37Show/hide
Query:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT
        A  RMKKF D   R++E   GD+VFLKL PYRQ  + K++NE                     KLELP+NA+IH VFHVSQLK+A+GNI  I+   P + 
Subjt:  ALNRMKKFVDGHCRELELSFGDWVFLKLHPYRQAIVAKRKNE---------------------KLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLT

Query:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI
            W+T PEE+  Y   + T +WE LV+WKGL  HEATWE   +L  QFP F LEDK DL  ++  +PPI
Subjt:  NKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLEDKADLHPQAFVKPPI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTACAAAGACGACCTGAGGAAAGAATAGAGGAGTCAGAATGTACGATGAGCAAAGTTAGAGAAGAGGTGGCAAAGTTTCCCGAGATCGATAGGGCAATGAGAGAGTT
AACACAAAAATGTGGAGAAAACTGCACACCAAATGAAAGACCAGACGAAATCAAAGACAGGCGAATAAAGAGATGCAGCATGCGATCTCCGCCTTGGCCAAGGAATCAAG
GTGGGAGTTTGAATAAGAGAACAAAAGCAAAGGCAGGATATGCGGATCAAAGCAATTTCAAGAAACCATATTTTGACATTCATCAGCTGACAGATTTGAAAAAGGTAACT
GTGACCAAAATTAGTTTCTCTGGTGCAGCGTTAGACTGGGGAATGGAAAGACTTGAAACGGCGGATGTTGGATCGTTTCCAGCCAGCGCAAGAGGGGTCCATCTGAGCAA
TTTCATGCAATCAAGCCAAACATCAACTCTACCTGATTTCATAGAGAGATTTGAGGCTATGCCAGTGCCTCTTCCTCACCTAACAGATGAGGTTGAAGTCATCTGTTTTG
AACCAATGGGCTTGGCCCAAATTATGAAAGCAGCCCAAAGAATAGAAGACAAAGATATGGCCATCCAAAACATAATAGGCCCACCTCATTCCAAGGCCCAGAAGTCGTTC
CATCCTTCTTTGAACCTAAAGCCTTTTAATAAACCCAGTGAGCCCAACCCTGTTCGTACTATTACTCTCACTAGCAACCAACCCACCAACAAGAAAGAAGTTCCATACAG
ACATCTCTCAGACACAAAGCTCAAAGCGAGAAAAGAACAAGGCCTCTATTTTCGATGTGATGAAAATTACTTTGCTTCTGCTCGTCAACAACGACATGATGAAAGTGGAT
TGACAATTGAAAATGGGGAGGGTTTGAATGAGGAACCAATTGAAGAAATGGCTGAGCTATCCTTAAACACTGTGGTCAGAATTTCAAACCTTCATACCATCAAAATAAAG
GGAAAATTTGAAAACGAAGAGGTTATGGTTTTGGTTGATTCTAAACTTCCTGTTTTCGATATTACCAACCATAGGATGATTGTCGGAACAGGGGACACAGTTTGTAGTCT
CACCGTTACAATGTACTTCCTATTGCTTGAATTAGGAGGTATTTATACTTGGGAGTTACAGAGCATTCACTGGAACAAGGAGATCAAGGATTCCTTATGGAGCTACGAGC
CATTACTGCTTAGGTGGATGAAGTTAATGCTCCCATTGCAGCTCTGGTTTCATCTCTACCATAAGAGGAAAGATGGTAGTGGGAGATTTTGTGTGGATTGTGAAGCTACC
ATCCCATACAAGTTTCCCATTCCTATCATTGAAGAACTCCTCGGTAAACTTCATGGGGCAAAGGGAACTTTAAAGTTTCTGGGCCTTACCAGCTACTATAGGAGTTTTGT
CTTGAATTATGACGCGTTTACTTTCTGCCTTCGAAGCTCTGAAAAGGGCCATGGTCACACTACCTATGTTAGCTTTGCCCAATTTCTCCATTCCTTTCATTGTGCATCTG
AAACGGGACTTGGGGTTGAGTCCCTTCCTCCCCAGTATCAAAATTGGGAGGAAAAACTTTTGGGATATGATTTTGAGATCCAATATTGCTCTGCTCCGACCATCCTTGAA
GTGGATATTGTGAAAGAAGTGAGTCAAGATGAATTTTTCCCAAAACTGTACTTTGAATTGAAGAAGGATCCAACTATCAGCTCTGGCCATCCTTCAATGTTTGTGATTAT
TGAGGGCCATCCTTCAATGTTTGTGATTATTGATCAACTAAGTAAGCTCTGGCCATCCTTCAATCCTGAAGACCAAGAATGTCACTCTCGACCAACACTAATAAAGGGAG
GAAATGATTCCACTATTACAATTCCACCCTATTCAGCCCTGAATCGAATGAAGAAATTTGTGGATGGACATTGCAGAGAGCTGGAACTTAGTTTTGGTGATTGGGTATTT
CTCAAGCTACATCCGTATAGGCAAGCCATTGTGGCCAAAAGGAAGAATGAGAAGCTGGAGTTACCTAACAATGCATCCATTCATTCAGTATTCCATGTTTCACAACTTAA
ACAAGCTTTGGGCAATATTATGACCATAGAGCCCACTCTTCCGTGCTTGACAAATAAGTTTCTATGGCTTACCATTCCAGAAGAAGTGCTAACTTATCATTTGAAGGAGG
GAACGAATGATTGGGAAGTGCTTGTGAAATGGAAGGGCCTTCTCGAGCATGAAGCCACATGGGAAATTAATGAAGAGTTACACAATCAGTTTCCTGTTTTTCCTCTTGAG
GACAAGGCGGATCTCCACCCCCAGGCATTTGTCAAGCCCCCAATACGTATAAAACGTATGTTAGAAGGGGTAGAAATGAGAATTCCCCCGTTATTGGCTGGTAGTGTTTT
TGGAGGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTACAAAGACGACCTGAGGAAAGAATAGAGGAGTCAGAATGTACGATGAGCAAAGTTAGAGAAGAGGTGGCAAAGTTTCCCGAGATCGATAGGGCAATGAGAGAGTT
AACACAAAAATGTGGAGAAAACTGCACACCAAATGAAAGACCAGACGAAATCAAAGACAGGCGAATAAAGAGATGCAGCATGCGATCTCCGCCTTGGCCAAGGAATCAAG
GTGGGAGTTTGAATAAGAGAACAAAAGCAAAGGCAGGATATGCGGATCAAAGCAATTTCAAGAAACCATATTTTGACATTCATCAGCTGACAGATTTGAAAAAGGTAACT
GTGACCAAAATTAGTTTCTCTGGTGCAGCGTTAGACTGGGGAATGGAAAGACTTGAAACGGCGGATGTTGGATCGTTTCCAGCCAGCGCAAGAGGGGTCCATCTGAGCAA
TTTCATGCAATCAAGCCAAACATCAACTCTACCTGATTTCATAGAGAGATTTGAGGCTATGCCAGTGCCTCTTCCTCACCTAACAGATGAGGTTGAAGTCATCTGTTTTG
AACCAATGGGCTTGGCCCAAATTATGAAAGCAGCCCAAAGAATAGAAGACAAAGATATGGCCATCCAAAACATAATAGGCCCACCTCATTCCAAGGCCCAGAAGTCGTTC
CATCCTTCTTTGAACCTAAAGCCTTTTAATAAACCCAGTGAGCCCAACCCTGTTCGTACTATTACTCTCACTAGCAACCAACCCACCAACAAGAAAGAAGTTCCATACAG
ACATCTCTCAGACACAAAGCTCAAAGCGAGAAAAGAACAAGGCCTCTATTTTCGATGTGATGAAAATTACTTTGCTTCTGCTCGTCAACAACGACATGATGAAAGTGGAT
TGACAATTGAAAATGGGGAGGGTTTGAATGAGGAACCAATTGAAGAAATGGCTGAGCTATCCTTAAACACTGTGGTCAGAATTTCAAACCTTCATACCATCAAAATAAAG
GGAAAATTTGAAAACGAAGAGGTTATGGTTTTGGTTGATTCTAAACTTCCTGTTTTCGATATTACCAACCATAGGATGATTGTCGGAACAGGGGACACAGTTTGTAGTCT
CACCGTTACAATGTACTTCCTATTGCTTGAATTAGGAGGTATTTATACTTGGGAGTTACAGAGCATTCACTGGAACAAGGAGATCAAGGATTCCTTATGGAGCTACGAGC
CATTACTGCTTAGGTGGATGAAGTTAATGCTCCCATTGCAGCTCTGGTTTCATCTCTACCATAAGAGGAAAGATGGTAGTGGGAGATTTTGTGTGGATTGTGAAGCTACC
ATCCCATACAAGTTTCCCATTCCTATCATTGAAGAACTCCTCGGTAAACTTCATGGGGCAAAGGGAACTTTAAAGTTTCTGGGCCTTACCAGCTACTATAGGAGTTTTGT
CTTGAATTATGACGCGTTTACTTTCTGCCTTCGAAGCTCTGAAAAGGGCCATGGTCACACTACCTATGTTAGCTTTGCCCAATTTCTCCATTCCTTTCATTGTGCATCTG
AAACGGGACTTGGGGTTGAGTCCCTTCCTCCCCAGTATCAAAATTGGGAGGAAAAACTTTTGGGATATGATTTTGAGATCCAATATTGCTCTGCTCCGACCATCCTTGAA
GTGGATATTGTGAAAGAAGTGAGTCAAGATGAATTTTTCCCAAAACTGTACTTTGAATTGAAGAAGGATCCAACTATCAGCTCTGGCCATCCTTCAATGTTTGTGATTAT
TGAGGGCCATCCTTCAATGTTTGTGATTATTGATCAACTAAGTAAGCTCTGGCCATCCTTCAATCCTGAAGACCAAGAATGTCACTCTCGACCAACACTAATAAAGGGAG
GAAATGATTCCACTATTACAATTCCACCCTATTCAGCCCTGAATCGAATGAAGAAATTTGTGGATGGACATTGCAGAGAGCTGGAACTTAGTTTTGGTGATTGGGTATTT
CTCAAGCTACATCCGTATAGGCAAGCCATTGTGGCCAAAAGGAAGAATGAGAAGCTGGAGTTACCTAACAATGCATCCATTCATTCAGTATTCCATGTTTCACAACTTAA
ACAAGCTTTGGGCAATATTATGACCATAGAGCCCACTCTTCCGTGCTTGACAAATAAGTTTCTATGGCTTACCATTCCAGAAGAAGTGCTAACTTATCATTTGAAGGAGG
GAACGAATGATTGGGAAGTGCTTGTGAAATGGAAGGGCCTTCTCGAGCATGAAGCCACATGGGAAATTAATGAAGAGTTACACAATCAGTTTCCTGTTTTTCCTCTTGAG
GACAAGGCGGATCTCCACCCCCAGGCATTTGTCAAGCCCCCAATACGTATAAAACGTATGTTAGAAGGGGTAGAAATGAGAATTCCCCCGTTATTGGCTGGTAGTGTTTT
TGGAGGTTAG
Protein sequenceShow/hide protein sequence
MVQRRPEERIEESECTMSKVREEVAKFPEIDRAMRELTQKCGENCTPNERPDEIKDRRIKRCSMRSPPWPRNQGGSLNKRTKAKAGYADQSNFKKPYFDIHQLTDLKKVT
VTKISFSGAALDWGMERLETADVGSFPASARGVHLSNFMQSSQTSTLPDFIERFEAMPVPLPHLTDEVEVICFEPMGLAQIMKAAQRIEDKDMAIQNIIGPPHSKAQKSF
HPSLNLKPFNKPSEPNPVRTITLTSNQPTNKKEVPYRHLSDTKLKARKEQGLYFRCDENYFASARQQRHDESGLTIENGEGLNEEPIEEMAELSLNTVVRISNLHTIKIK
GKFENEEVMVLVDSKLPVFDITNHRMIVGTGDTVCSLTVTMYFLLLELGGIYTWELQSIHWNKEIKDSLWSYEPLLLRWMKLMLPLQLWFHLYHKRKDGSGRFCVDCEAT
IPYKFPIPIIEELLGKLHGAKGTLKFLGLTSYYRSFVLNYDAFTFCLRSSEKGHGHTTYVSFAQFLHSFHCASETGLGVESLPPQYQNWEEKLLGYDFEIQYCSAPTILE
VDIVKEVSQDEFFPKLYFELKKDPTISSGHPSMFVIIEGHPSMFVIIDQLSKLWPSFNPEDQECHSRPTLIKGGNDSTITIPPYSALNRMKKFVDGHCRELELSFGDWVF
LKLHPYRQAIVAKRKNEKLELPNNASIHSVFHVSQLKQALGNIMTIEPTLPCLTNKFLWLTIPEEVLTYHLKEGTNDWEVLVKWKGLLEHEATWEINEELHNQFPVFPLE
DKADLHPQAFVKPPIRIKRMLEGVEMRIPPLLAGSVFGG