; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC08G149090 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC08G149090
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionRetrotransposon protein
Genome locationCmU531Chr08:18471405..18472332
RNA-Seq ExpressionCmUC08G149090
SyntenyCmUC08G149090
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035621.1 retrotransposon protein [Cucumis melo var. makuwa]1.2e-4036.01Show/hide
Query:  LVECLVQCVQSGHWRADNGIFQPGFVANILQ---------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG
        LVECLV+ V +G WR+DNG F+PG++  + +               I+SR++ +KR ++A+ EM    CSGFGWN E KCI  E E+FD W  SHP+AKG
Subjt:  LVECLVQCVQSGHWRADNGIFQPGFVANILQ---------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG

Query:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSE--PVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSRRSQSSLIG
        L +KSF  YD+L  VFGKDRATG RA +  ++GS   P  D    D +     DF   Y P          E     +  R + SS   R R   ++   
Subjt:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSE--PVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSRRSQSSLIG

Query:  EYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYCMQVL
        +  D+VR   +   + +  IA+W I+    A + R+E+   L++IP L++ D   + R L+ +   +  F++ P   KY YC  +L
Subjt:  EYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYCMQVL

KAA0038122.1 retrotransposon protein [Cucumis melo var. makuwa]1.2e-4036.01Show/hide
Query:  LVECLVQCVQSGHWRADNGIFQPGFVANILQ---------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG
        LVECLV+ V +G WR+DNG F+PG++  + +               I+SR++ +KR ++A+ EM    CSGFGWN E KCI  E E+FD W  SHP+AKG
Subjt:  LVECLVQCVQSGHWRADNGIFQPGFVANILQ---------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG

Query:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSE--PVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSRRSQSSLIG
        L +KSF  YD+L  VFGKDRATG RA +  ++GS   P  D    D +     DF   Y P          E     +  R + SS   R R   ++   
Subjt:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSE--PVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSRRSQSSLIG

Query:  EYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYCMQVL
        +  D+VR   +   + +  IA+W I+    A + R+E+   L++IP L++ D   + R L+ +   +  F++ P   KY YC  +L
Subjt:  EYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYCMQVL

KAA0057083.1 retrotransposon protein [Cucumis melo var. makuwa]1.8e-4136.36Show/hide
Query:  LVECLVQCVQSGHWRADNGIFQPGFVANILQ---------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG
        LVECLV+ V +G WR+DNG F+PG++  + +               I+SR++ +KR ++A+ EM    CSGFGWN E KCI  E E+FD W  SHP+AKG
Subjt:  LVECLVQCVQSGHWRADNGIFQPGFVANILQ---------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG

Query:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSE--PVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSRRSQSSLIG
        L +KSF  YD+L  VFGKDRATG RA +  ++GS   P  D E  D +     DF   Y P          E     +  R + SS   R R   ++   
Subjt:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSE--PVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSRRSQSSLIG

Query:  EYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYCMQVL
        +  D+VR   +   + +  IA+W I+    A + R+E+   L++IP L++ D   + R L+ +   +  F++ P   KY YC  +L
Subjt:  EYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYCMQVL

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]6.3e-5043.46Show/hide
Query:  ILVECLVQCVQSGHWRADNGIFQPGFVANILQIESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKGLRHKSFPFYDDLVI
        +LVECL+Q V+ G WRADNG F+ G++              +QY AI EM+   CSGFGWN   KCI+ E  +FD WVK HP+A+GL +K FP++ DL +
Subjt:  ILVECLVQCVQSGHWRADNGIFQPGFVANILQIESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKGLRHKSFPFYDDLVI

Query:  VFGKDRATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGG--RGSASSMPSRSRRSQSSLIGEYSDVVREGFQLLT
        VFG+DRATG R  T VE+ S+   D E +D+  N     E+F IP+P  +  P+ ED P+T       + SS PS+ RRS S   G+  D  R   +  +
Subjt:  VFGKDRATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGG--RGSASSMPSRSRRSQSSLIGEYSDVVREGFQLLT

Query:  KSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFP
        K I  IA W     ++     + LYAELQ+IPG+ V D L V  SLL DP +L  F+D+P
Subjt:  KSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFP

XP_008441954.1 PREDICTED: uncharacterized protein LOC103485953 [Cucumis melo]3.1e-4138.25Show/hide
Query:  VECLVQCVQSGHWRADNGIFQPGFVANILQ----------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG
        VECLV+ V SG WR+DNG FQPG++A + +                I+  V++LK+ Y+AI EM    CSGFGWN E +CI  E ++FD+W+KSHP+AKG
Subjt:  VECLVQCVQSGHWRADNGIFQPGFVANILQ----------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG

Query:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGG--RGSAS---SMPSRSRRSQSS
        L HKSFP+YDDL  VFGKDRATG+R+ T   VGS  V +  N+ I    S D       D P + S     +P  + G   G AS   +  S S+R + S
Subjt:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGG--RGSAS---SMPSRSRRSQSS

Query:  LIGEYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYC
           E  +V+R   +   + +  IA W      +    R ++  +LQ IP L  QD   + + L      +  F+  P + K +YC
Subjt:  LIGEYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYC

TrEMBL top hitse value%identityAlignment
A0A1S3B4L3 uncharacterized protein LOC1034859531.5e-4138.25Show/hide
Query:  VECLVQCVQSGHWRADNGIFQPGFVANILQ----------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG
        VECLV+ V SG WR+DNG FQPG++A + +                I+  V++LK+ Y+AI EM    CSGFGWN E +CI  E ++FD+W+KSHP+AKG
Subjt:  VECLVQCVQSGHWRADNGIFQPGFVANILQ----------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG

Query:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGG--RGSAS---SMPSRSRRSQSS
        L HKSFP+YDDL  VFGKDRATG+R+ T   VGS  V +  N+ I    S D       D P + S     +P  + G   G AS   +  S S+R + S
Subjt:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGG--RGSAS---SMPSRSRRSQSS

Query:  LIGEYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYC
           E  +V+R   +   + +  IA W      +    R ++  +LQ IP L  QD   + + L      +  F+  P + K +YC
Subjt:  LIGEYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYC

A0A5A7U0H7 Retrotransposon protein1.5e-4138.25Show/hide
Query:  VECLVQCVQSGHWRADNGIFQPGFVANILQ----------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG
        VECLV+ V SG WR+DNG FQPG++A + +                I+  V++LK+ Y+AI EM    CSGFGWN E +CI  E ++FD+W+KSHP+AKG
Subjt:  VECLVQCVQSGHWRADNGIFQPGFVANILQ----------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG

Query:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGG--RGSAS---SMPSRSRRSQSS
        L HKSFP+YDDL  VFGKDRATG+R+ T   VGS  V +  N+ I    S D       D P + S     +P  + G   G AS   +  S S+R + S
Subjt:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGG--RGSAS---SMPSRSRRSQSS

Query:  LIGEYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYC
           E  +V+R   +   + +  IA W      +    R ++  +LQ IP L  QD   + + L      +  F+  P + K +YC
Subjt:  LIGEYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYC

A0A5A7UME4 Retrotransposon protein8.9e-4236.36Show/hide
Query:  LVECLVQCVQSGHWRADNGIFQPGFVANILQ---------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG
        LVECLV+ V +G WR+DNG F+PG++  + +               I+SR++ +KR ++A+ EM    CSGFGWN E KCI  E E+FD W  SHP+AKG
Subjt:  LVECLVQCVQSGHWRADNGIFQPGFVANILQ---------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG

Query:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSE--PVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSRRSQSSLIG
        L +KSF  YD+L  VFGKDRATG RA +  ++GS   P  D E  D +     DF   Y P          E     +  R + SS   R R   ++   
Subjt:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSE--PVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSRRSQSSLIG

Query:  EYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYCMQVL
        +  D+VR   +   + +  IA+W I+    A + R+E+   L++IP L++ D   + R L+ +   +  F++ P   KY YC  +L
Subjt:  EYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYCMQVL

A0A5D3C7T4 Uncharacterized protein3.0e-5043.46Show/hide
Query:  ILVECLVQCVQSGHWRADNGIFQPGFVANILQIESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKGLRHKSFPFYDDLVI
        +LVECL+Q V+ G WRADNG F+ G++              +QY AI EM+   CSGFGWN   KCI+ E  +FD WVK HP+A+GL +K FP++ DL +
Subjt:  ILVECLVQCVQSGHWRADNGIFQPGFVANILQIESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKGLRHKSFPFYDDLVI

Query:  VFGKDRATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGG--RGSASSMPSRSRRSQSSLIGEYSDVVREGFQLLT
        VFG+DRATG R  T VE+ S+   D E +D+  N     E+F IP+P  +  P+ ED P+T       + SS PS+ RRS S   G+  D  R   +  +
Subjt:  VFGKDRATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGG--RGSASSMPSRSRRSQSSLIGEYSDVVREGFQLLT

Query:  KSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFP
        K I  IA W     ++     + LYAELQ+IPG+ V D L V  SLL DP +L  F+D+P
Subjt:  KSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFP

A0A5D3DPR5 Retrotransposon protein5.8e-4136.01Show/hide
Query:  LVECLVQCVQSGHWRADNGIFQPGFVANILQ---------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG
        LVECLV+ V +G WR+DNG F+PG++  + +               I+SR++ +KR ++A+ EM    CSGFGWN E KCI  E E+FD W  SHP+AKG
Subjt:  LVECLVQCVQSGHWRADNGIFQPGFVANILQ---------------IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKG

Query:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSE--PVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSRRSQSSLIG
        L +KSF  YD+L  VFGKDRATG RA +  ++GS   P  D    D +     DF   Y P          E     +  R + SS   R R   ++   
Subjt:  LRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSE--PVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSRRSQSSLIG

Query:  EYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYCMQVL
        +  D+VR   +   + +  IA+W I+    A + R+E+   L++IP L++ D   + R L+ +   +  F++ P   KY YC  +L
Subjt:  EYSDVVREGFQLLTKSIDGIAQWSIVNEDLARRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYCMQVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G30140.1 unknown protein2.0e-0930Show/hide
Query:  KTRILVECLVQCVQSGHWRADNGIF---------------QPGFVANILQIESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHP
        +T +L+E + Q     +WR  +GI                + G   N     SR++ LK  Y + ++ L    SGFGW+ E K      E++  ++K+HP
Subjt:  KTRILVECLVQCVQSGHWRADNGIF---------------QPGFVANILQIESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHP

Query:  SAKGLRHKSFPFYDDLVIVFGKDRATGSRA
        + K ++ +S   ++DL I+FG   ATGS A
Subjt:  SAKGLRHKSFPFYDDLVIVFGKDRATGSRA

AT2G24960.2 unknown protein2.2e-0831.17Show/hide
Query:  IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKGLRHKSFPFYDDLVIVFGKDRATG
        +++R + L+R YN I  +L    +GF W+A    +  + +I++ ++++HP A+  R K+ P Y +L  +FGK+ + G
Subjt:  IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKGLRHKSFPFYDDLVIVFGKDRATG

AT4G02210.1 unknown protein1.7e-0523.71Show/hide
Query:  IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKGLRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSEPVVDEENE
        +++R +TL+  + ++  +L     GF W+   + +  +  ++D ++K HP ++  R KS P Y DL +V+  D  +  +A  ++  G    + +E++
Subjt:  IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKGLRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSEPVVDEENE

AT4G02210.2 unknown protein1.7e-0523.71Show/hide
Query:  IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKGLRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSEPVVDEENE
        +++R +TL+  + ++  +L     GF W+   + +  +  ++D ++K HP ++  R KS P Y DL +V+  D  +  +A  ++  G    + +E++
Subjt:  IESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKGLRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSEPVVDEENE

AT5G27260.1 unknown protein1.7e-1328.5Show/hide
Query:  KTRILVECLVQCVQSGHWRADNGI---------FQPGF------VANILQIESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHP
        +T++LV+ LV+ + + +WR  NG          F P          N     SR++ LK QY + ++ L    SGFGW+   K      E++  ++K+HP
Subjt:  KTRILVECLVQCVQSGHWRADNGI---------FQPGF------VANILQIESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHP

Query:  SAKGLRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSR-RSQSS
        + K LR+ +F F+D+L I+FG+  ATG  A    +  ++ +     E+       DF+N Y  D    +  +    P    G   +  +P R R RS+ S
Subjt:  SAKGLRHKSFPFYDDLVIVFGKDRATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSR-RSQSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGACCCGAATCCTCGTGGAGTGTTTGGTCCAATGCGTTCAATCCGGGCACTGGCGAGCTGATAACGGGATTTTTCAACCTGGATTCGTAGCGAACATACTGCA
GATAGAGTCAAGGGTTAGGACGTTGAAGAGACAGTACAACGCGATTGTTGAAATGTTGAGCACAGGATGTAGTGGGTTTGGTTGGAATGCGGAGTGCAAATGTATTGACT
GTGAGGCGGAGATATTTGACGCGTGGGTCAAGAGTCATCCGAGTGCAAAAGGACTGCGACATAAGTCATTTCCGTTCTATGACGACTTAGTCATTGTATTCGGCAAAGAT
AGAGCCACAGGAAGTCGTGCAACTACCACTGTAGAGGTCGGATCTGAACCTGTTGTGGATGAGGAGAACGAGGACATCTTGAATAACCAGTCCCCGGACTTTGAGAACTT
CTATATTCCTGATCCACCTTTTGTCAACTCTCCCACATCAGAGGACACTCCAACTACCCTCGGCGGTAGAGGATCTGCAAGTAGCATGCCATCAAGAAGTAGGAGGTCCC
AAAGTTCCTTGATTGGAGAGTACAGCGACGTGGTTCGAGAGGGATTCCAGCTTCTGACGAAGTCCATTGACGGCATTGCACAGTGGTCTATCGTGAACGAAGACCTGGCA
AGGCGTCGTCGTCGAGAACTTTACGCCGAGCTGCAATCAATTCCTGGTCTGTCGGTGCAAGATGACTTGACTGTTACACGCTCATTGCTTGCAGATCCGATGCTGTTAAG
CCACTTCGTGGACTTCCCACCACAGTGGAAGTACGACTATTGCATGCAAGTTCTTGGGCGACAACGGGATCCAGCCCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAAGACCCGAATCCTCGTGGAGTGTTTGGTCCAATGCGTTCAATCCGGGCACTGGCGAGCTGATAACGGGATTTTTCAACCTGGATTCGTAGCGAACATACTGCA
GATAGAGTCAAGGGTTAGGACGTTGAAGAGACAGTACAACGCGATTGTTGAAATGTTGAGCACAGGATGTAGTGGGTTTGGTTGGAATGCGGAGTGCAAATGTATTGACT
GTGAGGCGGAGATATTTGACGCGTGGGTCAAGAGTCATCCGAGTGCAAAAGGACTGCGACATAAGTCATTTCCGTTCTATGACGACTTAGTCATTGTATTCGGCAAAGAT
AGAGCCACAGGAAGTCGTGCAACTACCACTGTAGAGGTCGGATCTGAACCTGTTGTGGATGAGGAGAACGAGGACATCTTGAATAACCAGTCCCCGGACTTTGAGAACTT
CTATATTCCTGATCCACCTTTTGTCAACTCTCCCACATCAGAGGACACTCCAACTACCCTCGGCGGTAGAGGATCTGCAAGTAGCATGCCATCAAGAAGTAGGAGGTCCC
AAAGTTCCTTGATTGGAGAGTACAGCGACGTGGTTCGAGAGGGATTCCAGCTTCTGACGAAGTCCATTGACGGCATTGCACAGTGGTCTATCGTGAACGAAGACCTGGCA
AGGCGTCGTCGTCGAGAACTTTACGCCGAGCTGCAATCAATTCCTGGTCTGTCGGTGCAAGATGACTTGACTGTTACACGCTCATTGCTTGCAGATCCGATGCTGTTAAG
CCACTTCGTGGACTTCCCACCACAGTGGAAGTACGACTATTGCATGCAAGTTCTTGGGCGACAACGGGATCCAGCCCCATGA
Protein sequenceShow/hide protein sequence
MRKTRILVECLVQCVQSGHWRADNGIFQPGFVANILQIESRVRTLKRQYNAIVEMLSTGCSGFGWNAECKCIDCEAEIFDAWVKSHPSAKGLRHKSFPFYDDLVIVFGKD
RATGSRATTTVEVGSEPVVDEENEDILNNQSPDFENFYIPDPPFVNSPTSEDTPTTLGGRGSASSMPSRSRRSQSSLIGEYSDVVREGFQLLTKSIDGIAQWSIVNEDLA
RRRRRELYAELQSIPGLSVQDDLTVTRSLLADPMLLSHFVDFPPQWKYDYCMQVLGRQRDPAP