; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C10G190710 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C10G190710
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionRetrotransposon protein
Genome locationCla97Chr10:9111535..9114077
RNA-Seq ExpressionCla97C10G190710
SyntenyCla97C10G190710
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]1.2e-3926.52Show/hide
Query:  LTAVCAATIAMVNTITTLLQLEDNRERSSPLI----RHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVT
        L ++  A IA    +  +L+L  N  +    I    RH+I++L +F MI+  D  CR++TRMDRR F ILC LLRT  GL  T+ VD+EEMVA+FLH++ 
Subjt:  LTAVCAATIAMVNTITTLLQLEDNRERSSPLI----RHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVT

Query:  HD-------------------------------------KLEPVTENCTDDRWRWLQLIDHAIG-----------------LRKGEIATNVLAVCNQSCE
        HD                                     K +PV   CTD RWRW +    A+                   RKGE+ATNVL V +   +
Subjt:  HD-------------------------------------KLEPVTENCTDDRWRWLQLIDHAIG-----------------LRKGEIATNVLAVCNQSCE

Query:  FIFVFAGWEGSAVDSRLLRDAVSKPTELK---------------------------------------PDCANKFSFRWKHPDHAQELQNIFGRMR----
        F++V  GWEGSA DSR+LRDA+S+P  LK                                       P  + +F F  KH      ++  FG ++    
Subjt:  FIFVFAGWEGSAVDSRLLRDAVSKPTELK---------------------------------------PDCANKFSFRWKHPDHAQELQNIFGRMR----

Query:  --RTKSSWSVWTSVYSL-----------------DISELIMEL-SDLDSGKHTTDDAAEV--------------TGVFHTG-------------------
          R KS   V    +++                 DI + I+ + S     KHT     E                G F  G                   
Subjt:  --RTKSSWSVWTSVYSL-----------------DISELIMEL-SDLDSGKHTTDDAAEV--------------TGVFHTG-------------------

Query:  -------------------------CSGFGWNAKRKGIDCETEIFDVWV---------KSFPFYDDLAIMFGKDRATGSHATTTAEVDSEPVVEEENKDI
                                 CSGFGWN ++K I  E E+FD W          KSF  YD+L+ +FGKDRATG  A + A++ S           
Subjt:  -------------------------CSGFGWNAKRKGIDCETEIFDVWV---------KSFPFYDDLAIMFGKDRATGSHATTTAEVDSEPVVEEENKDI

Query:  LNNQSPDYENF---YIPDPPFASSPM----------------------------------------SEDIPTTLVVESLGVACHQ--EWPVVNEDLESRR
          N  P Y+ F    +PD  F  SPM                                        S DI  T  +E      H+  EWP++    ++ +
Subjt:  LNNQSPDYENF---YIPDPPFASSPM----------------------------------------SEDIPTTLVVESLGVACHQ--EWPVVNEDLESRR

Query:  RRRELYAELQSIPGLSVQDGLTVARSLLADPMLLSHFVDFP
         R+E+  +L++IP L++ D   + R L+ +   +  F++ P
Subjt:  RRRELYAELQSIPGLSVQDGLTVARSLLADPMLLSHFVDFP

KAA0036995.1 retrotransposon protein [Cucumis melo var. makuwa]2.9e-3544.39Show/hide
Query:  LIAILTAVCAATIAMVNTITTLLQLEDNRERSSPLIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVT
        L+ +LT    A   M+ T+  L+       ++    RH+I++L +F MI+E D  CRE+TRMDRR F ILC LLRT  GL  T+ VD+EEMV +FLH++ 
Subjt:  LIAILTAVCAATIAMVNTITTLLQLEDNRERSSPLIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVT

Query:  HDKL----EPVTENCTDDRWRWLQ-----------LIDHAIGLR------KGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELK
        HD+L     PVT NC D RW++ +            ++ + G R      KGEIATNVL VC+   +F++V AGWEGSA DSR+LRDA+S+   L+
Subjt:  HDKL----EPVTENCTDDRWRWLQ-----------LIDHAIGLR------KGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELK

KAA0051849.1 retrotransposon protein [Cucumis melo var. makuwa]3.1e-3729.05Show/hide
Query:  PLIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTHD-------------------------------
        P  RH+I++L +F MI+E D  CR++TRMDRRTF ILC LLR   GL  T+ VD+EEMVA+FL ++ HD                               
Subjt:  PLIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTHD-------------------------------

Query:  ------KLEPVTENCTDDRWRWLQLIDHAIGLRKGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELK---------------------
              +  PVT NC D RW+  + I   +  RKGEIA NVL VC+   +F++V AGWEGSA DSR+LRDA+S+   L+                     
Subjt:  ------KLEPVTENCTDDRWRWLQLIDHAIGLRKGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELK---------------------

Query:  ------------------PDCANKFSFRWKHPDHAQELQNIFGRMR------RTKSSWSVWTSVYSLD----------ISELIMELSDLDSGKHTTDDAA
                          P  A ++ F  KH      ++  FG ++      R KS  +    V   D           SE I  +  ++      D+ A
Subjt:  ------------------PDCANKFSFRWKHPDHAQELQNIFGRMR------RTKSSWSVWTSVYSLD----------ISELIMELSDLDSGKHTTDDAA

Query:  E---VTGVFHTGCSGFGWNA--------KRKGIDCETEIFDVWVKSFPFYDDLAIMFGKDRATGSHATTTAEVDSEPVVEEENKDILNNQSPDYENFYIP
        E        H G    G+ A        K  G      +  +  K FP+YD+L  +F +DRAT   A T A+V S       ++  + + + D+   Y  
Subjt:  E---VTGVFHTGCSGFGWNA--------KRKGIDCETEIFDVWVKSFPFYDDLAIMFGKDRATGSHATTTAEVDSEPVVEEENKDILNNQSPDYENFYIP

Query:  DPPFASSPMSEDIPTTLVVESLGVACHQEWPVVNEDLESRRRRRELYAELQSIPGLSVQDGLTVARSLLADPMLLSHFVDFP
            +   + E I   L   +  +    EWP  N        R E +  L+ +P L+  D   + R LL+    L  FV  P
Subjt:  DPPFASSPMSEDIPTTLVVESLGVACHQEWPVVNEDLESRRRRRELYAELQSIPGLSVQDGLTVARSLLADPMLLSHFVDFP

KAA0052026.1 retrotransposon protein [Cucumis melo var. makuwa]2.7e-3640.22Show/hide
Query:  MDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTHD------KLEPVTENCTDDRWRWLQLI--DHA-IGLRKGEIATNVLAVCNQSCEFIFVFA
        MDRR F ILC LLRT  GL  T+ VD+EEMVA+FLH++THD      + E +     D  +  + +I  D A     KGE+ATNVL VC+   +FI+V A
Subjt:  MDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTHD------KLEPVTENCTDDRWRWLQLI--DHA-IGLRKGEIATNVLAVCNQSCEFIFVFA

Query:  GWEGSAVDSRLLRDAVSKPTELK---PDCANKFSFRWKHPDHA------QELQNIFGRMRRTKSSWSVWTS----VYSLDISELIMELSDLDSG----KH
        GWEGSA DSR+LRDA+S+P  LK      +      W   + A       EL N+ G     ++ W  + +    + +L I    +  S +DS     K 
Subjt:  GWEGSAVDSRLLRDAVSKPTELK---PDCANKFSFRWKHPDHA------QELQNIFGRMRRTKSSWSVWTS----VYSLDISELIMELSDLDSG----KH

Query:  TTDDAAEVTGVFHTGCSGFGWNAKRKGIDCETEIFDVWV---------KSFPFYDDLAIMFGKDRATGSHA
             AE+ G     CSGFGWN ++K I  E E FD W          KSFP YD+L+ +FGKDRA G  A
Subjt:  TTDDAAEVTGVFHTGCSGFGWNAKRKGIDCETEIFDVWV---------KSFPFYDDLAIMFGKDRATGSHA

KAG6469112.1 hypothetical protein ZIOFF_073810 [Zingiber officinale]3.8e-3529.49Show/hide
Query:  LIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTHD--------------------------------
        L R+ I+ +N   M +  D  C +N RMDRR+ + LC LL T G L+  + + + E+V  FLH++ H+                                
Subjt:  LIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTHD--------------------------------

Query:  -----KLEPVTENCTDDRWRWLQLIDHAIG-----------------LRKGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELKPDCAN
             K EP+ ENCTDDRW+W +    A+                   RKGEIATNVL VC  + +F +V  GWEGSA D R+LRDA+S+   LK     
Subjt:  -----KLEPVTENCTDDRWRWLQLIDHAIG-----------------LRKGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELKPDCAN

Query:  KFSFRWKHPDHAQELQNIFGRM-----RRTKSSWSVWTSVYSLDISELIMEL------------------------------SDLDSGKHTTDDAAEVTG
            + K    +  ++N+   M     R+T+S+  +WT      + + ++EL                              S L +  H       +  
Subjt:  KFSFRWKHPDHAQELQNIFGRM-----RRTKSSWSVWTSVYSLDISELIMEL------------------------------SDLDSGKHTTDDAAEVTG

Query:  VFH------TGCSGFGWNAKRKGIDCETEIFDVWVKS-----------FPFYDDLAIMFGKDRATGSHATTTAEVDSEPVVEEENKDILN
         FH         SGFGWN   K I    ++FD WVKS           FP  DDL  ++GKD ATG++A T A+   E  + +E  D  N
Subjt:  VFH------TGCSGFGWNAKRKGIDCETEIFDVWVKS-----------FPFYDDLAIMFGKDRATGSHATTTAEVDSEPVVEEENKDILN

TrEMBL top hitse value%identityAlignment
A0A5A7SWD8 Retrotransposon protein5.6e-4026.52Show/hide
Query:  LTAVCAATIAMVNTITTLLQLEDNRERSSPLI----RHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVT
        L ++  A IA    +  +L+L  N  +    I    RH+I++L +F MI+  D  CR++TRMDRR F ILC LLRT  GL  T+ VD+EEMVA+FLH++ 
Subjt:  LTAVCAATIAMVNTITTLLQLEDNRERSSPLI----RHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVT

Query:  HD-------------------------------------KLEPVTENCTDDRWRWLQLIDHAIG-----------------LRKGEIATNVLAVCNQSCE
        HD                                     K +PV   CTD RWRW +    A+                   RKGE+ATNVL V +   +
Subjt:  HD-------------------------------------KLEPVTENCTDDRWRWLQLIDHAIG-----------------LRKGEIATNVLAVCNQSCE

Query:  FIFVFAGWEGSAVDSRLLRDAVSKPTELK---------------------------------------PDCANKFSFRWKHPDHAQELQNIFGRMR----
        F++V  GWEGSA DSR+LRDA+S+P  LK                                       P  + +F F  KH      ++  FG ++    
Subjt:  FIFVFAGWEGSAVDSRLLRDAVSKPTELK---------------------------------------PDCANKFSFRWKHPDHAQELQNIFGRMR----

Query:  --RTKSSWSVWTSVYSL-----------------DISELIMEL-SDLDSGKHTTDDAAEV--------------TGVFHTG-------------------
          R KS   V    +++                 DI + I+ + S     KHT     E                G F  G                   
Subjt:  --RTKSSWSVWTSVYSL-----------------DISELIMEL-SDLDSGKHTTDDAAEV--------------TGVFHTG-------------------

Query:  -------------------------CSGFGWNAKRKGIDCETEIFDVWV---------KSFPFYDDLAIMFGKDRATGSHATTTAEVDSEPVVEEENKDI
                                 CSGFGWN ++K I  E E+FD W          KSF  YD+L+ +FGKDRATG  A + A++ S           
Subjt:  -------------------------CSGFGWNAKRKGIDCETEIFDVWV---------KSFPFYDDLAIMFGKDRATGSHATTTAEVDSEPVVEEENKDI

Query:  LNNQSPDYENF---YIPDPPFASSPM----------------------------------------SEDIPTTLVVESLGVACHQ--EWPVVNEDLESRR
          N  P Y+ F    +PD  F  SPM                                        S DI  T  +E      H+  EWP++    ++ +
Subjt:  LNNQSPDYENF---YIPDPPFASSPM----------------------------------------SEDIPTTLVVESLGVACHQ--EWPVVNEDLESRR

Query:  RRRELYAELQSIPGLSVQDGLTVARSLLADPMLLSHFVDFP
         R+E+  +L++IP L++ D   + R L+ +   +  F++ P
Subjt:  RRRELYAELQSIPGLSVQDGLTVARSLLADPMLLSHFVDFP

A0A5A7T686 Retrotransposon protein1.4e-3544.39Show/hide
Query:  LIAILTAVCAATIAMVNTITTLLQLEDNRERSSPLIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVT
        L+ +LT    A   M+ T+  L+       ++    RH+I++L +F MI+E D  CRE+TRMDRR F ILC LLRT  GL  T+ VD+EEMV +FLH++ 
Subjt:  LIAILTAVCAATIAMVNTITTLLQLEDNRERSSPLIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVT

Query:  HDKL----EPVTENCTDDRWRWLQ-----------LIDHAIGLR------KGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELK
        HD+L     PVT NC D RW++ +            ++ + G R      KGEIATNVL VC+   +F++V AGWEGSA DSR+LRDA+S+   L+
Subjt:  HDKL----EPVTENCTDDRWRWLQ-----------LIDHAIGLR------KGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELK

A0A5A7UCU1 Retrotransposon protein1.3e-3640.22Show/hide
Query:  MDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTHD------KLEPVTENCTDDRWRWLQLI--DHA-IGLRKGEIATNVLAVCNQSCEFIFVFA
        MDRR F ILC LLRT  GL  T+ VD+EEMVA+FLH++THD      + E +     D  +  + +I  D A     KGE+ATNVL VC+   +FI+V A
Subjt:  MDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTHD------KLEPVTENCTDDRWRWLQLI--DHA-IGLRKGEIATNVLAVCNQSCEFIFVFA

Query:  GWEGSAVDSRLLRDAVSKPTELK---PDCANKFSFRWKHPDHA------QELQNIFGRMRRTKSSWSVWTS----VYSLDISELIMELSDLDSG----KH
        GWEGSA DSR+LRDA+S+P  LK      +      W   + A       EL N+ G     ++ W  + +    + +L I    +  S +DS     K 
Subjt:  GWEGSAVDSRLLRDAVSKPTELK---PDCANKFSFRWKHPDHA------QELQNIFGRMRRTKSSWSVWTS----VYSLDISELIMELSDLDSG----KH

Query:  TTDDAAEVTGVFHTGCSGFGWNAKRKGIDCETEIFDVWV---------KSFPFYDDLAIMFGKDRATGSHA
             AE+ G     CSGFGWN ++K I  E E FD W          KSFP YD+L+ +FGKDRA G  A
Subjt:  TTDDAAEVTGVFHTGCSGFGWNAKRKGIDCETEIFDVWV---------KSFPFYDDLAIMFGKDRATGSHA

A0A5A7UEC4 Retrotransposon protein1.5e-3729.05Show/hide
Query:  PLIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTHD-------------------------------
        P  RH+I++L +F MI+E D  CR++TRMDRRTF ILC LLR   GL  T+ VD+EEMVA+FL ++ HD                               
Subjt:  PLIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTHD-------------------------------

Query:  ------KLEPVTENCTDDRWRWLQLIDHAIGLRKGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELK---------------------
              +  PVT NC D RW+  + I   +  RKGEIA NVL VC+   +F++V AGWEGSA DSR+LRDA+S+   L+                     
Subjt:  ------KLEPVTENCTDDRWRWLQLIDHAIGLRKGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELK---------------------

Query:  ------------------PDCANKFSFRWKHPDHAQELQNIFGRMR------RTKSSWSVWTSVYSLD----------ISELIMELSDLDSGKHTTDDAA
                          P  A ++ F  KH      ++  FG ++      R KS  +    V   D           SE I  +  ++      D+ A
Subjt:  ------------------PDCANKFSFRWKHPDHAQELQNIFGRMR------RTKSSWSVWTSVYSLD----------ISELIMELSDLDSGKHTTDDAA

Query:  E---VTGVFHTGCSGFGWNA--------KRKGIDCETEIFDVWVKSFPFYDDLAIMFGKDRATGSHATTTAEVDSEPVVEEENKDILNNQSPDYENFYIP
        E        H G    G+ A        K  G      +  +  K FP+YD+L  +F +DRAT   A T A+V S       ++  + + + D+   Y  
Subjt:  E---VTGVFHTGCSGFGWNA--------KRKGIDCETEIFDVWVKSFPFYDDLAIMFGKDRATGSHATTTAEVDSEPVVEEENKDILNNQSPDYENFYIP

Query:  DPPFASSPMSEDIPTTLVVESLGVACHQEWPVVNEDLESRRRRRELYAELQSIPGLSVQDGLTVARSLLADPMLLSHFVDFP
            +   + E I   L   +  +    EWP  N        R E +  L+ +P L+  D   + R LL+    L  FV  P
Subjt:  DPPFASSPMSEDIPTTLVVESLGVACHQEWPVVNEDLESRRRRRELYAELQSIPGLSVQDGLTVARSLLADPMLLSHFVDFP

A0A5A7VPL4 Retrotransposon protein2.4e-3549.08Show/hide
Query:  PLIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTHDKL----EPVTENCTDDRWRWLQLIDHAIG--
        P  RH+I++L +F MI+E D  CR++TRMDRRTF ILC LLR   GL  T+ VD+EEMVA+FLH++ HD+L     PVT NC D RW+  +    A+   
Subjt:  PLIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTHDKL----EPVTENCTDDRWRWLQLIDHAIG--

Query:  ---------------LRKGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELK
                        RKGE+ATNVL VC+   +F++V AGWEGSA DSR+LRDA+S+   L+
Subjt:  ---------------LRKGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G28950.1 unknown protein1.5e-0850Show/hide
Query:  RKGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELKP
        RKG+I+ N+LA CN   EF++V +GWEGSA DS++L DA+++ +   P
Subjt:  RKGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELKP

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.9e-0825.38Show/hide
Query:  FHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTH-------------------------------------------D
        + ++   +  C EN RMD+  F  LC LL+T G LR T  + +E  +AIFL ++ H                                           D
Subjt:  FHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVTH-------------------------------------------D

Query:  KLE---PVTENCTD--DRWRWLQL--IDHAIGLRKGE--IATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELKPDCANKFSFRWKHPD
         LE   P  ++C    D +    +  +D     R G   +  NVLA  +    F +V AGWEGSA D ++L  A+++  +L+      +    K+P+
Subjt:  KLE---PVTENCTD--DRWRWLQL--IDHAIGLRKGE--IATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELKPDCANKFSFRWKHPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGACATGAACATTCAACCTGACGAGTTAATCGCAATACTGACTGCTGTATGTGCGGCCACCATTGCAATGGTGAATACTATCACCACTTTGCTACAATTGGAGGA
CAATCGGGAGCGATCGTCTCCACTTATTAGACATCAAATTCAAAAATTGAACTTCTTTCATATGATTTACGAAGATGATCATACGTGTCGTGAGAACACTCGTATGGATA
GGAGAACGTTCACGATTCTGTGTCAGTTACTTAGGACGACGGGTGGGCTAAGACCAACAAAGTATGTAGATATGGAGGAAATGGTGGCCATCTTCCTGCACCTAGTCACA
CACGATAAACTAGAGCCAGTCACTGAGAATTGTACAGATGACAGATGGCGCTGGCTTCAGCTAATCGATCATGCTATAGGACTTAGAAAGGGCGAGATCGCAACGAATGT
TCTTGCTGTATGCAATCAAAGTTGTGAGTTCATATTCGTTTTCGCTGGATGGGAAGGATCAGCTGTTGACTCGAGGCTTCTGCGAGATGCAGTGTCCAAGCCAACTGAAT
TGAAACCTGATTGTGCTAACAAGTTTAGTTTTAGATGGAAGCATCCGGATCACGCGCAAGAGCTGCAAAACATATTTGGACGGATGAGGAGGACAAAATCCTCGTGGAGT
GTTTGGACCAGTGTGTACAGTCTGGACATTAGCGAGCTGATAATGGAACTTTCCGACCTAGATTCTGGCAAACATACTACGGATGATGCAGCAGAGGTTACCGGGGTGTT
CCATACAGGATGTAGTGGGTTTGGTTGGAATGCGAAGCGCAAGGGTATTGACTGTGAGACAGAGATATTTGATGTGTGGGTCAAATCATTTCCGTTCTATGACGACTTGG
CCATTATGTTCGGCAAAGACAGAGCCACAGGGAGTCATGCAACCACCACTGCAGAGGTCGATTCTGAACCTGTTGTGGAAGAGGAGAACAAGGACATCTTGAATAACCAG
TCCCCAGACTATGAGAATTTCTATATTCCTGATCCACCGTTTGCCAGCTCGCCCATGTCAGAGGACATTCCAACTACCCTAGTGGTAGAGAGTCTGGGAGTAGCATGCCA
TCAAGAATGGCCTGTCGTGAACGAGGACTTGGAAAGCCGTCGCCGTCGTCGAGAACTGTACGCCGAGCTGCAATCCATTCCTGGTCTGTCGGTGCAGGATGGCTTGACTG
TTGCACGGTCATTGCTTGCAGATCCGATGCTGTTAAGCCACTTTGTGGACTTCCCACCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGACATGAACATTCAACCTGACGAGTTAATCGCAATACTGACTGCTGTATGTGCGGCCACCATTGCAATGGTGAATACTATCACCACTTTGCTACAATTGGAGGA
CAATCGGGAGCGATCGTCTCCACTTATTAGACATCAAATTCAAAAATTGAACTTCTTTCATATGATTTACGAAGATGATCATACGTGTCGTGAGAACACTCGTATGGATA
GGAGAACGTTCACGATTCTGTGTCAGTTACTTAGGACGACGGGTGGGCTAAGACCAACAAAGTATGTAGATATGGAGGAAATGGTGGCCATCTTCCTGCACCTAGTCACA
CACGATAAACTAGAGCCAGTCACTGAGAATTGTACAGATGACAGATGGCGCTGGCTTCAGCTAATCGATCATGCTATAGGACTTAGAAAGGGCGAGATCGCAACGAATGT
TCTTGCTGTATGCAATCAAAGTTGTGAGTTCATATTCGTTTTCGCTGGATGGGAAGGATCAGCTGTTGACTCGAGGCTTCTGCGAGATGCAGTGTCCAAGCCAACTGAAT
TGAAACCTGATTGTGCTAACAAGTTTAGTTTTAGATGGAAGCATCCGGATCACGCGCAAGAGCTGCAAAACATATTTGGACGGATGAGGAGGACAAAATCCTCGTGGAGT
GTTTGGACCAGTGTGTACAGTCTGGACATTAGCGAGCTGATAATGGAACTTTCCGACCTAGATTCTGGCAAACATACTACGGATGATGCAGCAGAGGTTACCGGGGTGTT
CCATACAGGATGTAGTGGGTTTGGTTGGAATGCGAAGCGCAAGGGTATTGACTGTGAGACAGAGATATTTGATGTGTGGGTCAAATCATTTCCGTTCTATGACGACTTGG
CCATTATGTTCGGCAAAGACAGAGCCACAGGGAGTCATGCAACCACCACTGCAGAGGTCGATTCTGAACCTGTTGTGGAAGAGGAGAACAAGGACATCTTGAATAACCAG
TCCCCAGACTATGAGAATTTCTATATTCCTGATCCACCGTTTGCCAGCTCGCCCATGTCAGAGGACATTCCAACTACCCTAGTGGTAGAGAGTCTGGGAGTAGCATGCCA
TCAAGAATGGCCTGTCGTGAACGAGGACTTGGAAAGCCGTCGCCGTCGTCGAGAACTGTACGCCGAGCTGCAATCCATTCCTGGTCTGTCGGTGCAGGATGGCTTGACTG
TTGCACGGTCATTGCTTGCAGATCCGATGCTGTTAAGCCACTTTGTGGACTTCCCACCGTAG
Protein sequenceShow/hide protein sequence
MDDMNIQPDELIAILTAVCAATIAMVNTITTLLQLEDNRERSSPLIRHQIQKLNFFHMIYEDDHTCRENTRMDRRTFTILCQLLRTTGGLRPTKYVDMEEMVAIFLHLVT
HDKLEPVTENCTDDRWRWLQLIDHAIGLRKGEIATNVLAVCNQSCEFIFVFAGWEGSAVDSRLLRDAVSKPTELKPDCANKFSFRWKHPDHAQELQNIFGRMRRTKSSWS
VWTSVYSLDISELIMELSDLDSGKHTTDDAAEVTGVFHTGCSGFGWNAKRKGIDCETEIFDVWVKSFPFYDDLAIMFGKDRATGSHATTTAEVDSEPVVEEENKDILNNQ
SPDYENFYIPDPPFASSPMSEDIPTTLVVESLGVACHQEWPVVNEDLESRRRRRELYAELQSIPGLSVQDGLTVARSLLADPMLLSHFVDFPP