; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G04570 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G04570
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionMuDRA-like transposase
Genome locationClcChr08:13832017..13835477
RNA-Seq ExpressionClc08G04570
SyntenyClc08G04570
Gene Ontology termsGO:0006412 - translation (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0005840 - ribosome (cellular component)
GO:0003735 - structural constituent of ribosome (molecular function)
GO:0008097 - 5S rRNA binding (molecular function)
InterPro domainsIPR005484 - Ribosomal protein L18
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038875106.1 uncharacterized protein LOC120067636 [Benincasa hispida]3.9e-6561.06Show/hide
Query:  KESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWF
        KESY  LHAY EALK KN  TVFEI  EES+Y KYMF+ L  SLR F+S +P+IIVDGTHLK  YKG +++GVA+D NNQLYPL YAIVD+ENDR+  WF
Subjt:  KESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWF

Query:  MTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRW
          +LK VI EC NL+FVS+R Q+I N V+ VFP  +H +CTY+LKRNVEKYFKDES++KF +   ++      K    QIVNYN+G LA+YL+DA + RW
Subjt:  MTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRW

Query:  ARCCQYGN
        A C Q GN
Subjt:  ARCCQYGN

XP_038885908.1 uncharacterized protein LOC120076214 [Benincasa hispida]1.1e-6258.05Show/hide
Query:  KESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWF
        +ESY VLHAYGE LK +NL T FEIELE   ++KYMFM LGP +R F S RPVIIVDG+HLK  YKG +++GV+MDGNNQ+Y L YAIVD+E DR+ KWF
Subjt:  KESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWF

Query:  MTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRW
        M++LK  I E PNLVFV +R  SI N + AVFPT FHGLCTYHL+ N+   FKD ++   F DA RA + SEF+++  Q+    NG +++YL+D G+ RW
Subjt:  MTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRW

Query:  ARCCQ
        A   Q
Subjt:  ARCCQ

XP_038892626.1 uncharacterized protein LOC120081651 [Benincasa hispida]6.0e-8272.12Show/hide
Query:  KESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWF
        +ESY VLHAY EALK +N  TVFEIELEES+Y+KY+FM+LG SLRGF+S RPVII+DGTHLK  YKG II+GVA+DGNNQLYPL YAIVDSENDR+ KWF
Subjt:  KESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWF

Query:  MTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRW
        MT+LK  I ECPNLVF+S+R QSI NV+D VF  ++H LCT+HLKRNVE YFKD+ VRK F+DA RAY+ESEFK +  QIVNY NGSLA YLEDA +  W
Subjt:  MTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRW

Query:  ARCCQYGN
        ARC Q+GN
Subjt:  ARCCQYGN

XP_038895847.1 uncharacterized protein LOC120084017 [Benincasa hispida]2.6e-6156.19Show/hide
Query:  RSIPKESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRS
        R  P+ESY +LHAYGE LK +N  T FEIEL+   ++KYMFM LGP +RGF S RPVIIV G+HLK  +KG +++GV+MDGNNQ+YPL YAIVD+E DR+
Subjt:  RSIPKESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRS

Query:  CKWFMTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAG
         KWFM++LK  I   PNL+FVS+R  SI N + AVF T FHGLCTYHLK N+   FKD ++   F DA RA++ SEF+ +  Q+    N  +++YLED G
Subjt:  CKWFMTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAG

Query:  VHRWARCCQY
        + RWA  C Y
Subjt:  VHRWARCCQY

XP_038896229.1 uncharacterized protein LOC120084506 [Benincasa hispida]9.7e-6451.33Show/hide
Query:  GDVIMQPHSSSGLEDGIKVGQIFICTNDVKIRLAMLAIKNNFELWVKKSNKRSIPKESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLR
        GD+IMQP SS GL D I+  ++     D+                          +  YGV  +Y +A   + +A                        R
Subjt:  GDVIMQPHSSSGLEDGIKVGQIFICTNDVKIRLAMLAIKNNFELWVKKSNKRSIPKESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLR

Query:  GFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWFMTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLK
        G Q      I++GTHLK  YKG +I+G+ MDGNNQLYPL Y IVDSENDR+ KWFMT LK  I ECPNLVFVSN  QSI N++D VFP ++HGLCT+HLK
Subjt:  GFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWFMTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLK

Query:  RNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRWARCCQYGN
        RNVEKYFKDESVRK F+DACRAY+E EFK +  QIVNYNNGSLA+YLEDA + RWARC Q GN
Subjt:  RNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRWARCCQYGN

TrEMBL top hitse value%identityAlignment
A0A0A0KE39 Uncharacterized protein1.8e-6097.62Show/hide
Query:  MSINQNHLLRLVLSCRKITAQVTNPATSSIVAMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVGEKLGHRLKEIGVSDVRIDLAEELSRPVYYRKM
        MSINQNHLLRLVLSCRKITAQVTNPATSSI+AMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVGEKLGHRLKEIGVSDVRIDLAEELSRP+YYRKM
Subjt:  MSINQNHLLRLVLSCRKITAQVTNPATSSIVAMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVGEKLGHRLKEIGVSDVRIDLAEELSRPVYYRKM

Query:  VLPLFDSVQRSGVAVDGAEKLGTGTI
        VLPLFDSVQRSGVAVDGAEKLGTG+I
Subjt:  VLPLFDSVQRSGVAVDGAEKLGTGTI

A0A1S4DVU4 uncharacterized protein LOC1034886722.9e-5894.44Show/hide
Query:  MSINQNHLLRLVLSCRKITAQVTNPATSSIVAMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVGEKLGHRLKEIGVSDVRIDLAEELSRPVYYRKM
        MSINQNHLLRLVLSCRKITAQVTNPATSSI+AMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVG KLGHRLKEIGVSDVRIDLAEELSRP+YYRKM
Subjt:  MSINQNHLLRLVLSCRKITAQVTNPATSSIVAMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVGEKLGHRLKEIGVSDVRIDLAEELSRPVYYRKM

Query:  VLPLFDSVQRSGVAVDGAEKLGTGTI
        VLPLFDSVQRSG+AVDG EKLG G+I
Subjt:  VLPLFDSVQRSGVAVDGAEKLGTGTI

A0A5D3BWW4 Uncharacterized protein2.9e-5894.44Show/hide
Query:  MSINQNHLLRLVLSCRKITAQVTNPATSSIVAMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVGEKLGHRLKEIGVSDVRIDLAEELSRPVYYRKM
        MSINQNHLLRLVLSCRKITAQVTNPATSSI+AMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVG KLGHRLKEIGVSDVRIDLAEELSRP+YYRKM
Subjt:  MSINQNHLLRLVLSCRKITAQVTNPATSSIVAMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVGEKLGHRLKEIGVSDVRIDLAEELSRPVYYRKM

Query:  VLPLFDSVQRSGVAVDGAEKLGTGTI
        VLPLFDSVQRSG+AVDG EKLG G+I
Subjt:  VLPLFDSVQRSGVAVDGAEKLGTGTI

A0A6J1DE35 protein FAR-RED ELONGATED HYPOCOTYL 3-like2.4e-6054.72Show/hide
Query:  RSIPKESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLRGFQSY-RPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDR
        R  P++SY  LHAYGEA+K +N  T+F +E E S  +KYMFM LG S++GFQS  R VI+VDGTHLK  +KG +++GVAMDGNNQ+YPL +AIVD+E+D 
Subjt:  RSIPKESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLRGFQSY-RPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDR

Query:  SCKWFMTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDA
        S KWFM +LK +I +   LVF+S+R  SITN    +FP AFH +CTYHL  N++  FK+ +  K + DA  AY++S F  Y +QI++  +GSLAKYL++ 
Subjt:  SCKWFMTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDA

Query:  GVHRWARCCQYG
        GV RWARC Q G
Subjt:  GVHRWARCCQYG

A0A6J1JZ79 uncharacterized protein LOC1114887456.1e-5691.27Show/hide
Query:  MSINQNHLLRLVLSCRKITAQVTNPATSSIVAMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVGEKLGHRLKEIGVSDVRIDLAEELSRPVYYRKM
        MSINQNHLLRLVLSCRKITAQVTNP TS+I+AMASSSEQEFVAYY SKLHRFPRSNNFWDSK+ASRVGEKLGHRLKEIGVSDVRIDLAEELSRPVYYR  
Subjt:  MSINQNHLLRLVLSCRKITAQVTNPATSSIVAMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVGEKLGHRLKEIGVSDVRIDLAEELSRPVYYRKM

Query:  VLPLFDSVQRSGVAVDGAEKLGTGTI
        VLPLFDSV+RSGVAV+GAEKLG+G I
Subjt:  VLPLFDSVQRSGVAVDGAEKLGTGTI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase1.2e-1628.73Show/hide
Query:  ESQYYKYMFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWFMTHLKVVIAECPNLVFVSNRRQSITNVV
        E   ++ +F     S++GFQ  RP+I+VD  +L   YK  ++I  A D  NQ +PL +A+    +  S +WF+T ++  + +   +  +S+    I  V+
Subjt:  ESQYYKYMFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWFMTHLKVVIAECPNLVFVSNRRQSITNVV

Query:  DA-----VFPTAFHGLCTYHLKR---NVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRWA
        +        P A+H  C YHL     +V   F D ++    ++A  + Q+ EF SY+ +I    N    K+L+    H+WA
Subjt:  DA-----VFPTAFHGLCTYHLKR---NVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRWA

AT1G64255.1 MuDR family transposase3.1e-1225.73Show/hide
Query:  MFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWFMTHLKVVIAECPNLVFVSNRRQSITNVVDA-----
        +F     S+ GFQ  RP+I+VD  +L   Y+  ++I   +D  N+ +PL +A+    +    +WF+T ++  + +   L  +S+    I  VV+      
Subjt:  MFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWFMTHLKVVIAECPNLVFVSNRRQSITNVVDA-----

Query:  VFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRWA
          P A+H     H      + F    +      A    Q+ EF SY++ I    N    K+L+    +RWA
Subjt:  VFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRWA

AT1G64260.1 MuDR family transposase1.2e-1627.59Show/hide
Query:  YKYMFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWFMTHLKVVIAECPNLVFVSNRRQSITNVVDA--
        ++ +F     S+ GFQ  RP+I+VD   L   Y+  ++I   +D  N+ +PL +A+    +  S +WF T ++  + +  +L  +S+  + I  VV+   
Subjt:  YKYMFMILGPSLRGFQSYRPVIIVDGTHLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWFMTHLKVVIAECPNLVFVSNRRQSITNVVDA--

Query:  ---VFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRWA
             P A H  C  HL+      F+D ++      A    Q+ EF SY++ I    N    K+L+    H+WA
Subjt:  ---VFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAYQESEFKSYLSQIVNYNNGSLAKYLEDAGVHRWA

AT2G43310.1 Ribosomal L18p/L5e family protein1.2e-4068.64Show/hide
Query:  NQNHLLRLVLSCRKITAQVTNPATSSIVAMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVGEKLGHRLKEIGVSDVRIDLAEELSRPVYYRKMVLP
        +++HLL+LVLSCRKITAQVT P +S+I+AMASSSEQEF+A  R+ L+RFP SN+FWDSK ASRVGEKLG RL+E+GV  V ID  EE+SRP+++RK VLP
Subjt:  NQNHLLRLVLSCRKITAQVTNPATSSIVAMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVGEKLGHRLKEIGVSDVRIDLAEELSRPVYYRKMVLP

Query:  LFDSVQRSGVAVDGAEKL
        LFDSV+R+G+ VDG E+L
Subjt:  LFDSVQRSGVAVDGAEKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCATAAACCAGAACCACCTCCTACGCCTGGTTCTTTCCTGCCGGAAAATCACCGCCCAGGTTACCAATCCAGCCACCTCCTCCATCGTTGCCATGGCATCCTCCTC
CGAGCAGGAATTCGTCGCCTATTACCGCTCCAAACTCCACCGCTTTCCTCGATCCAACAACTTCTGGGACTCCAAAGTCGCCTCTCGCGTCGGCGAAAAGCTTGGCCATC
GTTTGAAGGAGATCGGCGTGTCCGATGTTCGGATTGATCTTGCTGAAGAGCTCTCCCGCCCTGTTTACTATCGGAAGATGGTTCTGCCGCTCTTCGATTCTGTTCAGCGC
TCCGGCGTCGCCGTCGATGGCGCTGAGAAGCTCGGAACAGGAACCATCCATTGCTCGATGCCCAATGACTCGATTACTCGATACACGCTAACTCGCTCCTCGAAAGTCAG
AAACTCGATGGATAAGAAGCATTTGTTCATTCGTTTTGGTGGTACATGGGACAACACAAAAGAACGATATACTGGTGGTCATTTGAAGAGTATCCTAGTTTCTTTAAGCA
CATTGACATTTTTAGATTTGGTGAATCTAGCATATGAACTCACCCAATATTCTTTAATGGACGATCCTGATGTCCAATTTCTCCTCTTGGAGTACGATCATAGTAGGCCT
CAGATATTTATAAGTCTTGAGGAAAGGCATCAAGAGGTTGTGAATGGTATCAGATTTTCTACTTGTGTTAATGAGATAGAAGACGAACCCATGTGTGGTGCAGATGTCAA
TGAGCATCATATTTTCTCATTGGTTGTCTCACTAGAAAACGACAACATCCATGACAATATAACGAACGCAATTCTGGTTGTTTTAGGTCGTGGTCCTGGTGTAAATGAAG
AACAATCAACTACAATTCTGTTAAATGATTTTGGGCATTTTGTTGGGAATATGCCTCAAATACATGAAGCCAACATACCATCTAGTACTAATGATAGAACGGGGGACGTA
ATCATGCAACCACATTCTTCATCAGGGTTAGAGGATGGTATTAAAGTAGGCCAGATTTTCATCTGCACGAATGATGTGAAGATAAGATTAGCAATGTTGGCTATTAAGAA
CAATTTTGAGCTGTGGGTTAAGAAATCAAACAAAAGGAGTATACCAAAGGAATCGTATGGAGTACTACATGCTTATGGAGAGGCTTTAAAGACGAAAAATCTTGCAACTG
TATTTGAGATTGAGCTTGAAGAATCACAATACTATAAATACATGTTTATGATATTAGGTCCAAGTTTAAGAGGTTTTCAGAGTTATCGACCTGTAATAATCGTAGATGGT
ACTCATCTAAAGAGTAACTACAAGGGGGCGATAATCATTGGTGTTGCCATGGATGGCAATAACCAACTCTACCCTTTGGTATATGCAATTGTCGATAGTGAAAATGATCG
ATCGTGTAAGTGGTTCATGACTCACCTAAAAGTAGTGATTGCAGAATGCCCCAACCTCGTGTTTGTCTCAAATCGTAGACAAAGTATAACCAATGTCGTTGATGCCGTAT
TCCCCACCGCCTTCCACGGACTTTGTACTTACCATTTGAAAAGGAACGTGGAGAAATACTTCAAAGATGAGTCCGTGAGAAAATTTTTTAACGATGCTTGTAGAGCTTAT
CAGGAATCAGAATTCAAGAGCTATTTGAGCCAAATAGTGAACTACAACAACGGTTCCCTGGCGAAATATTTGGAGGATGCTGGTGTTCATCGATGGGCACGATGTTGCCA
ATATGGCAACCCTGTAGTTCCTACTACATACCCCAAAACAAGCAAGGCAATTGAGCATAGCTCATCATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCATAAACCAGAACCACCTCCTACGCCTGGTTCTTTCCTGCCGGAAAATCACCGCCCAGGTTACCAATCCAGCCACCTCCTCCATCGTTGCCATGGCATCCTCCTC
CGAGCAGGAATTCGTCGCCTATTACCGCTCCAAACTCCACCGCTTTCCTCGATCCAACAACTTCTGGGACTCCAAAGTCGCCTCTCGCGTCGGCGAAAAGCTTGGCCATC
GTTTGAAGGAGATCGGCGTGTCCGATGTTCGGATTGATCTTGCTGAAGAGCTCTCCCGCCCTGTTTACTATCGGAAGATGGTTCTGCCGCTCTTCGATTCTGTTCAGCGC
TCCGGCGTCGCCGTCGATGGCGCTGAGAAGCTCGGAACAGGAACCATCCATTGCTCGATGCCCAATGACTCGATTACTCGATACACGCTAACTCGCTCCTCGAAAGTCAG
AAACTCGATGGATAAGAAGCATTTGTTCATTCGTTTTGGTGGTACATGGGACAACACAAAAGAACGATATACTGGTGGTCATTTGAAGAGTATCCTAGTTTCTTTAAGCA
CATTGACATTTTTAGATTTGGTGAATCTAGCATATGAACTCACCCAATATTCTTTAATGGACGATCCTGATGTCCAATTTCTCCTCTTGGAGTACGATCATAGTAGGCCT
CAGATATTTATAAGTCTTGAGGAAAGGCATCAAGAGGTTGTGAATGGTATCAGATTTTCTACTTGTGTTAATGAGATAGAAGACGAACCCATGTGTGGTGCAGATGTCAA
TGAGCATCATATTTTCTCATTGGTTGTCTCACTAGAAAACGACAACATCCATGACAATATAACGAACGCAATTCTGGTTGTTTTAGGTCGTGGTCCTGGTGTAAATGAAG
AACAATCAACTACAATTCTGTTAAATGATTTTGGGCATTTTGTTGGGAATATGCCTCAAATACATGAAGCCAACATACCATCTAGTACTAATGATAGAACGGGGGACGTA
ATCATGCAACCACATTCTTCATCAGGGTTAGAGGATGGTATTAAAGTAGGCCAGATTTTCATCTGCACGAATGATGTGAAGATAAGATTAGCAATGTTGGCTATTAAGAA
CAATTTTGAGCTGTGGGTTAAGAAATCAAACAAAAGGAGTATACCAAAGGAATCGTATGGAGTACTACATGCTTATGGAGAGGCTTTAAAGACGAAAAATCTTGCAACTG
TATTTGAGATTGAGCTTGAAGAATCACAATACTATAAATACATGTTTATGATATTAGGTCCAAGTTTAAGAGGTTTTCAGAGTTATCGACCTGTAATAATCGTAGATGGT
ACTCATCTAAAGAGTAACTACAAGGGGGCGATAATCATTGGTGTTGCCATGGATGGCAATAACCAACTCTACCCTTTGGTATATGCAATTGTCGATAGTGAAAATGATCG
ATCGTGTAAGTGGTTCATGACTCACCTAAAAGTAGTGATTGCAGAATGCCCCAACCTCGTGTTTGTCTCAAATCGTAGACAAAGTATAACCAATGTCGTTGATGCCGTAT
TCCCCACCGCCTTCCACGGACTTTGTACTTACCATTTGAAAAGGAACGTGGAGAAATACTTCAAAGATGAGTCCGTGAGAAAATTTTTTAACGATGCTTGTAGAGCTTAT
CAGGAATCAGAATTCAAGAGCTATTTGAGCCAAATAGTGAACTACAACAACGGTTCCCTGGCGAAATATTTGGAGGATGCTGGTGTTCATCGATGGGCACGATGTTGCCA
ATATGGCAACCCTGTAGTTCCTACTACATACCCCAAAACAAGCAAGGCAATTGAGCATAGCTCATCATAG
Protein sequenceShow/hide protein sequence
MSINQNHLLRLVLSCRKITAQVTNPATSSIVAMASSSEQEFVAYYRSKLHRFPRSNNFWDSKVASRVGEKLGHRLKEIGVSDVRIDLAEELSRPVYYRKMVLPLFDSVQR
SGVAVDGAEKLGTGTIHCSMPNDSITRYTLTRSSKVRNSMDKKHLFIRFGGTWDNTKERYTGGHLKSILVSLSTLTFLDLVNLAYELTQYSLMDDPDVQFLLLEYDHSRP
QIFISLEERHQEVVNGIRFSTCVNEIEDEPMCGADVNEHHIFSLVVSLENDNIHDNITNAILVVLGRGPGVNEEQSTTILLNDFGHFVGNMPQIHEANIPSSTNDRTGDV
IMQPHSSSGLEDGIKVGQIFICTNDVKIRLAMLAIKNNFELWVKKSNKRSIPKESYGVLHAYGEALKTKNLATVFEIELEESQYYKYMFMILGPSLRGFQSYRPVIIVDG
THLKSNYKGAIIIGVAMDGNNQLYPLVYAIVDSENDRSCKWFMTHLKVVIAECPNLVFVSNRRQSITNVVDAVFPTAFHGLCTYHLKRNVEKYFKDESVRKFFNDACRAY
QESEFKSYLSQIVNYNNGSLAKYLEDAGVHRWARCCQYGNPVVPTTYPKTSKAIEHSSS