; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017607 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017607
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSWIM-type domain-containing protein
Genome locationchr5:5911042..5913542
RNA-Seq ExpressionLag0017607
SyntenyLag0017607
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI10421.1 hypothetical protein PRUPE_4G046500 [Prunus persica]2.7e-10431.27Show/hide
Query:  VFLPSLNYIGGRQAVIRGIDCDRVSLIELRMMLTDLDVFNVEEFYYVRPG---------------------IVPESRLIHIYVEHDPNI-EVL-DLTEPV
        V  P  +YI G       +D DR+S +E++ M+ +  V N + FYY  PG                      +P +R + +YVEHD ++ EV+ DLT+P 
Subjt:  VFLPSLNYIGGRQAVIRGIDCDRVSLIELRMMLTDLDVFNVEEFYYVRPG---------------------IVPESRLIHIYVEHDPNI-EVL-DLTEPV

Query:  AMERPI--------------------------------FLEWRPGAQRNPVQ---------NQVELDDFVEVEVSG---ESSLSDSDSASSSLSRESIAS
          +  I                                F E R        Q         N+ E D   + +  G   E + +D +++ S    E   +
Subjt:  AMERPI--------------------------------FLEWRPGAQRNPVQ---------NQVELDDFVEVEVSG---ESSLSDSDSASSSLSRESIAS

Query:  L--------------------------NINVNVAEEDVDSDIGSYLN--LHTPIGSDEEEDVRG-PEFRVETDMN-------------------------
        +                          N   ++  ED++ D   Y +  LH+   SDEE   R   E  ++TDM+                         
Subjt:  L--------------------------NINVNVAEEDVDSDIGSYLN--LHTPIGSDEEEDVRG-PEFRVETDMN-------------------------

Query:  ---------QNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQA
                 +NDK R++A C   CKWRL+AS+++ +NT QIK+Y   H+CS+ F N+N+ S ++++ Y+ R +  P      +  EV ++    +S  Q 
Subjt:  ---------QNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQA

Query:  YRAWRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQY------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIA
        Y+A RKA   I G+  +QY+KLW+Y  EIR++N GS++ ++                               ACHLKG  QGQL+ ++G+D +D++YPIA
Subjt:  YRAWRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQY------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIA

Query:  WAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRL
        +A+ E E+KDSW WFL LL  D+G  +  G+TFISDQQKGL   FE   P  +HR+CV+HLYGNF+++FKG ALK   W  A  +N  DFN  M ++  L
Subjt:  WAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRL

Query:  SVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPL
           AY +L       W+RH F    KCD LLNNL E FN+ +L AR+KPI+ + E IR  +M R+  K +  ++ E  I  +I  KLEKN   S    P+
Subjt:  SVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPL

Query:  WAGNGKFEV-ESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVD--------------------------DTTYEPILPPVIRRPPG
        + GNG F+V ++   Q+ V++  RTC+C  W+++GIP  HAV  I++ +++ E+FVD                               P+LPP++++ PG
Subjt:  WAGNGKFEV-ESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVD--------------------------DTTYEPILPPVIRRPPG

Query:  RPRKQR--KRDDDEAGTSSTKMHRR
        RP+K+R  K  + E     TK+ ++
Subjt:  RPRKQR--KRDDDEAGTSSTKMHRR

VVA36347.1 PREDICTED: transposon [Prunus dulcis]2.6e-9937.7Show/hide
Query:  NINVNVAEEDVDSDIGSYLN--LHTPIGSDEEEDVRG-PEFRVETDMN----------------------------------QNDKVRVTAVCSGQCKWR
        N   N+  ED++ D   Y +  LH+   S EE   R   EF ++TDM+                                  +NDK R++A C   CKWR
Subjt:  NINVNVAEEDVDSDIGSYLN--LHTPIGSDEEEDVRG-PEFRVETDMN----------------------------------QNDKVRVTAVCSGQCKWR

Query:  LHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKLWDYAA
        L+AS+++ +NT QIK+Y   H+CS+ F N+N+ S ++++ Y+ R +  P      +  EV ++    +S  Q Y+A RKA   I G+  +QY+KLW+Y  
Subjt:  LHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKLWDYAA

Query:  EIRRSNVGSTVKIQY------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFT
        EIR++N GS++ ++                               ACHLKG  QGQL+ A+G+D +D++YPIA+A+ E E+KDSW WFL LL  D+G  +
Subjt:  EIRRSNVGSTVKIQY------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFT

Query:  SKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKC
          G+TFISDQQKGL   FE   P  +HR+CV+HLYGNF+++FKG ALK   W  A  +N  DFN  M ++  L   AY +L       W+RH F    KC
Subjt:  SKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKC

Query:  DFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGKFEV-ESGTTQYVVDIEQRTCT
        D LLNNL E FN+ +L AR+KPI+ + E IR  +M R+  K +  ++ E  I  +I  KLEKN   S    P++ GNG F+V ++   Q+ V++  RTC+
Subjt:  DFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGKFEV-ESGTTQYVVDIEQRTCT

Query:  CGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD--------TTYEPILPPV
        C  W+++GIPC HAV  I++ +++ E+FVD           Y P++ PV
Subjt:  CGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD--------TTYEPILPPV

XP_020080379.1 uncharacterized protein LOC109704055 [Ananas comosus]1.0e-9836.62Show/hide
Query:  EEDVDSDIGSYLNLHTPIGSDEEEDV---RGPEFRVETDMN----------------------------------QNDKVRVTAVCSGQCKWRLHASVIE
        + + D+D      LH+   S E++ V   R PEFR ETDM                                   +N+K +V AVC   C W ++AS I 
Subjt:  EEDVDSDIGSYLNLHTPIGSDEEEDV---RGPEFRVETDMN----------------------------------QNDKVRVTAVCSGQCKWRLHASVIE

Query:  NDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKLWDYAAEIRRSNV
        +  T+Q+K Y  +H C + F N+ + S+W+AK Y++RFR+ P W L    E V  D  V +SR++ YRA RKA R I GS  EQY  LWDYA E+++SN 
Subjt:  NDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKLWDYAAEIRRSNV

Query:  GSTVKIQY------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFI
        GST  I+                                C LK    GQL+AA+GVD +D ++PIA+A+V+ E+  SW WFL  L  D+    S  +TF+
Subjt:  GSTVKIQY------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFI

Query:  SDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNL
        SD+QKGL+   E+  P+  HRFCV+HLY NFQ  +KG  LK   W  AR + +  F   M  I      AY++L+  P + WSR  F+ + KCD LLNN+
Subjt:  SDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNL

Query:  SESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGKFEVESGTTQYVVDIEQRTCTCGLWQVSG
         E FN  +L AR KPI+ + E IR  +M R+  K E   +    I  +I+KK EK    S    P  AG+GKF+V S   Q+VVD+ Q+TC+C  W ++G
Subjt:  SESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGKFEVESGTTQYVVDIEQRTCTCGLWQVSG

Query:  IPCPHAVQCIYYVKKKPEEFVDD--------------------------TTYEPILPPVIRRPPGRPRKQRKRDDDEAGTSSTKMHR---RFRCVDVG
        +PCPHA+  + +   +PE++V D                          T  EPI PP  ++ PGRP+  RK+   E  TS+ K+ R   R  C   G
Subjt:  IPCPHAVQCIYYVKKKPEEFVDD--------------------------TTYEPILPPVIRRPPGRPRKQRKRDDDEAGTSSTKMHR---RFRCVDVG

XP_030955781.1 uncharacterized protein LOC115977893 [Quercus lobata]3.1e-10039.92Show/hide
Query:  DMNQNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRK
        +   NDK RVTA+C  +C W++HAS  +  +  QIK++   HTC ++  N  ++S W+A  YL  FR    W+   +   V +D++V ++  Q Y+A R 
Subjt:  DMNQNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRK

Query:  ANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQYA------------------------------CHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEA
        A + I+G+  +QY++LWDYAA IR+ NVGSTVKIQ A                              CHLKG   GQL+ A+G DG+D+++PIA+AIVE 
Subjt:  ANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQYA------------------------------CHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEA

Query:  ETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQ
        E K SWTWFL  L  D+G      + FISD+QKGLV TF++  PN  HRFCV+HL+ NF+K+F G  LK   W  AR      F+  M E+ +L V AY+
Subjt:  ETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQ

Query:  YLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGK
        +L  +  + WSRHAF    K D L+NN++ESFNA +L AR+KP++ + E IR ++M R+  K +   R E  +  RI KKLE+        I  W G  K
Subjt:  YLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGK

Query:  FEVE-SGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVD--------------------------DTTYEPILPPVIRRPPGRPRKQR
        +EVE     +YVVD+ +RTC CG W +SGIPC HA   I    ++ E +VD                           T Y+PI PP   +  G+ +K R
Subjt:  FEVE-SGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVD--------------------------DTTYEPILPPVIRRPPGRPRKQR

Query:  KRDDDE
        KR+  E
Subjt:  KRDDDE

XP_034229423.1 uncharacterized protein LOC117638399 [Prunus dulcis]1.3e-10633.33Show/hide
Query:  VFLPSLNYIGGRQAVIRGIDCDRVSLIELRMMLTDLDVFNVEEFYYVRPG---------------------IVPESRLIHIYVEHDPNI-EVL-DLTEPV
        V  P  +YI G       +D DR+S +E++ M+ +L V N + FYY  PG                      +P +R + +YVEHD ++ EV+ DLT+P 
Subjt:  VFLPSLNYIGGRQAVIRGIDCDRVSLIELRMMLTDLDVFNVEEFYYVRPG---------------------IVPESRLIHIYVEHDPNI-EVL-DLTEPV

Query:  AMERPIFLEWRPGAQRNPVQNQVELDDFVEVEVSGESSLSDSDSA--------------------SSSLSRESIASL------------NINVNVAEEDV
          +  I          + + +QV+ + +    +S +    +  SA                      +   +++ +L            N   N+  ED+
Subjt:  AMERPIFLEWRPGAQRNPVQNQVELDDFVEVEVSGESSLSDSDSA--------------------SSSLSRESIASL------------NINVNVAEEDV

Query:  DSDIGSYLN--LHTPIGSDEEEDVRG-PEFRVETDMN----------------------------------QNDKVRVTAVCSGQCKWRLHASVIENDNT
        + D   Y +  LH+   S EE   R   EF ++TDM+                                  +NDK R++A C   CKWRL+AS+++ +NT
Subjt:  DSDIGSYLN--LHTPIGSDEEEDVRG-PEFRVETDMN----------------------------------QNDKVRVTAVCSGQCKWRLHASVIENDNT

Query:  IQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTV
         QIK+Y   H+CS+ F N+N+ S ++++ Y+ R +  P      +  EV ++    +S  Q Y+A RKA   I G+  +QY+KLW+Y  EIR++N GS++
Subjt:  IQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTV

Query:  KIQY------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQ
         ++                               ACHLKG  QGQL+ A+G+D +D++YPIA+A+ E E+KDSW WFL LL  D+G  +  G+TFISDQQ
Subjt:  KIQY------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQ

Query:  KGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESF
        KGL   FE   P  +HR+CV+HLYGNF+++FKG ALK   W  A  +N  DFN  M ++  L   AY +L       W+RH F    KCD LLNNL E F
Subjt:  KGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESF

Query:  NALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGKFEV-ESGTTQYVVDIEQRTCTCGLWQVSGIPC
        N+ +L AR+KPI+ + E IR  +M R+  K +  ++ E  I  +I  KLEKN   S    P++ GNG F+V ++   Q+ V++  RTC+C  W+++GIPC
Subjt:  NALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGKFEV-ESGTTQYVVDIEQRTCTCGLWQVSGIPC

Query:  PHAVQCIYYVKKKPEEFVDD--------TTYEPILPPV
         HAV  I++ +++ E+FVD           Y P++ PV
Subjt:  PHAVQCIYYVKKKPEEFVDD--------TTYEPILPPV

TrEMBL top hitse value%identityAlignment
A0A2N9EMB4 SWIM-type domain-containing protein2.6e-10539.62Show/hide
Query:  QNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANR
        +NDKVRV AVC   CKW  +A  +  + T Q++T++ +HTCSR + N    S W+ K    R   QP+   + I E+V K + V ISRS+AYRA + A  
Subjt:  QNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANR

Query:  RINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQY---------------------------------------------ACHLKGPCQGQLMAAIGVDGDD
        +I GS  EQY ++ DY  E+ R+N GS+V ++                                              ACHLKGP  GQL+AAI  D ++
Subjt:  RINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQY---------------------------------------------ACHLKGPCQGQLMAAIGVDGDD

Query:  SLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVM
          +P+A+A+VEAETKDSWTWFL+ L  DV   + +  TFISD+QKGLVPTFEE F  + HR CV+H+Y NF+K+F G+ LK LFWR+A     K +   M
Subjt:  SLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVM

Query:  TEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSS
         E+  +   A++++   P + W +H F+ + KCD L+NN+ ESFN ++L AR KPII L E IR  +M R     E   ++E ++   I KKL +   +S
Subjt:  TEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSS

Query:  RSTIPLWAGNGKFEVESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD--------TTYEPIL------------------PPVI
         +     AGNG FEV  G  QY V+++++ CTC  W +SG+PC HA+ CI+Y ++  E++VD           Y+PI+                  PP I
Subjt:  RSTIPLWAGNGKFEVESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD--------TTYEPIL------------------PPVI

Query:  RRPPGRPRKQRKRDDDEAGTSSTKM
        RRPPGRP+K R+R+ DE    S K+
Subjt:  RRPPGRPRKQRKRDDDEAGTSSTKM

A0A2N9FLE9 Uncharacterized protein7.0e-10639.58Show/hide
Query:  QNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANR
        +NDKVRV AVC   CKW  +A  +  + T Q++T++ +HTCSR + N    S W+ K    R   QP+   + I E+V K + V ISRS+AYRA + A  
Subjt:  QNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANR

Query:  RINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQY---------------------------------------------ACHLKGPCQGQLMAAIGVDGDD
        +I GS  EQY ++ DY  E+ R+N GS+V ++                                              ACHLKGP  GQL+AAI  D ++
Subjt:  RINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQY---------------------------------------------ACHLKGPCQGQLMAAIGVDGDD

Query:  SLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVM
          +P+A+A+VEAETKDSWTWFL+ L  DV   + +  TFISD+QKGLVPTFEE F  + HR CV+H+Y NF+K+F G+ LK LFWR+A     K +   M
Subjt:  SLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVM

Query:  TEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSS
         E+  +   A++++   P + W +H F+ + KCD L+NN+ ESFN ++L AR KPII L E IR  +M R     E   ++E ++   I KKL +   +S
Subjt:  TEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSS

Query:  RSTIPLWAGNGKFEVESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD--------TTYEPIL------------------PPVI
         +     AGNG FEV  G  QY V+++++ CTC  W +SG+PC HA+ CI+Y ++  E++VD           Y+PI+                  PP I
Subjt:  RSTIPLWAGNGKFEVESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD--------TTYEPIL------------------PPVI

Query:  RRPPGRPRKQRKRDDDEAGTSSTKMHRR
        RRPPGRP+K R+R+ DE    S K+ +R
Subjt:  RRPPGRPRKQRKRDDDEAGTSSTKMHRR

A0A2N9GX21 SWIM-type domain-containing protein5.9e-10534.31Show/hide
Query:  FLPSLNYIGGRQAVIRGIDCDRVSLIE--LRMMLTDLDVFNVEEFYYVRPGIVPESRLIHIYVEHDPNIEVLDLTEPVAMERPIFLEWRP-------GAQ
        F+P ++Y+GG+  + +GID D +   E  + +  TD DV  +          V +     IY+EH   ++ LD ++   ++ P+ L   P         Q
Subjt:  FLPSLNYIGGRQAVIRGIDCDRVSLIE--LRMMLTDLDVFNVEEFYYVRPGIVPESRLIHIYVEHDPNIEVLDLTEPVAMERPIFLEWRP-------GAQ

Query:  RNP----VQNQVELDDFVEVEVSGESS---LSD------------------SDSASSSLSRESIASLNINVNVA-EEDVDSDIGSYLNLHTP-IGSDEEE
         +P    V  ++E  D+ E   SG S    L+D                  +D   S +  E   S    + V  + + DSD      L  P  GSD+E 
Subjt:  RNP----VQNQVELDDFVEVEVSGESS---LSD------------------SDSASSSLSRESIASLNINVNVA-EEDVDSDIGSYLNLHTP-IGSDEEE

Query:  DVRGPEFRVETDMN-----------------------------------QNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMN
        + R  EF  E DM                                    QN+K RVTA C  +C WR+HAS+ +  +  +IKT+  +H C + + NR + 
Subjt:  DVRGPEFRVETDMN-----------------------------------QNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMN

Query:  SNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQY------------------
        S+W+A  Y E  R    W+   + E +++DY V +S+ + YRA   A + I+GS  EQ+KKLW Y A IR+   GST+++                    
Subjt:  SNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQY------------------

Query:  ------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQH
                     CHLKG   GQL+ AIG D +D+++PIA+A+VE E KDSWTWFL  L  D+G    K  TFISDQQKG+V T +   P   HRFCV+H
Subjt:  ------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQH

Query:  LYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKL
        LY NF K +KG  LK   W+ AR    K+F + M EI  L   A+ +L  + +  WS+HA    +KCD LLNNL+E+FN+ +L AR+KPII + E IR+ 
Subjt:  LYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKL

Query:  VMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGKFEVE-SGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD--
        +M R   K ++  + E  I  RI  KLEK+  ++R  I    G   FEV+    +++VVD+  ++CTC  W +SGIPC HAV CIY   K PEE+VD   
Subjt:  VMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGKFEVE-SGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD--

Query:  ------------------------TTYEPILPPVIRRPPGRPRKQRKRDDDE
                                T Y+PI PPV+R   GRP+K R++++ E
Subjt:  ------------------------TTYEPILPPVIRRPPGRPRKQRKRDDDE

A0A2N9HT52 SWIM-type domain-containing protein1.2e-10532.2Show/hide
Query:  YIGGRQAVIRGIDCDRVSLIELRMMLTDLDVFNVEEFYYVRPG----------------------IVPESRLIHIYVEHD-PNIEVLDLTEPVAMERPIF
        YIGG   V   ++ D  S+ E+  M  +L V+N  +FYYV PG                      + P +  + +YVEH+ PN        P++M++PI 
Subjt:  YIGGRQAVIRGIDCDRVSLIELRMMLTDLDVFNVEEFYYVRPG----------------------IVPESRLIHIYVEHD-PNIEVLDLTEPVAMERPIF

Query:  LEWRPGAQRNPVQNQVELDDFVEVEVSGESS-----------LSDSDSASSSLSRES---IASLNINVNVAEED--------------------------
        +  R  + R+        DD V + ++ + +           L D   +  ++S  S   +AS + ++ +++                            
Subjt:  LEWRPGAQRNPVQNQVELDDFVEVEVSGESS-----------LSDSDSASSSLSRES---IASLNINVNVAEED--------------------------

Query:  ----VDSDIGSYLNLHTPIGSDEEEDVRGPEFRVETDMN----------------------------------QNDKVRVTAVCSGQCKWRLHASVIEND
            + SD GS   L++P  SD+E     P+FR  TD++                                  +N++ +V   C   C W+L A   ++ 
Subjt:  ----VDSDIGSYLNLHTPIGSDEEEDVRGPEFRVETDMN----------------------------------QNDKVRVTAVCSGQCKWRLHASVIEND

Query:  NTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKLWDYAAEIRRSNVGS
         + QI ++   HTCSR F +R ++S WVAK Y++ FR  P+  L E++ +V KD++V +SRS+AYRA + A   I GS  EQY+++ DY  E+ R+N G+
Subjt:  NTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKLWDYAAEIRRSNVGS

Query:  TVKIQY--------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFI
        T K+                                  ACHLKG   GQL+AA+GVDG++ +YPIA+A+ EAE+ ++WTWFL  L  D+G     G+ F+
Subjt:  TVKIQY--------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFI

Query:  SDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNL
        SDQQKGLVP  ++      HRFCV+HL+ NF+K  KG  LK L W  AR + +++F+  M E+  +S  A+  L G     W+RHAF  Y KCD LLNNL
Subjt:  SDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNL

Query:  SESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGKFEVESGTTQYVVDIEQRTCTCGLWQVSG
         E+FN+ ++ AR KPII + ETIR+ +M RI K  +   + +  I  RI +KL+K    S+   P W G  ++EV S   +Y+VDI +++C C  W ++G
Subjt:  SESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGKFEVESGTTQYVVDIEQRTCTCGLWQVSG

Query:  IPCPHAVQCIYYVKKKPEEFVDD----TTY----------------------EPILPPVIRRPPGRPRKQRKRDDDEAGTSSTKMHRRFRCVDVG
        IPC HAV+ I Y     E++VDD     TY                      +P+LPP+ RR PGRP++  +R D +   +S K+ R    +  G
Subjt:  IPCPHAVQCIYYVKKKPEEFVDD----TTY----------------------EPILPPVIRRPPGRPRKQRKRDDDEAGTSSTKMHRRFRCVDVG

A0A2N9HUS0 Uncharacterized protein7.0e-10639.58Show/hide
Query:  QNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANR
        +NDKVRV AVC   CKW  +A  +  + T Q++T++ +HTCSR + N    S W+ K    R   QP+   + I E+V K + V ISRS+AYRA + A  
Subjt:  QNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANR

Query:  RINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQY---------------------------------------------ACHLKGPCQGQLMAAIGVDGDD
        +I GS  EQY ++ DY  E+ R+N GS+V ++                                              ACHLKGP  GQL+AAI  D ++
Subjt:  RINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQY---------------------------------------------ACHLKGPCQGQLMAAIGVDGDD

Query:  SLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVM
          +P+A+A+VEAETKDSWTWFL+ L  DV   + +  TFISD+QKGLVPTFEE F  + HR CV+H+Y NF+K+F G+ LK LFWR+A     K +   M
Subjt:  SLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVM

Query:  TEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSS
         E+  +   A++++   P + W +H F+ + KCD L+NN+ ESFN ++L AR KPII L E IR  +M R     E   ++E ++   I KKL +   +S
Subjt:  TEIGRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSS

Query:  RSTIPLWAGNGKFEVESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD--------TTYEPIL------------------PPVI
         +     AGNG FEV  G  QY V+++++ CTC  W +SG+PC HA+ CI+Y ++  E++VD           Y+PI+                  PP I
Subjt:  RSTIPLWAGNGKFEVESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD--------TTYEPIL------------------PPVI

Query:  RRPPGRPRKQRKRDDDEAGTSSTKMHRR
        RRPPGRP+K R+R+ DE    S K+ +R
Subjt:  RRPPGRPRKQRKRDDDEAGTSSTKMHRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase4.4e-1220.78Show/hide
Query:  CKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRI-------SRSQAYRAWRKANRRINGSLN
        CKW + AS  E D   +I    G H C  E  N + ++  +        R QP  S AE+ +  +K +   +       S      A  KA +R  G  +
Subjt:  CKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRI-------SRSQAYRAWRKANRRINGSLN

Query:  EQYKKLWDYAAEIRRSNVGSTVKIQY------------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKD
        + ++ +    + +  SN G  V  QY                                      +L G  + +LM A   D  +  +P+A+A+ +  + D
Subjt:  EQYKKLWDYAAEIRRSNVGSTVKIQY------------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKD

Query:  SWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEE-----FFPNVAHRFCVQHLYGNFQKEFKGLALKTLFW--RVARCNNMKDFNDVMTEIGRLSVG
        SW WFL  ++  V     +G   IS     ++    E       P   HRFC+ HL         G      F        +  ++F+  M EI   +  
Subjt:  SWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEE-----FFPNVAHRFCVQHLYGNFQKEFKGLALKTLFW--RVARCNNMKDFNDVMTEIGRLSVG

Query:  AYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMV-----------RITKKHEKCNRMEKEIGN----RILKKLE
        A+++L   P   W+            L ++    +   ++    + +  + +  RK+ M                +  K +R   + G+     +++KLE
Subjt:  AYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMV-----------RITKKHEKCNRMEKEIGN----RILKKLE

Query:  KNIRSSRSTI----PL--------WAGNGKFEV---ESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD--------TTYE----
        +    S + +    PL         A   K  +    + +T  +V +   TCTCG +Q +  PC HA+     +K  P ++VDD         TY     
Subjt:  KNIRSSRSTI----PL--------WAGNGKFEV---ESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD--------TTYE----

Query:  ---------------PILPPVIRRPPGRPRKQRKRDDDE
                        ++PPVI  PP +   + K  D E
Subjt:  ---------------PILPPVIRRPPGRPRKQRKRDDDE

AT1G64255.1 MuDR family transposase2.6e-2021.69Show/hide
Query:  QCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKL
        +CKW L A+ ++    ++I  Y G HTC       +  S +         R  P  +++E+ +  KK     +  S    A  KA +R+ G  ++ ++  
Subjt:  QCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKL

Query:  WDYAAEIRRSNVGSTVKIQY-----------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKDSWTWFLV
            + +  SN G  V  +Y                                     +L    Q +LM A GVD  +  +P+A+A+ +  + D W WFL 
Subjt:  WDYAAEIRRSNVGSTVKIQY-----------------------------------ACHLKGPCQGQLMAAIGVDGDDSLYPIAWAIVEAETKDSWTWFLV

Query:  LLQTDVGCFTSKGYTFISDQQKGLVPTFEEF-----FPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSGIP
         ++  V     KG   IS     ++    E       P   HRF + H Y  F + F    L     R    +   +F   M +I   +  A ++L   P
Subjt:  LLQTDVGCFTSKGYTFISDQQKGLVPTFEEF-----FPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGRLSVGAYQYLSGIP

Query:  SKYWS-------RHA-FQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWA--
           W+       R+   +   K  F + N  E    +V  +    ++ LF+ +R         K   C+R     G+   + +   +   R+T   ++  
Subjt:  SKYWS-------RHA-FQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWA--

Query:  ----GNGKFEVESGTT--QYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD
             N  F+V +     + +V +   +CTCG +Q    PC HA+     +K  P ++VDD
Subjt:  ----GNGKFEVESGTT--QYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD

AT1G64260.1 MuDR family transposase4.6e-2523.77Show/hide
Query:  MNQNDKVRVTAVC-SGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVE--EVKKDYQVRISR-------
        + + +K   T  C   +CKW L A+ +E    ++I  Y G HTCS E+ N +  S + A       R QP  S+AE+ +  + K  Y+++ S+       
Subjt:  MNQNDKVRVTAVC-SGQCKWRLHASVIENDNTIQIKTYMGDHTCSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVE--EVKKDYQVRISR-------

Query:  ---------SQAYRAWRK---ANRRINGSLNE-QYK------------KLWDYAAEIRRSNVGSTVKIQYACHLKGPCQGQLMAAIGVDGDDSLYPIAWA
                  Q++R   K   A    NG L + QY               W ++  I        + +     L G  Q +LM A GVD  +  +P+A+A
Subjt:  ---------SQAYRAWRK---ANRRINGSLNE-QYK------------KLWDYAAEIRRSNVGSTVKIQYACHLKGPCQGQLMAAIGVDGDDSLYPIAWA

Query:  IVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEE-----FFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEI
        + +  + DSW WF   ++  V     K    IS   + +V    E       P   H+FC+ HL   F   F+   L++L  +    N  ++F+  M +I
Subjt:  IVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEE-----FFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEI

Query:  GRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNA-----LVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIR
           +  A+++L  IP   W+  A    L+   +  +    F           A    ++ +F+ +R      ++  +   NR         + KLE+ + 
Subjt:  GRLSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNA-----LVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIR

Query:  SSRSTIPLWAGNGKFEV--ESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD
         S   +        F+V   S   +++V +   TCTC  +Q    PC HA+     +K  P ++VD+
Subjt:  SSRSTIPLWAGNGKFEV--ESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTATTCCTCCCTAGTCTGAACTACATTGGTGGAAGGCAAGCCGTTATTCGAGGGATAGATTGTGATAGGGTCTCGTTGATAGAGTTACGCATGATGTTGACTGATCT
TGATGTATTCAATGTTGAGGAGTTTTACTATGTTAGGCCTGGTATAGTTCCTGAAAGTAGACTTATCCACATTTATGTTGAACATGATCCGAACATTGAAGTTTTAGACC
TAACTGAACCTGTTGCAATGGAAAGACCAATTTTTTTGGAGTGGCGACCTGGTGCTCAACGAAACCCCGTTCAAAACCAAGTAGAATTGGATGACTTTGTTGAGGTAGAA
GTGTCTGGAGAGTCTTCTTTAAGTGATAGTGATAGTGCAAGTTCTTCTCTGTCTAGAGAGTCCATTGCTTCGCTGAATATCAATGTCAATGTGGCTGAAGAGGATGTAGA
TTCAGATATTGGGTCATATTTGAATCTCCATACACCTATAGGGTCTGACGAAGAAGAGGATGTTAGGGGTCCTGAATTTAGGGTTGAGACGGATATGAATCAGAATGATA
AGGTGAGGGTCACGGCGGTTTGTTCGGGACAGTGTAAATGGAGGTTGCATGCTAGTGTTATTGAAAACGATAACACTATTCAAATTAAGACATACATGGGTGATCATACA
TGCAGTAGGGAATTTGCAAACAGGAACATGAACTCCAATTGGGTTGCCAAGATGTATTTGGAACGGTTTAGGAGTCAACCTAATTGGTCTCTTGCTGAAATTGTTGAGGA
GGTGAAGAAGGACTATCAGGTAAGAATTTCTCGAAGTCAAGCCTACAGGGCATGGCGTAAGGCAAACAGAAGGATCAATGGTAGTCTTAATGAACAATATAAGAAATTGT
GGGATTATGCTGCAGAGATTAGACGGTCCAATGTTGGATCAACGGTGAAAATTCAATATGCCTGCCATCTAAAAGGTCCTTGTCAAGGGCAATTAATGGCTGCAATTGGT
GTTGATGGTGATGACAGTTTGTACCCTATTGCGTGGGCCATAGTTGAGGCAGAAACGAAGGATAGCTGGACATGGTTCCTCGTTTTGCTACAGACAGACGTTGGATGTTT
CACATCTAAAGGATACACATTCATCTCAGATCAACAAAAGGGTTTAGTTCCGACTTTTGAGGAGTTTTTTCCAAATGTGGCTCATCGTTTCTGTGTGCAACATTTGTATG
GCAATTTTCAAAAAGAATTTAAAGGGTTGGCATTAAAGACTTTGTTTTGGAGGGTTGCTAGGTGTAACAACATGAAGGATTTTAATGATGTTATGACTGAGATTGGTAGG
TTAAGTGTGGGGGCTTACCAATATTTGAGTGGGATTCCTAGCAAGTATTGGTCTAGGCATGCCTTTCAATGTTATCTGAAGTGTGATTTCTTGCTTAATAATTTATCGGA
GTCATTTAATGCACTTGTACTGCCAGCAAGAAACAAGCCCATAATTCAACTATTTGAAACCATTAGGAAGTTGGTAATGGTTAGGATAACCAAGAAGCATGAAAAATGTA
ATAGGATGGAAAAGGAAATTGGAAATAGAATTCTGAAGAAACTTGAAAAGAATATTAGAAGCTCTAGGAGTACAATACCTTTGTGGGCTGGGAATGGGAAATTTGAAGTG
GAGTCAGGTACAACTCAATATGTTGTAGACATTGAACAACGAACTTGTACGTGTGGGTTGTGGCAAGTAAGTGGGATTCCATGTCCACATGCTGTGCAGTGCATATATTA
TGTGAAAAAGAAGCCCGAAGAGTTTGTTGATGATACGACGTATGAGCCCATTCTCCCACCAGTTATCCGAAGACCACCTGGCCGACCAAGAAAACAAAGGAAACGAGATG
ATGATGAAGCTGGGACTTCAAGCACCAAAATGCATAGGAGGTTTAGATGCGTAGATGTAGGTTGCCCGGACACAACTCTAGATCTTGCAAAGGTCCAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTATTCCTCCCTAGTCTGAACTACATTGGTGGAAGGCAAGCCGTTATTCGAGGGATAGATTGTGATAGGGTCTCGTTGATAGAGTTACGCATGATGTTGACTGATCT
TGATGTATTCAATGTTGAGGAGTTTTACTATGTTAGGCCTGGTATAGTTCCTGAAAGTAGACTTATCCACATTTATGTTGAACATGATCCGAACATTGAAGTTTTAGACC
TAACTGAACCTGTTGCAATGGAAAGACCAATTTTTTTGGAGTGGCGACCTGGTGCTCAACGAAACCCCGTTCAAAACCAAGTAGAATTGGATGACTTTGTTGAGGTAGAA
GTGTCTGGAGAGTCTTCTTTAAGTGATAGTGATAGTGCAAGTTCTTCTCTGTCTAGAGAGTCCATTGCTTCGCTGAATATCAATGTCAATGTGGCTGAAGAGGATGTAGA
TTCAGATATTGGGTCATATTTGAATCTCCATACACCTATAGGGTCTGACGAAGAAGAGGATGTTAGGGGTCCTGAATTTAGGGTTGAGACGGATATGAATCAGAATGATA
AGGTGAGGGTCACGGCGGTTTGTTCGGGACAGTGTAAATGGAGGTTGCATGCTAGTGTTATTGAAAACGATAACACTATTCAAATTAAGACATACATGGGTGATCATACA
TGCAGTAGGGAATTTGCAAACAGGAACATGAACTCCAATTGGGTTGCCAAGATGTATTTGGAACGGTTTAGGAGTCAACCTAATTGGTCTCTTGCTGAAATTGTTGAGGA
GGTGAAGAAGGACTATCAGGTAAGAATTTCTCGAAGTCAAGCCTACAGGGCATGGCGTAAGGCAAACAGAAGGATCAATGGTAGTCTTAATGAACAATATAAGAAATTGT
GGGATTATGCTGCAGAGATTAGACGGTCCAATGTTGGATCAACGGTGAAAATTCAATATGCCTGCCATCTAAAAGGTCCTTGTCAAGGGCAATTAATGGCTGCAATTGGT
GTTGATGGTGATGACAGTTTGTACCCTATTGCGTGGGCCATAGTTGAGGCAGAAACGAAGGATAGCTGGACATGGTTCCTCGTTTTGCTACAGACAGACGTTGGATGTTT
CACATCTAAAGGATACACATTCATCTCAGATCAACAAAAGGGTTTAGTTCCGACTTTTGAGGAGTTTTTTCCAAATGTGGCTCATCGTTTCTGTGTGCAACATTTGTATG
GCAATTTTCAAAAAGAATTTAAAGGGTTGGCATTAAAGACTTTGTTTTGGAGGGTTGCTAGGTGTAACAACATGAAGGATTTTAATGATGTTATGACTGAGATTGGTAGG
TTAAGTGTGGGGGCTTACCAATATTTGAGTGGGATTCCTAGCAAGTATTGGTCTAGGCATGCCTTTCAATGTTATCTGAAGTGTGATTTCTTGCTTAATAATTTATCGGA
GTCATTTAATGCACTTGTACTGCCAGCAAGAAACAAGCCCATAATTCAACTATTTGAAACCATTAGGAAGTTGGTAATGGTTAGGATAACCAAGAAGCATGAAAAATGTA
ATAGGATGGAAAAGGAAATTGGAAATAGAATTCTGAAGAAACTTGAAAAGAATATTAGAAGCTCTAGGAGTACAATACCTTTGTGGGCTGGGAATGGGAAATTTGAAGTG
GAGTCAGGTACAACTCAATATGTTGTAGACATTGAACAACGAACTTGTACGTGTGGGTTGTGGCAAGTAAGTGGGATTCCATGTCCACATGCTGTGCAGTGCATATATTA
TGTGAAAAAGAAGCCCGAAGAGTTTGTTGATGATACGACGTATGAGCCCATTCTCCCACCAGTTATCCGAAGACCACCTGGCCGACCAAGAAAACAAAGGAAACGAGATG
ATGATGAAGCTGGGACTTCAAGCACCAAAATGCATAGGAGGTTTAGATGCGTAGATGTAGGTTGCCCGGACACAACTCTAGATCTTGCAAAGGTCCAGTGA
Protein sequenceShow/hide protein sequence
MVFLPSLNYIGGRQAVIRGIDCDRVSLIELRMMLTDLDVFNVEEFYYVRPGIVPESRLIHIYVEHDPNIEVLDLTEPVAMERPIFLEWRPGAQRNPVQNQVELDDFVEVE
VSGESSLSDSDSASSSLSRESIASLNINVNVAEEDVDSDIGSYLNLHTPIGSDEEEDVRGPEFRVETDMNQNDKVRVTAVCSGQCKWRLHASVIENDNTIQIKTYMGDHT
CSREFANRNMNSNWVAKMYLERFRSQPNWSLAEIVEEVKKDYQVRISRSQAYRAWRKANRRINGSLNEQYKKLWDYAAEIRRSNVGSTVKIQYACHLKGPCQGQLMAAIG
VDGDDSLYPIAWAIVEAETKDSWTWFLVLLQTDVGCFTSKGYTFISDQQKGLVPTFEEFFPNVAHRFCVQHLYGNFQKEFKGLALKTLFWRVARCNNMKDFNDVMTEIGR
LSVGAYQYLSGIPSKYWSRHAFQCYLKCDFLLNNLSESFNALVLPARNKPIIQLFETIRKLVMVRITKKHEKCNRMEKEIGNRILKKLEKNIRSSRSTIPLWAGNGKFEV
ESGTTQYVVDIEQRTCTCGLWQVSGIPCPHAVQCIYYVKKKPEEFVDDTTYEPILPPVIRRPPGRPRKQRKRDDDEAGTSSTKMHRRFRCVDVGCPDTTLDLAKVQ