; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011638 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011638
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSWIM-type domain-containing protein
Genome locationchr1:29715083..29718358
RNA-Seq ExpressionLag0011638
SyntenyLag0011638
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR007527 - Zinc finger, SWIM-type
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI10421.1 hypothetical protein PRUPE_4G046500 [Prunus persica]1.4e-10533.05Show/hide
Query:  GYKAILTNDDILDMVEMLPEDRMIHMYVEHNQN-REII-DFTVPIAEVKPMFLEWYPEEASVESLDKEGTDIEEINSISYREMTKDVEGTGKEEMDYMVE
        G  A+  ++ IL  V  +P +R + +YVEH+ +  E+I D T P     PM       +A+++     G    EI+   + ++ ++  G+     D++ +
Subjt:  GYKAILTNDDILDMVEMLPEDRMIHMYVEHNQN-REII-DFTVPIAEVKPMFLEWYPEEASVESLDKEGTDIEEINSISYREMTKDVEGTGKEEMDYMVE

Query:  KGMEKHVDGSDKEEIASMEGTDQNHLEV-VEEDQFEYCDEEWDADESEDDGHDDNDKDIGGTSTNETCEENEVQSGNVECDSKDDYEKGEDSSDE----G
                   +E  A   G  Q+  +V +  +++EY D + D D+++   +++ND D G  S +E  +E+     +   D  + Y+   D+ +     G
Subjt:  KGMEKHVDGSDKEEIASMEGTDQNHLEV-VEEDQFEYCDEEWDADESEDDGHDDNDKDIGGTSTNETCEENEVQSGNVECDSKDDYEKGEDSSDE----G

Query:  SVEANEPFDAHISI-DADAKSDYRSSSDLNFPVNS----SGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTA
          + N  ++  I + D +   +  +S +L+   NS    + RR     E   DT M+  +F +GMKF + KVLK AI+ Y     Y  +++KNDK R +A
Subjt:  SVEANEPFDAHISI-DADAKSDYRSSSDLNFPVNS----SGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTA

Query:  TCDGGCTWRLHASVGKGEATFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWR------------------------------LNEGSSVK-
         C  GC WRL+AS+ +GE T+Q+K+Y  +HS S+ F N+N+ S+ +++RY+SR +  P  +                              L EG+  + 
Subjt:  TCDGGCTWRLHASVGKGEATFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWR------------------------------LNEGSSVK-

Query:  -----ILCDRVHEDNN----------PIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRL
               C+ + + NN          P ++RLY+C   CK GF+ GCRP I +DACHL G  QG L+  VG D ND++YPIA+A+ E E+K SW WFL+L
Subjt:  -----ILCDRVHEDNN----------PIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRL

Query:  LESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWAR
        L  D+G  S  G+TF+SDQQKGL   F  V     HR+CVRHLY NF+++FKG  LK+  W AA ++ + +F+  M++L  LD  A+ ++KN     WAR
Subjt:  LESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWAR

Query:  HSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEV-EVGVAQFV
        H F    K D+LLNN  E FN++I   RDKPI+ MLE+I   +M R+  K ++ S+ EG + P I+ KLEK K       P++ GN LF+V +  V Q  
Subjt:  HSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEV-EVGVAQFV

Query:  VDLEKRICSCRKCDLTGIPCVHQSN------------------------TYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQ-RRDASEPRPSQ
        V+L  R CSCR+ +L GIP +H  +                         YS  ++PV     W      P+ PP++K+  GRP++++  +D+   +   
Subjt:  VDLEKRICSCRKCDLTGIPCVHQSN------------------------TYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQ-RRDASEPRPSQ

Query:  PVR--RQTSTVTCSKCKNVGHNARSCKGE
        P +  ++  T TC+KC   GHN  +CK +
Subjt:  PVR--RQTSTVTCSKCKNVGHNARSCKGE

XP_019077061.1 PREDICTED: uncharacterized protein LOC104879996 [Vitis vinifera]2.0e-10435.09Show/hide
Query:  DSKDDYEKGEDSSDEGSVEANEPFDAHISIDADAKSDYRSSSDLNFPVNSSGRRVNVDP---EFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGY
        +S DD  +     D+G   A          + +  SD+  S +L     SS    N  P   EF +   ME ++ V   KF +  + K+A+KE+ ++  +
Subjt:  DSKDDYEKGEDSSDEGSVEANEPFDAHISIDADAKSDYRSSSDLNFPVNSSGRRVNVDP---EFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGY

Query:  NIRLIKNDKQRCTATCDGGCTWRLHASVGKGEATFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDW--------------------------
        +     NDK R TA C   C W++HAS  +    FQ+K++KS H+  ++  N  + S  +A +YL  FR    W                          
Subjt:  NIRLIKNDKQRCTATCDGGCTWRLHASVGKGEATFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDW--------------------------

Query:  --------------------------RLNEGSSVKILCDRVHEDNNPIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYP
                                  + N GS+VKI      +  N ++ R+YIC   CK GFL GCRP I +D CHL G   G L+  VG DGND+I+P
Subjt:  --------------------------RLNEGSSVKILCDRVHEDNNPIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYP

Query:  IAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELG
        IA+AIVE E K+SWTWFL+ L  DIG     G+ F+SD+QKGLV TF ++    +HRFCVRHL+ANF+K F G  LK+  W AA+ATTK+ FD  MDEL 
Subjt:  IAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELG

Query:  KLDVNAFQYVKNIPSKYWARHSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAI
        KLDV A++++  +  + W+RH+F    KSD L+NN +ESFNA+I + RDKP++ M+EII  ++M R+  K +   R E  + P I  KLE+ K   G+ I
Subjt:  KLDVNAFQYVKNIPSKYWARHSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAI

Query:  PVWSGNLLFEVE-VGVAQFVVDLEKRICSCRKCDLTGIPCVHQS------------------------NTYSSFLKPVNGSNLWEKTTADPLQPPVLKRP
          W+G   +EVE +   ++VVDL +R C C +  L+GI C H +                        + Y   + P+     W KT  D ++PP   + 
Subjt:  PVWSGNLLFEVE-VGVAQFVVDLEKRICSCRKCDLTGIPCVHQS------------------------NTYSSFLKPVNGSNLWEKTTADPLQPPVLKRP

Query:  LGRPRRQQRRDASEPRPSQPVRRQTSTVTCSKCKNVGHNARSCK
        +G+ ++ ++R+A EP  +  V ++ + + C  C   GHN R+CK
Subjt:  LGRPRRQQRRDASEPRPSQPVRRQTSTVTCSKCKNVGHNARSCK

XP_023873370.1 uncharacterized protein LOC111985960 [Quercus suber]1.8e-10835.49Show/hide
Query:  ISIDADAKS--DYRSSSDLNFP---VNSSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLH
        +S+D  + S  D   SSD + P   V++S RR    P FR  T +E++ F   M F ++K  KDAI +Y + GG++I+ +KND  R  A C  GC +  +
Subjt:  ISIDADAKS--DYRSSSDLNFP---VNSSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLH

Query:  ASVGKGEATFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWRL---------------NEGSSVK---------------------------
         +    E +F++KT   EH+ SR + N +  +  I ++ + R R+QPD +L               +EG + +                           
Subjt:  ASVGKGEATFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWRL---------------NEGSSVK---------------------------

Query:  -------ILCDRVHEDNN-------------PIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWT
               I+  +VH  N+             P + R+YIC +GCK GFL GCRP I LDACHL     G LM  VG D ND+ +P A+A+VEAETK +WT
Subjt:  -------ILCDRVHEDNN-------------PIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWT

Query:  WFLRLLESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPS
        WFL LL +DIG  +   + F+SDQQKGLV TF + + + +HR C RHLY N +K   G+ +++ FWKAA+AT +  ++ AM+EL ++D +AF ++++  +
Subjt:  WFLRLLESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPS

Query:  KYWARHSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGV
          W RH F    +SD++LNN  ESFN+ I + R KPII MLE I   +MTR     E+  ++E  L P I+ +L KEK      I  W+G   FEV+ G+
Subjt:  KYWARHSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGV

Query:  AQFVVDLEKRICSCRKCDLTGIPCVH------------------------QSNTYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRRDASEPR
          F+VDLE++ CSCRK D+ GIPC H                          + Y   ++P+NG N+W+ +   P+QPP+ +RP GRP++++  +  EPR
Subjt:  AQFVVDLEKRICSCRKCDLTGIPCVH------------------------QSNTYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRRDASEPR

Query:  PSQPVRRQTSTVTCSKCKNVGHNARSCKGEVARQRKKRRTTTFMGFNIPSAEEESLQEEEPMEVEVLWSQPGSSTQTSS
          +  R    +  C  C  +GHN RSCKGEV       R+ +            + ++       V  +QPGS+ Q SS
Subjt:  PSQPVRRQTSTVTCSKCKNVGHNARSCKGEVARQRKKRRTTTFMGFNIPSAEEESLQEEEPMEVEVLWSQPGSSTQTSS

XP_023914573.1 uncharacterized protein LOC112026126 [Quercus suber]9.7e-10732.01Show/hide
Query:  YKAILTNDDILDMVEMLPEDRMIHMYVEHNQNREIIDFTVPIAEVKPMFL--EWYPEEASVESLDKEGTDIEEINSISYREMTKDVEGTGKEEMDYMVEK
        +  ++ +D  + M +++     IH++VEH  +  +    VP +E++ + +  E+ P  +     D      +E+  +   ++  +     +     +V+ 
Subjt:  YKAILTNDDILDMVEMLPEDRMIHMYVEHNQNREIIDFTVPIAEVKPMFL--EWYPEEASVESLDKEGTDIEEINSISYREMTKDVEGTGKEEMDYMVEK

Query:  GMEKHVDGSDKEEIASMEGTDQNHLEVVEEDQFEYCDEEWDADESEDDGHDDNDKD---IGGTSTNETCEENEVQSGNVECDSKDDYEKGEDSSDEGSVE
         +E+  +GS        EG++    ++ EE + +  +   D+ +S D+  +D++ +   +GG   N   E  E+ S +    S +D   G+DSSD+   E
Subjt:  GMEKHVDGSDKEEIASMEGTDQNHLEVVEEDQFEYCDEEWDADESEDDGHDDNDKD---IGGTSTNETCEENEVQSGNVECDSKDDYEKGEDSSDEGSVE

Query:  ANEPFDAHISIDADAKSDYRSSSDLNFPVNSSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTW
        A                           V++S R+    P FR     E + F   M F ++K  KDAI +YAV GG+ I+ +KND  R  A C  GC +
Subjt:  ANEPFDAHISIDADAKSDYRSSSDLNFPVNSSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTW

Query:  RLHASVGKGEATFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWRLNE---------------GSSVKI---------------------LC
          + +    E +F++KT   EH+ SR + N    +S I ++   R R+QPD +L +               G + +                       C
Subjt:  RLHASVGKGEATFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWRLNE---------------GSSVKI---------------------LC

Query:  D-------------RVHEDNN-------------PIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKT
        D             +VH  N+             P + R+YIC +GCK GFL GCRP I LDACHL     G LM  VG D ND+ +P+A+A+VEAETK 
Subjt:  D-------------RVHEDNN-------------PIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKT

Query:  SWTWFLRLLESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKN
        +WTWFL LL +DIG    K + F+SDQQKGLV TF + + + +HR C RHLY N +K   G+ +++ FWKAA+AT +  F+ AM+EL ++D +AF+++++
Subjt:  SWTWFLRLLESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKN

Query:  IPSKYWARHSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVE
          +  WARH F +  +SD +LNN  ESFN  + + R KPII MLE I   +MTR     E+  ++E  L P I+ +L KEK      I  W+G   FEV+
Subjt:  IPSKYWARHSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVE

Query:  VGVAQFVVDLEKRICSCRKCDLTGIPCVH------------------------QSNTYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRRDAS
         G+  F+VDLE++ CSCRK D+ G+PC H                          + Y   ++P+NG N+W  +   P+QPP+ +RP GRP++++  +  
Subjt:  VGVAQFVVDLEKRICSCRKCDLTGIPCVH------------------------QSNTYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRRDAS

Query:  EPRPSQPVRRQTSTVTCSKCKNVGHNARSCKGEV
        EPR  +  R    +  C  C  +GHN RSCKGEV
Subjt:  EPRPSQPVRRQTSTVTCSKCKNVGHNARSCKGEV

XP_030936410.1 uncharacterized protein LOC115961597 [Quercus lobata]1.1e-10534.83Show/hide
Query:  NVECDSKDDYEKGEDSSD-------EGSVEANEPFDAHISIDADAKS----DYRSSSDLNFPVNSSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLK
        N+  DS D ++   ++++        G + ++   +  +S+D  + S    D  S  D+      +  R +  P FR     E + F   M F  +K  K
Subjt:  NVECDSKDDYEKGEDSSD-------EGSVEANEPFDAHISIDADAKS----DYRSSSDLNFPVNSSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLK

Query:  DAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLHASVGKGEATFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWRLNE----------
        DAI +YAV GG+ I+ +KND  R  A C  GC +  + +    E +F++KT   EH+ +R + N    +S I ++ + R R+QP  +L +          
Subjt:  DAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLHASVGKGEATFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWRLNE----------

Query:  ------------------------------------------GSSVKILCDRVHEDNN-----------PIYRRLYICFKGCKDGFLVGCRPFISLDACH
                                                  GS++ ++    +ED +           P + R+YIC +GCK GFL GCRP I LDACH
Subjt:  ------------------------------------------GSSVKILCDRVHEDNN-----------PIYRRLYICFKGCKDGFLVGCRPFISLDACH

Query:  LNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLK
        L     G LM  VG D ND+ +  A+A+VEAETK SWTWFL LL +DIG    K + F+SDQQKGLV TF + + + +HR C RHLY N +K   G+ ++
Subjt:  LNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLK

Query:  NWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWARHSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRL
        + FWKAA+AT +  F+ AM+EL ++D +AF+++++  +  WARH F +  +SD +LNN  ESFN+ I + R KPII MLE I   +MTR     E+  ++
Subjt:  NWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWARHSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRL

Query:  EGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGVAQFVVDLEKRICSCRKCDLTGIPCVH------------------------QSNTYSSFLKPV
        E  L P I+ +L KEK      I  W+G + FEV+ G+  F+VDLE++ CSCRK D+ GIPC H                          + Y   ++P+
Subjt:  EGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGVAQFVVDLEKRICSCRKCDLTGIPCVH------------------------QSNTYSSFLKPV

Query:  NGSNLWEKTTADPLQPPVLKRPLGRPRRQQRRDASEPRPSQPVRRQTSTVTCSKCKNVGHNARSCKGEV
        NG N+W  +   P+QPP+ +RP GRP++++  +  EPR  +  R    +  C  C  +GHN RSCKGEV
Subjt:  NGSNLWEKTTADPLQPPVLKRPLGRPRRQQRRDASEPRPSQPVRRQTSTVTCSKCKNVGHNARSCKGEV

TrEMBL top hitse value%identityAlignment
A0A2N9FLE9 Uncharacterized protein2.5e-10834.75Show/hide
Query:  EDQFEYCDEEWDADESEDDGHDDNDKDIGGTSTNETCEENEVQSGNVECDS--KDDYEKGEDSSDEGSVEANEPFDAHISIDADAKSDYRSSSDLNFPVN
        +   E+ D E   + S+DDGH +          NE C    V  G++E D   +D    G+   ++  ++            +D    Y S   L+   +
Subjt:  EDQFEYCDEEWDADESEDDGHDDNDKDIGGTSTNETCEENEVQSGNVECDS--KDDYEKGEDSSDEGSVEANEPFDAHISIDADAKSDYRSSSDLNFPVN

Query:  SSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLHASVGKGEATFQVKTYKSEHSYSREFTN
            +    P ++     + I+F IGMKFN+ +  K+A+ +YAV GG+ IR  KNDK R  A C  GC W  +A    GE T Q++T+ +EH+ SR + N
Subjt:  SSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLHASVGKGEATFQVKTYKSEHSYSREFTN

Query:  RNLKSSCIARRYLSRFRQQPDW----------------------------------------------------RLNEGSSVKILCD----------RVH
            S  + ++  +R  +QPD                                                     R N GSSV +  D          R  
Subjt:  RNLKSSCIARRYLSRFRQQPDW----------------------------------------------------RLNEGSSVKILCD----------RVH

Query:  EDNNPIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMSDQQKG
           NP++ RLY+C   CK GF V CRPFI +DACHL GP  G L++ +  D N+  +P+A+A+VEAETK SWTWFL  L  D+ + S +  TF+SD+QKG
Subjt:  EDNNPIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMSDQQKG

Query:  LVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWARHSFLTSCKSDLLLNNNSESFNA
        LVPTF  VF  ++HR CVRH+Y NF+K+F G+ LK+ FW+ A AT + +++ AM EL ++D  AF++V++ P + W +H F    K D L+NN  ESFN 
Subjt:  LVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWARHSFLTSCKSDLLLNNNSESFNA

Query:  FITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGVAQFVVDLEKRICSCRKCDLTGIPCVHQ
         I + R KPII ++E I   +M R     E   ++EG+L PNI+ KL +EK   GN     +GN  FEV  G  Q+ V+L+++ C+C++ DL+G+PC H 
Subjt:  FITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGVAQFVVDLEKRICSCRKCDLTGIPCVHQ

Query:  SN------------------------TYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRRDASEPRP-SQPVRRQTSTVTCSKCKNVGHNARS
         +                         Y   + P+NG ++WE T   P++PP ++RP GRP++ +RR+  EPRP S  + ++   + C KC   GHN R+
Subjt:  SN------------------------TYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRRDASEPRP-SQPVRRQTSTVTCSKCKNVGHNARS

Query:  CKGEV
        CKG+V
Subjt:  CKGEV

A0A2N9FZE3 SWIM-type domain-containing protein4.3e-10837.56Show/hide
Query:  DADAKSDYRSSSDLNFPVNSSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLHASVGKGEA
        D    SDY SS  LN P NS     ++ P+FR  T ++   F +GM F  +  LK+A+  Y ++ G+ +R  KN++ +    C  GC W+L A  G    
Subjt:  DADAKSDYRSSSDLNFPVNSSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLHASVGKGEA

Query:  TFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWRLN------------------------------EGS------SVKILCDRVHEDN----
        +FQ+ +++S H+ SR F +R + S  +A++Y+  FR  PD  L+                              EGS       V+  C+ +   N    
Subjt:  TFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWRLN------------------------------EGS------SVKILCDRVHEDN----

Query:  --------NPIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMS
                   ++RLY+C  GCK GFL GCRP I LDACHL G   G L++ VG DGN+ +YPIA+A+ EAE+  +WTWFL  L  DIG+    G+ F+S
Subjt:  --------NPIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMS

Query:  DQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWARHSFLTSCKSDLLLNNNS
        DQQKGLVP    V Q   HRFCVRHL+ANF+K  KG  LK+  W AA+A+T  EFD  M E+  +   A   ++      WARH+F    K D+LLNN  
Subjt:  DQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWARHSFLTSCKSDLLLNNNS

Query:  ESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGVAQFVVDLEKRICSCRKCDLTGI
        E+FN+ I   R KPII MLE I + +MTRI K  +   + +G + P I+ KL+K K       P W G   +EV     +++VD+ K+ C+C K DLTGI
Subjt:  ESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGVAQFVVDLEKRICSCRKCDLTGI

Query:  PCVH------------------------QSNTYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRR-DASEPRPSQPVRRQTSTVTCSKCKNVG
        PC H                            YS  ++P NG + W     DP+ PP+ +R  GRP+R  RR D  E + S  ++R  +++ C +C  VG
Subjt:  PCVH------------------------QSNTYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRR-DASEPRPSQPVRRQTSTVTCSKCKNVG

Query:  HNARSCK
        HN RSCK
Subjt:  HNARSCK

A0A2N9HT52 SWIM-type domain-containing protein2.5e-10837.56Show/hide
Query:  DADAKSDYRSSSDLNFPVNSSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLHASVGKGEA
        D    SDY SS  LN P NS     ++ P+FR  T ++   F +GM F  +  LK+A+  Y ++ G+ +R  KN++ +    C  GC W+L A  G    
Subjt:  DADAKSDYRSSSDLNFPVNSSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLHASVGKGEA

Query:  TFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWRLN------------------------------EGS------SVKILCDRVHEDN----
        +FQ+ +++S H+ SR F +R + S  +A++Y+  FR  PD  L+                              EGS       V+  C+ +   N    
Subjt:  TFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWRLN------------------------------EGS------SVKILCDRVHEDN----

Query:  --------NPIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMS
                   ++RLY+C  GCK GFL GCRP I LDACHL G   G L++ VG DGN+ +YPIA+A+ EAE+  +WTWFL  L  DIG+    G+ F+S
Subjt:  --------NPIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMS

Query:  DQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWARHSFLTSCKSDLLLNNNS
        DQQKGLVP    V Q   HRFCVRHL+ANF+K  KG  LK+  W AA+A+T  EFD  M E+  +   A   ++      WARH+F    K D+LLNN  
Subjt:  DQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWARHSFLTSCKSDLLLNNNS

Query:  ESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGVAQFVVDLEKRICSCRKCDLTGI
        E+FN+ I   R KPII MLE I + +MTRI K  +   + +G + P I+ KL+K K       P W G   +EV     +++VD+ K+ C+C K DLTGI
Subjt:  ESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGVAQFVVDLEKRICSCRKCDLTGI

Query:  PCVH------------------------QSNTYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRR-DASEPRPSQPVRRQTSTVTCSKCKNVG
        PC H                            YS  ++P NG + W     DP+ PP+ +R  GRP+R  RR D  E + S  ++R  +++ C +C  VG
Subjt:  PCVH------------------------QSNTYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRR-DASEPRPSQPVRRQTSTVTCSKCKNVG

Query:  HNARSCK
        HN RSCK
Subjt:  HNARSCK

A0A2N9HUS0 Uncharacterized protein2.5e-10834.75Show/hide
Query:  EDQFEYCDEEWDADESEDDGHDDNDKDIGGTSTNETCEENEVQSGNVECDS--KDDYEKGEDSSDEGSVEANEPFDAHISIDADAKSDYRSSSDLNFPVN
        +   E+ D E   + S+DDGH +          NE C    V  G++E D   +D    G+   ++  ++            +D    Y S   L+   +
Subjt:  EDQFEYCDEEWDADESEDDGHDDNDKDIGGTSTNETCEENEVQSGNVECDS--KDDYEKGEDSSDEGSVEANEPFDAHISIDADAKSDYRSSSDLNFPVN

Query:  SSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLHASVGKGEATFQVKTYKSEHSYSREFTN
            +    P ++     + I+F IGMKFN+ +  K+A+ +YAV GG+ IR  KNDK R  A C  GC W  +A    GE T Q++T+ +EH+ SR + N
Subjt:  SSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLHASVGKGEATFQVKTYKSEHSYSREFTN

Query:  RNLKSSCIARRYLSRFRQQPDW----------------------------------------------------RLNEGSSVKILCD----------RVH
            S  + ++  +R  +QPD                                                     R N GSSV +  D          R  
Subjt:  RNLKSSCIARRYLSRFRQQPDW----------------------------------------------------RLNEGSSVKILCD----------RVH

Query:  EDNNPIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMSDQQKG
           NP++ RLY+C   CK GF V CRPFI +DACHL GP  G L++ +  D N+  +P+A+A+VEAETK SWTWFL  L  D+ + S +  TF+SD+QKG
Subjt:  EDNNPIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMSDQQKG

Query:  LVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWARHSFLTSCKSDLLLNNNSESFNA
        LVPTF  VF  ++HR CVRH+Y NF+K+F G+ LK+ FW+ A AT + +++ AM EL ++D  AF++V++ P + W +H F    K D L+NN  ESFN 
Subjt:  LVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWARHSFLTSCKSDLLLNNNSESFNA

Query:  FITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGVAQFVVDLEKRICSCRKCDLTGIPCVHQ
         I + R KPII ++E I   +M R     E   ++EG+L PNI+ KL +EK   GN     +GN  FEV  G  Q+ V+L+++ C+C++ DL+G+PC H 
Subjt:  FITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGVAQFVVDLEKRICSCRKCDLTGIPCVHQ

Query:  SN------------------------TYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRRDASEPRP-SQPVRRQTSTVTCSKCKNVGHNARS
         +                         Y   + P+NG ++WE T   P++PP ++RP GRP++ +RR+  EPRP S  + ++   + C KC   GHN R+
Subjt:  SN------------------------TYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRRDASEPRP-SQPVRRQTSTVTCSKCKNVGHNARS

Query:  CKGEV
        CKG+V
Subjt:  CKGEV

M5X0G1 ZnF_PMZ domain-containing protein (Fragment)6.8e-10633.05Show/hide
Query:  GYKAILTNDDILDMVEMLPEDRMIHMYVEHNQN-REII-DFTVPIAEVKPMFLEWYPEEASVESLDKEGTDIEEINSISYREMTKDVEGTGKEEMDYMVE
        G  A+  ++ IL  V  +P +R + +YVEH+ +  E+I D T P     PM       +A+++     G    EI+   + ++ ++  G+     D++ +
Subjt:  GYKAILTNDDILDMVEMLPEDRMIHMYVEHNQN-REII-DFTVPIAEVKPMFLEWYPEEASVESLDKEGTDIEEINSISYREMTKDVEGTGKEEMDYMVE

Query:  KGMEKHVDGSDKEEIASMEGTDQNHLEV-VEEDQFEYCDEEWDADESEDDGHDDNDKDIGGTSTNETCEENEVQSGNVECDSKDDYEKGEDSSDE----G
                   +E  A   G  Q+  +V +  +++EY D + D D+++   +++ND D G  S +E  +E+     +   D  + Y+   D+ +     G
Subjt:  KGMEKHVDGSDKEEIASMEGTDQNHLEV-VEEDQFEYCDEEWDADESEDDGHDDNDKDIGGTSTNETCEENEVQSGNVECDSKDDYEKGEDSSDE----G

Query:  SVEANEPFDAHISI-DADAKSDYRSSSDLNFPVNS----SGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTA
          + N  ++  I + D +   +  +S +L+   NS    + RR     E   DT M+  +F +GMKF + KVLK AI+ Y     Y  +++KNDK R +A
Subjt:  SVEANEPFDAHISI-DADAKSDYRSSSDLNFPVNS----SGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTA

Query:  TCDGGCTWRLHASVGKGEATFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWR------------------------------LNEGSSVK-
         C  GC WRL+AS+ +GE T+Q+K+Y  +HS S+ F N+N+ S+ +++RY+SR +  P  +                              L EG+  + 
Subjt:  TCDGGCTWRLHASVGKGEATFQVKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWR------------------------------LNEGSSVK-

Query:  -----ILCDRVHEDNN----------PIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRL
               C+ + + NN          P ++RLY+C   CK GF+ GCRP I +DACHL G  QG L+  VG D ND++YPIA+A+ E E+K SW WFL+L
Subjt:  -----ILCDRVHEDNN----------PIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRL

Query:  LESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWAR
        L  D+G  S  G+TF+SDQQKGL   F  V     HR+CVRHLY NF+++FKG  LK+  W AA ++ + +F+  M++L  LD  A+ ++KN     WAR
Subjt:  LESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWAR

Query:  HSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEV-EVGVAQFV
        H F    K D+LLNN  E FN++I   RDKPI+ MLE+I   +M R+  K ++ S+ EG + P I+ KLEK K       P++ GN LF+V +  V Q  
Subjt:  HSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEV-EVGVAQFV

Query:  VDLEKRICSCRKCDLTGIPCVHQSN------------------------TYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQ-RRDASEPRPSQ
        V+L  R CSCR+ +L GIP +H  +                         YS  ++PV     W      P+ PP++K+  GRP++++  +D+   +   
Subjt:  VDLEKRICSCRKCDLTGIPCVHQSN------------------------TYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQ-RRDASEPRPSQ

Query:  PVR--RQTSTVTCSKCKNVGHNARSCKGE
        P +  ++  T TC+KC   GHN  +CK +
Subjt:  PVR--RQTSTVTCSKCKNVGHNARSCKGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64260.1 MuDR family transposase2.4e-1527.27Show/hide
Query:  YRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMSDQQKGLVPTFN
        +R ++  F    +GF   CRP I +D   LNG  Q  LM   G D  +  +P+A+A+ +  +  SW WF   +   +     K    +S   + +V   N
Subjt:  YRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDIYPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMSDQQKGLVPTFN

Query:  ---NVFQR--VDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWA
           +++Q     H+FC+ HL + F   F+   L++   +A     K EFD  M+++ + +  A++++  IP   WA
Subjt:  ---NVFQR--VDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQYVKNIPSKYWA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGAATACAATATAGTCCTAGGGTTGAGTATTTTGGGGGAAAGAACATTGAGTACAGGATATAAAGCGATTCTGACCAATGACGATATTTTGGATATGGTTGAGAT
GCTCCCAGAGGATAGGATGATCCATATGTATGTGGAGCACAACCAGAATAGAGAAATTATAGATTTTACGGTACCTATCGCTGAGGTTAAACCAATGTTTTTGGAGTGGT
ACCCTGAGGAAGCGAGTGTAGAGTCCCTTGATAAGGAAGGAACTGACATTGAAGAAATAAATTCTATTTCTTATAGGGAAATGACAAAAGATGTGGAAGGAACTGGCAAG
GAAGAAATGGATTATATGGTTGAGAAGGGAATGGAAAAACATGTGGATGGAAGTGATAAGGAAGAAATAGCATCTATGGAAGGCACTGACCAAAACCATCTGGAAGTTGT
AGAGGAAGATCAATTTGAGTATTGTGATGAAGAATGGGATGCAGATGAGTCAGAAGATGACGGTCACGATGATAACGACAAAGATATCGGGGGTACAAGTACAAATGAGA
CTTGTGAAGAGAATGAAGTTCAATCTGGCAATGTCGAATGTGACTCTAAAGATGATTATGAAAAAGGAGAAGATTCTAGCGATGAGGGTTCTGTGGAGGCGAATGAACCA
TTCGATGCACACATTTCTATTGACGCAGATGCGAAATCGGATTATCGATCATCGAGTGATTTGAATTTTCCGGTAAATTCAAGTGGAAGAAGAGTGAATGTTGATCCTGA
GTTTAGAGAAGATACATATATGGAAAGGATCGAATTTGTTATTGGAATGAAGTTCAACACTTCTAAGGTACTGAAAGATGCTATAAAAGAGTATGCAGTCAGAGGTGGGT
ACAATATTCGATTGATAAAGAATGACAAGCAACGGTGCACAGCTACTTGTGATGGAGGATGTACTTGGAGACTACATGCTAGTGTGGGTAAGGGGGAGGCCACTTTTCAG
GTAAAGACCTACAAAAGCGAGCATAGTTATAGTAGGGAGTTCACCAACCGAAACTTGAAATCTTCATGTATTGCTCGAAGATACCTATCAAGGTTTAGACAACAACCGGA
TTGGAGGTTAAATGAAGGTTCATCTGTGAAAATTTTGTGTGATCGGGTGCATGAGGACAATAATCCTATATATAGACGCCTTTACATTTGCTTTAAGGGGTGCAAGGATG
GGTTTCTAGTCGGATGTAGACCATTCATTTCATTGGATGCCTGTCACTTGAACGGTCCATGTCAAGGACATCTTATGTCTGTAGTTGGAACGGATGGGAATGACGACATC
TACCCTATAGCATGGGCTATAGTTGAAGCAGAAACAAAGACTAGTTGGACATGGTTTCTTCGTCTGCTCGAGAGTGACATAGGATCCTTTTCTGTGAAAGGATATACATT
CATGTCTGACCAACAAAAGGGATTGGTACCCACCTTCAATAATGTGTTTCAAAGGGTTGATCATCGATTTTGTGTAAGACATTTATATGCCAATTTCCAGAAGCAATTCA
AAGGATTGACATTGAAGAATTGGTTTTGGAAAGCTGCCCAAGCAACAACAAAATCTGAGTTCGATGGAGCTATGGACGAGTTGGGAAAACTTGATGTCAATGCCTTTCAA
TATGTCAAGAACATACCTTCCAAGTATTGGGCACGACACTCTTTCCTAACTAGCTGTAAGTCTGACCTCCTTCTAAACAACAATAGTGAGTCATTTAATGCTTTTATCAC
TCAAATGCGAGACAAGCCTATAATACAAATGTTAGAAATAATTTGGAAGCTTGTAATGACTAGGATCACAAAGAAGTGTGAAATTTACAGTAGGCTAGAAGGAAATTTAG
GGCCGAACATTAAGAGTAAGCTAGAAAAGGAAAAGAAACGTTATGGCAATGCAATACCTGTATGGTCTGGAAACCTGTTGTTTGAGGTTGAGGTGGGGGTTGCTCAATTT
GTAGTAGATTTGGAGAAGAGAATATGTAGTTGTCGAAAGTGTGACTTGACTGGAATCCCATGTGTCCATCAATCCAATACATACTCGAGCTTCCTCAAACCTGTGAATGG
GTCAAATCTTTGGGAAAAAACTACGGCGGACCCGCTACAACCACCTGTACTTAAACGACCACTAGGACGACCAAGAAGGCAACAAAGGAGAGATGCTAGTGAACCTAGAC
CTTCACAACCTGTGAGAAGACAAACATCTACGGTAACATGTTCAAAATGTAAAAATGTTGGCCATAATGCACGGTCGTGTAAAGGAGAAGTGGCAAGACAAAGAAAGAAG
AGGAGAACTACAACCTTTATGGGATTTAACATTCCATCTGCTGAGGAAGAGTCGTTGCAAGAGGAGGAGCCAATGGAAGTGGAAGTTCTTTGGTCTCAACCTGGATCATC
GACTCAAACATCTAGTACTCCAGCTAGAAGGACATATTTCACAAGATCTCATGATGAAGTCGTCAACCCAGTTGGAGATGGGGAGAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGAATACAATATAGTCCTAGGGTTGAGTATTTTGGGGGAAAGAACATTGAGTACAGGATATAAAGCGATTCTGACCAATGACGATATTTTGGATATGGTTGAGAT
GCTCCCAGAGGATAGGATGATCCATATGTATGTGGAGCACAACCAGAATAGAGAAATTATAGATTTTACGGTACCTATCGCTGAGGTTAAACCAATGTTTTTGGAGTGGT
ACCCTGAGGAAGCGAGTGTAGAGTCCCTTGATAAGGAAGGAACTGACATTGAAGAAATAAATTCTATTTCTTATAGGGAAATGACAAAAGATGTGGAAGGAACTGGCAAG
GAAGAAATGGATTATATGGTTGAGAAGGGAATGGAAAAACATGTGGATGGAAGTGATAAGGAAGAAATAGCATCTATGGAAGGCACTGACCAAAACCATCTGGAAGTTGT
AGAGGAAGATCAATTTGAGTATTGTGATGAAGAATGGGATGCAGATGAGTCAGAAGATGACGGTCACGATGATAACGACAAAGATATCGGGGGTACAAGTACAAATGAGA
CTTGTGAAGAGAATGAAGTTCAATCTGGCAATGTCGAATGTGACTCTAAAGATGATTATGAAAAAGGAGAAGATTCTAGCGATGAGGGTTCTGTGGAGGCGAATGAACCA
TTCGATGCACACATTTCTATTGACGCAGATGCGAAATCGGATTATCGATCATCGAGTGATTTGAATTTTCCGGTAAATTCAAGTGGAAGAAGAGTGAATGTTGATCCTGA
GTTTAGAGAAGATACATATATGGAAAGGATCGAATTTGTTATTGGAATGAAGTTCAACACTTCTAAGGTACTGAAAGATGCTATAAAAGAGTATGCAGTCAGAGGTGGGT
ACAATATTCGATTGATAAAGAATGACAAGCAACGGTGCACAGCTACTTGTGATGGAGGATGTACTTGGAGACTACATGCTAGTGTGGGTAAGGGGGAGGCCACTTTTCAG
GTAAAGACCTACAAAAGCGAGCATAGTTATAGTAGGGAGTTCACCAACCGAAACTTGAAATCTTCATGTATTGCTCGAAGATACCTATCAAGGTTTAGACAACAACCGGA
TTGGAGGTTAAATGAAGGTTCATCTGTGAAAATTTTGTGTGATCGGGTGCATGAGGACAATAATCCTATATATAGACGCCTTTACATTTGCTTTAAGGGGTGCAAGGATG
GGTTTCTAGTCGGATGTAGACCATTCATTTCATTGGATGCCTGTCACTTGAACGGTCCATGTCAAGGACATCTTATGTCTGTAGTTGGAACGGATGGGAATGACGACATC
TACCCTATAGCATGGGCTATAGTTGAAGCAGAAACAAAGACTAGTTGGACATGGTTTCTTCGTCTGCTCGAGAGTGACATAGGATCCTTTTCTGTGAAAGGATATACATT
CATGTCTGACCAACAAAAGGGATTGGTACCCACCTTCAATAATGTGTTTCAAAGGGTTGATCATCGATTTTGTGTAAGACATTTATATGCCAATTTCCAGAAGCAATTCA
AAGGATTGACATTGAAGAATTGGTTTTGGAAAGCTGCCCAAGCAACAACAAAATCTGAGTTCGATGGAGCTATGGACGAGTTGGGAAAACTTGATGTCAATGCCTTTCAA
TATGTCAAGAACATACCTTCCAAGTATTGGGCACGACACTCTTTCCTAACTAGCTGTAAGTCTGACCTCCTTCTAAACAACAATAGTGAGTCATTTAATGCTTTTATCAC
TCAAATGCGAGACAAGCCTATAATACAAATGTTAGAAATAATTTGGAAGCTTGTAATGACTAGGATCACAAAGAAGTGTGAAATTTACAGTAGGCTAGAAGGAAATTTAG
GGCCGAACATTAAGAGTAAGCTAGAAAAGGAAAAGAAACGTTATGGCAATGCAATACCTGTATGGTCTGGAAACCTGTTGTTTGAGGTTGAGGTGGGGGTTGCTCAATTT
GTAGTAGATTTGGAGAAGAGAATATGTAGTTGTCGAAAGTGTGACTTGACTGGAATCCCATGTGTCCATCAATCCAATACATACTCGAGCTTCCTCAAACCTGTGAATGG
GTCAAATCTTTGGGAAAAAACTACGGCGGACCCGCTACAACCACCTGTACTTAAACGACCACTAGGACGACCAAGAAGGCAACAAAGGAGAGATGCTAGTGAACCTAGAC
CTTCACAACCTGTGAGAAGACAAACATCTACGGTAACATGTTCAAAATGTAAAAATGTTGGCCATAATGCACGGTCGTGTAAAGGAGAAGTGGCAAGACAAAGAAAGAAG
AGGAGAACTACAACCTTTATGGGATTTAACATTCCATCTGCTGAGGAAGAGTCGTTGCAAGAGGAGGAGCCAATGGAAGTGGAAGTTCTTTGGTCTCAACCTGGATCATC
GACTCAAACATCTAGTACTCCAGCTAGAAGGACATATTTCACAAGATCTCATGATGAAGTCGTCAACCCAGTTGGAGATGGGGAGAAGTAA
Protein sequenceShow/hide protein sequence
MGEYNIVLGLSILGERTLSTGYKAILTNDDILDMVEMLPEDRMIHMYVEHNQNREIIDFTVPIAEVKPMFLEWYPEEASVESLDKEGTDIEEINSISYREMTKDVEGTGK
EEMDYMVEKGMEKHVDGSDKEEIASMEGTDQNHLEVVEEDQFEYCDEEWDADESEDDGHDDNDKDIGGTSTNETCEENEVQSGNVECDSKDDYEKGEDSSDEGSVEANEP
FDAHISIDADAKSDYRSSSDLNFPVNSSGRRVNVDPEFREDTYMERIEFVIGMKFNTSKVLKDAIKEYAVRGGYNIRLIKNDKQRCTATCDGGCTWRLHASVGKGEATFQ
VKTYKSEHSYSREFTNRNLKSSCIARRYLSRFRQQPDWRLNEGSSVKILCDRVHEDNNPIYRRLYICFKGCKDGFLVGCRPFISLDACHLNGPCQGHLMSVVGTDGNDDI
YPIAWAIVEAETKTSWTWFLRLLESDIGSFSVKGYTFMSDQQKGLVPTFNNVFQRVDHRFCVRHLYANFQKQFKGLTLKNWFWKAAQATTKSEFDGAMDELGKLDVNAFQ
YVKNIPSKYWARHSFLTSCKSDLLLNNNSESFNAFITQMRDKPIIQMLEIIWKLVMTRITKKCEIYSRLEGNLGPNIKSKLEKEKKRYGNAIPVWSGNLLFEVEVGVAQF
VVDLEKRICSCRKCDLTGIPCVHQSNTYSSFLKPVNGSNLWEKTTADPLQPPVLKRPLGRPRRQQRRDASEPRPSQPVRRQTSTVTCSKCKNVGHNARSCKGEVARQRKK
RRTTTFMGFNIPSAEEESLQEEEPMEVEVLWSQPGSSTQTSSTPARRTYFTRSHDEVVNPVGDGEK