; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy01g005390 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy01g005390
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionMuDRA-like transposase
Genome locationChr01:7506923..7513461
RNA-Seq ExpressionLcy01g005390
SyntenyLcy01g005390
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]8.9e-11646.71Show/hide
Query:  NRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMA
        +RQAKSWV+G+L+++KF DV RTYRPKDI+ D+R+++GVNLSYD+AWR+ EE   L RG    SY LLP +GEALKI  PGT+F+LEL+   +FK+VFMA
Subjt:  NRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMA

Query:  LGASIGGFKSSIRPVLVMEHA------------------------------------------------VGQVPGPVIVSGRHASIGKAVKTVFPNAFHC
        LG SI GF + IRPVLV++ A                                                 G V   V VS RH +I KA+  VFP AFHC
Subjt:  LGASIGGFKSSIRPVLVMEHA------------------------------------------------VGQVPGPVIVSGRHASIGKAVKTVFPNAFHC

Query:  VCIRHLRENLKTHFK--SGNFIKVFEGAAKAFREVEFNEYWQQLS--ENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDARERPITIFMN
         CI+H++ NL   FK  +    ++F  AAKA+RE  FN  W QL     VR YL  +G ERWAR +Q    Y QMT+N AE  N++ + AR+ P+T  ++
Subjt:  VCIRHLRENLKTHFK--SGNFIKVFEGAAKAFREVEFNEYWQQLS--ENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDARERPITIFMN

Query:  HIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLY
        HIRG  Q  +Y RR   S+  +  S +A   L +  + A+RHVV  I Q+  +V DG+L G V+ ++RTC CREF+YF++PCSHAIA    RNI+ +TL 
Subjt:  HIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLY

Query:  SRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQPLNNS
           Y  NS + AYA+PI P+GHVS W  SP FV+  V  P  VPRVGRR+ +RIPS GEV+   KC RC   GHN KTC +PLN +
Subjt:  SRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQPLNNS

XP_022154803.1 uncharacterized protein LOC111021969 [Momordica charantia]2.1e-12034.06Show/hide
Query:  GLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLKSR--APAFVIRNDGDLNTFLTW-KSVSEVPIYISTIPKCPSNQAS
        G WNE    Y+GG + G ++D GI Y D +S ++R+T I+ + F++ ++C+ + + +   P F I +D  L  +L      S+VP+Y+S +PK      S
Subjt:  GLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLKSR--APAFVIRNDGDLNTFLTW-KSVSEVPIYISTIPKCPSNQAS

Query:  RPFPIPC-----AMPNNPYEVEQNLSVPP----ISASVLNNIPSSFTSF---------PCN----ESKNIPVY-NLGDDEDHDSEFYG--RQDWGDY---
            +P         + P++VEQN+  P     + +S+    P S   F         PCN    E ++   + ++GDDEDH+    G   +D  DY   
Subjt:  RPFPIPC-----AMPNNPYEVEQNLSVPP----ISASVLNNIPSSFTSF---------PCN----ESKNIPVY-NLGDDEDHDSEFYG--RQDWGDY---

Query:  ---GDNE----------------------------------------------DTDAKVDVAVGD-----------------------------------
            DNE                                              D D   D+A+G                                    
Subjt:  ---GDNE----------------------------------------------DTDAKVDVAVGD-----------------------------------

Query:  ---------------------------DEEEIHAYSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFIL
                                    E+ IH   TLR     ++QA SWV+G LIK+  +D+ R YRP+DI+ D+R+++GVN  Y++ WRARE    L
Subjt:  ---------------------------DEEEIHAYSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFIL

Query:  ARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMALGASIGGFKSSIRPVLVMEHA-----------------------------------
          GS +ESY  L ++G ALK    GTVF ++LEDD +FK+ FMALG SI GF S IR VLV++ A                                   
Subjt:  ARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMALGASIGGFKSSIRPVLVMEHA-----------------------------------

Query:  -------------VGQVPGPVIVSGRHASIGKAVKTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQLS--ENVRTYLQGVG
                     +G V G V VS RH +I  +V T+F +A H  C+ H+   L   F++G   ++F  AAKAF+  +F  YW QL+    V  YL+ +G
Subjt:  -------------VGQVPGPVIVSGRHASIGKAVKTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQLS--ENVRTYLQGVG

Query:  IERWARAYQCGHWYNQMTTNLAECTNSMLKDARERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGH
        +++WAR YQ G  YNQMT+NLAE  N++L  AR+ PIT    + R   Q  +Y RR   S+  +  +++A  IL++  E+A+ H VRPI ++EFEV DG 
Subjt:  IERWARAYQCGHWYNQMTTNLAECTNSMLKDARERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGH

Query:  LGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIG
           RVN++++TC C++F Y+E+PCSHAIA    RNI  HTL S  Y +++LI AY +P+ PLG   +W     +V   + PPR VPRVGR Q  RIPS+G
Subjt:  LGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIG

Query:  EVQHVHKCSRCRNMGHNIKTCRQPLNNS
        EV+ VHKC RC N+GHN KTCRQPL  +
Subjt:  EVQHVHKCSRCRNMGHNIKTCRQPLNNS

XP_022155431.1 uncharacterized protein LOC111022579 isoform X1 [Momordica charantia]2.0e-11534.84Show/hide
Query:  RVWVCYDGLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLKSR--APAFVIRNDGDLNTFLTWKSVSEVPIYISTIPKC
        R++V Y G+WN+++ EY+GG L+G  +   I Y D L  LY LT    +E D++++C+ ++K+R   P F I ND DLN +L  +  SEVP+YIS  P+ 
Subjt:  RVWVCYDGLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLKSR--APAFVIRNDGDLNTFLTWKSVSEVPIYISTIPKC

Query:  -------------------PSNQASRPFPIPCAMPNNPYEVEQNLSVPPISASVLNNIPSSFTSFPCNESKNIP--VYNLG----DDEDHDSEFYGRQDW
                                 R   + C  P  P E +    VP +       +  ++ S    + K +   V++ G    D+ D    F+G++D 
Subjt:  -------------------PSNQASRPFPIPCAMPNNPYEVEQNLSVPPISASVLNNIPSSFTSFPCNESKNIP--VYNLG----DDEDHDSEFYGRQDW

Query:  G------------DYGDNEDTDAKVDVAV---------------GDDEEEIHAY-----STLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVND
                     ++   + T   + +                 G D   I  Y      +L  + R +RQAK+WV+G LIKSKF  V   Y+P++I+ D
Subjt:  G------------DYGDNEDTDAKVDVAV---------------GDDEEEIHAY-----STLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVND

Query:  IRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMALGASIGGFKSSIRPVLV---------------
        +RQD+G+N+SY++AWRARE  ++  +GSSEESY LL R+GEALK+  PGT ++++LED   FK++FMALG  I GF + IRPV+V               
Subjt:  IRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMALGASIGGFKSSIRPVLV---------------

Query:  ---------------------------------MEHAVGQVPGPVIVSGRHASIGKAVKTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFRE
                                         ++ A+G+V   + VS R+ SI K++  VFP AFH +CI HL++NL   F +     +F  AAKA+RE
Subjt:  ---------------------------------MEHAVGQVPGPVIVSGRHASIGKAVKTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFRE

Query:  VEFNEYWQQLS---ENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDARERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILE
         +F E W+++S     V  YL+ VG+ RW R Y  G  YN MTTN+AE  N++LK+ARE  +   + H+R   Q  +  RR+  S   S  +K+A  I+ 
Subjt:  VEFNEYWQQLS---ENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDARERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILE

Query:  KAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFV
        K ++++    V  I  ++F V D      VNL+ R C C EF++ +LPC+HAI A   +NID ++L +  Y    L+ AYA  I  +G  SEWK    +V
Subjt:  KAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFV

Query:  NVEVLPPRRVPRVGRRQALRIPSIGEVQ--HVHKCSRCRNMGHNIKTCRQPL
         + VLPP+ V +VGRRQ+ +I S GE +   +HKC RC   GHN  TCR P+
Subjt:  NVEVLPPRRVPRVGRRQALRIPSIGEVQ--HVHKCSRCRNMGHNIKTCRQPL

XP_031736136.1 uncharacterized protein LOC116401763 [Cucumis sativus]4.9e-11433.04Show/hide
Query:  VCYDGLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLK--SRAPAFVIRNDGDLNTFLTWKSVSEVPIYISTIPK----
        V Y G+W+ER ++Y+GG L+G  +   I + D  + LY L  +   +FD+ +RC+ ++K    AP F + ND DL  +L  ++  +VP+Y+S  PK    
Subjt:  VCYDGLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLK--SRAPAFVIRNDGDLNTFLTWKSVSEVPIYISTIPK----

Query:  -----------CPSNQASRPFPIPCAMPNNPYEVEQNLSVPPISASVLNNI-----------------PSSFTSFPC---NESKNIPVYNLGDDED----
                     SNQA    P P  + +   E E  + V  +   + +N+                   +FT  P    +ES +IP +  G  +D    
Subjt:  -----------CPSNQASRPFPIPCAMPNNPYEVEQNLSVPPISASVLNNI-----------------PSSFTSFPC---NESKNIPVYNLGDDED----

Query:  -----HDSEFYGRQDWGDYGDNEDTDAKVDV-------------------------AVGDDEEE---------------------------------IHA
               S    + D  D+ +   T  + DV                          V    +E                                 +H+
Subjt:  -----HDSEFYGRQDWGDYGDNEDTDAKVDV-------------------------AVGDDEEE---------------------------------IHA

Query:  YSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELED
         S L  L R +RQAKSWV+G LIKSKF+ VGR Y+P+DI+ D+RQD+G+N+SY++AWRARE  +   RG  EESY LL R+GEALK+   GT+F +ELED
Subjt:  YSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELED

Query:  DGHFKHVFMALGASIGGFKSSIRPVLVME------------------------------------------------HAVGQVPGPVIVSGRHASIGKAV
        +  FK++FMA+G  + GF + IRPV+VM+                                                 A+G+VP    V+ R     K +
Subjt:  DGHFKHVFMALGASIGGFKSSIRPVLVME------------------------------------------------HAVGQVPGPVIVSGRHASIGKAV

Query:  KTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQL---SENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDAR
         +VFP+AFH +C++HL +NL   +K+     +F  A++ +RE  F+E W+ +     +   YL  VGI RW+R +  G  YN MTTN+AE  NS+LK+ R
Subjt:  KTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQL---SENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDAR

Query:  ERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTY
        + PI  F+ H+R   Q  ++ RR       S  +K+A  +L+K +E+A    V PI  Y+F V D      +NLH + C C+EF+  +LPC+HAIA    
Subjt:  ERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTY

Query:  RNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQPLN
        RNI+ ++L +  Y    L+ AY++ + P+G+ SEWK + ++V++ VLPP+ V RVGR +  RIPS+GE   +HKC RC+  GHN  TC  P++
Subjt:  RNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQPLN

XP_031740993.1 uncharacterized protein LOC116403682 [Cucumis sativus]4.9e-11433.04Show/hide
Query:  VCYDGLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLK--SRAPAFVIRNDGDLNTFLTWKSVSEVPIYISTIPK----
        V Y G+W+ER ++Y+GG L+G  +   I + D  + LY L  +   +FD+ +RC+ ++K    AP F + ND DL  +L  ++  +VP+Y+S  PK    
Subjt:  VCYDGLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLK--SRAPAFVIRNDGDLNTFLTWKSVSEVPIYISTIPK----

Query:  -----------CPSNQASRPFPIPCAMPNNPYEVEQNLSVPPISASVLNNI-----------------PSSFTSFPC---NESKNIPVYNLGDDED----
                     SNQA    P P  + +   E E  + V  +   + +N+                   +FT  P    +ES +IP +  G  +D    
Subjt:  -----------CPSNQASRPFPIPCAMPNNPYEVEQNLSVPPISASVLNNI-----------------PSSFTSFPC---NESKNIPVYNLGDDED----

Query:  -----HDSEFYGRQDWGDYGDNEDTDAKVDV-------------------------AVGDDEEE---------------------------------IHA
               S    + D  D+ +   T  + DV                          V    +E                                 +H+
Subjt:  -----HDSEFYGRQDWGDYGDNEDTDAKVDV-------------------------AVGDDEEE---------------------------------IHA

Query:  YSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELED
         S L  L R +RQAKSWV+G LIKSKF+ VGR Y+P+DI+ D+RQD+G+N+SY++AWRARE  +   RG  EESY LL R+GEALK+   GT+F +ELED
Subjt:  YSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELED

Query:  DGHFKHVFMALGASIGGFKSSIRPVLVME------------------------------------------------HAVGQVPGPVIVSGRHASIGKAV
        +  FK++FMA+G  + GF + IRPV+VM+                                                 A+G+VP    V+ R     K +
Subjt:  DGHFKHVFMALGASIGGFKSSIRPVLVME------------------------------------------------HAVGQVPGPVIVSGRHASIGKAV

Query:  KTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQL---SENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDAR
         +VFP+AFH +C++HL +NL   +K+     +F  A++ +RE  F+E W+ +     +   YL  VGI RW+R +  G  YN MTTN+AE  NS+LK+ R
Subjt:  KTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQL---SENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDAR

Query:  ERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTY
        + PI  F+ H+R   Q  ++ RR       S  +K+A  +L+K +E+A    V PI  Y+F V D      +NLH + C C+EF+  +LPC+HAIA    
Subjt:  ERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTY

Query:  RNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQPLN
        RNI+ ++L +  Y    L+ AY++ + P+G+ SEWK + ++V++ VLPP+ V RVGR +  RIPS+GE   +HKC RC+  GHN  TC  P++
Subjt:  RNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQPLN

TrEMBL top hitse value%identityAlignment
A0A2I7YUI4 MuDRA-like transposase2.4e-11433.04Show/hide
Query:  VCYDGLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLK--SRAPAFVIRNDGDLNTFLTWKSVSEVPIYISTIPK----
        V Y G+W+ER ++Y+GG L+G  +   I + D  + LY L  +   +FD+ +RC+ ++K    AP F + ND DL  +L  ++  +VP+Y+S  PK    
Subjt:  VCYDGLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLK--SRAPAFVIRNDGDLNTFLTWKSVSEVPIYISTIPK----

Query:  -----------CPSNQASRPFPIPCAMPNNPYEVEQNLSVPPISASVLNNI-----------------PSSFTSFPC---NESKNIPVYNLGDDED----
                     SNQA    P P  + +   E E  + V  +   + +N+                   +FT  P    +ES +IP +  G  +D    
Subjt:  -----------CPSNQASRPFPIPCAMPNNPYEVEQNLSVPPISASVLNNI-----------------PSSFTSFPC---NESKNIPVYNLGDDED----

Query:  -----HDSEFYGRQDWGDYGDNEDTDAKVDV-------------------------AVGDDEEE---------------------------------IHA
               S    + D  D+ +   T  + DV                          V    +E                                 +H+
Subjt:  -----HDSEFYGRQDWGDYGDNEDTDAKVDV-------------------------AVGDDEEE---------------------------------IHA

Query:  YSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELED
         S L  L R +RQAKSWV+G LIKSKF+ VGR Y+P+DI+ D+RQD+G+N+SY++AWRARE  +   RG  EESY LL R+GEALK+   GT+F +ELED
Subjt:  YSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELED

Query:  DGHFKHVFMALGASIGGFKSSIRPVLVME------------------------------------------------HAVGQVPGPVIVSGRHASIGKAV
        +  FK++FMA+G  + GF + IRPV+VM+                                                 A+G+VP    V+ R     K +
Subjt:  DGHFKHVFMALGASIGGFKSSIRPVLVME------------------------------------------------HAVGQVPGPVIVSGRHASIGKAV

Query:  KTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQL---SENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDAR
         +VFP+AFH +C++HL +NL   +K+     +F  A++ +RE  F+E W+ +     +   YL  VGI RW+R +  G  YN MTTN+AE  NS+LK+ R
Subjt:  KTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQL---SENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDAR

Query:  ERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTY
        + PI  F+ H+R   Q  ++ RR       S  +K+A  +L+K +E+A    V PI  Y+F V D      +NLH + C C+EF+  +LPC+HAIA    
Subjt:  ERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTY

Query:  RNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQPLN
        RNI+ ++L +  Y    L+ AY++ + P+G+ SEWK + ++V++ VLPP+ V RVGR +  RIPS+GE   +HKC RC+  GHN  TC  P++
Subjt:  RNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQPLN

A0A5A7UB55 MuDRA-like transposase3.4e-11332.67Show/hide
Query:  MPRVWVC--YDGLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLK--SRAPAFVIRNDGDLNTFLTWKSVSEVPIYIST
        M  VW+   + G W+E +++Y+GG L+G  +   I + D  S LY L  +   +FD+++RC+ ++K    AP F + ND DL  ++  ++  EVP+Y+S 
Subjt:  MPRVWVC--YDGLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLK--SRAPAFVIRNDGDLNTFLTWKSVSEVPIYIST

Query:  IPKCPSN-----------------QASRPFPIPCAMPNNPYEVEQNLSVPPISASVLNN-------IPSSFTSFPCNES----KNIPVYN-------LGD
         P    +                 Q   P P P  M       E  + +  +   + +N       I  S+ S+   +     +++ +YN         D
Subjt:  IPKCPSN-----------------QASRPFPIPCAMPNNPYEVEQNLSVPPISASVLNN-------IPSSFTSFPCNES----KNIPVYN-------LGD

Query:  DEDHDSEFYGRQDWGDY---------GDNEDTDAKVDVAVGD--------------------------------------DEE-----------------
            D +  G+ D+  Y         G +E++    ++ VG                                       D +                 
Subjt:  DEDHDSEFYGRQDWGDY---------GDNEDTDAKVDVAVGD--------------------------------------DEE-----------------

Query:  -----EIHAYSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPG
             ++H+ S L FL R +RQAKSWV+G LIKSKF+  GR Y+P+DI+ D+RQD+G+N+SY++AWRARE  +   RGS EESY LL R+GEALK    G
Subjt:  -----EIHAYSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPG

Query:  TVFDLELEDDGHFKHVFMALGASIGGFKSSIRPVLVME------------------------------------------------HAVGQVPGPVIVSG
        T+F +ELEDD  FK++FMA+GA + GF + IRPV+VM+                                                 A+G+VP    V+ 
Subjt:  TVFDLELEDDGHFKHVFMALGASIGGFKSSIRPVLVME------------------------------------------------HAVGQVPGPVIVSG

Query:  RHASIGKAVKTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQL---SENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAEC
        R     K + +VFP+AFH +C++HL +NL   +K+     +F  A++ +RE  F E W+ L         YL  VGI RW+R +  G  YN MTTN+AE 
Subjt:  RHASIGKAVKTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQL---SENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAEC

Query:  TNSMLKDARERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPC
         NS+LK+ R+ PI  F+ ++R   Q  ++ RR       S  +K+A  +++K +E A    V PI  Y+F V D      VNL  + C C+EF+  +LPC
Subjt:  TNSMLKDARERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPC

Query:  SHAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQP
        SHAIAA   RNI+ ++L +  Y    L+ AYA+ + P+G+ S+WK S  +V++ VLPP+ V RVGR +  RIPS+GE   +HKC RC+ +GHN  TC  P
Subjt:  SHAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQP

Query:  LN
        ++
Subjt:  LN

A0A6J1DJT1 uncharacterized protein LOC1110207154.3e-11646.71Show/hide
Query:  NRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMA
        +RQAKSWV+G+L+++KF DV RTYRPKDI+ D+R+++GVNLSYD+AWR+ EE   L RG    SY LLP +GEALKI  PGT+F+LEL+   +FK+VFMA
Subjt:  NRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMA

Query:  LGASIGGFKSSIRPVLVMEHA------------------------------------------------VGQVPGPVIVSGRHASIGKAVKTVFPNAFHC
        LG SI GF + IRPVLV++ A                                                 G V   V VS RH +I KA+  VFP AFHC
Subjt:  LGASIGGFKSSIRPVLVMEHA------------------------------------------------VGQVPGPVIVSGRHASIGKAVKTVFPNAFHC

Query:  VCIRHLRENLKTHFK--SGNFIKVFEGAAKAFREVEFNEYWQQLS--ENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDARERPITIFMN
         CI+H++ NL   FK  +    ++F  AAKA+RE  FN  W QL     VR YL  +G ERWAR +Q    Y QMT+N AE  N++ + AR+ P+T  ++
Subjt:  VCIRHLRENLKTHFK--SGNFIKVFEGAAKAFREVEFNEYWQQLS--ENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDARERPITIFMN

Query:  HIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLY
        HIRG  Q  +Y RR   S+  +  S +A   L +  + A+RHVV  I Q+  +V DG+L G V+ ++RTC CREF+YF++PCSHAIA    RNI+ +TL 
Subjt:  HIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLY

Query:  SRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQPLNNS
           Y  NS + AYA+PI P+GHVS W  SP FV+  V  P  VPRVGRR+ +RIPS GEV+   KC RC   GHN KTC +PLN +
Subjt:  SRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQPLNNS

A0A6J1DLB0 uncharacterized protein LOC1110219699.9e-12134.06Show/hide
Query:  GLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLKSR--APAFVIRNDGDLNTFLTW-KSVSEVPIYISTIPKCPSNQAS
        G WNE    Y+GG + G ++D GI Y D +S ++R+T I+ + F++ ++C+ + + +   P F I +D  L  +L      S+VP+Y+S +PK      S
Subjt:  GLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLKSR--APAFVIRNDGDLNTFLTW-KSVSEVPIYISTIPKCPSNQAS

Query:  RPFPIPC-----AMPNNPYEVEQNLSVPP----ISASVLNNIPSSFTSF---------PCN----ESKNIPVY-NLGDDEDHDSEFYG--RQDWGDY---
            +P         + P++VEQN+  P     + +S+    P S   F         PCN    E ++   + ++GDDEDH+    G   +D  DY   
Subjt:  RPFPIPC-----AMPNNPYEVEQNLSVPP----ISASVLNNIPSSFTSF---------PCN----ESKNIPVY-NLGDDEDHDSEFYG--RQDWGDY---

Query:  ---GDNE----------------------------------------------DTDAKVDVAVGD-----------------------------------
            DNE                                              D D   D+A+G                                    
Subjt:  ---GDNE----------------------------------------------DTDAKVDVAVGD-----------------------------------

Query:  ---------------------------DEEEIHAYSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFIL
                                    E+ IH   TLR     ++QA SWV+G LIK+  +D+ R YRP+DI+ D+R+++GVN  Y++ WRARE    L
Subjt:  ---------------------------DEEEIHAYSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVNDIRQDFGVNLSYDEAWRAREETFIL

Query:  ARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMALGASIGGFKSSIRPVLVMEHA-----------------------------------
          GS +ESY  L ++G ALK    GTVF ++LEDD +FK+ FMALG SI GF S IR VLV++ A                                   
Subjt:  ARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMALGASIGGFKSSIRPVLVMEHA-----------------------------------

Query:  -------------VGQVPGPVIVSGRHASIGKAVKTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQLS--ENVRTYLQGVG
                     +G V G V VS RH +I  +V T+F +A H  C+ H+   L   F++G   ++F  AAKAF+  +F  YW QL+    V  YL+ +G
Subjt:  -------------VGQVPGPVIVSGRHASIGKAVKTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQLS--ENVRTYLQGVG

Query:  IERWARAYQCGHWYNQMTTNLAECTNSMLKDARERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGH
        +++WAR YQ G  YNQMT+NLAE  N++L  AR+ PIT    + R   Q  +Y RR   S+  +  +++A  IL++  E+A+ H VRPI ++EFEV DG 
Subjt:  IERWARAYQCGHWYNQMTTNLAECTNSMLKDARERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGH

Query:  LGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIG
           RVN++++TC C++F Y+E+PCSHAIA    RNI  HTL S  Y +++LI AY +P+ PLG   +W     +V   + PPR VPRVGR Q  RIPS+G
Subjt:  LGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIG

Query:  EVQHVHKCSRCRNMGHNIKTCRQPLNNS
        EV+ VHKC RC N+GHN KTCRQPL  +
Subjt:  EVQHVHKCSRCRNMGHNIKTCRQPLNNS

A0A6J1DRN0 uncharacterized protein LOC111022579 isoform X19.6e-11634.84Show/hide
Query:  RVWVCYDGLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLKSR--APAFVIRNDGDLNTFLTWKSVSEVPIYISTIPKC
        R++V Y G+WN+++ EY+GG L+G  +   I Y D L  LY LT    +E D++++C+ ++K+R   P F I ND DLN +L  +  SEVP+YIS  P+ 
Subjt:  RVWVCYDGLWNEREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLKSR--APAFVIRNDGDLNTFLTWKSVSEVPIYISTIPKC

Query:  -------------------PSNQASRPFPIPCAMPNNPYEVEQNLSVPPISASVLNNIPSSFTSFPCNESKNIP--VYNLG----DDEDHDSEFYGRQDW
                                 R   + C  P  P E +    VP +       +  ++ S    + K +   V++ G    D+ D    F+G++D 
Subjt:  -------------------PSNQASRPFPIPCAMPNNPYEVEQNLSVPPISASVLNNIPSSFTSFPCNESKNIP--VYNLG----DDEDHDSEFYGRQDW

Query:  G------------DYGDNEDTDAKVDVAV---------------GDDEEEIHAY-----STLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVND
                     ++   + T   + +                 G D   I  Y      +L  + R +RQAK+WV+G LIKSKF  V   Y+P++I+ D
Subjt:  G------------DYGDNEDTDAKVDVAV---------------GDDEEEIHAY-----STLRFLLRLNRQAKSWVIGNLIKSKFQDVGRTYRPKDIVND

Query:  IRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMALGASIGGFKSSIRPVLV---------------
        +RQD+G+N+SY++AWRARE  ++  +GSSEESY LL R+GEALK+  PGT ++++LED   FK++FMALG  I GF + IRPV+V               
Subjt:  IRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMALGASIGGFKSSIRPVLV---------------

Query:  ---------------------------------MEHAVGQVPGPVIVSGRHASIGKAVKTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFRE
                                         ++ A+G+V   + VS R+ SI K++  VFP AFH +CI HL++NL   F +     +F  AAKA+RE
Subjt:  ---------------------------------MEHAVGQVPGPVIVSGRHASIGKAVKTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFRE

Query:  VEFNEYWQQLS---ENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDARERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILE
         +F E W+++S     V  YL+ VG+ RW R Y  G  YN MTTN+AE  N++LK+ARE  +   + H+R   Q  +  RR+  S   S  +K+A  I+ 
Subjt:  VEFNEYWQQLS---ENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDARERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILE

Query:  KAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFV
        K ++++    V  I  ++F V D      VNL+ R C C EF++ +LPC+HAI A   +NID ++L +  Y    L+ AYA  I  +G  SEWK    +V
Subjt:  KAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFV

Query:  NVEVLPPRRVPRVGRRQALRIPSIGEVQ--HVHKCSRCRNMGHNIKTCRQPL
         + VLPP+ V +VGRRQ+ +I S GE +   +HKC RC   GHN  TCR P+
Subjt:  NVEVLPPRRVPRVGRRQALRIPSIGEVQ--HVHKCSRCRNMGHNIKTCRQPL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64255.1 MuDR family transposase1.1e-0521.46Show/hide
Query:  PNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQLSE---NVRTYLQGVGIERWARAYQCGHWYNQMTTN----LAECTN-SMLKDA
        P A+H   + H        F S         A    ++ EF  Y   + E     R +L      RWA A+  G  Y  M  N     A C         
Subjt:  PNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQLSE---NVRTYLQGVGIERWARAYQCGHWYNQMTTN----LAECTN-SMLKDA

Query:  RERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEE-----KAKRHVVRPIAQYEFEVDDGHLGGR--VNLHARTCICREFEYFELPCS
            + +  + +R  F   +   R++ +        +   +++K EE         ++V P+    F+V      G   V L   +C C +F+ ++ PC 
Subjt:  RERPITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEE-----KAKRHVVRPIAQYEFEVDDGHLGGR--VNLHARTCICREFEYFELPCS

Query:  HAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVP
        HA+A C     +        Y +  L   YA     +  +S W   P+   V  L P  +P
Subjt:  HAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVP

AT1G64260.1 MuDR family transposase5.9e-0922.96Show/hide
Query:  PNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQLSE---NVRTYLQGVGIERWARAYQCGHWYNQMTTN----LAECTN-SMLKDA
        P A H  C+ HLR      F+  N   + E A    ++ EF+ Y   + E       +L  +   +WA A+  G  Y  +  +     A C        A
Subjt:  PNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQLSE---NVRTYLQGVGIERWARAYQCGHWYNQMTTN----LAECTN-SMLKDA

Query:  RERPITIFMNHIRGWFQAKY-----YIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGR--VNLHARTCICREFEYFELPCS
            + +  + +R  F          + R    T P       M  LE+    +  +V+  + +  F+V +        V L+  TC CR+F+ ++ PC 
Subjt:  RERPITIFMNHIRGWFQAKY-----YIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGR--VNLHARTCICREFEYFELPCS

Query:  HAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPP
        HA+A      I+        Y V      YA    P+  V+ W   P+   V  L P
Subjt:  HAIAACTYRNIDYHTLYSRVYHVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCTTTCTGCTTGATAAAGTGGATTCTTTGGTGGAGGCGACCTTCCTACTCTCCCAAGTTTTGGAAGATGCTCTGTTGAAATTCAATCGTGGGAAGCTATCAAT
AACAGTCTCATACAATGTTGTGGTTGCCTCCTTTGAGATCCCAAGGTCGCAATTTATATATTATGATGCGAACAAAAATTATGTTGGAAAAATTTCTTTGAAATCTTTCC
ACGATGCGATGGCCAATAAGAAAGGTCCATTGGCGGTCTACCTTATTGAAGAATCAAGAAATGTCCTGATAGTAAGAATGGAGGGATCGGGGTCTACATTATCGGTGCCG
TTCTCTCCGGCTGAAGAACAAGATTTTGGACACATTATGTACGATAATGTGTTTGGTCTTGGTTCTCAAGAATTAAGATGGATCATAAAACAATTGCCTCTCTTTGAGTA
CTCAGTTATTGTTACCGTCCAGAGTTCACAAATCAAATTTTTCGTTGCATCATCAAATGCTATCATTCTTACTAAAGAGAGTGGACGCTATACAATTCAAGGTTATGAAG
GAGATGTTGAAGCTCAACTCCATGTTGTTCTTCGTCCCATCGTGTTTTTCCTAGATTTGACACATAAAATAATGCCTCGTGTGTGGGTTTGTTATGATGGTTTGTGGAAT
GAGAGGGAAAAGGAATATAAGGGTGGAGAGTTGAGGGGTTTCGACATAGACATTGGTATTGCATATAGTGACTTCTTAAGTCGATTATATAGATTAACCAGTATAAGTTC
AGAGGAATTTGACCTTAGAGTAAGATGTCTACTCCAATTGAAGTCCAGGGCACCCGCATTTGTTATTAGAAACGATGGAGACCTTAACACCTTCTTGACATGGAAGTCAG
TCTCTGAAGTACCCATCTACATATCCACAATCCCAAAGTGTCCAAGTAACCAAGCCTCGCGACCATTTCCCATCCCATGCGCCATGCCAAATAATCCGTATGAGGTGGAG
CAGAATTTATCAGTTCCACCGATATCTGCATCAGTCTTAAACAATATACCATCGTCATTTACATCCTTCCCATGCAATGAATCCAAAAATATTCCTGTGTATAACTTGGG
GGATGATGAAGACCATGATAGTGAATTTTATGGTCGCCAAGATTGGGGGGACTACGGAGATAATGAGGACACAGACGCAAAGGTTGATGTAGCTGTGGGAGATGATGAGG
AAGAGATTCACGCGTACAGTACACTGAGGTTCCTTCTGCGCCTGAACAGGCAGGCCAAGAGTTGGGTTATTGGAAATTTGATCAAGTCGAAGTTCCAAGATGTCGGTCGT
ACTTATAGGCCAAAGGATATTGTGAATGATATTAGACAAGATTTTGGTGTGAATTTAAGCTACGACGAGGCTTGGCGGGCTAGAGAAGAAACTTTCATTCTTGCTAGAGG
ATCTTCAGAAGAATCTTACAAACTGTTACCGAGATTTGGTGAAGCATTGAAAATTGAAATTCCTGGTACAGTGTTCGACTTAGAACTTGAAGACGATGGACACTTTAAGC
ATGTGTTTATGGCACTAGGTGCTTCTATTGGAGGGTTCAAGAGCTCTATTCGTCCAGTACTAGTGATGGAACATGCAGTTGGACAAGTTCCTGGTCCGGTCATTGTGTCT
GGCAGACATGCCAGCATCGGTAAGGCAGTCAAAACTGTATTCCCTAATGCATTTCACTGTGTGTGTATCCGGCACCTAAGGGAGAACCTAAAGACACATTTTAAGTCCGG
GAACTTCATTAAAGTATTTGAGGGAGCTGCAAAGGCATTTCGCGAGGTCGAATTCAATGAGTACTGGCAGCAACTCTCGGAAAATGTTAGGACATACTTACAAGGGGTCG
GAATTGAGAGGTGGGCTCGAGCATACCAGTGTGGTCATTGGTATAACCAAATGACCACCAATCTTGCGGAGTGCACTAACTCAATGTTGAAAGATGCACGAGAGCGACCA
ATCACTATATTTATGAATCATATAAGAGGGTGGTTCCAAGCAAAATACTACATACGCAGGAATACACACTCCACTAACCCGAGCCAGTGGTCAAAGTTTGCAATGACCAT
TTTGGAAAAAGCAGAGGAGAAGGCAAAGAGGCACGTGGTCAGACCTATTGCTCAATATGAGTTTGAGGTCGACGATGGACATTTGGGTGGACGTGTCAACCTTCACGCTA
GAACATGCATCTGTCGTGAATTTGAATATTTCGAACTTCCATGCTCCCATGCGATTGCTGCATGCACTTATCGCAACATCGACTATCATACCCTTTATTCCAGAGTCTAC
CATGTGAACTCCTTGATAGATGCATATGCACAGCCCATACAACCCCTTGGGCATGTGTCGGAGTGGAAGAGGTCACCTCAATTCGTCAACGTGGAAGTCTTGCCCCCTCG
AAGAGTGCCTAGGGTTGGTCGTCGTCAGGCGTTGAGAATACCGTCTATTGGAGAAGTGCAACATGTCCACAAATGCAGCCGGTGTAGGAATATGGGTCATAACATAAAAA
CATGCAGACAACCACTGAACAATTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCTTTCTGCTTGATAAAGTGGATTCTTTGGTGGAGGCGACCTTCCTACTCTCCCAAGTTTTGGAAGATGCTCTGTTGAAATTCAATCGTGGGAAGCTATCAAT
AACAGTCTCATACAATGTTGTGGTTGCCTCCTTTGAGATCCCAAGGTCGCAATTTATATATTATGATGCGAACAAAAATTATGTTGGAAAAATTTCTTTGAAATCTTTCC
ACGATGCGATGGCCAATAAGAAAGGTCCATTGGCGGTCTACCTTATTGAAGAATCAAGAAATGTCCTGATAGTAAGAATGGAGGGATCGGGGTCTACATTATCGGTGCCG
TTCTCTCCGGCTGAAGAACAAGATTTTGGACACATTATGTACGATAATGTGTTTGGTCTTGGTTCTCAAGAATTAAGATGGATCATAAAACAATTGCCTCTCTTTGAGTA
CTCAGTTATTGTTACCGTCCAGAGTTCACAAATCAAATTTTTCGTTGCATCATCAAATGCTATCATTCTTACTAAAGAGAGTGGACGCTATACAATTCAAGGTTATGAAG
GAGATGTTGAAGCTCAACTCCATGTTGTTCTTCGTCCCATCGTGTTTTTCCTAGATTTGACACATAAAATAATGCCTCGTGTGTGGGTTTGTTATGATGGTTTGTGGAAT
GAGAGGGAAAAGGAATATAAGGGTGGAGAGTTGAGGGGTTTCGACATAGACATTGGTATTGCATATAGTGACTTCTTAAGTCGATTATATAGATTAACCAGTATAAGTTC
AGAGGAATTTGACCTTAGAGTAAGATGTCTACTCCAATTGAAGTCCAGGGCACCCGCATTTGTTATTAGAAACGATGGAGACCTTAACACCTTCTTGACATGGAAGTCAG
TCTCTGAAGTACCCATCTACATATCCACAATCCCAAAGTGTCCAAGTAACCAAGCCTCGCGACCATTTCCCATCCCATGCGCCATGCCAAATAATCCGTATGAGGTGGAG
CAGAATTTATCAGTTCCACCGATATCTGCATCAGTCTTAAACAATATACCATCGTCATTTACATCCTTCCCATGCAATGAATCCAAAAATATTCCTGTGTATAACTTGGG
GGATGATGAAGACCATGATAGTGAATTTTATGGTCGCCAAGATTGGGGGGACTACGGAGATAATGAGGACACAGACGCAAAGGTTGATGTAGCTGTGGGAGATGATGAGG
AAGAGATTCACGCGTACAGTACACTGAGGTTCCTTCTGCGCCTGAACAGGCAGGCCAAGAGTTGGGTTATTGGAAATTTGATCAAGTCGAAGTTCCAAGATGTCGGTCGT
ACTTATAGGCCAAAGGATATTGTGAATGATATTAGACAAGATTTTGGTGTGAATTTAAGCTACGACGAGGCTTGGCGGGCTAGAGAAGAAACTTTCATTCTTGCTAGAGG
ATCTTCAGAAGAATCTTACAAACTGTTACCGAGATTTGGTGAAGCATTGAAAATTGAAATTCCTGGTACAGTGTTCGACTTAGAACTTGAAGACGATGGACACTTTAAGC
ATGTGTTTATGGCACTAGGTGCTTCTATTGGAGGGTTCAAGAGCTCTATTCGTCCAGTACTAGTGATGGAACATGCAGTTGGACAAGTTCCTGGTCCGGTCATTGTGTCT
GGCAGACATGCCAGCATCGGTAAGGCAGTCAAAACTGTATTCCCTAATGCATTTCACTGTGTGTGTATCCGGCACCTAAGGGAGAACCTAAAGACACATTTTAAGTCCGG
GAACTTCATTAAAGTATTTGAGGGAGCTGCAAAGGCATTTCGCGAGGTCGAATTCAATGAGTACTGGCAGCAACTCTCGGAAAATGTTAGGACATACTTACAAGGGGTCG
GAATTGAGAGGTGGGCTCGAGCATACCAGTGTGGTCATTGGTATAACCAAATGACCACCAATCTTGCGGAGTGCACTAACTCAATGTTGAAAGATGCACGAGAGCGACCA
ATCACTATATTTATGAATCATATAAGAGGGTGGTTCCAAGCAAAATACTACATACGCAGGAATACACACTCCACTAACCCGAGCCAGTGGTCAAAGTTTGCAATGACCAT
TTTGGAAAAAGCAGAGGAGAAGGCAAAGAGGCACGTGGTCAGACCTATTGCTCAATATGAGTTTGAGGTCGACGATGGACATTTGGGTGGACGTGTCAACCTTCACGCTA
GAACATGCATCTGTCGTGAATTTGAATATTTCGAACTTCCATGCTCCCATGCGATTGCTGCATGCACTTATCGCAACATCGACTATCATACCCTTTATTCCAGAGTCTAC
CATGTGAACTCCTTGATAGATGCATATGCACAGCCCATACAACCCCTTGGGCATGTGTCGGAGTGGAAGAGGTCACCTCAATTCGTCAACGTGGAAGTCTTGCCCCCTCG
AAGAGTGCCTAGGGTTGGTCGTCGTCAGGCGTTGAGAATACCGTCTATTGGAGAAGTGCAACATGTCCACAAATGCAGCCGGTGTAGGAATATGGGTCATAACATAAAAA
CATGCAGACAACCACTGAACAATTCTTGA
Protein sequenceShow/hide protein sequence
MAFFLLDKVDSLVEATFLLSQVLEDALLKFNRGKLSITVSYNVVVASFEIPRSQFIYYDANKNYVGKISLKSFHDAMANKKGPLAVYLIEESRNVLIVRMEGSGSTLSVP
FSPAEEQDFGHIMYDNVFGLGSQELRWIIKQLPLFEYSVIVTVQSSQIKFFVASSNAIILTKESGRYTIQGYEGDVEAQLHVVLRPIVFFLDLTHKIMPRVWVCYDGLWN
EREKEYKGGELRGFDIDIGIAYSDFLSRLYRLTSISSEEFDLRVRCLLQLKSRAPAFVIRNDGDLNTFLTWKSVSEVPIYISTIPKCPSNQASRPFPIPCAMPNNPYEVE
QNLSVPPISASVLNNIPSSFTSFPCNESKNIPVYNLGDDEDHDSEFYGRQDWGDYGDNEDTDAKVDVAVGDDEEEIHAYSTLRFLLRLNRQAKSWVIGNLIKSKFQDVGR
TYRPKDIVNDIRQDFGVNLSYDEAWRAREETFILARGSSEESYKLLPRFGEALKIEIPGTVFDLELEDDGHFKHVFMALGASIGGFKSSIRPVLVMEHAVGQVPGPVIVS
GRHASIGKAVKTVFPNAFHCVCIRHLRENLKTHFKSGNFIKVFEGAAKAFREVEFNEYWQQLSENVRTYLQGVGIERWARAYQCGHWYNQMTTNLAECTNSMLKDARERP
ITIFMNHIRGWFQAKYYIRRNTHSTNPSQWSKFAMTILEKAEEKAKRHVVRPIAQYEFEVDDGHLGGRVNLHARTCICREFEYFELPCSHAIAACTYRNIDYHTLYSRVY
HVNSLIDAYAQPIQPLGHVSEWKRSPQFVNVEVLPPRRVPRVGRRQALRIPSIGEVQHVHKCSRCRNMGHNIKTCRQPLNNS