; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G34430 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G34430
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTransposase
Genome locationChr3:30195419..30204563
RNA-Seq ExpressionCSPI03G34430
SyntenyCSPI03G34430
Gene Ontology termsNA
InterPro domainsIPR004242 - Transposon, En/Spm-like
IPR025452 - Domain of unknown function DUF4218
IPR029480 - Transposase-associated domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051686.1 transposon protein, putative, CACTA, En/Spm sub-class [Cucumis melo var. makuwa]7.3e-22339.85Show/hide
Query:  MVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV----------------------------------------------------
        M+QS  E+SCN SNTFDTMFDDAKKPL+PGCKKFTKLSALVRLYNLKV                                                    
Subjt:  MVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV----------------------------------------------------

Query:  -----RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKNL----------CPKQPRYDINTYLTPLIDDLQMLWTDG
             RKEY N TKCP CGL RWKI KNS KE SGVAA QMWYFPIVPR IRMFK  ENAKNL           PKQP YDINTYL PLIDDL++LW +G
Subjt:  -----RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKNL----------CPKQPRYDINTYLTPLIDDLQMLWTDG

Query:  VRCFDAYKKE----SAILLWTINDFPAY-------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSEIKVPEGYSANI
        V CFDAYK+E     A+LLWTINDFPAY       GKSKDGMN+RLD  EM IRPEL    T                                      
Subjt:  VRCFDAYKKE----SAILLWTINDFPAY-------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSEIKVPEGYSANI

Query:  SSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKNVREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPI
                                                                                                            
Subjt:  SSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKNVREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPI

Query:  GLGAWFRDQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWEVNY
                        GKV+SNTIRWLAHGP+  +MTY+GY++NG SYHTKS DD++TVQNSGIMLVATTMQVSSAKDK+  + DMSFY II+++WEV+Y
Subjt:  GLGAWFRDQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWEVNY

Query:  NTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-TCDF
        NTF  VLFKC WVEN  GVR DDL++TL+  +R        I+    ++ F      D +WSVVV P +KDF YKC +DD+GDMLPHYPP+S ++ T D 
Subjt:  NTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-TCDF

Query:  DEGGETYARSDCEGIWENSIFRYDYFRMYRYQFAVMVYS-----DDEQVATPLQIEETSSQKKKRGPTRMKDITRFSSEGRKIVIQYSELDQPIGPNATK
        DE G+ Y R DCEG W+   +RY+ +      F V   S     D + VA P +  E+S+QK+KRGPT MK+ITR  SEG+K+VIQY+EL Q IG NA K
Subjt:  DEGGETYARSDCEGIWENSIFRYDYFRMYRYQFAVMVYS-----DDEQVATPLQIEETSSQKKKRGPTRMKDITRFSSEGRKIVIQYSELDQPIGPNATK

Query:  LKSFIGMTVRFHVPFTYSIWHAIPKEMKEKIYELIEADFILDHKSKESILQNAGVCYHGFKSRLTTTYVLSFKDDVEKLKHPLAKYSFIDQDHWDEFVAS
        LKSFIG TV                                                     RL TTYVL F DDVEKLK P  +YSFIDQ HW EFVAS
Subjt:  LKSFIGMTVRFHVPFTYSIWHAIPKEMKEKIYELIEADFILDHKSKESILQNAGVCYHGFKSRLTTTYVLSFKDDVEKLKHPLAKYSFIDQDHWDEFVAS

Query:  RLKEDFE-------------KYNHRASHKGYANLVEEL-------------------------------------DNLLGNQKTRSLSDYDDVLTQALGG
        RLKEDF+             KYNHR S KGYANL+EEL                                     D L+  Q T +    +D+LT+ALGG
Subjt:  RLKEDFE-------------KYNHRASHKGYANLVEEL-------------------------------------DNLLGNQKTRSLSDYDDVLTQALGG

Query:  KDRQGILRGVG--------ITPSIEGDSNNETEDVAS----DSVPKKATDVVTE----------------------------------------------
        KDR GILR VG           + +  +N + +D A+    D + K+  ++  E                                              
Subjt:  KDRQGILRGVG--------ITPSIEGDSNNETEDVAS----DSVPKKATDVVTE----------------------------------------------

Query:  ----VKTPCMLAFEIEDTIIAHGTIFNAEGDSENIKVSVDVVLDGQCAIQIPKKEGITKMTQEVGSHLMWPRHIVLTRNDKKDTVGFSTDPSTFSSATYL
            V TPC LAFE +D +IA GTI +++ + +N+KV++DVV+D  CAI IP ++G+ K++QEVGSH++WPR +V+T N K D   F+ D STF+     
Subjt:  ----VKTPCMLAFEIEDTIIAHGTIFNAEGDSENIKVSVDVVLDGQCAIQIPKKEGITKMTQEVGSHLMWPRHIVLTRNDKKDTVGFSTDPSTFSSATYL

Query:  RALVALRCLSLRDFLSMKPICTSCLDAYMMYLHTSMVEGRSLSLFKFIDVRSISFSSYKQSCAQLLNARLLRAEYD---------------------QGI
         A VAL  L LR            +  +M YL+T M   R+L+L+KF+D  SIS  S+K+  AQLL ARLL  +YD                     +G 
Subjt:  RALVALRCLSLRDFLSMKPICTSCLDAYMMYLHTSMVEGRSLSLFKFIDVRSISFSSYKQSCAQLLNARLLRAEYD---------------------QGI

Query:  EYWIDPLKNRIDGDMIEVVQMSFNISKKKKPNWKVVK--------VCGIDQHDSRKKVI----NLDVSQMKSLPPTYPQDKIDEVRLELSEFLSK
         +WIDPLKNRID D+ EVV+ SFNI  KKKP W+VVK         CG       + +I       +  MK  P  Y QD ID +R E +EF+ K
Subjt:  EYWIDPLKNRIDGDMIEVVQMSFNISKKKKPNWKVVK--------VCGIDQHDSRKKVI----NLDVSQMKSLPPTYPQDKIDEVRLELSEFLSK

TYK04901.1 hypothetical protein E5676_scaffold143G001050 [Cucumis melo var. makuwa]6.8e-22943.04Show/hide
Query:  LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAYGKSKDGMNTRLDFFEMKIRPE------LTQTFTDNRTY------
        L PKQP YDINTYL PLIDDL++LW +GVRCFDAYK+E     A+LLWTINDFPAY  +   M   +     K  P         + F D + +      
Subjt:  LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAYGKSKDGMNTRLDFFEMKIRPE------LTQTFTDNRTY------

Query:  -IPTACYTLSREEKYRFCKTL----------------SEIKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKNVR----EVKL--
         +  A Y   +E  +   K                  S  K+P  Y  N+  R  LD +    ++ + C  ++  LL I+ +S    N R    E+ +  
Subjt:  -IPTACYTLSREEKYRFCKTL----------------SEIKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKNVR----EVKL--

Query:  -CGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGL-----------------------------------------
           P+ +      E  +  L N  +N  RP+GCIAEA +CEEAV  C DFLS LDPIGL                                         
Subjt:  -CGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGL-----------------------------------------

Query:  ------------GAWFRDQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYD
                    GAW RD+VM EL EGKV+SNTIRWLAH P+  +MTY+GY++NG SYHTKS DD+RTVQNSGIMLVATTMQVSSAKDK+  + DM FY 
Subjt:  ------------GAWFRDQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYD

Query:  IIQEVWEVNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYAR----WSIVVMPPEKD-------FDAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPL
        II+++WEV+YNTF  VLFKC WVEN  GVR DDL++TL+  +R        ++   K         D +WSVV+MPL+KDF YKC +DD+GDMLPHYPP+
Subjt:  IIQEVWEVNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYAR----WSIVVMPPEKD-------FDAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPL

Query:  SSYH-TCDFDEGGETYARSDCEGIWENSIFRYDYFRMYRYQFAVMVYSDDEQVATPLQIEETSSQKKKRGPTRMKDITRFSSEGRKIVIQYSELDQPIGP
        S ++ T D DE  +TY R DCEG W+ + ++Y+    Y Y+  +    DD  VATP +  E+S+QK+K GPT MK+ITR  SEG+K+VIQY+EL Q IG 
Subjt:  SSYH-TCDFDEGGETYARSDCEGIWENSIFRYDYFRMYRYQFAVMVYSDDEQVATPLQIEETSSQKKKRGPTRMKDITRFSSEGRKIVIQYSELDQPIGP

Query:  NATKLKSFIGMTVRFHVPFTYSIWHAIPKEMKEKIYELIEADFILDHKSKESILQNAGVCYHGFKSRLTTTYVLSFKDDVEKLKHPLAKYSFIDQDHWDE
        NATKLKSFIG TVRF VP TYS W  +PKE+K+KI+ELIEA F++D +SK++I+QNAGVC+  FK RLTTTYVL F DDVEKLK    +YSFIDQ HW E
Subjt:  NATKLKSFIGMTVRFHVPFTYSIWHAIPKEMKEKIYELIEADFILDHKSKESILQNAGVCYHGFKSRLTTTYVLSFKDDVEKLKHPLAKYSFIDQDHWDE

Query:  FVASRLKEDFE-------------KYNHRASHKGYANLVEELDNLLGNQKTRSLSDYDDVLTQALGGKDRQGILRGVGITPSIEGDSNNETEDVASDSVP
        FVASRLKEDF+             KYNHR S KGYANL+EEL+ L+  Q T +     D++T+ALGGKDR GILRGVG   + +   +  T+  A+    
Subjt:  FVASRLKEDFE-------------KYNHRASHKGYANLVEELDNLLGNQKTRSLSDYDDVLTQALGGKDRQGILRGVGITPSIEGDSNNETEDVASDSVP

Query:  KKAT-------------------------DVVTEVK---------------------------------------TPCMLAFEIEDTIIAHGTIFNAEGD
        +KAT                         D V ++K                                       TPC LAFE +D ++A GTI ++  +
Subjt:  KKAT-------------------------DVVTEVK---------------------------------------TPCMLAFEIEDTIIAHGTIFNAEGD

Query:  SENIKVSVDVVLDGQCAIQIPKKEGITKMTQEVGSHLMWPRHIVLTRNDKKDTVGFSTDPSTFSSATYLRALVALRCLSLRDFLSMKPICTSCLDAYMMY
         +N+KV+VDVV+DG CAI IP ++G+ KM+QEVGSH++WPR +V+T N K D   F+ D STF+      A VAL                        Y
Subjt:  SENIKVSVDVVLDGQCAIQIPKKEGITKMTQEVGSHLMWPRHIVLTRNDKKDTVGFSTDPSTFSSATYLRALVALRCLSLRDFLSMKPICTSCLDAYMMY

Query:  LHTSMVEGRSLSLFKFIDVRSISFSSYKQSCAQLLNARLLRAEYDQ---------------------GIEYWIDPLKNRIDGDMIEVVQMSFNISKKKKP
        L+T M   R+L+L+KF+D  SIS  S K+  AQLL ARLL  +YDQ                     G  +WIDPLKN ID D+ EVV+ SFN+  KK P
Subjt:  LHTSMVEGRSLSLFKFIDVRSISFSSYKQSCAQLLNARLLRAEYDQ---------------------GIEYWIDPLKNRIDGDMIEVVQMSFNISKKKKP

Query:  NWKVVKV--------CGIDQHDSRKKVI----NLDVSQMKSLPPTYPQDKIDEVRLELSEFLSK
        NW+V+K         CG       + +I       +  MK  P  Y QD ID +R E +EF+ K
Subjt:  NWKVVKV--------CGIDQHDSRKKVI----NLDVSQMKSLPPTYPQDKIDEVRLELSEFLSK

TYK08420.1 transposase [Cucumis melo var. makuwa]1.2e-24451.14Show/hide
Query:  MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLF
        MDKSWM++NRMSREYD+ ++ FI++GLSHAKGSNSIRCPCL C N LLKDVS ++YHLYANGIDKSYK+WFWHG++LNS+ VT+ MEN  +E  E++DLF
Subjt:  MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLF

Query:  NTVNMVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV------------------------------------------------
        NTVNM+QS  ++SCN SNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV                                                
Subjt:  NTVNMVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV------------------------------------------------

Query:  ---------RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKN----------------------------------
                 RKEY N TKCPKCGLSRWKI KNS KE SGVAA QMWYFPIVPR IRMFK  ENAKN                                  
Subjt:  ---------RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKN----------------------------------

Query:  -------LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAY-------------------------------------
               L PKQP YDINTYL PLIDDL++LW +GVRCFDAYK+E     A+LLWTINDFPAY                                     
Subjt:  -------LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAY-------------------------------------

Query:  ----------------------------------------------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSE
                                                      GKSKDGMN+RLD  +M IRPEL    T N+TYIP ACYTLSREEKYRFCKTL+E
Subjt:  ----------------------------------------------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSE

Query:  IKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKN---------------------------------------------------
        IKVPEGYS+NI S V L+DLK   LK HDCHVLMQQLLPIAIRS+LPKN                                                   
Subjt:  IKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKN---------------------------------------------------

Query:  -------VREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGA-----------------WFR-------
               VREVKLCGPVYLRWMYPFER MKVLK++VRNR RP+GCIAEA +CEEAV  C DFLS LD IGLG+                 + R       
Subjt:  -------VREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGA-----------------WFR-------

Query:  ----------DQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWE
                  ++VM EL EGKV+SNTIRWLAHGP+  +MTY+GY++NG SYHTKS DD+RTVQNSGIMLVATTMQVSSAKDK+  + DMSFY II+++WE
Subjt:  ----------DQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWE

Query:  VNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-T
        V+YNTF  VLFKC WVEN  GVR DDL++TL+  +R        I+    ++ F      D +WSVVV P +KDFPYKC +DD+GDMLPHYPP+S ++ T
Subjt:  VNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-T

Query:  CDFDEGGETYARSDCEGIW
         D DE G+ Y R DCEG W
Subjt:  CDFDEGGETYARSDCEGIW

TYK18888.1 transposase [Cucumis melo var. makuwa]6.8e-24551.14Show/hide
Query:  MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLF
        MDKSWM++NRMSREYD+G++ FI+ GLSHAKGSNSIRCPCL C N LLKDVS ++YHLYANGIDKSYK+WFWHG++LNS+ VT+ MEN  +E  E++DLF
Subjt:  MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLF

Query:  NTVNMVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV------------------------------------------------
        NTVNM+QS  ++SCN SNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV                                                
Subjt:  NTVNMVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV------------------------------------------------

Query:  ---------RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKN----------------------------------
                 RKEY N TKCPKCGLSRWKI KNS KE SGVAA QMWYFPIVPR IRMFK  ENAKN                                  
Subjt:  ---------RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKN----------------------------------

Query:  -------LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAY-------------------------------------
               L PKQP YDINTYL PLIDDL++LW +GV+CFDAYK+E     A+LLWTINDFPAY                                     
Subjt:  -------LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAY-------------------------------------

Query:  ----------------------------------------------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSE
                                                      GKSKDGMN+RLD  +M IRPEL    T N+TYIP ACYTLSREEKYRFCKTL+E
Subjt:  ----------------------------------------------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSE

Query:  IKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKN---------------------------------------------------
        IKVPEGYS+NI S V L+DLK   LK HDCHVLMQQLLPIAIRS+LPKN                                                   
Subjt:  IKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKN---------------------------------------------------

Query:  -------VREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGA-----------------WFR-------
               VREVKLCGPVYLRWMYPFER MKVLK++VRNR RP+GCIAEA +CEEAV  C DFLS LD IGLG+                 + R       
Subjt:  -------VREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGA-----------------WFR-------

Query:  ----------DQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWE
                  ++VM EL EGKV+SNTIRWLAHGP+  +MTY+GY++NG SYHTKS DD+RTVQNSGIMLVATTMQVSSAKDK+  + DMSFY II+++WE
Subjt:  ----------DQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWE

Query:  VNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-T
        V+YNTF  VLFKC WVEN  GVR DDL++TL+  +R        I+    ++ F      D +WSVVV P +KDFPYKC +DD+GDMLPHYPP+S ++ T
Subjt:  VNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-T

Query:  CDFDEGGETYARSDCEGIW
         D DE G+ Y R DCEG W
Subjt:  CDFDEGGETYARSDCEGIW

TYK20367.1 transposase [Cucumis melo var. makuwa]5.7e-24451.03Show/hide
Query:  MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLF
        MDKSWM++NRMS EYD+G++ FI+ GLSHAKGSNSIRCPCL C N LLKDVS ++YHLYANGIDKSYK+WFWHG++LNS+ VT+ MEN  +E  E++DLF
Subjt:  MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLF

Query:  NTVNMVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV------------------------------------------------
        NTVNM+QS  ++SCN SNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV                                                
Subjt:  NTVNMVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV------------------------------------------------

Query:  ---------RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKN----------------------------------
                 RKEY N TKCPKCGLSRWKI KNS KE SGVAA QMWYFPIVPR IRMFK  ENAKN                                  
Subjt:  ---------RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKN----------------------------------

Query:  -------LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAY-------------------------------------
               L PKQP YDINTYL PLIDDL++LW +GV+CFDAYK+E     A+LLWTINDFPAY                                     
Subjt:  -------LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAY-------------------------------------

Query:  ----------------------------------------------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSE
                                                      GKSKDGMN+RLD  +M IRPEL    T N+TYIP ACYTLSREEKYRFCKTL+E
Subjt:  ----------------------------------------------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSE

Query:  IKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKN---------------------------------------------------
        IKVPEGYS+NI S V L+DLK   LK HDCHVLMQQLLPIAIRS+LPKN                                                   
Subjt:  IKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKN---------------------------------------------------

Query:  -------VREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGA-----------------WFR-------
               VREVKLCGPVYLRWMYPFER MKVLK++VRNR RP+GCIAEA +CEEAV  C DFLS LD IGLG+                 + R       
Subjt:  -------VREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGA-----------------WFR-------

Query:  ----------DQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWE
                  ++VM EL EGKV+SNTIRWLAHGP+  +MTY+GY++NG SYHTKS DD+RTVQNSGIMLVATTMQVSSAKDK+  + DMSFY II+++WE
Subjt:  ----------DQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWE

Query:  VNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-T
        V+YNTF  VLFKC WVEN  GVR DDL++TL+  +R        I+    ++ F      D +WSVVV P +KDFPYKC +DD+GDMLPHYPP+S ++ T
Subjt:  VNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-T

Query:  CDFDEGGETYARSDCEGIW
         D DE G+ Y R DCEG W
Subjt:  CDFDEGGETYARSDCEGIW

TrEMBL top hitse value%identityAlignment
A0A5A7UBV8 Transposon protein, putative, CACTA, En/Spm sub-class3.5e-22339.85Show/hide
Query:  MVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV----------------------------------------------------
        M+QS  E+SCN SNTFDTMFDDAKKPL+PGCKKFTKLSALVRLYNLKV                                                    
Subjt:  MVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV----------------------------------------------------

Query:  -----RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKNL----------CPKQPRYDINTYLTPLIDDLQMLWTDG
             RKEY N TKCP CGL RWKI KNS KE SGVAA QMWYFPIVPR IRMFK  ENAKNL           PKQP YDINTYL PLIDDL++LW +G
Subjt:  -----RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKNL----------CPKQPRYDINTYLTPLIDDLQMLWTDG

Query:  VRCFDAYKKE----SAILLWTINDFPAY-------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSEIKVPEGYSANI
        V CFDAYK+E     A+LLWTINDFPAY       GKSKDGMN+RLD  EM IRPEL    T                                      
Subjt:  VRCFDAYKKE----SAILLWTINDFPAY-------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSEIKVPEGYSANI

Query:  SSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKNVREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPI
                                                                                                            
Subjt:  SSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKNVREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPI

Query:  GLGAWFRDQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWEVNY
                        GKV+SNTIRWLAHGP+  +MTY+GY++NG SYHTKS DD++TVQNSGIMLVATTMQVSSAKDK+  + DMSFY II+++WEV+Y
Subjt:  GLGAWFRDQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWEVNY

Query:  NTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-TCDF
        NTF  VLFKC WVEN  GVR DDL++TL+  +R        I+    ++ F      D +WSVVV P +KDF YKC +DD+GDMLPHYPP+S ++ T D 
Subjt:  NTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-TCDF

Query:  DEGGETYARSDCEGIWENSIFRYDYFRMYRYQFAVMVYS-----DDEQVATPLQIEETSSQKKKRGPTRMKDITRFSSEGRKIVIQYSELDQPIGPNATK
        DE G+ Y R DCEG W+   +RY+ +      F V   S     D + VA P +  E+S+QK+KRGPT MK+ITR  SEG+K+VIQY+EL Q IG NA K
Subjt:  DEGGETYARSDCEGIWENSIFRYDYFRMYRYQFAVMVYS-----DDEQVATPLQIEETSSQKKKRGPTRMKDITRFSSEGRKIVIQYSELDQPIGPNATK

Query:  LKSFIGMTVRFHVPFTYSIWHAIPKEMKEKIYELIEADFILDHKSKESILQNAGVCYHGFKSRLTTTYVLSFKDDVEKLKHPLAKYSFIDQDHWDEFVAS
        LKSFIG TV                                                     RL TTYVL F DDVEKLK P  +YSFIDQ HW EFVAS
Subjt:  LKSFIGMTVRFHVPFTYSIWHAIPKEMKEKIYELIEADFILDHKSKESILQNAGVCYHGFKSRLTTTYVLSFKDDVEKLKHPLAKYSFIDQDHWDEFVAS

Query:  RLKEDFE-------------KYNHRASHKGYANLVEEL-------------------------------------DNLLGNQKTRSLSDYDDVLTQALGG
        RLKEDF+             KYNHR S KGYANL+EEL                                     D L+  Q T +    +D+LT+ALGG
Subjt:  RLKEDFE-------------KYNHRASHKGYANLVEEL-------------------------------------DNLLGNQKTRSLSDYDDVLTQALGG

Query:  KDRQGILRGVG--------ITPSIEGDSNNETEDVAS----DSVPKKATDVVTE----------------------------------------------
        KDR GILR VG           + +  +N + +D A+    D + K+  ++  E                                              
Subjt:  KDRQGILRGVG--------ITPSIEGDSNNETEDVAS----DSVPKKATDVVTE----------------------------------------------

Query:  ----VKTPCMLAFEIEDTIIAHGTIFNAEGDSENIKVSVDVVLDGQCAIQIPKKEGITKMTQEVGSHLMWPRHIVLTRNDKKDTVGFSTDPSTFSSATYL
            V TPC LAFE +D +IA GTI +++ + +N+KV++DVV+D  CAI IP ++G+ K++QEVGSH++WPR +V+T N K D   F+ D STF+     
Subjt:  ----VKTPCMLAFEIEDTIIAHGTIFNAEGDSENIKVSVDVVLDGQCAIQIPKKEGITKMTQEVGSHLMWPRHIVLTRNDKKDTVGFSTDPSTFSSATYL

Query:  RALVALRCLSLRDFLSMKPICTSCLDAYMMYLHTSMVEGRSLSLFKFIDVRSISFSSYKQSCAQLLNARLLRAEYD---------------------QGI
         A VAL  L LR            +  +M YL+T M   R+L+L+KF+D  SIS  S+K+  AQLL ARLL  +YD                     +G 
Subjt:  RALVALRCLSLRDFLSMKPICTSCLDAYMMYLHTSMVEGRSLSLFKFIDVRSISFSSYKQSCAQLLNARLLRAEYD---------------------QGI

Query:  EYWIDPLKNRIDGDMIEVVQMSFNISKKKKPNWKVVK--------VCGIDQHDSRKKVI----NLDVSQMKSLPPTYPQDKIDEVRLELSEFLSK
         +WIDPLKNRID D+ EVV+ SFNI  KKKP W+VVK         CG       + +I       +  MK  P  Y QD ID +R E +EF+ K
Subjt:  EYWIDPLKNRIDGDMIEVVQMSFNISKKKKPNWKVVK--------VCGIDQHDSRKKVI----NLDVSQMKSLPPTYPQDKIDEVRLELSEFLSK

A0A5D3BYY0 ULP_PROTEASE domain-containing protein3.3e-22943.04Show/hide
Query:  LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAYGKSKDGMNTRLDFFEMKIRPE------LTQTFTDNRTY------
        L PKQP YDINTYL PLIDDL++LW +GVRCFDAYK+E     A+LLWTINDFPAY  +   M   +     K  P         + F D + +      
Subjt:  LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAYGKSKDGMNTRLDFFEMKIRPE------LTQTFTDNRTY------

Query:  -IPTACYTLSREEKYRFCKTL----------------SEIKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKNVR----EVKL--
         +  A Y   +E  +   K                  S  K+P  Y  N+  R  LD +    ++ + C  ++  LL I+ +S    N R    E+ +  
Subjt:  -IPTACYTLSREEKYRFCKTL----------------SEIKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKNVR----EVKL--

Query:  -CGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGL-----------------------------------------
           P+ +      E  +  L N  +N  RP+GCIAEA +CEEAV  C DFLS LDPIGL                                         
Subjt:  -CGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGL-----------------------------------------

Query:  ------------GAWFRDQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYD
                    GAW RD+VM EL EGKV+SNTIRWLAH P+  +MTY+GY++NG SYHTKS DD+RTVQNSGIMLVATTMQVSSAKDK+  + DM FY 
Subjt:  ------------GAWFRDQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYD

Query:  IIQEVWEVNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYAR----WSIVVMPPEKD-------FDAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPL
        II+++WEV+YNTF  VLFKC WVEN  GVR DDL++TL+  +R        ++   K         D +WSVV+MPL+KDF YKC +DD+GDMLPHYPP+
Subjt:  IIQEVWEVNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYAR----WSIVVMPPEKD-------FDAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPL

Query:  SSYH-TCDFDEGGETYARSDCEGIWENSIFRYDYFRMYRYQFAVMVYSDDEQVATPLQIEETSSQKKKRGPTRMKDITRFSSEGRKIVIQYSELDQPIGP
        S ++ T D DE  +TY R DCEG W+ + ++Y+    Y Y+  +    DD  VATP +  E+S+QK+K GPT MK+ITR  SEG+K+VIQY+EL Q IG 
Subjt:  SSYH-TCDFDEGGETYARSDCEGIWENSIFRYDYFRMYRYQFAVMVYSDDEQVATPLQIEETSSQKKKRGPTRMKDITRFSSEGRKIVIQYSELDQPIGP

Query:  NATKLKSFIGMTVRFHVPFTYSIWHAIPKEMKEKIYELIEADFILDHKSKESILQNAGVCYHGFKSRLTTTYVLSFKDDVEKLKHPLAKYSFIDQDHWDE
        NATKLKSFIG TVRF VP TYS W  +PKE+K+KI+ELIEA F++D +SK++I+QNAGVC+  FK RLTTTYVL F DDVEKLK    +YSFIDQ HW E
Subjt:  NATKLKSFIGMTVRFHVPFTYSIWHAIPKEMKEKIYELIEADFILDHKSKESILQNAGVCYHGFKSRLTTTYVLSFKDDVEKLKHPLAKYSFIDQDHWDE

Query:  FVASRLKEDFE-------------KYNHRASHKGYANLVEELDNLLGNQKTRSLSDYDDVLTQALGGKDRQGILRGVGITPSIEGDSNNETEDVASDSVP
        FVASRLKEDF+             KYNHR S KGYANL+EEL+ L+  Q T +     D++T+ALGGKDR GILRGVG   + +   +  T+  A+    
Subjt:  FVASRLKEDFE-------------KYNHRASHKGYANLVEELDNLLGNQKTRSLSDYDDVLTQALGGKDRQGILRGVGITPSIEGDSNNETEDVASDSVP

Query:  KKAT-------------------------DVVTEVK---------------------------------------TPCMLAFEIEDTIIAHGTIFNAEGD
        +KAT                         D V ++K                                       TPC LAFE +D ++A GTI ++  +
Subjt:  KKAT-------------------------DVVTEVK---------------------------------------TPCMLAFEIEDTIIAHGTIFNAEGD

Query:  SENIKVSVDVVLDGQCAIQIPKKEGITKMTQEVGSHLMWPRHIVLTRNDKKDTVGFSTDPSTFSSATYLRALVALRCLSLRDFLSMKPICTSCLDAYMMY
         +N+KV+VDVV+DG CAI IP ++G+ KM+QEVGSH++WPR +V+T N K D   F+ D STF+      A VAL                        Y
Subjt:  SENIKVSVDVVLDGQCAIQIPKKEGITKMTQEVGSHLMWPRHIVLTRNDKKDTVGFSTDPSTFSSATYLRALVALRCLSLRDFLSMKPICTSCLDAYMMY

Query:  LHTSMVEGRSLSLFKFIDVRSISFSSYKQSCAQLLNARLLRAEYDQ---------------------GIEYWIDPLKNRIDGDMIEVVQMSFNISKKKKP
        L+T M   R+L+L+KF+D  SIS  S K+  AQLL ARLL  +YDQ                     G  +WIDPLKN ID D+ EVV+ SFN+  KK P
Subjt:  LHTSMVEGRSLSLFKFIDVRSISFSSYKQSCAQLLNARLLRAEYDQ---------------------GIEYWIDPLKNRIDGDMIEVVQMSFNISKKKKP

Query:  NWKVVKV--------CGIDQHDSRKKVI----NLDVSQMKSLPPTYPQDKIDEVRLELSEFLSK
        NW+V+K         CG       + +I       +  MK  P  Y QD ID +R E +EF+ K
Subjt:  NWKVVKV--------CGIDQHDSRKKVI----NLDVSQMKSLPPTYPQDKIDEVRLELSEFLSK

A0A5D3CAP5 Transposase5.6e-24551.14Show/hide
Query:  MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLF
        MDKSWM++NRMSREYD+ ++ FI++GLSHAKGSNSIRCPCL C N LLKDVS ++YHLYANGIDKSYK+WFWHG++LNS+ VT+ MEN  +E  E++DLF
Subjt:  MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLF

Query:  NTVNMVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV------------------------------------------------
        NTVNM+QS  ++SCN SNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV                                                
Subjt:  NTVNMVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV------------------------------------------------

Query:  ---------RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKN----------------------------------
                 RKEY N TKCPKCGLSRWKI KNS KE SGVAA QMWYFPIVPR IRMFK  ENAKN                                  
Subjt:  ---------RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKN----------------------------------

Query:  -------LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAY-------------------------------------
               L PKQP YDINTYL PLIDDL++LW +GVRCFDAYK+E     A+LLWTINDFPAY                                     
Subjt:  -------LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAY-------------------------------------

Query:  ----------------------------------------------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSE
                                                      GKSKDGMN+RLD  +M IRPEL    T N+TYIP ACYTLSREEKYRFCKTL+E
Subjt:  ----------------------------------------------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSE

Query:  IKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKN---------------------------------------------------
        IKVPEGYS+NI S V L+DLK   LK HDCHVLMQQLLPIAIRS+LPKN                                                   
Subjt:  IKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKN---------------------------------------------------

Query:  -------VREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGA-----------------WFR-------
               VREVKLCGPVYLRWMYPFER MKVLK++VRNR RP+GCIAEA +CEEAV  C DFLS LD IGLG+                 + R       
Subjt:  -------VREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGA-----------------WFR-------

Query:  ----------DQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWE
                  ++VM EL EGKV+SNTIRWLAHGP+  +MTY+GY++NG SYHTKS DD+RTVQNSGIMLVATTMQVSSAKDK+  + DMSFY II+++WE
Subjt:  ----------DQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWE

Query:  VNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-T
        V+YNTF  VLFKC WVEN  GVR DDL++TL+  +R        I+    ++ F      D +WSVVV P +KDFPYKC +DD+GDMLPHYPP+S ++ T
Subjt:  VNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-T

Query:  CDFDEGGETYARSDCEGIW
         D DE G+ Y R DCEG W
Subjt:  CDFDEGGETYARSDCEGIW

A0A5D3D5Q1 Transposase3.3e-24551.14Show/hide
Query:  MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLF
        MDKSWM++NRMSREYD+G++ FI+ GLSHAKGSNSIRCPCL C N LLKDVS ++YHLYANGIDKSYK+WFWHG++LNS+ VT+ MEN  +E  E++DLF
Subjt:  MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLF

Query:  NTVNMVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV------------------------------------------------
        NTVNM+QS  ++SCN SNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV                                                
Subjt:  NTVNMVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV------------------------------------------------

Query:  ---------RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKN----------------------------------
                 RKEY N TKCPKCGLSRWKI KNS KE SGVAA QMWYFPIVPR IRMFK  ENAKN                                  
Subjt:  ---------RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKN----------------------------------

Query:  -------LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAY-------------------------------------
               L PKQP YDINTYL PLIDDL++LW +GV+CFDAYK+E     A+LLWTINDFPAY                                     
Subjt:  -------LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAY-------------------------------------

Query:  ----------------------------------------------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSE
                                                      GKSKDGMN+RLD  +M IRPEL    T N+TYIP ACYTLSREEKYRFCKTL+E
Subjt:  ----------------------------------------------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSE

Query:  IKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKN---------------------------------------------------
        IKVPEGYS+NI S V L+DLK   LK HDCHVLMQQLLPIAIRS+LPKN                                                   
Subjt:  IKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKN---------------------------------------------------

Query:  -------VREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGA-----------------WFR-------
               VREVKLCGPVYLRWMYPFER MKVLK++VRNR RP+GCIAEA +CEEAV  C DFLS LD IGLG+                 + R       
Subjt:  -------VREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGA-----------------WFR-------

Query:  ----------DQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWE
                  ++VM EL EGKV+SNTIRWLAHGP+  +MTY+GY++NG SYHTKS DD+RTVQNSGIMLVATTMQVSSAKDK+  + DMSFY II+++WE
Subjt:  ----------DQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWE

Query:  VNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-T
        V+YNTF  VLFKC WVEN  GVR DDL++TL+  +R        I+    ++ F      D +WSVVV P +KDFPYKC +DD+GDMLPHYPP+S ++ T
Subjt:  VNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-T

Query:  CDFDEGGETYARSDCEGIW
         D DE G+ Y R DCEG W
Subjt:  CDFDEGGETYARSDCEGIW

A0A5D3D9W4 Transposase2.8e-24451.03Show/hide
Query:  MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLF
        MDKSWM++NRMS EYD+G++ FI+ GLSHAKGSNSIRCPCL C N LLKDVS ++YHLYANGIDKSYK+WFWHG++LNS+ VT+ MEN  +E  E++DLF
Subjt:  MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLF

Query:  NTVNMVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV------------------------------------------------
        NTVNM+QS  ++SCN SNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV                                                
Subjt:  NTVNMVQSVQEQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKV------------------------------------------------

Query:  ---------RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKN----------------------------------
                 RKEY N TKCPKCGLSRWKI KNS KE SGVAA QMWYFPIVPR IRMFK  ENAKN                                  
Subjt:  ---------RKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKN----------------------------------

Query:  -------LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAY-------------------------------------
               L PKQP YDINTYL PLIDDL++LW +GV+CFDAYK+E     A+LLWTINDFPAY                                     
Subjt:  -------LCPKQPRYDINTYLTPLIDDLQMLWTDGVRCFDAYKKE----SAILLWTINDFPAY-------------------------------------

Query:  ----------------------------------------------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSE
                                                      GKSKDGMN+RLD  +M IRPEL    T N+TYIP ACYTLSREEKYRFCKTL+E
Subjt:  ----------------------------------------------GKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSE

Query:  IKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKN---------------------------------------------------
        IKVPEGYS+NI S V L+DLK   LK HDCHVLMQQLLPIAIRS+LPKN                                                   
Subjt:  IKVPEGYSANISSRVYLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKN---------------------------------------------------

Query:  -------VREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGA-----------------WFR-------
               VREVKLCGPVYLRWMYPFER MKVLK++VRNR RP+GCIAEA +CEEAV  C DFLS LD IGLG+                 + R       
Subjt:  -------VREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGA-----------------WFR-------

Query:  ----------DQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWE
                  ++VM EL EGKV+SNTIRWLAHGP+  +MTY+GY++NG SYHTKS DD+RTVQNSGIMLVATTMQVSSAKDK+  + DMSFY II+++WE
Subjt:  ----------DQVMSELQEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWE

Query:  VNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-T
        V+YNTF  VLFKC WVEN  GVR DDL++TL+  +R        I+    ++ F      D +WSVVV P +KDFPYKC +DD+GDMLPHYPP+S ++ T
Subjt:  VNYNTFKVVLFKCNWVENNAGVRKDDLNYTLILYARWS------IVVMPPEKDF------DAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYH-T

Query:  CDFDEGGETYARSDCEGIW
         D DE G+ Y R DCEG W
Subjt:  CDFDEGGETYARSDCEGIW

SwissProt top hitse value%identityAlignment
Q945S8 Histone-lysine N-methyltransferase ASHH32.0e-0559.46Show/hide
Query:  GNVFHKLMRHIGNSVNFELPDWFSKWKPMPYTFIRRS
        G VF KL+  IG S  FELP+W +K KP PY FIRR+
Subjt:  GNVFHKLMRHIGNSVNFELPDWFSKWKPMPYTFIRRS

Arabidopsis top hitse value%identityAlignment
AT2G44150.1 histone-lysine N-methyltransferase ASHH31.4e-0659.46Show/hide
Query:  GNVFHKLMRHIGNSVNFELPDWFSKWKPMPYTFIRRS
        G VF KL+  IG S  FELP+W +K KP PY FIRR+
Subjt:  GNVFHKLMRHIGNSVNFELPDWFSKWKPMPYTFIRRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACAAGTCATGGATGATCCAAAATAGGATGTCAAGAGAATATGATTTAGGAATCAAAAGGTTCATTAAAATTGGGTTGAGTCATGCCAAAGGTTCTAATTCAATTAG
ATGTCCATGTTTGAATTGCAGTAATCATTTACTTAAGGATGTCTCAACAATTCAGTATCATTTGTATGCCAATGGAATTGATAAAAGTTACAAGGTATGGTTCTGGCATG
GTAAAGAATTGAACTCAGATAATGTTACCAACACGATGGAAAATACAGTGAATGAAACTGATGAAAACGAAGATTTATTTAATACAGTTAATATGGTTCAATCCGTTCAA
GAACAATCTTGTAATGCATCAAATACCTTCGACACTATGTTTGATGACGCGAAGAAACCCTTATACCCAGGATGTAAGAAATTCACAAAGTTATCAGCCCTCGTGAGATT
GTATAACCTGAAGGTTAGAAAAGAATATGAAAATATGACCAAGTGCCCTAAGTGTGGTTTGTCAAGGTGGAAGATCGCTAAGAACTCAAAAAAGGAAAAAAGTGGTGTGG
CTGCCAACCAGATGTGGTACTTTCCAATAGTTCCAAGACTTATAAGAATGTTCAAGAGATTTGAAAATGCTAAGAACTTGTGTCCAAAGCAACCGAGATACGATATCAAT
ACGTACTTAACACCTTTAATTGATGATCTACAAATGTTGTGGACAGATGGAGTTCGTTGTTTCGATGCATATAAGAAAGAATCGGCTATTTTATTGTGGACAATCAACGA
TTTTCCAGCCTACGGAAAAAGTAAGGATGGGATGAATACTAGACTTGACTTCTTTGAAATGAAAATTAGACCAGAATTGACACAAACGTTTACTGATAATAGAACTTACA
TCCCTACGGCATGTTATACACTGTCAAGAGAAGAGAAGTATCGATTTTGCAAGACTTTGTCAGAAATTAAAGTTCCAGAAGGTTATTCAGCAAATATTAGCAGTCGTGTC
TATTTGGATGACTTAAAACGGACCGACCTTAAGTTGCACGACTGTCATGTGTTAATGCAGCAACTGCTTCCAATTGCAATACGTTCAGTCCTACCAAAGAATGTGAGAGA
AGTAAAGCTTTGTGGACCTGTATATTTGCGGTGGATGTACCCATTTGAACGATGCATGAAAGTGTTGAAAAATTATGTTCGCAATAGAACTCGACCGAAAGGATGCATTG
CAGAAGCAAGCATATGCGAGGAGGCAGTTGACTTATGTTTAGATTTTCTTTCTGAATTAGATCCTATTGGACTTGGTGCATGGTTTAGGGATCAGGTGATGAGTGAACTA
CAAGAAGGGAAGGTTCTCTCGAACACCATACGCTGGCTTGCACATGGACCCCACCCTACTTTAATGACGTACGACGGGTACATAATTAATGGAACGTCTTACCACACAAA
GTCTCATGATGACTATCGAACGGTGCAAAATAGTGGGATTATGTTAGTCGCAACAACAATGCAAGTGTCAAGTGCGAAGGACAAAGACTTGAACGTTTGTGACATGTCTT
TCTATGACATCATTCAAGAGGTTTGGGAAGTCAACTACAATACGTTTAAGGTTGTTCTTTTCAAATGCAATTGGGTTGAAAATAATGCTGGTGTTCGGAAAGACGATCTT
AACTACACACTGATCCTGTATGCAAGATGGTCGATTGTTGTGATGCCACCCGAGAAGGACTTTGATGCAAAATGGTCCGTTGTTGTGATGCCACTCGAGAAGGACTTTCC
CTACAAATGTGTTGATGATGACATAGGCGACATGTTACCTCACTATCCCCCACTCTCTTCTTACCACACATGTGATTTTGACGAAGGTGGTGAAACATATGCTAGATCTG
ACTGTGAAGGCATATGGGAAAATTCCATTTTCAGATATGACTACTTCAGAATGTACCGTTATCAATTTGCAGTAATGGTTTATTCAGATGATGAGCAGGTTGCTACACCC
CTTCAAATAGAGGAAACTAGTAGCCAGAAAAAGAAACGTGGCCCTACGAGAATGAAAGATATAACACGTTTTAGTAGTGAGGGGCGGAAAATAGTTATTCAGTACAGCGA
ACTAGATCAACCTATTGGACCAAATGCAACAAAGTTGAAGAGTTTTATTGGAATGACCGTAAGGTTTCACGTTCCTTTCACGTACTCTATTTGGCATGCTATTCCAAAAG
AGATGAAGGAGAAAATCTATGAACTAATTGAGGCCGACTTCATTCTTGACCATAAGTCGAAGGAAAGCATACTTCAAAATGCGGGAGTATGCTATCATGGATTCAAGTCG
AGATTAACAACTACGTACGTACTTTCATTTAAGGACGATGTGGAGAAATTAAAACACCCACTAGCGAAATATTCATTCATCGATCAAGACCATTGGGATGAATTTGTGGC
TTCACGGTTGAAGGAAGATTTTGAAAAATACAACCATAGAGCATCTCACAAGGGGTATGCAAATCTTGTTGAGGAGTTGGACAATCTATTGGGCAACCAGAAGACACGTA
GTTTATCTGACTATGACGATGTTCTCACCCAGGCATTGGGAGGAAAAGATCGTCAGGGCATACTCCGAGGCGTGGGGATAACACCATCAATCGAAGGGGATTCAAATAAT
GAGACTGAAGACGTGGCATCTGATTCAGTACCAAAGAAAGCGACAGACGTTGTGACAGAAGTTAAAACTCCGTGTATGCTTGCGTTTGAAATAGAAGATACTATCATTGC
ACATGGGACAATATTTAATGCTGAAGGTGACAGTGAGAACATTAAGGTATCGGTGGATGTTGTCCTCGATGGCCAATGTGCAATTCAGATACCGAAAAAGGAAGGAATCA
CCAAAATGACGCAGGAAGTCGGTTCACATTTAATGTGGCCACGACATATTGTTCTTACTAGGAATGATAAAAAGGATACCGTGGGCTTTAGTACGGATCCGTCCACATTC
TCTAGTGCAACATATCTACGTGCTCTAGTGGCACTCCGGTGTTTGTCACTGAGGGATTTCTTGTCAATGAAGCCAATATGCACATCATGCCTAGATGCTTACATGATGTA
CCTTCACACAAGTATGGTAGAAGGACGGAGTTTAAGCTTGTTCAAGTTCATTGATGTCAGATCTATAAGTTTCTCGAGTTACAAACAATCGTGTGCGCAGTTGTTGAATG
CTCGACTGCTCAGAGCCGAGTATGACCAAGGTATTGAATATTGGATTGACCCGTTGAAGAATCGGATTGATGGAGACATGATTGAAGTGGTTCAAATGTCATTCAATATA
TCGAAGAAGAAAAAACCAAATTGGAAGGTTGTTAAGGTATGTGGCATTGACCAACATGATAGTCGTAAAAAAGTTATTAATTTAGACGTTTCACAGATGAAAAGTTTACC
TCCTACATACCCGCAAGATAAAATCGATGAAGTTAGATTGGAATTGTCGGAGTTCCTTTCTAAGCAAGAAGAGTTCAATATATATGTGACTGGCTCATGGAATTTTGTTA
TTTGGGGCCCTATTGCATTGGGATATGAATCTTCTTGCAATCAAATAACTGCCGAAGTACTGCGGTTTTCCAAAATATGGACAACTTGGTCATACACATGGGATAATGCT
GGGCACTTCATTACAAATTCTAAATGCAGTTGGACTGGGAATGTTTTTCACAAGTTGATGAGACATATTGGAAATTCAGTTAATTTTGAACTTCCTGATTGGTTTAGCAA
ATGGAAGCCTATGCCATATACCTTCATCAGGCGTAGTACCCTTCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACAAGTCATGGATGATCCAAAATAGGATGTCAAGAGAATATGATTTAGGAATCAAAAGGTTCATTAAAATTGGGTTGAGTCATGCCAAAGGTTCTAATTCAATTAG
ATGTCCATGTTTGAATTGCAGTAATCATTTACTTAAGGATGTCTCAACAATTCAGTATCATTTGTATGCCAATGGAATTGATAAAAGTTACAAGGTATGGTTCTGGCATG
GTAAAGAATTGAACTCAGATAATGTTACCAACACGATGGAAAATACAGTGAATGAAACTGATGAAAACGAAGATTTATTTAATACAGTTAATATGGTTCAATCCGTTCAA
GAACAATCTTGTAATGCATCAAATACCTTCGACACTATGTTTGATGACGCGAAGAAACCCTTATACCCAGGATGTAAGAAATTCACAAAGTTATCAGCCCTCGTGAGATT
GTATAACCTGAAGGTTAGAAAAGAATATGAAAATATGACCAAGTGCCCTAAGTGTGGTTTGTCAAGGTGGAAGATCGCTAAGAACTCAAAAAAGGAAAAAAGTGGTGTGG
CTGCCAACCAGATGTGGTACTTTCCAATAGTTCCAAGACTTATAAGAATGTTCAAGAGATTTGAAAATGCTAAGAACTTGTGTCCAAAGCAACCGAGATACGATATCAAT
ACGTACTTAACACCTTTAATTGATGATCTACAAATGTTGTGGACAGATGGAGTTCGTTGTTTCGATGCATATAAGAAAGAATCGGCTATTTTATTGTGGACAATCAACGA
TTTTCCAGCCTACGGAAAAAGTAAGGATGGGATGAATACTAGACTTGACTTCTTTGAAATGAAAATTAGACCAGAATTGACACAAACGTTTACTGATAATAGAACTTACA
TCCCTACGGCATGTTATACACTGTCAAGAGAAGAGAAGTATCGATTTTGCAAGACTTTGTCAGAAATTAAAGTTCCAGAAGGTTATTCAGCAAATATTAGCAGTCGTGTC
TATTTGGATGACTTAAAACGGACCGACCTTAAGTTGCACGACTGTCATGTGTTAATGCAGCAACTGCTTCCAATTGCAATACGTTCAGTCCTACCAAAGAATGTGAGAGA
AGTAAAGCTTTGTGGACCTGTATATTTGCGGTGGATGTACCCATTTGAACGATGCATGAAAGTGTTGAAAAATTATGTTCGCAATAGAACTCGACCGAAAGGATGCATTG
CAGAAGCAAGCATATGCGAGGAGGCAGTTGACTTATGTTTAGATTTTCTTTCTGAATTAGATCCTATTGGACTTGGTGCATGGTTTAGGGATCAGGTGATGAGTGAACTA
CAAGAAGGGAAGGTTCTCTCGAACACCATACGCTGGCTTGCACATGGACCCCACCCTACTTTAATGACGTACGACGGGTACATAATTAATGGAACGTCTTACCACACAAA
GTCTCATGATGACTATCGAACGGTGCAAAATAGTGGGATTATGTTAGTCGCAACAACAATGCAAGTGTCAAGTGCGAAGGACAAAGACTTGAACGTTTGTGACATGTCTT
TCTATGACATCATTCAAGAGGTTTGGGAAGTCAACTACAATACGTTTAAGGTTGTTCTTTTCAAATGCAATTGGGTTGAAAATAATGCTGGTGTTCGGAAAGACGATCTT
AACTACACACTGATCCTGTATGCAAGATGGTCGATTGTTGTGATGCCACCCGAGAAGGACTTTGATGCAAAATGGTCCGTTGTTGTGATGCCACTCGAGAAGGACTTTCC
CTACAAATGTGTTGATGATGACATAGGCGACATGTTACCTCACTATCCCCCACTCTCTTCTTACCACACATGTGATTTTGACGAAGGTGGTGAAACATATGCTAGATCTG
ACTGTGAAGGCATATGGGAAAATTCCATTTTCAGATATGACTACTTCAGAATGTACCGTTATCAATTTGCAGTAATGGTTTATTCAGATGATGAGCAGGTTGCTACACCC
CTTCAAATAGAGGAAACTAGTAGCCAGAAAAAGAAACGTGGCCCTACGAGAATGAAAGATATAACACGTTTTAGTAGTGAGGGGCGGAAAATAGTTATTCAGTACAGCGA
ACTAGATCAACCTATTGGACCAAATGCAACAAAGTTGAAGAGTTTTATTGGAATGACCGTAAGGTTTCACGTTCCTTTCACGTACTCTATTTGGCATGCTATTCCAAAAG
AGATGAAGGAGAAAATCTATGAACTAATTGAGGCCGACTTCATTCTTGACCATAAGTCGAAGGAAAGCATACTTCAAAATGCGGGAGTATGCTATCATGGATTCAAGTCG
AGATTAACAACTACGTACGTACTTTCATTTAAGGACGATGTGGAGAAATTAAAACACCCACTAGCGAAATATTCATTCATCGATCAAGACCATTGGGATGAATTTGTGGC
TTCACGGTTGAAGGAAGATTTTGAAAAATACAACCATAGAGCATCTCACAAGGGGTATGCAAATCTTGTTGAGGAGTTGGACAATCTATTGGGCAACCAGAAGACACGTA
GTTTATCTGACTATGACGATGTTCTCACCCAGGCATTGGGAGGAAAAGATCGTCAGGGCATACTCCGAGGCGTGGGGATAACACCATCAATCGAAGGGGATTCAAATAAT
GAGACTGAAGACGTGGCATCTGATTCAGTACCAAAGAAAGCGACAGACGTTGTGACAGAAGTTAAAACTCCGTGTATGCTTGCGTTTGAAATAGAAGATACTATCATTGC
ACATGGGACAATATTTAATGCTGAAGGTGACAGTGAGAACATTAAGGTATCGGTGGATGTTGTCCTCGATGGCCAATGTGCAATTCAGATACCGAAAAAGGAAGGAATCA
CCAAAATGACGCAGGAAGTCGGTTCACATTTAATGTGGCCACGACATATTGTTCTTACTAGGAATGATAAAAAGGATACCGTGGGCTTTAGTACGGATCCGTCCACATTC
TCTAGTGCAACATATCTACGTGCTCTAGTGGCACTCCGGTGTTTGTCACTGAGGGATTTCTTGTCAATGAAGCCAATATGCACATCATGCCTAGATGCTTACATGATGTA
CCTTCACACAAGTATGGTAGAAGGACGGAGTTTAAGCTTGTTCAAGTTCATTGATGTCAGATCTATAAGTTTCTCGAGTTACAAACAATCGTGTGCGCAGTTGTTGAATG
CTCGACTGCTCAGAGCCGAGTATGACCAAGGTATTGAATATTGGATTGACCCGTTGAAGAATCGGATTGATGGAGACATGATTGAAGTGGTTCAAATGTCATTCAATATA
TCGAAGAAGAAAAAACCAAATTGGAAGGTTGTTAAGGTATGTGGCATTGACCAACATGATAGTCGTAAAAAAGTTATTAATTTAGACGTTTCACAGATGAAAAGTTTACC
TCCTACATACCCGCAAGATAAAATCGATGAAGTTAGATTGGAATTGTCGGAGTTCCTTTCTAAGCAAGAAGAGTTCAATATATATGTGACTGGCTCATGGAATTTTGTTA
TTTGGGGCCCTATTGCATTGGGATATGAATCTTCTTGCAATCAAATAACTGCCGAAGTACTGCGGTTTTCCAAAATATGGACAACTTGGTCATACACATGGGATAATGCT
GGGCACTTCATTACAAATTCTAAATGCAGTTGGACTGGGAATGTTTTTCACAAGTTGATGAGACATATTGGAAATTCAGTTAATTTTGAACTTCCTGATTGGTTTAGCAA
ATGGAAGCCTATGCCATATACCTTCATCAGGCGTAGTACCCTTCACTAA
Protein sequenceShow/hide protein sequence
MDKSWMIQNRMSREYDLGIKRFIKIGLSHAKGSNSIRCPCLNCSNHLLKDVSTIQYHLYANGIDKSYKVWFWHGKELNSDNVTNTMENTVNETDENEDLFNTVNMVQSVQ
EQSCNASNTFDTMFDDAKKPLYPGCKKFTKLSALVRLYNLKVRKEYENMTKCPKCGLSRWKIAKNSKKEKSGVAANQMWYFPIVPRLIRMFKRFENAKNLCPKQPRYDIN
TYLTPLIDDLQMLWTDGVRCFDAYKKESAILLWTINDFPAYGKSKDGMNTRLDFFEMKIRPELTQTFTDNRTYIPTACYTLSREEKYRFCKTLSEIKVPEGYSANISSRV
YLDDLKRTDLKLHDCHVLMQQLLPIAIRSVLPKNVREVKLCGPVYLRWMYPFERCMKVLKNYVRNRTRPKGCIAEASICEEAVDLCLDFLSELDPIGLGAWFRDQVMSEL
QEGKVLSNTIRWLAHGPHPTLMTYDGYIINGTSYHTKSHDDYRTVQNSGIMLVATTMQVSSAKDKDLNVCDMSFYDIIQEVWEVNYNTFKVVLFKCNWVENNAGVRKDDL
NYTLILYARWSIVVMPPEKDFDAKWSVVVMPLEKDFPYKCVDDDIGDMLPHYPPLSSYHTCDFDEGGETYARSDCEGIWENSIFRYDYFRMYRYQFAVMVYSDDEQVATP
LQIEETSSQKKKRGPTRMKDITRFSSEGRKIVIQYSELDQPIGPNATKLKSFIGMTVRFHVPFTYSIWHAIPKEMKEKIYELIEADFILDHKSKESILQNAGVCYHGFKS
RLTTTYVLSFKDDVEKLKHPLAKYSFIDQDHWDEFVASRLKEDFEKYNHRASHKGYANLVEELDNLLGNQKTRSLSDYDDVLTQALGGKDRQGILRGVGITPSIEGDSNN
ETEDVASDSVPKKATDVVTEVKTPCMLAFEIEDTIIAHGTIFNAEGDSENIKVSVDVVLDGQCAIQIPKKEGITKMTQEVGSHLMWPRHIVLTRNDKKDTVGFSTDPSTF
SSATYLRALVALRCLSLRDFLSMKPICTSCLDAYMMYLHTSMVEGRSLSLFKFIDVRSISFSSYKQSCAQLLNARLLRAEYDQGIEYWIDPLKNRIDGDMIEVVQMSFNI
SKKKKPNWKVVKVCGIDQHDSRKKVINLDVSQMKSLPPTYPQDKIDEVRLELSEFLSKQEEFNIYVTGSWNFVIWGPIALGYESSCNQITAEVLRFSKIWTTWSYTWDNA
GHFITNSKCSWTGNVFHKLMRHIGNSVNFELPDWFSKWKPMPYTFIRRSTLH