; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002285 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002285
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRNA-directed DNA polymerase
Genome locationchr4:41304830..41310925
RNA-Seq ExpressionLag0002285
SyntenyLag0002285
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR001969 - Aspartic peptidase, active site
IPR004130 - GPN-loop GTPase
IPR005162 - Retrotransposon gag domain
IPR012337 - Ribonuclease H-like superfamily
IPR021109 - Aspartic peptidase domain superfamily
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7014963.1 unnamed protein product [Microthlaspi erraticum]3.3e-17731.81Show/hide
Query:  EAQKEGHQYYRYQGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLM
        E ++E  Q  R   Q  +D+++K DIP + G M +E FL+W   V+ FF  M  PE K+V++VA++LK  A  WWD+L + RQR  K P++ W RMK+LM
Subjt:  EAQKEGHQYYRYQGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLM

Query:  KERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRA
         +RFLP ++EQILY  Y  C QG R+V EYT EF R   R +LGE+    VAR+  GLR+ I+E+  LQ +  + EA S A  ++ ++  +  R++S   
Subjt:  KERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRA

Query:  MGEQTSNTNKSALGDRTSQQNSMAALKGKQVETNTTRDNKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRTLA------------------
                       R + QN+  A   K+         KN                    C+RC   GH SN CP RRT+A                  
Subjt:  MGEQTSNTNKSALGDRTSQQNSMAALKGKQVETNTTRDNKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRTLA------------------

Query:  -------------SRVMTMWTFL---------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLS
                       ++     L         N   T C+I  K+CNVI+D+GSTEN+V+ KLV  L LP   H  PY + WIKKG + +VT T  +P+S
Subjt:  -------------SRVMTMWTFL---------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLS

Query:  IGASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLW----------------MGKK----------------------------------
        IG  YK+++ICDVLDMD CH+L GRPWQY    T++GRDN   F W                 GKK                                  
Subjt:  IGASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLW----------------MGKK----------------------------------

Query:  ----------------------------------------------------------------------------------------IVLLP-------
                                                                                                ++L+P       
Subjt:  ----------------------------------------------------------------------------------------IVLLP-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------LNPAKQL------------------------------------------------------------------------------
                       L+  KQ+                                                                              
Subjt:  ---------------LNPAKQL------------------------------------------------------------------------------

Query:  APTTTSSKK-------------------------------------------------------------------------------------------
        AP T   KK                                                                                           
Subjt:  APTTTSSKK-------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQPAG
                                                             DALSR+ASLL  L  EI+  + L + Y  D +F  +W KC    P+ 
Subjt:  -----------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQPAG

Query:  DFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMD
        DFHI +GFLFK D LC+P SSLRE LI++ H GGL+GHLGRDKT+  LE+RY+WP LR+D    V RC+ CQ  KGQSQNTGLY PLP+P  IW+DLSMD
Subjt:  DFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMD

Query:  FILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTN
        F+LGLPRTQRG+DSVFVVVDR+SKM HF+ CKKTADA  +A LFF+E+VRLHG+PKTI+SDRD KFLSHFW TLW+ F TTLK S+T+HPQTDGQTEVTN
Subjt:  FILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTN

Query:  KTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADK
        +TLGN+IR + G++PKQWD+AL Q EFAYN   +  TGKSPF LVYT +P+  +DL K+P     S+ AE MAE+I A    V+  +E    K K AADK
Subjt:  KTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADK

Query:  KRRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLPQ
        +RR K FK GD VM+ LRK RFPVGTY KL+  K GPF +L K  +NAY +DLP+
Subjt:  KRRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLPQ

KAG7588770.1 Integrase catalytic core [Arabidopsis suecica]1.9e-18031.88Show/hide
Query:  AQKEGHQYYRYQGQQEN-DFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLM
        +++E  +    QG Q N D+++K DIP +SG M +E FL+W   V+ FF  M  PE+K+V++VA++LK  A  WWD+L + RQR  K P+R W RMK+LM
Subjt:  AQKEGHQYYRYQGQQEN-DFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLM

Query:  KERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRA
         ERFLP ++EQILY  Y +C QG+R+V +YT EF R   R++L E++   VAR+  GL+  ++ ++ LQ +  + EA S A  ++ ++  +  R ++   
Subjt:  KERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRA

Query:  MGEQTSNTNKSALGDR-TSQQNSMAALKGKQVETNTTRDNKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRTLASRVMTMWTFL-------
             SN       D+    ++S    KG    T+++      D     + +  Y+RPTL KCFRC   GH SN CP R+T+A        F+       
Subjt:  MGEQTSNTNKSALGDR-TSQQNSMAALKGKQVETNTTRDNKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRTLASRVMTMWTFL-------

Query:  ------------------------------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIG
                                      N   T C+I  K+CN+I+D+GS+EN+V+ KLV  L +P   H  PY + W+KKG +  V+ T  IPLSIG
Subjt:  ------------------------------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIG

Query:  ASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVL-------------------------------------------------
          YK++IICDVLDMD CHILLGRPWQY    T++GRDN   F W   KI +                                                 
Subjt:  ASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVL-------------------------------------------------

Query:  --------------------------------------LPLNPAKQL-----------------------------------------------------
                                              + L P   L                                                     
Subjt:  --------------------------------------LPLNPAKQL-----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------------AP
                                                                                                          AP
Subjt:  --------------------------------------------------------------------------------------------------AP

Query:  TTTSSKK---------------------------------------------------------------------------------------------
         T   KK                                                                                             
Subjt:  TTTSSKK---------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQPAGDF
                                                           DALSR+ASLLT L  EI+  + L + Y  D +F  +W KC    P+ DF
Subjt:  ---------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQPAGDF

Query:  HIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMDFI
        HI EG+LFK D LC+P SSLRE LI+E H GGL+GHLGRDKT+  LE+RY+WP LRKD    V RCF CQV KGQSQNTGLY PL +P  IW+DLSMDF+
Subjt:  HIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMDFI

Query:  LGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNKT
        LGLPRTQRG+DSVFVVVD++SKM HF+ C+KTADA  +A LFF+E+VRLHG+PK+IVSDRD KFLSHFW TLW+ F T+LK S+T+HPQ+DGQTEVTN+T
Subjt:  LGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNKT

Query:  LGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADKKR
        LGN+IR + G+KPKQWD+AL Q EFAYN   +  TGKSPF LVYT +P+  +DL  +P +   S  A+ MA+ I      VR  +E    K K AADKK+
Subjt:  LGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADKKR

Query:  RFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP
        R K FK GD VM+ L+K RFPVGTY KL+ RK GPF IL K  +NAY +DLP
Subjt:  RFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP

TXG62763.1 hypothetical protein EZV62_009757 [Acer yangbiense]5.1e-18631.79Show/hide
Query:  PREDQQIYPYQYQQEAQKE---GHQYYRYQGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQ
        PR  + +    +  E  +E   G+Q  R+  + + D++++ DIP + GK++IE FL+W+  VE FF +    E ++V+LVA KL+ GA  WW++L+++R+
Subjt:  PREDQQIYPYQYQQEAQKE---GHQYYRYQGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQ

Query:  RYGKRPIRRWERMKRLMKERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAA--
        R GK PIR W RMK+LM  RFLP ++EQ ++  YQNC QG ++V +YTEE+ RL  R NL E++   V+R+ GGL++ I++++ LQ +  ++EA + A  
Subjt:  RYGKRPIRRWERMKRLMKERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAA--

Query:  AVIEEQIANRFKRTYSRRAMGEQTSNTNKSALGDRTSQQNSMAALK--GKQVETNTTRDNKNSDNTTT--------RRTSNTYSRPTLGKCFRCNQVGHL
        A + E+ +NRF  ++ ++ MGE   ++N +A   R S  +     K    QV+  +  +N +    T         R   N YSRP + KC+RC Q GH 
Subjt:  AVIEEQIANRFKRTYSRRAMGEQTSNTNKSALGDRTSQQNSMAALK--GKQVETNTTRDNKNSDNTTT--------RRTSNTYSRPTLGKCFRCNQVGHL

Query:  SNECPQRR-----------------------------------------TLASRVMTMWTFLNQMTTR-------CTINGKICNVIIDSGSTENVVASKL
        SNECP R+                                          +  RVM   T L+ ++ R       C+I GK+C++I+DSGS EN V+ KL
Subjt:  SNECPQRR-----------------------------------------TLASRVMTMWTFLNQMTTR-------CTINGKICNVIIDSGSTENVVASKL

Query:  VSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLP--------
        +  L L    HP PY + WI+KG +A VT    +P+SIG  Y++++ CDV+DMDA H+LLGRPWQ+ V  T++GRDN   F W GK+I ++P        
Subjt:  VSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLP--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------LNPAKQ--------------------------------------------------------------
                                        L P K                                                               
Subjt:  --------------------------------LNPAKQ--------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------LAPTTTSSKK--------------------------------------------------------------------------------
                  +AP T   KK                                                                                
Subjt:  ----------LAPTTTSSKK--------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTI
                                                                        DALSR+ASLL  ++ EII  + L + Y+DD+DF  +
Subjt:  ----------------------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTI

Query:  WKKCQDHQPAGDFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPI
        W+ C   Q  G+FH+ EG+LF  + LC+P SSLRE LI+E H GGL GHLGRDKT++ + +RY+WPQL++DV NFV +C+  Q  KGQ+QNTGLY PLP+
Subjt:  WKKCQDHQPAGDFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPI

Query:  PQHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSH
        P  IWEDL+MDF+LGLPRTQRG+DSVFVVVDR+SKM HF+PC+KT+DA +VA LFF+E+VRLHG+P++I SDRD KFLSHFW TLW++ DT LK+S+T+H
Subjt:  PQHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSH

Query:  PQTDGQTEVTNKTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEK
        PQTDGQTE  N+TLGNLIR + G+KPKQWD+ALAQAEFAYN+  +  TGKSPF LVY + P+  +DL ++P     S+ AE MAEQ++ +  +V+  +E+
Subjt:  PQTDGQTEVTNKTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEK

Query:  QTTKYKEAADKKRRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP
        +  KYK AAD KRR K F  GD VM+ LRK RFPVG+YNKLK RK GP+ ++ K   NAY IDLP
Subjt:  QTTKYKEAADKKRRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP

XP_025979678.1 uncharacterized protein LOC112997809 [Glycine max]5.5e-18031.15Show/hide
Query:  RYQGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLMKERFLPMNFE
        R      +D+++K DIP + G M +E FL+W   V+ FF  M+ PE K+V++VA++LK  A  WWD+L + R R  K PIR W +MK+LM ERFLP ++E
Subjt:  RYQGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLMKERFLPMNFE

Query:  QILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRAMGEQTSNTNK
        QILY  Y  C QG RSV EYT EF R   R  LGES+   VAR+  GL+  ++E++ LQ +  + EA + A  ++ ++  +  R +S        +NT  
Subjt:  QILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRAMGEQTSNTNK

Query:  SALGDRTSQQNSMAALKGKQVETNTTRDNKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRTLA----------------------------
        S  GD+     +  +  G +   + +  +        ++ +N Y +PT+  CFRCN  GH SN CP RR  A                            
Subjt:  SALGDRTSQQNSMAALKGKQVETNTTRDNKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRTLA----------------------------

Query:  --SRVMTMWTFL-----------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIGASYKDQI
           RV  +   +           N   T C++  K+C++IID+GSTEN+V+ KLV+   LP  PH  PY + W+ KG + +VT +  +P+SIG  YK+++
Subjt:  --SRVMTMWTFL-----------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIGASYKDQI

Query:  ICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLPL-----NPAKQ-------------------------------------------
         CDV+DMD CHILLGRPWQ+    T++GRDN   F W   KI + P+     NP  +                                           
Subjt:  ICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLPL-----NPAKQ-------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------LAPTTTSSKK
                                                                                                   AP T   KK
Subjt:  ------------------------------------------------------------------------------------------LAPTTTSSKK

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQPAGDFHIIEGFL
                                                    DALSR+ SLL  L  E++  + L + Y +D +F  +W KC++H P  DFH+ EGFL
Subjt:  --------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQPAGDFHIIEGFL

Query:  FKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMDFILGLPRTQ
        FK + LC+P SSLRE LI++ H GGL+GH+GRDKT+  LE+R++WP LRKD    V +C+TCQV KGQSQNTGLY PLPIP  IW+DL+MDF+LGLPRTQ
Subjt:  FKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMDFILGLPRTQ

Query:  RGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNKTLGNLIRC
        RG+DSVFVVVDR+SKM+HF+ CKKTADA  +A LFF+E+V LHG+PK+I SDRD KFLSHFW TLWK FDT+L  S+T+HPQTDGQTEVTN+TLGN+IRC
Subjt:  RGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNKTLGNLIRC

Query:  LSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADKKRRFKEFKI
        + G+KPKQWD+AL Q EFAYN   +  TGK+PF LVYT +PR  +DL K+P +  FS+ AE MAE+I A+ + V+  +E    K K AADK++R K F +
Subjt:  LSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADKKRRFKEFKI

Query:  GDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP
        GD VM+ LRK RFPVGTY+KL+ RK GPF +  K  +NAY + LP
Subjt:  GDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]4.8e-21647.23Show/hide
Query:  MATKKVSTNVTGDTSSAGKETEPTTVLSPRSTTVRLLSVEQDTKILKEDVGEIKQILEMICEKLGCRSDQQVAEPRIHKNVERRQQEHQGEDFRPRQRQD
        MA ++V+   TG+ +   +  E  TVLSP+++TVRLL+VE+    L      + + +E++  +      ++  +P     +  +  +  GE    R R+ 
Subjt:  MATKKVSTNVTGDTSSAGKETEPTTVLSPRSTTVRLLSVEQDTKILKEDVGEIKQILEMICEKLGCRSDQQVAEPRIHKNVERRQQEHQGEDFRPRQRQD

Query:  RQIVEQKFAQETHSAPRSYQERRTGQQEVKYNPLFRRQVEWNDDSSSEEDYDYQERCKEGQQNFRYQFQQETPREDQQIYPYQYQQEAQK-EGHQYYRYQ
        R  V         + PR  Q RR G             ++   D   +ED   QE  +E Q++       ++   D+Q   + +  EA+    +Q    +
Subjt:  RQIVEQKFAQETHSAPRSYQERRTGQQEVKYNPLFRRQVEWNDDSSSEEDYDYQERCKEGQQNFRYQFQQETPREDQQIYPYQYQQEAQK-EGHQYYRYQ

Query:  GQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLMKERFLPMNFEQIL
          + +D+KMKID+P Y GK  IEAFL+W++  ENFF+YM+TPE KKV LVALKL+ GA AWWDQLE+NRQR GK+P+R WE+MK+L+K RFLP N+EQ L
Subjt:  GQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLMKERFLPMNFEQIL

Query:  YDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRAMGEQTSNTNKSAL
        Y+QYQNCRQG RSV EY EEFHRL ARTNL E++Q+ VARF GGLR DIKE++ LQP  +L+EAIS A  +EE IA R K    R A     +N+ KS  
Subjt:  YDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRAMGEQTSNTNKSAL

Query:  GDRTSQQNSMAALKGKQVETNTTRDNKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRTLA---------------------------SRVM
         D   Q ++    KGK+++       +  + T      N+YSRP+LGKCFRC Q GHLS+ CPQR+T+A                            RV 
Subjt:  GDRTSQQNSMAALKGKQVETNTTRDNKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRTLA---------------------------SRVM

Query:  TMWTFL-------------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIGASYKDQIICDV
         +   L                 TRCTING++C+VIIDSGS+EN VA KLV+ LNL    HP PYK+ W++KGGEA V+   T+PLSIG +YKDQI+CDV
Subjt:  TMWTFL-------------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIGASYKDQIICDV

Query:  LDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLP--------LNPAKQLAPTTTSSKKDALSRKASLLTLLKGEIIAMDHLPD-------TY
        ++MD CH+LLGRPWQY  Q+ HKGR+NTY+F WMG+K+VLLP        L   KQL   T S K     R+  +L L+  E      + D         
Subjt:  LDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLP--------LNPAKQLAPTTTSSKKDALSRKASLLTLLKGEIIAMDHLPD-------TY

Query:  NDDQDFSTIWKKCQDHQPAGDFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQN
        ++      IW KC +   A D+HI+EG+LFK + LC+PH+SLREALIKEAH+GGLAGH G++KTL I  +RY+WPQ+RKD +NFV RC  CQ  KG S N
Subjt:  NDDQDFSTIWKKCQDHQPAGDFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQN

Query:  TGLYTPLPIPQHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDT
         GLY+PLPIP  IWEDLS+DF++GLP+TQR  DS+ V+VDR+SKM HF+ CKKT DA+Y+ANLFF+E+VRLHG+PK+IV DRDV+FLSHFW+TLWKKFDT
Subjt:  TGLYTPLPIPQHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDT

Query:  TLKY
        TLK+
Subjt:  TLKY

TrEMBL top hitse value%identityAlignment
A0A2N9F1X7 Uncharacterized protein9.5e-17835.1Show/hide
Query:  GKRPIRRWERMKRLMKERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIE
        GK  +R W RMK+L+  RFLP ++EQ L+  YQNC QG+R+V +YT EF RL  R +L E++   VAR+  GL+  ++E++ LQ +  + EA + A   E
Subjt:  GKRPIRRWERMKRLMKERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIE

Query:  EQIANRFKRTYSRRAMGEQTS-NTNKSALGDRTSQQNSMAALKGKQVETNTTRDNKNSDNTTTR-----RTSNTYSRPTLGKCFRCNQVGHLSNECPQRR
               +  Y RR+  E +S N  K       S Q    ++ G    TN++ DN  +   TT      R  N Y R    KC+RC + GH SN CP+RR
Subjt:  EQIANRFKRTYSRRAMGEQTS-NTNKSALGDRTSQQNSMAALKGKQVETNTTRDNKNSDNTTTR-----RTSNTYSRPTLGKCFRCNQVGHLSNECPQRR

Query:  TLASRVMTMWTFLNQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDAC
                                      +D+GS EN VA +LV  L LP   HP PY + WIKKG   +VT    +P+SIG  YKD+++CD++DMDA 
Subjt:  TLASRVMTMWTFLNQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDAC

Query:  HILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLPL---------------------------------------------------------------
        H+LLGRPWQY V  T+KGRDN Y F W   KI ++PL                                                               
Subjt:  HILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLPL---------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------NPAKQ-------------------------------------------------------
                                                NP +                                                        
Subjt:  ----------------------------------------NPAKQ-------------------------------------------------------

Query:  --LAPTTTSSKK----------------------------------------------------------------------------------------
          +AP T   KK                                                                                        
Subjt:  --LAPTTTSSKK----------------------------------------------------------------------------------------

Query:  -----------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQPAGDFHIIEGFLFKKDVLCVPHSSLRE
                                     DALSR+A+LL  L  EI+  D L + Y +D DF  IW+KC   QP  DF+  EG+LF+ + LCVP +SLRE
Subjt:  -----------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQPAGDFHIIEGFLFKKDVLCVPHSSLRE

Query:  ALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSK
         LI++ H GGL GHLGRDKT++ LE+RY+WPQL++DV N V +C+ CQV KGQSQNTGLY PLPIP HIWEDLSMDF+LGLPRTQRG+DSVFVVVDRYSK
Subjt:  ALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSK

Query:  MAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNKTLGNLIRCLSGEKPKQWDMALAQ
        M HF+PC+KT+DA ++A LFF+E+V+LHG+P++I SDRD KFLSHFW TLWK FDT+L  STT+HPQTDGQTE  N+TLGNLIR + G++PKQWD ALAQ
Subjt:  MAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNKTLGNLIRCLSGEKPKQWDMALAQ

Query:  AEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADKKRRFKEFKIGDLVMIQLRKSRFPV
        AEFAYN   +  TG+SPF +VY K P+ T+DL ++P     S+ AE MAEQ+QA+  +V++ +E+ T KYK AADK RR K FK GD VM+ LRK RFPV
Subjt:  AEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADKKRRFKEFKIGDLVMIQLRKSRFPV

Query:  GTYNKLKQRKLGPFAILDKYGENAYKIDLPQ
        GTYNKLK +K GP+ +L K  +NAY IDLP+
Subjt:  GTYNKLKQRKLGPFAILDKYGENAYKIDLPQ

A0A5B7BER3 Uncharacterized protein2.0e-21234.19Show/hide
Query:  QGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLMKERFLPMNFEQI
        +GQ   +++MKID+P+++G + IE+FL+W+  VE FF  M   + K+V+LVA KLKGGA AWWDQ++ NR+R GK+P+R W++M+RL++ERFLP+++EQ+
Subjt:  QGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLMKERFLPMNFEQI

Query:  LYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRAMGEQTSNTNKSA
        LY QYQNCRQG RSV EY++EF+ L +R NL E++   VAR+ GGLR  I++QL+L+ I  LNEA S A  +E Q         SR+ +  Q  N+ +S 
Subjt:  LYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRAMGEQTSNTNKSA

Query:  LGDRTSQQNSMAALKG--KQVETNTTRD------NKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRTL-----------------------
             +QQN    ++G   Q +  T RD      N+N+    +++++N Y+RP  GKCFRC Q GH SNECP RR +                       
Subjt:  LGDRTSQQNSMAALKG--KQVETNTTRD------NKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRTL-----------------------

Query:  ---------------ASRVMTMWTFL----------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTST
                        S V+     +          N   TRCTIN K+C+VIIDSGS+EN+V+  LV AL L    HP PYK+ WIKKG E +VT    
Subjt:  ---------------ASRVMTMWTFL----------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTST

Query:  IPLSIGASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLP-----------------------------------------
        +P SIG  YKD++ CD++DMDACH+LLGRPWQ+ V ATHKG+DNTY F W  KK+VL+P                                         
Subjt:  IPLSIGASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLP-----------------------------------------

Query:  ---------------------------------------------------------------------------------------------LNPAKQ-
                                                                                                     L P K  
Subjt:  ---------------------------------------------------------------------------------------------LNPAKQ-

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --LAPTTTSSKK----------------------------------------------------------------------------------------
          +AP T   KK                                                                                        
Subjt:  --LAPTTTSSKK----------------------------------------------------------------------------------------

Query:  --------------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQ
                                                                DALSR+A+LL ++  EI + + L + Y +D+DF   W KC+  Q
Subjt:  --------------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQ

Query:  PAGDFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDL
         + +FHI +G+LFK + LC+P +SLRE ++++ H+GGL GHLGRDKT+ ++E+RY+WPQL++DV  FV +C  CQ  KGQ+QNTGLYTPLP+P+ IWEDL
Subjt:  PAGDFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDL

Query:  SMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTE
        +MDFILGLPRTQRG+DSVFVVVDR+SKMAHF+PCKKT+DA +VANLFF+EIVRLHG+PK+I SDRDVKFLSHFW+TLW+KFDT+L+YS+T+HPQTDGQTE
Subjt:  SMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTE

Query:  VTNKTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEA
        VTN+TLGNLIRC SG++PKQWD+ L Q EFAYN M NR+T K+PFE+VYTK P+  +DL  +P     S+ AE  A++   +  +V++++EK    YK A
Subjt:  VTNKTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEA

Query:  ADKKRRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP
        ADK RR K F  GDLVM+ LRK+RFPVGTYNKLK RK GPF +  K  +NAY ++LP
Subjt:  ADKKRRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP

A0A5C7HZV4 Reverse transcriptase2.5e-18631.79Show/hide
Query:  PREDQQIYPYQYQQEAQKE---GHQYYRYQGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQ
        PR  + +    +  E  +E   G+Q  R+  + + D++++ DIP + GK++IE FL+W+  VE FF +    E ++V+LVA KL+ GA  WW++L+++R+
Subjt:  PREDQQIYPYQYQQEAQKE---GHQYYRYQGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQ

Query:  RYGKRPIRRWERMKRLMKERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAA--
        R GK PIR W RMK+LM  RFLP ++EQ ++  YQNC QG ++V +YTEE+ RL  R NL E++   V+R+ GGL++ I++++ LQ +  ++EA + A  
Subjt:  RYGKRPIRRWERMKRLMKERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAA--

Query:  AVIEEQIANRFKRTYSRRAMGEQTSNTNKSALGDRTSQQNSMAALK--GKQVETNTTRDNKNSDNTTT--------RRTSNTYSRPTLGKCFRCNQVGHL
        A + E+ +NRF  ++ ++ MGE   ++N +A   R S  +     K    QV+  +  +N +    T         R   N YSRP + KC+RC Q GH 
Subjt:  AVIEEQIANRFKRTYSRRAMGEQTSNTNKSALGDRTSQQNSMAALK--GKQVETNTTRDNKNSDNTTT--------RRTSNTYSRPTLGKCFRCNQVGHL

Query:  SNECPQRR-----------------------------------------TLASRVMTMWTFLNQMTTR-------CTINGKICNVIIDSGSTENVVASKL
        SNECP R+                                          +  RVM   T L+ ++ R       C+I GK+C++I+DSGS EN V+ KL
Subjt:  SNECPQRR-----------------------------------------TLASRVMTMWTFLNQMTTR-------CTINGKICNVIIDSGSTENVVASKL

Query:  VSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLP--------
        +  L L    HP PY + WI+KG +A VT    +P+SIG  Y++++ CDV+DMDA H+LLGRPWQ+ V  T++GRDN   F W GK+I ++P        
Subjt:  VSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIGASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLP--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------LNPAKQ--------------------------------------------------------------
                                        L P K                                                               
Subjt:  --------------------------------LNPAKQ--------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------LAPTTTSSKK--------------------------------------------------------------------------------
                  +AP T   KK                                                                                
Subjt:  ----------LAPTTTSSKK--------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTI
                                                                        DALSR+ASLL  ++ EII  + L + Y+DD+DF  +
Subjt:  ----------------------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTI

Query:  WKKCQDHQPAGDFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPI
        W+ C   Q  G+FH+ EG+LF  + LC+P SSLRE LI+E H GGL GHLGRDKT++ + +RY+WPQL++DV NFV +C+  Q  KGQ+QNTGLY PLP+
Subjt:  WKKCQDHQPAGDFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPI

Query:  PQHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSH
        P  IWEDL+MDF+LGLPRTQRG+DSVFVVVDR+SKM HF+PC+KT+DA +VA LFF+E+VRLHG+P++I SDRD KFLSHFW TLW++ DT LK+S+T+H
Subjt:  PQHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSH

Query:  PQTDGQTEVTNKTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEK
        PQTDGQTE  N+TLGNLIR + G+KPKQWD+ALAQAEFAYN+  +  TGKSPF LVY + P+  +DL ++P     S+ AE MAEQ++ +  +V+  +E+
Subjt:  PQTDGQTEVTNKTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEK

Query:  QTTKYKEAADKKRRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP
        +  KYK AAD KRR K F  GD VM+ LRK RFPVG+YNKLK RK GP+ ++ K   NAY IDLP
Subjt:  QTTKYKEAADKKRRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP

A0A6D2HLB5 Reverse transcriptase1.6e-17731.81Show/hide
Query:  EAQKEGHQYYRYQGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLM
        E ++E  Q  R   Q  +D+++K DIP + G M +E FL+W   V+ FF  M  PE K+V++VA++LK  A  WWD+L + RQR  K P++ W RMK+LM
Subjt:  EAQKEGHQYYRYQGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLM

Query:  KERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRA
         +RFLP ++EQILY  Y  C QG R+V EYT EF R   R +LGE+    VAR+  GLR+ I+E+  LQ +  + EA S A  ++ ++  +  R++S   
Subjt:  KERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRA

Query:  MGEQTSNTNKSALGDRTSQQNSMAALKGKQVETNTTRDNKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRTLA------------------
                       R + QN+  A   K+         KN                    C+RC   GH SN CP RRT+A                  
Subjt:  MGEQTSNTNKSALGDRTSQQNSMAALKGKQVETNTTRDNKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRTLA------------------

Query:  -------------SRVMTMWTFL---------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLS
                       ++     L         N   T C+I  K+CNVI+D+GSTEN+V+ KLV  L LP   H  PY + WIKKG + +VT T  +P+S
Subjt:  -------------SRVMTMWTFL---------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLS

Query:  IGASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLW----------------MGKK----------------------------------
        IG  YK+++ICDVLDMD CH+L GRPWQY    T++GRDN   F W                 GKK                                  
Subjt:  IGASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLW----------------MGKK----------------------------------

Query:  ----------------------------------------------------------------------------------------IVLLP-------
                                                                                                ++L+P       
Subjt:  ----------------------------------------------------------------------------------------IVLLP-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------LNPAKQL------------------------------------------------------------------------------
                       L+  KQ+                                                                              
Subjt:  ---------------LNPAKQL------------------------------------------------------------------------------

Query:  APTTTSSKK-------------------------------------------------------------------------------------------
        AP T   KK                                                                                           
Subjt:  APTTTSSKK-------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQPAG
                                                             DALSR+ASLL  L  EI+  + L + Y  D +F  +W KC    P+ 
Subjt:  -----------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQPAG

Query:  DFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMD
        DFHI +GFLFK D LC+P SSLRE LI++ H GGL+GHLGRDKT+  LE+RY+WP LR+D    V RC+ CQ  KGQSQNTGLY PLP+P  IW+DLSMD
Subjt:  DFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMD

Query:  FILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTN
        F+LGLPRTQRG+DSVFVVVDR+SKM HF+ CKKTADA  +A LFF+E+VRLHG+PKTI+SDRD KFLSHFW TLW+ F TTLK S+T+HPQTDGQTEVTN
Subjt:  FILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTN

Query:  KTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADK
        +TLGN+IR + G++PKQWD+AL Q EFAYN   +  TGKSPF LVYT +P+  +DL K+P     S+ AE MAE+I A    V+  +E    K K AADK
Subjt:  KTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADK

Query:  KRRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLPQ
        +RR K FK GD VM+ LRK RFPVGTY KL+  K GPF +L K  +NAY +DLP+
Subjt:  KRRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLPQ

A0A6N2LVR1 Uncharacterized protein4.8e-19832.77Show/hide
Query:  YYRYQGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLMKERFLPMN
        Y++ + +++  F+MK+D+P+++G+++IE FL+W+  VE FF YM  PE KKV+LVA +L GGA AWW+QL+L R R  K  ++ W +M+RL++ R+LP +
Subjt:  YYRYQGQQENDFKMKIDIPTYSGKMEIEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLMKERFLPMN

Query:  FEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRAMGEQTSNT
        +EQIL+ QYQNC+QG R+V  Y EEFHRL +R NL E+    VARF GGLR +I++++S+  I  L EAI+ A   E Q+    + T + R   + T N+
Subjt:  FEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLGESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRAMGEQTSNT

Query:  NKSALGDRTSQQNSMA-ALKGKQVETNTTRDNKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRT---------------------------
                    NSM+   KG             +         N YSRPT  KC+RC Q GH SN CP+R                             
Subjt:  NKSALGDRTSQQNSMA-ALKGKQVETNTTRDNKNSDNTTTRRTSNTYSRPTLGKCFRCNQVGHLSNECPQRRT---------------------------

Query:  ----------LASRVMTMWTFL------------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIP
                  L S+ + +   +            N   TRCT+N K+C+VIIDSGS+EN+++  +V+ L L    H  PYK+ WIKKG E +VT T    
Subjt:  ----------LASRVMTMWTFL------------NQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIP

Query:  LSIGASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLPLN-----------------------------------------
         SIG +Y D+I+CDV++MDACH++LGRPWQY V  T+KG+DN Y F+  G+K++L PL                                          
Subjt:  LSIGASYKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLPLN-----------------------------------------

Query:  --------------------------------------------------------------------------------------------PAKQ----
                                                                                                    P K     
Subjt:  --------------------------------------------------------------------------------------------PAKQ----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------L
                                                                                                           +
Subjt:  ---------------------------------------------------------------------------------------------------L

Query:  APTTTSSKK-------------------------------------------------------------------------------------------
        AP T   KK                                                                                           
Subjt:  APTTTSSKK-------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQPAG
                                                             DALSRK SLLT L+ E+I  + + D Y  D+DF   W KCQ      
Subjt:  -----------------------------------------------------DALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDHQPAG

Query:  DFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMD
          H  +G+LF+ + LC+P SSLRE +I E H GGL GHLGRDKT+ + E+RY+WPQL++D+ N V RC TCQ  KGQ+QNTGLY PLPIP   WEDLSMD
Subjt:  DFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMD

Query:  FILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTN
        FILGLPRTQRG+DSVFVVVDR+SKMAHF+ CKKT+DAV+VANLFFKE+VRLHG+PK+I SDRD KFLSHFW+TLW++FDTTL +S+TSHPQTDGQTEV N
Subjt:  FILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTN

Query:  KTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADK
        +TLGNLIRCLSGE+PKQWD+ LAQAEFAYN M NR+TGK+PF++VY + P+  +DL  +P     ++ AE MA++++A+  +VR+++E    KYK AADK
Subjt:  KTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADK

Query:  KRRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLPQITTFAP
        KRR K FK GDLVM+ LRK R P GT +KL  +K GP+ IL K  +NAY++DLP   T +P
Subjt:  KRRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLPQITTFAP

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein5.2e-5635.07Show/hide
Query:  QDHQPAGDFHIIEGFLF-KKDVLCVPH-SSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNT-GLYTPLPIP
        +D +   +  + +G L   KD + +P+ + L   +IK+ H  G   H G +   NI+ +R+ W  +RK +  +V  C TCQ+ K ++    G   P+P  
Subjt:  QDHQPAGDFHIIEGFLF-KKDVLCVPH-SSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNT-GLYTPLPIP

Query:  QHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHP
        +  WE LSMDFI  LP +  G +++FVVVDR+SKMA  +PC K+  A   A +F + ++   G PK I++D D  F S  WK    K++  +K+S    P
Subjt:  QHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHP

Query:  QTDGQTEVTNKTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQ
        QTDGQTE TN+T+  L+RC+    P  W   ++  + +YN+  +  T  +PFE+V+   P L+    ++PS   FS + +E +++   +   V+EH+   
Subjt:  QTDGQTEVTNKTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQ

Query:  TTKYKEAADKK-RRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP
          K K+  D K +  +EF+ GDLVM++  K+ F +   NKL     GPF +L K G N Y++DLP
Subjt:  TTKYKEAADKK-RRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP

P0CT41 Transposon Tf2-12 polyprotein5.2e-5635.07Show/hide
Query:  QDHQPAGDFHIIEGFLF-KKDVLCVPH-SSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNT-GLYTPLPIP
        +D +   +  + +G L   KD + +P+ + L   +IK+ H  G   H G +   NI+ +R+ W  +RK +  +V  C TCQ+ K ++    G   P+P  
Subjt:  QDHQPAGDFHIIEGFLF-KKDVLCVPH-SSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNT-GLYTPLPIP

Query:  QHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHP
        +  WE LSMDFI  LP +  G +++FVVVDR+SKMA  +PC K+  A   A +F + ++   G PK I++D D  F S  WK    K++  +K+S    P
Subjt:  QHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHP

Query:  QTDGQTEVTNKTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQ
        QTDGQTE TN+T+  L+RC+    P  W   ++  + +YN+  +  T  +PFE+V+   P L+    ++PS   FS + +E +++   +   V+EH+   
Subjt:  QTDGQTEVTNKTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQ

Query:  TTKYKEAADKK-RRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP
          K K+  D K +  +EF+ GDLVM++  K+ F +   NKL     GPF +L K G N Y++DLP
Subjt:  TTKYKEAADKK-RRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.7e-5733.33Show/hide
Query:  DNTYDFLWMGKKIVLLPLNPAKQLAPTTTSSKKDALSRKASLLT--LLKGEIIAMDHLPD---TYNDDQDFSTIWKKCQDHQP-AGDFHIIEGFLFKKDV
        D T ++L   K +V   ++ A       TS   D  S K+   +  L    +I M  L     T  D   F +  KK +  +    ++ + +  ++ +D 
Subjt:  DNTYDFLWMGKKIVLLPLNPAKQLAPTTTSSKKDALSRKASLLT--LLKGEIIAMDHLPD---TYNDDQDFSTIWKKCQDHQP-AGDFHIIEGFLFKKDV

Query:  LCVPHSSLREALIKEAHAGGL-AGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNT-GLYTPLPIPQHIWEDLSMDFILGLPRTQRGL
        L VP    + A+++  H   L  GH G   TL  +   Y+WP+L+  +  ++  C  CQ+ K       GL  PLPI +  W D+SMDF+ GLP T   L
Subjt:  LCVPHSSLREALIKEAHAGGL-AGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNT-GLYTPLPIPQHIWEDLSMDFILGLPRTQRGL

Query:  DSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNKTLGNLIRCLSG
        + + VVVDR+SK AHF+  +KT DA  + +L F+ I   HG P+TI SDRDV+  +  ++ L K+       S+ +HPQTDGQ+E T +TL  L+R    
Subjt:  DSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNKTLGNLIRCLSG

Query:  EKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADKKRRFKEFKIGDL
           + W + L Q EF YN    RT GKSPFE+    LP  T  +         S  A E+A+ ++AL  Q +E +E    + +   +++R+     IGD 
Subjt:  EKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADKKRRFKEFKIGDL

Query:  VMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDL
        V++  R + F  G Y K++Q  +GPF ++ K  +NAY++DL
Subjt:  VMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDL

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.6e-5733.33Show/hide
Query:  DNTYDFLWMGKKIVLLPLNPAKQLAPTTTSSKKDALSRKASLLT--LLKGEIIAMDHLPD---TYNDDQDFSTIWKKCQDHQP-AGDFHIIEGFLFKKDV
        D T ++L   K +V   ++ A       TS   D  S K+   +  L    +I M  L     T  D   F +  KK +  +    ++ + +  ++ +D 
Subjt:  DNTYDFLWMGKKIVLLPLNPAKQLAPTTTSSKKDALSRKASLLT--LLKGEIIAMDHLPD---TYNDDQDFSTIWKKCQDHQP-AGDFHIIEGFLFKKDV

Query:  LCVPHSSLREALIKEAHAGGL-AGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNT-GLYTPLPIPQHIWEDLSMDFILGLPRTQRGL
        L VP    + A+++  H   L  GH G   TL  +   Y+WP+L+  +  ++  C  CQ+ K       GL  PLPI +  W D+SMDF+ GLP T   L
Subjt:  LCVPHSSLREALIKEAHAGGL-AGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNT-GLYTPLPIPQHIWEDLSMDFILGLPRTQRGL

Query:  DSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNKTLGNLIRCLSG
        + + VVVDR+SK AHF+  +KT DA  + +L F+ I   HG P+TI SDRDV+  +  ++ L K+       S+ +HPQTDGQ+E T +TL  L+R  + 
Subjt:  DSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNKTLGNLIRCLSG

Query:  EKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADKKRRFKEFKIGDL
           + W + L Q EF YN    RT GKSPFE+    LP  T  +         S  A E+A+ ++AL  Q +E +E    + +   +++R+     IGD 
Subjt:  EKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADKKRRFKEFKIGDL

Query:  VMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDL
        V++  R + F  G Y K++Q  +GPF ++ K  +NAY++DL
Subjt:  VMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDL

Q9UR07 Transposon Tf2-11 polyprotein5.2e-5635.07Show/hide
Query:  QDHQPAGDFHIIEGFLF-KKDVLCVPH-SSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNT-GLYTPLPIP
        +D +   +  + +G L   KD + +P+ + L   +IK+ H  G   H G +   NI+ +R+ W  +RK +  +V  C TCQ+ K ++    G   P+P  
Subjt:  QDHQPAGDFHIIEGFLF-KKDVLCVPH-SSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNT-GLYTPLPIP

Query:  QHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHP
        +  WE LSMDFI  LP +  G +++FVVVDR+SKMA  +PC K+  A   A +F + ++   G PK I++D D  F S  WK    K++  +K+S    P
Subjt:  QHIWEDLSMDFILGLPRTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHP

Query:  QTDGQTEVTNKTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQ
        QTDGQTE TN+T+  L+RC+    P  W   ++  + +YN+  +  T  +PFE+V+   P L+    ++PS   FS + +E +++   +   V+EH+   
Subjt:  QTDGQTEVTNKTLGNLIRCLSGEKPKQWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQ

Query:  TTKYKEAADKK-RRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP
          K K+  D K +  +EF+ GDLVM++  K+ F +   NKL     GPF +L K G N Y++DLP
Subjt:  TTKYKEAADKK-RRFKEFKIGDLVMIQLRKSRFPVGTYNKLKQRKLGPFAILDKYGENAYKIDLP

Arabidopsis top hitse value%identityAlignment
AT4G12790.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein4.1e-6474.21Show/hide
Query:  QIELFSHVPVLKIFVEHLKRKNFNVCAVYLLDSQFMTDITKFISGCMASLSAMVQLELPHINILSKMDLVTNKKDIEDFLNPEPRVLLSELNQRMAPQFS
        QIELF+HVPVLK FVEHLK+KNFNVC VYLLDSQF+TD+TKFISGCM+SL+AM+QLELPH+NILSKMDL+ +K +I+D+LNPEPR LL+ELN+RM PQ++
Subjt:  QIELFSHVPVLKIFVEHLKRKNFNVCAVYLLDSQFMTDITKFISGCMASLSAMVQLELPHINILSKMDLVTNKKDIEDFLNPEPRVLLSELNQRMAPQFS

Query:  KLNKALIELVDEYNMVSFVPLDLRKESSIRYVLAQIDNCIQYGEDADVKIKDFDPEDDN
        KLNKALIE+V EY MV+F+P++LRKE SI+YVL+QID CIQ+GEDADV IKD D   D+
Subjt:  KLNKALIELVDEYNMVSFVPLDLRKESSIRYVLAQIDNCIQYGEDADVKIKDFDPEDDN

AT4G12790.2 P-loop containing nucleoside triphosphate hydrolases superfamily protein4.1e-6474.21Show/hide
Query:  QIELFSHVPVLKIFVEHLKRKNFNVCAVYLLDSQFMTDITKFISGCMASLSAMVQLELPHINILSKMDLVTNKKDIEDFLNPEPRVLLSELNQRMAPQFS
        QIELF+HVPVLK FVEHLK+KNFNVC VYLLDSQF+TD+TKFISGCM+SL+AM+QLELPH+NILSKMDL+ +K +I+D+LNPEPR LL+ELN+RM PQ++
Subjt:  QIELFSHVPVLKIFVEHLKRKNFNVCAVYLLDSQFMTDITKFISGCMASLSAMVQLELPHINILSKMDLVTNKKDIEDFLNPEPRVLLSELNQRMAPQFS

Query:  KLNKALIELVDEYNMVSFVPLDLRKESSIRYVLAQIDNCIQYGEDADVKIKDFDPEDDN
        KLNKALIE+V EY MV+F+P++LRKE SI+YVL+QID CIQ+GEDADV IKD D   D+
Subjt:  KLNKALIELVDEYNMVSFVPLDLRKESSIRYVLAQIDNCIQYGEDADVKIKDFDPEDDN

AT4G12790.3 P-loop containing nucleoside triphosphate hydrolases superfamily protein4.1e-6474.21Show/hide
Query:  QIELFSHVPVLKIFVEHLKRKNFNVCAVYLLDSQFMTDITKFISGCMASLSAMVQLELPHINILSKMDLVTNKKDIEDFLNPEPRVLLSELNQRMAPQFS
        QIELF+HVPVLK FVEHLK+KNFNVC VYLLDSQF+TD+TKFISGCM+SL+AM+QLELPH+NILSKMDL+ +K +I+D+LNPEPR LL+ELN+RM PQ++
Subjt:  QIELFSHVPVLKIFVEHLKRKNFNVCAVYLLDSQFMTDITKFISGCMASLSAMVQLELPHINILSKMDLVTNKKDIEDFLNPEPRVLLSELNQRMAPQFS

Query:  KLNKALIELVDEYNMVSFVPLDLRKESSIRYVLAQIDNCIQYGEDADVKIKDFDPEDDN
        KLNKALIE+V EY MV+F+P++LRKE SI+YVL+QID CIQ+GEDADV IKD D   D+
Subjt:  KLNKALIELVDEYNMVSFVPLDLRKESSIRYVLAQIDNCIQYGEDADVKIKDFDPEDDN

AT4G12790.4 P-loop containing nucleoside triphosphate hydrolases superfamily protein4.1e-6474.21Show/hide
Query:  QIELFSHVPVLKIFVEHLKRKNFNVCAVYLLDSQFMTDITKFISGCMASLSAMVQLELPHINILSKMDLVTNKKDIEDFLNPEPRVLLSELNQRMAPQFS
        QIELF+HVPVLK FVEHLK+KNFNVC VYLLDSQF+TD+TKFISGCM+SL+AM+QLELPH+NILSKMDL+ +K +I+D+LNPEPR LL+ELN+RM PQ++
Subjt:  QIELFSHVPVLKIFVEHLKRKNFNVCAVYLLDSQFMTDITKFISGCMASLSAMVQLELPHINILSKMDLVTNKKDIEDFLNPEPRVLLSELNQRMAPQFS

Query:  KLNKALIELVDEYNMVSFVPLDLRKESSIRYVLAQIDNCIQYGEDADVKIKDFDPEDDN
        KLNKALIE+V EY MV+F+P++LRKE SI+YVL+QID CIQ+GEDADV IKD D   D+
Subjt:  KLNKALIELVDEYNMVSFVPLDLRKESSIRYVLAQIDNCIQYGEDADVKIKDFDPEDDN

AT4G12790.5 P-loop containing nucleoside triphosphate hydrolases superfamily protein4.1e-6474.21Show/hide
Query:  QIELFSHVPVLKIFVEHLKRKNFNVCAVYLLDSQFMTDITKFISGCMASLSAMVQLELPHINILSKMDLVTNKKDIEDFLNPEPRVLLSELNQRMAPQFS
        QIELF+HVPVLK FVEHLK+KNFNVC VYLLDSQF+TD+TKFISGCM+SL+AM+QLELPH+NILSKMDL+ +K +I+D+LNPEPR LL+ELN+RM PQ++
Subjt:  QIELFSHVPVLKIFVEHLKRKNFNVCAVYLLDSQFMTDITKFISGCMASLSAMVQLELPHINILSKMDLVTNKKDIEDFLNPEPRVLLSELNQRMAPQFS

Query:  KLNKALIELVDEYNMVSFVPLDLRKESSIRYVLAQIDNCIQYGEDADVKIKDFDPEDDN
        KLNKALIE+V EY MV+F+P++LRKE SI+YVL+QID CIQ+GEDADV IKD D   D+
Subjt:  KLNKALIELVDEYNMVSFVPLDLRKESSIRYVLAQIDNCIQYGEDADVKIKDFDPEDDN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAACCAAGAAAGTGTCGACCAACGTTACGGGCGATACTTCCTCGGCTGGAAAAGAGACGGAGCCTACCACCGTCCTCTCACCACGATCTACAACCGTACGCTTGCT
GTCGGTTGAACAAGATACTAAAATCTTGAAAGAAGATGTGGGTGAGATAAAGCAGATTTTAGAGATGATTTGCGAAAAGTTGGGCTGCAGATCTGACCAACAAGTTGCTG
AACCAAGAATTCACAAAAACGTGGAGCGGAGACAGCAAGAACACCAAGGGGAAGATTTCAGACCAAGGCAAAGGCAAGACAGACAAATTGTGGAGCAAAAATTTGCTCAA
GAAACACATTCTGCACCAAGATCCTATCAAGAAAGACGCACAGGGCAGCAAGAAGTCAAATACAATCCTTTATTTCGAAGGCAAGTTGAATGGAATGATGACAGCTCAAG
TGAAGAAGATTATGACTACCAGGAAAGATGCAAGGAAGGCCAACAAAATTTTCGTTATCAATTCCAGCAAGAAACGCCAAGGGAAGATCAACAAATTTATCCATATCAAT
ATCAGCAAGAAGCACAAAAGGAGGGGCATCAATATTATAGGTACCAAGGTCAGCAAGAAAATGATTTTAAAATGAAGATTGATATACCAACTTATAGTGGGAAGATGGAG
ATCGAAGCCTTCCTCGAATGGGTTCGACATGTCGAAAATTTTTTCAGCTACATGAACACGCCCGAGACCAAAAAGGTGAGATTAGTGGCTCTAAAATTAAAAGGAGGAGC
CCAAGCATGGTGGGATCAATTGGAACTTAACCGCCAAAGATATGGGAAAAGACCTATCCGACGATGGGAAAGAATGAAAAGACTCATGAAGGAAAGATTCCTACCTATGA
ATTTCGAGCAAATTCTCTATGACCAATACCAAAATTGCAGACAAGGAGCTCGCTCGGTCATGGAATACACGGAAGAATTTCATCGGTTAGGGGCAAGAACAAATCTTGGA
GAGAGCCAACAATACCTAGTTGCAAGATTCAAAGGAGGCCTTCGGAATGACATTAAAGAACAGCTTTCTTTACAACCGATTGGCTATCTAAATGAAGCAATTTCTGCTGC
AGCTGTAATTGAAGAGCAAATTGCCAATCGCTTCAAAAGAACATACTCAAGAAGAGCCATGGGTGAACAAACCAGCAACACCAACAAATCTGCCCTTGGCGACCGAACCT
CGCAGCAAAATTCGATGGCAGCCCTCAAGGGAAAGCAAGTAGAGACCAATACAACTAGGGATAACAAGAACTCAGATAATACAACCACAAGGAGGACCTCCAACACTTAC
TCTCGGCCGACGCTAGGAAAGTGCTTTCGATGCAATCAAGTAGGCCATTTATCCAATGAGTGTCCCCAACGAAGGACTCTTGCGTCGAGAGTGATGACAATGTGGACTTT
CTTGAACCAGATGACGACACGATGCACAATCAATGGGAAAATATGCAACGTGATCATCGATAGTGGAAGCACTGAAAATGTTGTAGCTAGCAAGCTAGTTTCGGCGCTCA
ATCTTCCTTTACACCCACATCCGACACCCTATAAGGTAAGTTGGATCAAAAAAGGTGGCGAGGCTCAGGTCACTCATACAAGTACCATTCCGTTATCAATTGGAGCAAGC
TACAAAGACCAAATAATCTGTGATGTGTTGGATATGGATGCATGCCATATACTTTTGGGACGTCCATGGCAATACGGTGTCCAAGCAACACATAAAGGGAGAGACAATAC
ATATGACTTCCTATGGATGGGCAAGAAGATAGTTCTCCTACCACTCAATCCAGCAAAACAACTAGCTCCAACGACCACCTCTTCAAAGAAAGATGCCTTGAGCCGGAAAG
CAAGCCTTTTAACTCTTTTGAAGGGAGAAATTATTGCAATGGATCATCTTCCAGACACCTATAATGATGATCAAGATTTTTCAACAATCTGGAAAAAATGCCAAGACCAC
CAGCCAGCAGGGGATTTTCATATTATCGAAGGATTTCTCTTTAAAAAAGATGTCCTCTGTGTGCCACACTCTTCATTGCGTGAAGCATTAATCAAGGAAGCACATGCTGG
CGGTCTTGCTGGACACCTGGGACGTGACAAAACTCTGAACATTCTGGAACAAAGGTACTTTTGGCCCCAGCTCCGAAAGGATGTAAGTAATTTTGTTAGTCGATGCTTCA
CTTGCCAAGTTTTTAAGGGACAATCTCAAAACACTGGTTTATATACACCCTTGCCAATCCCACAACATATTTGGGAAGATCTTTCCATGGATTTCATACTTGGCCTTCCC
CGAACTCAACGTGGACTTGACTCAGTTTTTGTAGTAGTCGATCGTTACAGTAAAATGGCCCATTTTCTGCCATGTAAGAAAACTGCAGACGCTGTATATGTTGCTAATCT
GTTTTTTAAGGAGATTGTTCGACTCCATGGTATTCCAAAAACCATAGTATCTGATCGAGATGTTAAGTTTTTGAGTCACTTTTGGAAAACACTATGGAAAAAATTCGATA
CAACACTCAAATACAGCACAACCAGCCATCCTCAAACCGATGGGCAAACTGAGGTAACAAACAAGACCCTAGGGAACTTAATTCGATGCCTAAGTGGAGAGAAGCCAAAA
CAATGGGACATGGCATTGGCCCAAGCTGAATTCGCCTACAATCATATGAAGAACCGAACCACAGGGAAGTCCCCATTTGAACTTGTGTACACTAAATTACCTAGACTTAC
CATGGATCTAACCAAAGTTCCCTCTTCTATTGATTTTAGTATGGAGGCTGAAGAAATGGCCGAACAAATCCAAGCGCTTCACAATCAAGTACGAGAACATATAGAGAAGC
AAACAACTAAATACAAAGAAGCAGCGGACAAGAAGCGACGATTCAAGGAATTCAAGATAGGCGATCTCGTTATGATCCAACTGCGGAAAAGTCGATTTCCAGTAGGAACT
TACAACAAATTGAAGCAAAGGAAGCTCGGCCCATTTGCTATTCTTGACAAATATGGCGAAAATGCATACAAAATTGACCTCCCCCAGATTACAACATTTGCACCACATTT
AATGTTGCTGACTTGTCAGATAGAGCTCTTTTCCCATGTTCCGGTTCTTAAAATCTTTGTGGAGCATTTAAAGCGAAAAAATTTCAATGTCTGTGCTGTGTATTTGCTTG
ATTCTCAATTCATGACAGACATCACAAAGTTTATCAGTGGTTGTATGGCATCTCTTTCAGCAATGGTCCAACTAGAATTACCACATATTAACATCCTCTCCAAAATGGAT
CTTGTGACTAACAAAAAGGATATTGAAGACTTCTTAAATCCAGAGCCTCGAGTTTTGTTGTCTGAGTTGAATCAACGTATGGCTCCACAATTTTCAAAGCTAAATAAAGC
TCTAATTGAACTGGTGGACGAATATAACATGGTGAGTTTTGTGCCACTTGACTTGAGGAAAGAAAGCAGTATAAGATACGTATTGGCTCAGATAGATAACTGCATTCAGT
ACGGTGAAGATGCAGATGTGAAGATTAAGGACTTTGATCCAGAGGATGATAATGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAACCAAGAAAGTGTCGACCAACGTTACGGGCGATACTTCCTCGGCTGGAAAAGAGACGGAGCCTACCACCGTCCTCTCACCACGATCTACAACCGTACGCTTGCT
GTCGGTTGAACAAGATACTAAAATCTTGAAAGAAGATGTGGGTGAGATAAAGCAGATTTTAGAGATGATTTGCGAAAAGTTGGGCTGCAGATCTGACCAACAAGTTGCTG
AACCAAGAATTCACAAAAACGTGGAGCGGAGACAGCAAGAACACCAAGGGGAAGATTTCAGACCAAGGCAAAGGCAAGACAGACAAATTGTGGAGCAAAAATTTGCTCAA
GAAACACATTCTGCACCAAGATCCTATCAAGAAAGACGCACAGGGCAGCAAGAAGTCAAATACAATCCTTTATTTCGAAGGCAAGTTGAATGGAATGATGACAGCTCAAG
TGAAGAAGATTATGACTACCAGGAAAGATGCAAGGAAGGCCAACAAAATTTTCGTTATCAATTCCAGCAAGAAACGCCAAGGGAAGATCAACAAATTTATCCATATCAAT
ATCAGCAAGAAGCACAAAAGGAGGGGCATCAATATTATAGGTACCAAGGTCAGCAAGAAAATGATTTTAAAATGAAGATTGATATACCAACTTATAGTGGGAAGATGGAG
ATCGAAGCCTTCCTCGAATGGGTTCGACATGTCGAAAATTTTTTCAGCTACATGAACACGCCCGAGACCAAAAAGGTGAGATTAGTGGCTCTAAAATTAAAAGGAGGAGC
CCAAGCATGGTGGGATCAATTGGAACTTAACCGCCAAAGATATGGGAAAAGACCTATCCGACGATGGGAAAGAATGAAAAGACTCATGAAGGAAAGATTCCTACCTATGA
ATTTCGAGCAAATTCTCTATGACCAATACCAAAATTGCAGACAAGGAGCTCGCTCGGTCATGGAATACACGGAAGAATTTCATCGGTTAGGGGCAAGAACAAATCTTGGA
GAGAGCCAACAATACCTAGTTGCAAGATTCAAAGGAGGCCTTCGGAATGACATTAAAGAACAGCTTTCTTTACAACCGATTGGCTATCTAAATGAAGCAATTTCTGCTGC
AGCTGTAATTGAAGAGCAAATTGCCAATCGCTTCAAAAGAACATACTCAAGAAGAGCCATGGGTGAACAAACCAGCAACACCAACAAATCTGCCCTTGGCGACCGAACCT
CGCAGCAAAATTCGATGGCAGCCCTCAAGGGAAAGCAAGTAGAGACCAATACAACTAGGGATAACAAGAACTCAGATAATACAACCACAAGGAGGACCTCCAACACTTAC
TCTCGGCCGACGCTAGGAAAGTGCTTTCGATGCAATCAAGTAGGCCATTTATCCAATGAGTGTCCCCAACGAAGGACTCTTGCGTCGAGAGTGATGACAATGTGGACTTT
CTTGAACCAGATGACGACACGATGCACAATCAATGGGAAAATATGCAACGTGATCATCGATAGTGGAAGCACTGAAAATGTTGTAGCTAGCAAGCTAGTTTCGGCGCTCA
ATCTTCCTTTACACCCACATCCGACACCCTATAAGGTAAGTTGGATCAAAAAAGGTGGCGAGGCTCAGGTCACTCATACAAGTACCATTCCGTTATCAATTGGAGCAAGC
TACAAAGACCAAATAATCTGTGATGTGTTGGATATGGATGCATGCCATATACTTTTGGGACGTCCATGGCAATACGGTGTCCAAGCAACACATAAAGGGAGAGACAATAC
ATATGACTTCCTATGGATGGGCAAGAAGATAGTTCTCCTACCACTCAATCCAGCAAAACAACTAGCTCCAACGACCACCTCTTCAAAGAAAGATGCCTTGAGCCGGAAAG
CAAGCCTTTTAACTCTTTTGAAGGGAGAAATTATTGCAATGGATCATCTTCCAGACACCTATAATGATGATCAAGATTTTTCAACAATCTGGAAAAAATGCCAAGACCAC
CAGCCAGCAGGGGATTTTCATATTATCGAAGGATTTCTCTTTAAAAAAGATGTCCTCTGTGTGCCACACTCTTCATTGCGTGAAGCATTAATCAAGGAAGCACATGCTGG
CGGTCTTGCTGGACACCTGGGACGTGACAAAACTCTGAACATTCTGGAACAAAGGTACTTTTGGCCCCAGCTCCGAAAGGATGTAAGTAATTTTGTTAGTCGATGCTTCA
CTTGCCAAGTTTTTAAGGGACAATCTCAAAACACTGGTTTATATACACCCTTGCCAATCCCACAACATATTTGGGAAGATCTTTCCATGGATTTCATACTTGGCCTTCCC
CGAACTCAACGTGGACTTGACTCAGTTTTTGTAGTAGTCGATCGTTACAGTAAAATGGCCCATTTTCTGCCATGTAAGAAAACTGCAGACGCTGTATATGTTGCTAATCT
GTTTTTTAAGGAGATTGTTCGACTCCATGGTATTCCAAAAACCATAGTATCTGATCGAGATGTTAAGTTTTTGAGTCACTTTTGGAAAACACTATGGAAAAAATTCGATA
CAACACTCAAATACAGCACAACCAGCCATCCTCAAACCGATGGGCAAACTGAGGTAACAAACAAGACCCTAGGGAACTTAATTCGATGCCTAAGTGGAGAGAAGCCAAAA
CAATGGGACATGGCATTGGCCCAAGCTGAATTCGCCTACAATCATATGAAGAACCGAACCACAGGGAAGTCCCCATTTGAACTTGTGTACACTAAATTACCTAGACTTAC
CATGGATCTAACCAAAGTTCCCTCTTCTATTGATTTTAGTATGGAGGCTGAAGAAATGGCCGAACAAATCCAAGCGCTTCACAATCAAGTACGAGAACATATAGAGAAGC
AAACAACTAAATACAAAGAAGCAGCGGACAAGAAGCGACGATTCAAGGAATTCAAGATAGGCGATCTCGTTATGATCCAACTGCGGAAAAGTCGATTTCCAGTAGGAACT
TACAACAAATTGAAGCAAAGGAAGCTCGGCCCATTTGCTATTCTTGACAAATATGGCGAAAATGCATACAAAATTGACCTCCCCCAGATTACAACATTTGCACCACATTT
AATGTTGCTGACTTGTCAGATAGAGCTCTTTTCCCATGTTCCGGTTCTTAAAATCTTTGTGGAGCATTTAAAGCGAAAAAATTTCAATGTCTGTGCTGTGTATTTGCTTG
ATTCTCAATTCATGACAGACATCACAAAGTTTATCAGTGGTTGTATGGCATCTCTTTCAGCAATGGTCCAACTAGAATTACCACATATTAACATCCTCTCCAAAATGGAT
CTTGTGACTAACAAAAAGGATATTGAAGACTTCTTAAATCCAGAGCCTCGAGTTTTGTTGTCTGAGTTGAATCAACGTATGGCTCCACAATTTTCAAAGCTAAATAAAGC
TCTAATTGAACTGGTGGACGAATATAACATGGTGAGTTTTGTGCCACTTGACTTGAGGAAAGAAAGCAGTATAAGATACGTATTGGCTCAGATAGATAACTGCATTCAGT
ACGGTGAAGATGCAGATGTGAAGATTAAGGACTTTGATCCAGAGGATGATAATGATTAA
Protein sequenceShow/hide protein sequence
MATKKVSTNVTGDTSSAGKETEPTTVLSPRSTTVRLLSVEQDTKILKEDVGEIKQILEMICEKLGCRSDQQVAEPRIHKNVERRQQEHQGEDFRPRQRQDRQIVEQKFAQ
ETHSAPRSYQERRTGQQEVKYNPLFRRQVEWNDDSSSEEDYDYQERCKEGQQNFRYQFQQETPREDQQIYPYQYQQEAQKEGHQYYRYQGQQENDFKMKIDIPTYSGKME
IEAFLEWVRHVENFFSYMNTPETKKVRLVALKLKGGAQAWWDQLELNRQRYGKRPIRRWERMKRLMKERFLPMNFEQILYDQYQNCRQGARSVMEYTEEFHRLGARTNLG
ESQQYLVARFKGGLRNDIKEQLSLQPIGYLNEAISAAAVIEEQIANRFKRTYSRRAMGEQTSNTNKSALGDRTSQQNSMAALKGKQVETNTTRDNKNSDNTTTRRTSNTY
SRPTLGKCFRCNQVGHLSNECPQRRTLASRVMTMWTFLNQMTTRCTINGKICNVIIDSGSTENVVASKLVSALNLPLHPHPTPYKVSWIKKGGEAQVTHTSTIPLSIGAS
YKDQIICDVLDMDACHILLGRPWQYGVQATHKGRDNTYDFLWMGKKIVLLPLNPAKQLAPTTTSSKKDALSRKASLLTLLKGEIIAMDHLPDTYNDDQDFSTIWKKCQDH
QPAGDFHIIEGFLFKKDVLCVPHSSLREALIKEAHAGGLAGHLGRDKTLNILEQRYFWPQLRKDVSNFVSRCFTCQVFKGQSQNTGLYTPLPIPQHIWEDLSMDFILGLP
RTQRGLDSVFVVVDRYSKMAHFLPCKKTADAVYVANLFFKEIVRLHGIPKTIVSDRDVKFLSHFWKTLWKKFDTTLKYSTTSHPQTDGQTEVTNKTLGNLIRCLSGEKPK
QWDMALAQAEFAYNHMKNRTTGKSPFELVYTKLPRLTMDLTKVPSSIDFSMEAEEMAEQIQALHNQVREHIEKQTTKYKEAADKKRRFKEFKIGDLVMIQLRKSRFPVGT
YNKLKQRKLGPFAILDKYGENAYKIDLPQITTFAPHLMLLTCQIELFSHVPVLKIFVEHLKRKNFNVCAVYLLDSQFMTDITKFISGCMASLSAMVQLELPHINILSKMD
LVTNKKDIEDFLNPEPRVLLSELNQRMAPQFSKLNKALIELVDEYNMVSFVPLDLRKESSIRYVLAQIDNCIQYGEDADVKIKDFDPEDDND