; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007991 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007991
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr9:9574111..9580602
RNA-Seq ExpressionLag0007991
SyntenyLag0007991
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]4.4e-16633.96Show/hide
Query:  SSIAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEM
        + IA ++++  TS+ +W   + L G+  +S+I  L     + RK   KM +YL  MK + D L LAG  V  S LI   + GLD+EY P++ +++ ++ +
Subjt:  SSIAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEM

Query:  NWHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCY
        +W +    L++FE+ +  +N+L      TNL+ N  A     N +        +  N     N RG+RG GRGR    G + K  CQVCG   H A+ C+
Subjt:  NWHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCY

Query:  FRFERSIG-ANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLKDVL
         RF+++   +N S+ +   G++ A++A+   + D+ WY DSGA++H T          E+ G   LVVGNG KL I   G        S    L+L D+L
Subjt:  FRFERSIG-ANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLKDVL

Query:  HVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSS--SPISH-TKSIFTAFNESKFTTICPST
        +VP ITKNL+SVSKL ADN++ VEF  N CFVKDK T +++L+G LK+GLYQ   L  ++ N   F   K S    + H    +     ES    + PS 
Subjt:  HVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSS--SPISH-TKSIFTAFNESKFTTICPST

Query:  FVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQVENYF
                  DN S FC+ACQ GK H LPFK S + + +PLELV +D+WGP P+ +SSG+++Y+ FVDDF+RFT IYPL+ K + +++FI FKN  EN F
Subjt:  FVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQVENYF

Query:  ECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKP---------------------------------------
          +IK +Q D GGE++P   +    GI FR  CP+TS+QNG  ERKHRHI E GLTLL +                                        
Subjt:  ECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKP---------------------------------------

Query:  --------------PCLS-------NIDLSICTWLA------------------------------PI--GTTQFASPLPPVSSTPAS------------
                      PCL            + C +L                               P   G     SPL    + P++            
Subjt:  --------------PCLS-------NIDLSICTWLA------------------------------PI--GTTQFASPLPPVSSTPAS------------

Query:  ----PNLHSNDPPPLIPFTQSPTSTNSSPTSNSPPQPPPT-----------------KSLANTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKE
            P L + +P           ++++  T+N P +   T                 ++   +H + TR+K+GI +PK       +       EP   KE
Subjt:  ----PNLHSNDPPPLIPFTQSPTSTNSSPTSNSPPQPPPT-----------------KSLANTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKE

Query:  ALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELR
        AL  P WK+ M  E+ AL  N+TW L+P  N+ ++V +KWVF+ K  PDGS+ + KARLVAKGF Q   ID+ ETFSPVIK STVR+IL+I V  NWE+R
Subjt:  ALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELR

Query:  QIDINNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW-------------------------------------------------
        Q+DINNAFLNG L+E V+M QPEGF+D ++P++ICKL K++YGLKQAPR W                                                 
Subjt:  QIDINNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW-------------------------------------------------

Query:  -------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPM-----------------------
                                       D SG+YL Q KYI  +L+K  M+N  P  TP++ G+  T  +GE +                       
Subjt:  -------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPM-----------------------

Query:  -----EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE
             ++   ++SP+  HWQ +KR+LRYL+GT  + LH  PS  L I  FSDADWA   DD +
Subjt:  -----EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]3.3e-16635.05Show/hide
Query:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW
        IA ++++  TS+ +W   + L G+  KSRI+ L      TRK   KM EYL  MK ++D L LAG  +  S L+   + GLDAEY P++ +++ +  ++W
Subjt:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW

Query:  HEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQ--GNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCY
         +    L++FE+ L+  N+   SG   N SANFA K +  GN F+           +RG N+ +  FRG   GRG   G  S   CQVC   GH A+ C 
Subjt:  HEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQ--GNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCY

Query:  FRFERS-IGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLKDVL
        +RF+R   G N S+     G+++A+IA+P    D+ WY DSGA +H T   +      E+ G   L+VGNG KLKI   G   L       + L+L DVL
Subjt:  FRFERS-IGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLKDVL

Query:  HVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTI-CPSTFV
        +VP+ITKNL+SVSKLTADN++ VEF +N C VKDK T + LL+GRLK+GLYQ+    P                      ++ +  ES    +  P+  V
Subjt:  HVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTI-CPSTFV

Query:  TS---KNASVKDNES---GFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQV
             K+ +VK + S    FC+ACQ GK H LPFK S +   +PL L+ SD+WGP P+ S SG+++Y+ F+DDF+RFT I+PL+ K D + +FI FKN  
Subjt:  TS---KNASVKDNES---GFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQV

Query:  ENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL--------------------------------------
        EN F  KIK +Q D GGE++    V    GI FR  CP+TS+QNG  ERKHRH+ E+GLTLL                                      
Subjt:  ENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL--------------------------------------

Query:  ---------VKP------PCL--------------------------------------------------------------------SNIDLSICTWL
                 +KP      PCL                                                                    S+I L  C+  
Subjt:  ---------VKP------PCL--------------------------------------------------------------------SNIDLSICTWL

Query:  APIGTTQFASPLPPVSSTPASPNLHS------NDPPPLIPFTQSPTSTNSSPTSN------------------SPPQPPPTKSLANTHPMQTRAKNGIFQ
        A   TTQ A  + P ++T +  N HS      N+    +  ++   +TN+S T +                     Q    +  +NTH M+TR+K+GI +
Subjt:  APIGTTQFASPLPPVSSTPASPNLHS------NDPPPLIPFTQSPTSTNSSPTSN------------------SPPQPPPTKSLANTHPMQTRAKNGIFQ

Query:  PKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETF
        PK       +   S   EPK VKEAL  P WK+ M  EY AL  N TWTL+P   + +++ +KW+F+ K   DGS+ + KARLVAKGF Q   +DF ETF
Subjt:  PKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETF

Query:  SPVIKPSTVRVILNIVVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW-------------------------
        SPV+K STVR+IL I V  NWE+RQ+DINNAFLNG L+E V+M QPEG+ID ++P++ICKL K++YGLKQAPR W                         
Subjt:  SPVIKPSTVRVILNIVVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW-------------------------

Query:  -------------------------------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEP
                                                               D SG+YL Q KYI  +L+K  MEN     TP+V G+   A +GE 
Subjt:  -------------------------------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEP

Query:  M----------------------------EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE
        M                            ++   +++PT  HWQ +KR+LRYL+GT  H LH  PS +L I  F DADWA   DD +
Subjt:  M----------------------------EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]3.0e-16735.43Show/hide
Query:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW
        +A ++++  TS+ +W+  + L G+  +SRI+ L      TRK G KM EYL+ MK+IAD+L+LAG +V    L+T  + GLD EY PI+ Q++ K  + W
Subjt:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW

Query:  HEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCYFR
         E    L+++EN L  +N      + +NL+ N ++      +++ + +S      RG   N+    GRGRGR        +  CQVC + GH+A  CY R
Subjt:  HEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCYFR

Query:  FERS-IGANGSSSNT-----LNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLK
        F ++ IG N     +      N    AY+A+P  + D  WY DSGA++H T D N + +  E  G  +L VGNG+ LKI   G   L   D+    L+LK
Subjt:  FERS-IGANGSSSNT-----LNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLK

Query:  DVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSS----PISHTKSIFTAFNESKFTTI
        D+L+VPKITKNL+S+SKLT DN ++VEFH   CFVKDK T RILL+G++K+GLYQ+ P   + +N +   F          + H  S     NE     +
Subjt:  DVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSS----PISHTKSIFTAFNESKFTTI

Query:  CPSTFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQV
        C      +  AS  +N   FC+ACQ GK+H LPF+ S + + +PL+LV SD+WGP P++S SG+++Y+ F+DD++RFT IYPL+ K D  ++FI F+N V
Subjt:  CPSTFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQV

Query:  ENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKP-----------------------------------
        EN F  +IKTLQ D GGEF+    VL   GI  R  CP+TS QNG  ERKHRH+VE GLTLL +                                    
Subjt:  ENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKP-----------------------------------

Query:  ------------------PCLS-------NIDLSICTWLAPIGT---------------------------------------------TQFASPLPPVS
                          PCL            + C +L   G+                                             T    P+ P  
Subjt:  ------------------PCLS-------NIDLSICTWLAPIGT---------------------------------------------TQFASPLPPVS

Query:  STPAS--PNLHSNDPPPLIPFTQSPTSTNSSPTSNSPPQPPPTKSLANT-----------HPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALE
        S  A+    LH+N+       T+S      +   N+        + AN+           H M TR+K GI +PK          L    EP+   EALE
Subjt:  STPAS--PNLHSNDPPPLIPFTQSPTSTNSSPTSNSPPQPPPTKSLANT-----------HPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALE

Query:  SPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQID
        +P WK  M+ E+ AL  N+TWTL+P   + +++  KWVF+ K   DG++ + KARLVAKGF Q   +D+ ETFSPVIK  TVR+IL+I V  NWE+RQ+D
Subjt:  SPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQID

Query:  INNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW----------------------------------------------------
        INNAFLNG L+E V+M QPEGF+D SRP +ICKL K++YGLKQAPR W                                                    
Subjt:  INNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW----------------------------------------------------

Query:  ----------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPME-----------------ILPCIA--
                                    D SGLYL Q KY+L +L+K  +E++    TP+V G+ L+  + E M+                   P IA  
Subjt:  ----------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPME-----------------ILPCIA--

Query:  ---------SPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE
                 +PT+ HWQ+VKRV RYLKGT  H LH  PS  L I  FSDADWA   +D +
Subjt:  ---------SPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]3.0e-16733.33Show/hide
Query:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW
        IA +V++  TS+ +W   + L G+  +SRI  L      T K   KM +YL+ MK +AD L LAG  +  S L+   + GLD+EY P++ +++ ++ ++W
Subjt:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW

Query:  HEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQ--GNNFSKPQSQSPNAEQNRGHNFNQRGFR-GRGRGRGGRPGNNSKPTCQVCGRYGHSALVC
         +F   L++FE+ L+ +N+  +     N SANFA+K +  GN F           +      N RG R GRGR R  +P    +P CQ+CG++GH+A  C
Subjt:  HEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQ--GNNFSKPQSQSPNAEQNRGHNFNQRGFR-GRGRGRGGRPGNNSKPTCQVCGRYGHSALVC

Query:  YFRFERSIGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLKDVL
        Y+RF++S       +    G+++A++A+P    D+ WY DSGA++H T  +  +    E  G   L+VGNG KLKI   G   L+        ++L++VL
Subjt:  YFRFERSIGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLKDVL

Query:  HVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTK------SIFTAFNESKFTTIC
        +VP+ITKNL+SVSKLT DN+  VEF  N C+VKDK T + LL+GRLK+GLYQ+       +N     +  S   I H K       +     +     I 
Subjt:  HVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTK------SIFTAFNESKFTTIC

Query:  PSTFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQVE
        PS   T            FC+ACQ GK H LPFK S + + +PL+L+ +D+WGP P+ S S +++Y+ F+DDF+RFT I+PL+ K + + +F  FKN VE
Subjt:  PSTFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQVE

Query:  NYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL---------------------------------------
        N F  KIK ++ D GGE++P      + GI F+  CP+TS+QNG  ERKHRH+ E+GLTLL                                       
Subjt:  NYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL---------------------------------------

Query:  --------VKP------PCLSNID---------------------------------------------------------LSICTWLAPIGTTQFASPL
                +KP      PCL   +                                                         + + T   PIG   F + +
Subjt:  --------VKP------PCLSNID---------------------------------------------------------LSICTWLAPIGTTQFASPL

Query:  PPVSSTPASPNLHSNDPPPLIPFT----QSPTS-----TNSSPTSNSPPQ----------------------PPPTKSLANTHPMQTRAKNGIFQPKSFL
           ++  A+ N+     P L        QS  S     T+ +  SN   +                      PPP + + NTH M+TR+K G+++PK   
Subjt:  PPVSSTPASPNLHSNDPPPLIPFT----QSPTS-----TNSSPTSNSPPQ----------------------PPPTKSLANTHPMQTRAKNGIFQPKSFL

Query:  STNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIK
            +       EP+ V EAL  P W + M  EY AL  N+TWTL+P   + +V+ +KW+F+ K   DG++ + KARLVA+GF Q   +D+ ETFSPV+K
Subjt:  STNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIK

Query:  PSTVRVILNIVVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW------------------------------
         STVR+IL+I V  +WE+RQ+DINNAFLNG L+E V+M QPEG+ID ++P +IC+L K++YGLKQAPR W                              
Subjt:  PSTVRVILNIVVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW------------------------------

Query:  --------------------------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPM----
                                                          D+SG+YLTQ KYI  +L+K  MEN     TP++ G+  T  +GEPM    
Subjt:  --------------------------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPM----

Query:  ------------------------EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE
                                ++   + SPT+ HWQ +KR+LRYL G+T  GLH  PS  L I  FSDADWA   DD +
Subjt:  ------------------------EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE

RVX01903.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]2.8e-16535.95Show/hide
Query:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGK-SEMN
        I  +++   +S   W  LE ++ +++++R+ QL    QTTRK    M EY+  +K +ADNL+  GE V     I  ++ GL A+Y  I+  +  +  EM+
Subjt:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGK-SEMN

Query:  WHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGR-PGNNSKPTCQVCGRYGHSALVCY
         H  H  L++ E  L++ NS+    +   +SAN A          PQ Q  N +++ G N  Q GF  R    GGR   +  +P CQ+CG++GH+ + CY
Subjt:  WHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGR-PGNNSKPTCQVCGRYGHSALVCY

Query:  FRFERSIGANGSSSNTL--NGANA-----AYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLL
         RF+ +      + +T+  N  NA     A +A+P  I+D AW+ D+GAT H +   + ++    Y GN+ ++VGNG  L+I H G      F S++   
Subjt:  FRFERSIGANGSSSNTL--NGANA-----AYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLL

Query:  HLKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEP-LVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTI
         L+ VLHVP I  NL+SVS+  ADN+ F EFH  + FVKD+ T++ILLQG L++GLY+     VPS +      + +SS+    T +             
Subjt:  HLKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEP-LVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTI

Query:  CPSTFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQV
             +TS N   + +++  C ACQ  KSHKLPF +S ++++ PL L+ +DLWGP  + S++G R++I FVDDF+R++ IYPL SK  AL  FI FK+ V
Subjt:  CPSTFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQV

Query:  ENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKPPCLSNIDLSICTWLAPIGTTQFA----------SP
        EN F  +I+ L+SD GGEF+ F   L   GI  +  CP+T EQNG  ERK RHI+E GL L      L+   L    WL    T  F             
Subjt:  ENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKPPCLSNIDLSICTWLAPIGTTQFA----------SP

Query:  LPPVSSTPASPNLHSNDPPPLIPFTQSPTSTNSSP-------TSNSPPQPPPTKSLANTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALES
         PPVSS  +     ++ P    P T  P+ST S P          S  +P PT    N HPM TRAKNGI + K + S++ S       EP    +A++ 
Subjt:  LPPVSSTPASPNLHSNDPPPLIPFTQSPTSTNSSP-------TSNSPPQPPPTKSLANTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALES

Query:  PHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQIDI
         +W   M  +++ALQ+N TW L+PP +  +++G KWV+++K  PDG+V +YKARLVA+GF Q   +D+ ETFSPV+K ST+R+IL + +S NW + Q+D+
Subjt:  PHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQIDI

Query:  NNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW----DTSGL--------------------------------------------
         NAFL+G L+E V+M QP GFI+   P ++CKL K+LYGLKQAPR W    +TS L                                            
Subjt:  NNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW----DTSGL--------------------------------------------

Query:  --------------------------------YLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPM---------------------------
                                        +L+Q KY   +L +  M + KPA+TP + G+ L+  DGEP                            
Subjt:  --------------------------------YLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPM---------------------------

Query:  -EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDD
         +    +A+PT+ HW AVKR+LRYLKGT ++G+    S SL I  ++DADWA CPDD
Subjt:  -EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDD

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-16735.43Show/hide
Query:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW
        +A ++++  TS+ +W+  + L G+  +SRI+ L      TRK G KM EYL+ MK+IAD+L+LAG +V    L+T  + GLD EY PI+ Q++ K  + W
Subjt:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW

Query:  HEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCYFR
         E    L+++EN L  +N      + +NL+ N ++      +++ + +S      RG   N+    GRGRGR        +  CQVC + GH+A  CY R
Subjt:  HEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCYFR

Query:  FERS-IGANGSSSNT-----LNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLK
        F ++ IG N     +      N    AY+A+P  + D  WY DSGA++H T D N + +  E  G  +L VGNG+ LKI   G   L   D+    L+LK
Subjt:  FERS-IGANGSSSNT-----LNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLK

Query:  DVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSS----PISHTKSIFTAFNESKFTTI
        D+L+VPKITKNL+S+SKLT DN ++VEFH   CFVKDK T RILL+G++K+GLYQ+ P   + +N +   F          + H  S     NE     +
Subjt:  DVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSS----PISHTKSIFTAFNESKFTTI

Query:  CPSTFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQV
        C      +  AS  +N   FC+ACQ GK+H LPF+ S + + +PL+LV SD+WGP P++S SG+++Y+ F+DD++RFT IYPL+ K D  ++FI F+N V
Subjt:  CPSTFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQV

Query:  ENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKP-----------------------------------
        EN F  +IKTLQ D GGEF+    VL   GI  R  CP+TS QNG  ERKHRH+VE GLTLL +                                    
Subjt:  ENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKP-----------------------------------

Query:  ------------------PCLS-------NIDLSICTWLAPIGT---------------------------------------------TQFASPLPPVS
                          PCL            + C +L   G+                                             T    P+ P  
Subjt:  ------------------PCLS-------NIDLSICTWLAPIGT---------------------------------------------TQFASPLPPVS

Query:  STPAS--PNLHSNDPPPLIPFTQSPTSTNSSPTSNSPPQPPPTKSLANT-----------HPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALE
        S  A+    LH+N+       T+S      +   N+        + AN+           H M TR+K GI +PK          L    EP+   EALE
Subjt:  STPAS--PNLHSNDPPPLIPFTQSPTSTNSSPTSNSPPQPPPTKSLANT-----------HPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALE

Query:  SPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQID
        +P WK  M+ E+ AL  N+TWTL+P   + +++  KWVF+ K   DG++ + KARLVAKGF Q   +D+ ETFSPVIK  TVR+IL+I V  NWE+RQ+D
Subjt:  SPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQID

Query:  INNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW----------------------------------------------------
        INNAFLNG L+E V+M QPEGF+D SRP +ICKL K++YGLKQAPR W                                                    
Subjt:  INNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW----------------------------------------------------

Query:  ----------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPME-----------------ILPCIA--
                                    D SGLYL Q KY+L +L+K  +E++    TP+V G+ L+  + E M+                   P IA  
Subjt:  ----------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPME-----------------ILPCIA--

Query:  ---------SPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE
                 +PT+ HWQ+VKRV RYLKGT  H LH  PS  L I  FSDADWA   +D +
Subjt:  ---------SPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)1.5e-16733.33Show/hide
Query:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW
        IA +V++  TS+ +W   + L G+  +SRI  L      T K   KM +YL+ MK +AD L LAG  +  S L+   + GLD+EY P++ +++ ++ ++W
Subjt:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW

Query:  HEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQ--GNNFSKPQSQSPNAEQNRGHNFNQRGFR-GRGRGRGGRPGNNSKPTCQVCGRYGHSALVC
         +F   L++FE+ L+ +N+  +     N SANFA+K +  GN F           +      N RG R GRGR R  +P    +P CQ+CG++GH+A  C
Subjt:  HEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQ--GNNFSKPQSQSPNAEQNRGHNFNQRGFR-GRGRGRGGRPGNNSKPTCQVCGRYGHSALVC

Query:  YFRFERSIGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLKDVL
        Y+RF++S       +    G+++A++A+P    D+ WY DSGA++H T  +  +    E  G   L+VGNG KLKI   G   L+        ++L++VL
Subjt:  YFRFERSIGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLKDVL

Query:  HVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTK------SIFTAFNESKFTTIC
        +VP+ITKNL+SVSKLT DN+  VEF  N C+VKDK T + LL+GRLK+GLYQ+       +N     +  S   I H K       +     +     I 
Subjt:  HVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTK------SIFTAFNESKFTTIC

Query:  PSTFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQVE
        PS   T            FC+ACQ GK H LPFK S + + +PL+L+ +D+WGP P+ S S +++Y+ F+DDF+RFT I+PL+ K + + +F  FKN VE
Subjt:  PSTFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQVE

Query:  NYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL---------------------------------------
        N F  KIK ++ D GGE++P      + GI F+  CP+TS+QNG  ERKHRH+ E+GLTLL                                       
Subjt:  NYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL---------------------------------------

Query:  --------VKP------PCLSNID---------------------------------------------------------LSICTWLAPIGTTQFASPL
                +KP      PCL   +                                                         + + T   PIG   F + +
Subjt:  --------VKP------PCLSNID---------------------------------------------------------LSICTWLAPIGTTQFASPL

Query:  PPVSSTPASPNLHSNDPPPLIPFT----QSPTS-----TNSSPTSNSPPQ----------------------PPPTKSLANTHPMQTRAKNGIFQPKSFL
           ++  A+ N+     P L        QS  S     T+ +  SN   +                      PPP + + NTH M+TR+K G+++PK   
Subjt:  PPVSSTPASPNLHSNDPPPLIPFT----QSPTS-----TNSSPTSNSPPQ----------------------PPPTKSLANTHPMQTRAKNGIFQPKSFL

Query:  STNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIK
            +       EP+ V EAL  P W + M  EY AL  N+TWTL+P   + +V+ +KW+F+ K   DG++ + KARLVA+GF Q   +D+ ETFSPV+K
Subjt:  STNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIK

Query:  PSTVRVILNIVVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW------------------------------
         STVR+IL+I V  +WE+RQ+DINNAFLNG L+E V+M QPEG+ID ++P +IC+L K++YGLKQAPR W                              
Subjt:  PSTVRVILNIVVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW------------------------------

Query:  --------------------------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPM----
                                                          D+SG+YLTQ KYI  +L+K  MEN     TP++ G+  T  +GEPM    
Subjt:  --------------------------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPM----

Query:  ------------------------EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE
                                ++   + SPT+ HWQ +KR+LRYL G+T  GLH  PS  L I  FSDADWA   DD +
Subjt:  ------------------------EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE

A0A2Z6P4D5 Integrase catalytic domain-containing protein1.6e-16635.05Show/hide
Query:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW
        IA ++++  TS+ +W   + L G+  KSRI+ L      TRK   KM EYL  MK ++D L LAG  +  S L+   + GLDAEY P++ +++ +  ++W
Subjt:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW

Query:  HEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQ--GNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCY
         +    L++FE+ L+  N+   SG   N SANFA K +  GN F+           +RG N+ +  FRG   GRG   G  S   CQVC   GH A+ C 
Subjt:  HEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQ--GNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCY

Query:  FRFERS-IGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLKDVL
        +RF+R   G N S+     G+++A+IA+P    D+ WY DSGA +H T   +      E+ G   L+VGNG KLKI   G   L       + L+L DVL
Subjt:  FRFERS-IGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLKDVL

Query:  HVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTI-CPSTFV
        +VP+ITKNL+SVSKLTADN++ VEF +N C VKDK T + LL+GRLK+GLYQ+    P                      ++ +  ES    +  P+  V
Subjt:  HVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTI-CPSTFV

Query:  TS---KNASVKDNES---GFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQV
             K+ +VK + S    FC+ACQ GK H LPFK S +   +PL L+ SD+WGP P+ S SG+++Y+ F+DDF+RFT I+PL+ K D + +FI FKN  
Subjt:  TS---KNASVKDNES---GFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQV

Query:  ENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL--------------------------------------
        EN F  KIK +Q D GGE++    V    GI FR  CP+TS+QNG  ERKHRH+ E+GLTLL                                      
Subjt:  ENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL--------------------------------------

Query:  ---------VKP------PCL--------------------------------------------------------------------SNIDLSICTWL
                 +KP      PCL                                                                    S+I L  C+  
Subjt:  ---------VKP------PCL--------------------------------------------------------------------SNIDLSICTWL

Query:  APIGTTQFASPLPPVSSTPASPNLHS------NDPPPLIPFTQSPTSTNSSPTSN------------------SPPQPPPTKSLANTHPMQTRAKNGIFQ
        A   TTQ A  + P ++T +  N HS      N+    +  ++   +TN+S T +                     Q    +  +NTH M+TR+K+GI +
Subjt:  APIGTTQFASPLPPVSSTPASPNLHS------NDPPPLIPFTQSPTSTNSSPTSN------------------SPPQPPPTKSLANTHPMQTRAKNGIFQ

Query:  PKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETF
        PK       +   S   EPK VKEAL  P WK+ M  EY AL  N TWTL+P   + +++ +KW+F+ K   DGS+ + KARLVAKGF Q   +DF ETF
Subjt:  PKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETF

Query:  SPVIKPSTVRVILNIVVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW-------------------------
        SPV+K STVR+IL I V  NWE+RQ+DINNAFLNG L+E V+M QPEG+ID ++P++ICKL K++YGLKQAPR W                         
Subjt:  SPVIKPSTVRVILNIVVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW-------------------------

Query:  -------------------------------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEP
                                                               D SG+YL Q KYI  +L+K  MEN     TP+V G+   A +GE 
Subjt:  -------------------------------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEP

Query:  M----------------------------EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE
        M                            ++   +++PT  HWQ +KR+LRYL+GT  H LH  PS +L I  F DADWA   DD +
Subjt:  M----------------------------EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDDEE

A0A803PEH4 Uncharacterized protein9.2e-17038.11Show/hide
Query:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW
        IA EV+   ++  + + LE LYG+ +KS++      +QTTRK  + MSEYL   K  ++ L+LAG+  P + L+ +V+ GLDAEYL I+ QI  +S   W
Subjt:  IAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNW

Query:  HEFHPTLISFENTLNYMNSLQSSGDMTNLS---ANFAAKRQGNNFSKPQSQSPNAEQNRGHNF-NQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALV
         E    L+SF++ +  + +L  + +    S   AN AAK   N   +   QS NA  N G  F N RG   R RGRG  PG+ S+PTCQV G+YGH+A V
Subjt:  HEFHPTLISFENTLNYMNSLQSSGDMTNLS---ANFAAKRQGNNFSKPQSQSPNAEQNRGHNF-NQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALV

Query:  CYFRFERS-IGA------NGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNAS
        CY RF+ S +G+      N + +   N  ++A++ATPE++   AW+ADSGA++H T D  N+ QK +Y G E +VVGNGSKL+I H+G   L+    N  
Subjt:  CYFRFERS-IGA------NGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNAS

Query:  LLHLKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTT
         L LKD+L VPKI KNL+SVSKL  DN+V +EF+SN C VKDK T+++LL G LK+ LYQ++      S+     + +S+   + T S+ +  N+S+  +
Subjt:  LLHLKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTT

Query:  ICPSTF------------------VTSKNASVKDNE-SGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHI
        +  S                    + S N SV  N     CDACQ GK+H LPF+ S+T++   L+L+ +DLWGP P+ S+  + +YI FVDD++R+T +
Subjt:  ICPSTF------------------VTSKNASVKDNE-SGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHI

Query:  YPLRSKVDALKSFILFKNQVENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKP---PCLSN-------
        YPL+ K DAL +FI FK  VEN F  KIK+L+SD GGE++PFV ++Q  GI F+HPCPHTS QNG  +RKHRH VEMGLTLL +    PCL +       
Subjt:  YPLRSKVDALKSFILFKNQVENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKP---PCLSN-------

Query:  ------IDLSIC------TWLAPIG---------------------TTQFASPLPPV--------------SSTPASPNLHSNDPPPLIPFTQSPTSTNS
              ++L           L+P G                        + +P P +              S +P+S    S D  P      SP +++S
Subjt:  ------IDLSIC------TWLAPIG---------------------TTQFASPLPPV--------------SSTPASPNLHSNDPPPLIPFTQSPTSTNS

Query:  SPTSNSP--------------------------------------------PQPPPTKSLA--NTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYV
        SP + SP                                             Q  PT+  A   +HPM TR K GIF+P++F+S   S   S   EP  V
Subjt:  SPTSNSP--------------------------------------------PQPPPTKSLA--NTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYV

Query:  KEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWE
         EA++ P W   M  E+ AL+   T  L+PPS+  ++VGNKWV+R+K N DG+V + KARLVAKGFHQRP I++ ETFSP+IK +TVR+IL IVVS  WE
Subjt:  KEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWE

Query:  LRQIDINNAFLNGTLQEDVYMVQPEGFID---------PSRPDYICKLQKSLYGLKQAPREW--------DTSGLYLTQQKYILSML-QKLGMENLKPAS
        +      + F     +++  ++    ++D              +I +L K        P ++        D SGLYLTQ +YI  +L +K+ M  +K   
Subjt:  LRQIDINNAFLNGTLQEDVYMVQPEGFID---------PSRPDYICKLQKSLYGLKQAPREW--------DTSGLYLTQQKYILSML-QKLGMENLKPAS

Query:  TPVVHGKYLTAFDGEPMEILPCIASPTSCHWQAVKRVLRYL--------KGTTTHGLHFAPSNSLSIVAFSDADWACCPDD
          +  GK L+  DG+ ME      +PT   +++    L+YL         GT T GLH + S+ LSI  FSDADWACCPDD
Subjt:  TPVVHGKYLTAFDGEPMEILPCIASPTSCHWQAVKRVLRYL--------KGTTTHGLHFAPSNSLSIVAFSDADWACCPDD

A0A803QCY3 Uncharacterized protein8.8e-18140.42Show/hide
Query:  SIAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMN
        +IA EV+   ++ G+W  LE+LYG+ +KS++      +QTT+K G+ M EYL   K  AD+L+LAGE  P + L T+V++ LD  YL ++ QI  +++ +
Subjt:  SIAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMN

Query:  WHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCYF
        W E    L+SFE+ +  +                    +GNN             +RG +FN RG  GR RGRG    NNSKPTCQVCG+Y HSA+VCY 
Subjt:  WHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCYF

Query:  RFERS-IGANGSSSN-----TLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHL
         F+ S +G++  SSN       N   +A+IATPE ++  AW+ADSGA+++ T D + + QK EYGG E + VGNG KL I H G   L  +      L L
Subjt:  RFERS-IGANGSSSN-----TLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHL

Query:  KDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTICPS
         ++L VP I KN +SVSKLT DN V +EFHSN CFVKD +T R+LLQG LK+GLYQ++                        KS +  F+ SKF      
Subjt:  KDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTICPS

Query:  TFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQVENY
          V+  N    D+   FCDACQ GKSH LPFK S++K+ K L+LV +DLWGP+P+TS+  +++Y+ FVDD TRFT IYPL++K +A  +F+ FK+  EN 
Subjt:  TFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQVENY

Query:  FECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKPPCLSNIDLSICTWLAPIGTTQF---ASPLPPVSSTPAS
        FE KIK L++D GGE++     +   GI F H CPHTS QNG  ERKHRHIVEMGLTLL +        + +  W     T  +     P P +      
Subjt:  FECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKPPCLSNIDLSICTWLAPIGTTQF---ASPLPPVSSTPAS

Query:  PNLHSNDP-------------PPLIPFTQ-------------------------SPTS--------------------------------TNSSPTSNSP
          LH   P             P L P+                           SPT                                   SS  S  P
Subjt:  PNLHSNDP-------------PPLIPFTQ-------------------------SPTS--------------------------------TNSSPTSNSP

Query:  PQPPPTKS---LANTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNP
           P T S    AN       A+    Q ++F + NS        EP  V EAL    W   M +E  AL+ N+T+ L+PP+   +++GNKWVFR K N 
Subjt:  PQPPPTKS---LANTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNP

Query:  DGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAP
        DG++ + KARLVAKGFHQR  IDF ET+SPVIK STVR+IL I VS  W++RQ+DINNAFLNGTL+EDV+M QPEGF +  +  ++CKL KSLYGLKQA 
Subjt:  DGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAP

Query:  REWDTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPMEILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWA
        R WD +GL+LTQ KYI  +L++    NLK   TP + GK ++  DGEP+  +          ++++   L+YL        H  P  + ++    + DWA
Subjt:  REWDTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPMEILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWA

Query:  CCPDDEE
        CCPDD +
Subjt:  CCPDDEE

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.8e-3328.81Show/hide
Query:  WKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQIDINN
        W++ +  E  A + N TWT+       ++V ++WVF VK N  G+  +YKARLVA+GF Q+ +ID+ ETF+PV + S+ R IL++V+  N ++ Q+D+  
Subjt:  WKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQIDINN

Query:  AFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW-------------------------------------------------------
        AFLNGTL+E++YM  P+G       D +CKL K++YGLKQA R W                                                       
Subjt:  AFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW-------------------------------------------------------

Query:  ---------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPV---VHGKYLTAFD----------GEPMEILPCI--------
                                       +YL+Q  Y+  +L K  MEN    STP+   ++ + L + +          G  M I+ C         
Subjt:  ---------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPV---VHGKYLTAFD----------GEPMEILPCI--------

Query:  -------ASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSL--SIVAFSDADWA
               +   S  WQ +KRVLRYLKGT    L F  + +    I+ + D+DWA
Subjt:  -------ASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSL--SIVAFSDADWA

P04146 Copia protein2.8e-1427.25Show/hide
Query:  GNNS-KPTCQVCGRYGHSALVCYFRFERSIG----ANGSSSNTLNGANAAY----IATPEMINDHAWYADSGATSHTTPDANNMNQKME-YGGNEYLVVG
        GN+  K  C  CGR GH    C F ++R +      N     T      A+    +    ++++  +  DSGA+ H   D +     +E     +  V  
Subjt:  GNNS-KPTCQVCGRYGHSALVCYFRFERSIG----ANGSSSNTLNGANAAY----IATPEMINDHAWYADSGATSHTTPDANNMNQKME-YGGNEYLVVG

Query:  NGSKLKIEHVGKVDLHCFDSNASLLHLKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFA
         G  +     G V L     N   + L+DVL   +   NLMSV +L  +  + +EF        DKS   I      KNGL  ++      +N+    F 
Subjt:  NGSKLKIEHVGKVDLHCFDSNASLLHLKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFA

Query:  KSSSPISHTKSIFTAFNESKFTTICPSTFVTSKNASVKDNES---------GFCDACQLGKSHKLPFKLSHTKS--TKPLELVFSDLWGP-TPVTSSSGY
          S    H K+ F  ++E +F  I     +  K  ++  ++S           C+ C  GK  +LPFK    K+   +PL +V SD+ GP TPVT     
Subjt:  KSSSPISHTKSIFTAFNESKFTTICPSTFVTSKNASVKDNES---------GFCDACQLGKSHKLPFKLSHTKS--TKPLELVFSDLWGP-TPVTSSSGY

Query:  RFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQVENYFECKIKTLQSDWGGEF--RPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL
         +++ FVD FT +   Y ++ K D    F  F  + E +F  K+  L  D G E+            GI +    PHT + NG  ER  R I E   T++
Subjt:  RFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQVENYFECKIKTLQSDWGGEF--RPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-5423.88Show/hide
Query:  IAAEVINIPTSRGVWKVLEELYGS---TNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQI-NGKS
        +   +I+  T+RG+W  LE LY S   TNK  + +   AL  +   G+    +L+    +   L+  G  +        ++  L + Y  +   I +GK+
Subjt:  IAAEVINIPTSRGVWKVLEELYGS---TNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQI-NGKS

Query:  EMNWHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALV
         +   +    L+                    L+     K +    +        + Q   +N+ + G RG+ + R      N    C  C + GH    
Subjt:  EMNWHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALV

Query:  CYFRFERSIGANGSSSNTLNGANAAYIATPE-----MINDH-----------AWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVD
        C           G +S   N  N A +          IN+             W  D+ A+ H TP   ++  +   G    + +GN S  KI  +G + 
Subjt:  CYFRFERSIGANGSSSNTLNGANAAYIATPE-----MINDH-----------AWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVD

Query:  LHCFDSNAS-LLHLKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIF
          C  +N    L LKDV HVP +  NL  +S +  D   +  + +N  +   K +  ++ +G  +  LY+    +  Q  L     A+    +       
Subjt:  LHCFDSNAS-LLHLKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIF

Query:  TAFNESKFTTICPSTFVT-SKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVD
           +E     +   + ++ +K  +VK      CD C  GK H++ F+ S  +    L+LV+SD+ GP  + S  G +++++F+DD +R   +Y L++K  
Subjt:  TAFNESKFTTICPSTFVT-SKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVD

Query:  ALKSFILFKNQVENYFECKIKTLQSDWGGEF--RPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL-------------VKPPC------
          + F  F   VE     K+K L+SD GGE+  R F     + GI      P T + NG  ER +R IVE   ++L             V+  C      
Subjt:  ALKSFILFKNQVENYFECKIKTLQSDWGGEF--RPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL-------------VKPPC------

Query:  --------------------LSNIDLSICTWLAPI--------------------GTTQFASPL-PPV--------------SSTPASPNLHSNDPPPLI
                             S++ +  C   A +                    G  +F   L  PV              S    + ++       +I
Subjt:  --------------------LSNIDLSICTWLAPI--------------------GTTQFASPL-PPV--------------SSTPASPNLHSNDPPPLI

Query:  PFTQSPTSTNSSPT---------SNSPPQP------------------PPTKSLANTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALESPH
        P   +  ST+++PT         S    QP                   PT+      P++ R++    + + + ST     +S   EP+ +KE L  P 
Subjt:  PFTQSPTSTNSSPT---------SNSPPQP------------------PPTKSLANTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALESPH

Query:  WKDV---MLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQID
           +   M +E  +LQKN T+ L+        +  KWVF++K + D  + +YKARLV KGF Q+  IDF E FSPV+K +++R IL++  S + E+ Q+D
Subjt:  WKDV---MLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQID

Query:  INNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW----------------------------------------------------
        +  AFL+G L+E++YM QPEGF    +   +CKL KSLYGLKQAPR+W                                                    
Subjt:  INNAFLNGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW----------------------------------------------------

Query:  -------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVV-HGK--------------------YLTAFDGEPMEI
                                        +  L+L+Q+KYI  +L++  M+N KP STP+  H K                    Y +A  G  M  
Subjt:  -------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVV-HGK--------------------YLTAFDGEPMEI

Query:  LPC---------------IASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWA
        + C               + +P   HW+AVK +LRYL+GTT   L F  S+ + +  ++DAD A
Subjt:  LPC---------------IASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWA

P92520 Uncharacterized mitochondrial protein AtMg008202.5e-2346.09Show/hide
Query:  MQTRAKNGI--FQPKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKG
        M TR+K GI    PK  L+  ++       EPK V  AL+ P W   M +E  AL +N+TW L+PP    +++G KWVF+ K++ DG++ + KARLVAKG
Subjt:  MQTRAKNGI--FQPKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKG

Query:  FHQRPEIDFVETFSPVIKPSTVRVILNI
        FHQ   I FVET+SPV++ +T+R ILN+
Subjt:  FHQRPEIDFVETFSPVIKPSTVRVILNI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-11929.63Show/hide
Query:  SIAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGK-SEM
        S+   V    T+  +W+ L ++Y + +   ++QL   L+   K    + +Y+  +    D L+L G+ +     +  V+  L  EY P+I QI  K +  
Subjt:  SIAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGK-SEM

Query:  NWHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAE-QNRGHNFNQRGFRGRGRGRGGRPGNN-SKP---TCQVCGRYGHS
           E H  L++ E+ +  +    SS  +  ++AN  + R     +   + + N    NR +N N + +  +       P NN SKP    CQ+CG  GHS
Subjt:  NWHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAE-QNRGHNFNQRGFRGRGRGRGGRPGNN-SKP---TCQVCGRYGHS

Query:  ALVC--YFRFERSIGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLL
        A  C     F  S+ +    S        A +A     + + W  DSGAT H T D NN++    Y G + ++V +GS + I H G   L    + +  L
Subjt:  ALVC--YFRFERSIGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLL

Query:  HLKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTIC
        +L ++L+VP I KNL+SV +L   N V VEF      VKD +T   LLQG+ K+ LY+     P  S+     FA  SS  +H+         ++     
Subjt:  HLKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTIC

Query:  PS---TFVTSKNASVKDNESGF--CDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILF
        PS   + +++ + SV +    F  C  C + KS+K+PF  S   ST+PLE ++SD+W  +P+ S   YR+Y+ FVD FTR+T +YPL+ K    ++FI F
Subjt:  PS---TFVTSKNASVKDNESGF--CDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILF

Query:  KNQVENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL-----------------------VKPPCLS----
        KN +EN F+ +I T  SD GGEF          GI      PHT E NG  ERKHRHIVE GLTLL                       +  P L     
Subjt:  KNQVENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLL-----------------------VKPPCLS----

Query:  ---------NIDL--------------------------------------------------------------------------------SICTWLA
                 N D                                                                                 S C W +
Subjt:  ---------NIDL--------------------------------------------------------------------------------SICTWLA

Query:  PIGTTQFASPLPPVSS---------TPASP------------NLHS-----------------NDPPPLIPFTQSPTSTNSS-------PTSNSPPQ---
        P  T    +P+ P  S          P+SP            NL S                 N P P    TQ+ T T+SS       PT+ SP Q   
Subjt:  PIGTTQFASPLPPVSS---------TPASP------------NLHS-----------------NDPPPLIPFTQSPTSTNSS-------PTSNSPPQ---

Query:  -----------------------------------PPPTKSLA--------NTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALESPHWKDV
                                           PPP   +         NTH M TRAK GI +P    S   + +L+   EP+   +AL+   W++ 
Subjt:  -----------------------------------PPPTKSLA--------NTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALESPHWKDV

Query:  MLDEYTALQKNRTWTLI-PPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQIDINNAFL
        M  E  A   N TW L+ PP + V++VG +W+F  K N DGS+ +YKARLVAKG++QRP +D+ ETFSPVIK +++R++L + V  +W +RQ+D+NNAFL
Subjt:  MLDEYTALQKNRTWTLI-PPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQIDINNAFL

Query:  NGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW------------------DTS-------------------------------------
         GTL +DVYM QP GFID  RP+Y+CKL+K+LYGLKQAPR W                  DTS                                     
Subjt:  NGTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW------------------DTS-------------------------------------

Query:  -------------------------GLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDG----EPMEILPCIAS--------------------
                                 GL+L+Q++YIL +L +  M   KP +TP+     L+ + G    +P E    + S                    
Subjt:  -------------------------GLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDG----EPMEILPCIAS--------------------

Query:  ----PTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDD
            PT  H QA+KR+LRYL GT  HG+     N+LS+ A+SDADWA   DD
Subjt:  ----PTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-11729.1Show/hide
Query:  SIAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGK-SEM
        S+   V    T+  +W+ L ++Y + +   ++QL    +                    D L+L G+ +     +  V+  L  +Y P+I QI  K +  
Subjt:  SIAAEVINIPTSRGVWKVLEELYGSTNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGK-SEM

Query:  NWHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNS-KP---TCQVCGRYGHSA
        +  E H  LI+ E+ L  +N    S ++  ++AN    R  N      ++  N   N   N N R    +    G R  N   KP    CQ+C   GHSA
Subjt:  NWHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFAAKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNS-KP---TCQVCGRYGHSA

Query:  LVC--YFRFERSIGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLH
          C    +F+ +     S+S        A +A     N + W  DSGAT H T D NN++    Y G + +++ +GS + I H G   L    +++  L 
Subjt:  LVC--YFRFERSIGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHTTPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLH

Query:  LKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTICP
        L  VL+VP I KNL+SV +L   N V VEF      VKD +T   LLQG+ K+ LY+  P+  SQ+      FA   S  +H+ S  +         +  
Subjt:  LKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLVPSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTICP

Query:  STFVTSKNASVKDNESGF--CDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQV
        ++ +++ +  V +       C  C + KSHK+PF  S   S+KPLE ++SD+W  +P+ S   YR+Y+ FVD FTR+T +YPL+ K     +FI+FK+ V
Subjt:  STFVTSKNASVKDNESGF--CDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDFTRFTHIYPLRSKVDALKSFILFKNQV

Query:  ENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLV-------------------------------------
        EN F+ +I TL SD GGEF      L   GI      PHT E NG  ERKHRHIVEMGLTLL                                      
Subjt:  ENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLV-------------------------------------

Query:  --KPPCLSNIDLSICT---WLA-------------------------------------------------PIGTTQF----------------------
          +PP    + +  C    WL                                                  P  TT F                      
Subjt:  --KPPCLSNIDLSICT---WLA-------------------------------------------------PIGTTQF----------------------

Query:  ------------------ASPLPPVSSTP------ASPNL-------------------------------HSNDPPPLI-----------------PFT
                           SP PP S +P      +S NL                               +SN   P++                 P  
Subjt:  ------------------ASPLPPVSSTP------ASPNL-------------------------------HSNDPPPLI-----------------PFT

Query:  QSPTSTNSSPT---------------SNSPPQPP----------PTKSLANTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVM
        QSP S+   PT               +++PP PP            ++  NTH M TRAK+GI +P    S   +++L+   EP+   +A++   W+  M
Subjt:  QSPTSTNSSPT---------------SNSPPQPP----------PTKSLANTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVM

Query:  LDEYTALQKNRTWTLI-PPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQIDINNAFLN
          E  A   N TW L+ PP   V++VG +W+F  K N DGS+ +YKARLVAKG++QRP +D+ ETFSPVIK +++R++L + V  +W +RQ+D+NNAFL 
Subjt:  LDEYTALQKNRTWTLI-PPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQIDINNAFLN

Query:  GTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW------------------DTS--------------------------------------
        GTL ++VYM QP GF+D  RPDY+C+L+K++YGLKQAPR W                  DTS                                      
Subjt:  GTLQEDVYMVQPEGFIDPSRPDYICKLQKSLYGLKQAPREW------------------DTS--------------------------------------

Query:  ------------------------GLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDG----EPMEILPCIAS---------------------
                                GL+L+Q++Y L +L +  M   KP +TP+     LT   G    +P E    + S                     
Subjt:  ------------------------GLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDG----EPMEILPCIAS---------------------

Query:  ---PTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDD
           PT  HW A+KRVLRYL GT  HG+     N+LS+ A+SDADWA   DD
Subjt:  ---PTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPDD

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein6.4e-0625.17Show/hide
Query:  KYRNEVECYYCHKKGHLKYQCRKFKEDQKRKSEANIVEEVVLACVESDTKYSNHLSDWILDSAASVHIASDRNLFTSFTGGHHGLVRMGNGRTSKTRGIG
        K ++E  C  C+K  H +  C+      K + E  IV +  L  V +    +     WI+   A +++      FT+        V   +G      G G
Subjt:  KYRNEVECYYCHKKGHLKYQCRKFKEDQKRKSEANIVEEVVLACVESDTKYSNHLSDWILDSAASVHIASDRNLFTSFTGGHHGLVRMGNGRTSKTRGIG

Query:  DVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFG
        DV ++ + G K  +R+V +VP +  N++S GK+    Y    G
Subjt:  DVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFG

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.0e-4029.22Show/hide
Query:  EPKYVKEALESPHWKDVMLDEYTALQKNRTWTL--IPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNI
        EP    EA E   W   M DE  A++   TW +  +PP+ K   +G KWV+++K N DG++ +YKARLVAKG+ Q+  IDF+ETFSPV K ++V++IL I
Subjt:  EPKYVKEALESPHWKDVMLDEYTALQKNRTWTL--IPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNI

Query:  VVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGFI----DPSRPDYICKLQKSLYGLKQAPREW------------------------------------
            N+ L Q+DI+NAFLNG L E++YM  P G+     D   P+ +C L+KS+YGLKQA R+W                                    
Subjt:  VVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGFI----DPSRPDYICKLQKSLYGLKQAPREW------------------------------------

Query:  --------------------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPM----------
                                                      +G+ + Q+KY L +L + G+   KP+S P+      +A  G             
Subjt:  --------------------------------------------DTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPM----------

Query:  ------------------EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPD
                          ++     +P   H QAV ++L Y+KGT   GL ++    + +  FSDA +  C D
Subjt:  ------------------EILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPD

ATMG00240.1 Gag-Pol-related retrotransposon family protein4.9e-0651.22Show/hide
Query:  QAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPD
        QAV +VL Y+KGT   GL ++ ++ L + AF+D+DWA CPD
Subjt:  QAVKRVLRYLKGTTTHGLHFAPSNSLSIVAFSDADWACCPD

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-1032.26Show/hide
Query:  SGLYLTQQKYILSMLQKLGMENLKPASTPV---VHGKYLTAFDGEPMEILPCIAS------------------------PTSCHWQAVKRVLRYLKGTTT
        SGL+L+Q KY   +L   GM + KP STP+   ++    TA   +P +    + +                        PT   +  +KRVLRY+KGT  
Subjt:  SGLYLTQQKYILSMLQKLGMENLKPASTPV---VHGKYLTAFDGEPMEILPCIAS------------------------PTSCHWQAVKRVLRYLKGTTT

Query:  HGLHFAPSNSLSIVAFSDADWACC
        HGL+   ++ L++ AF D+DWA C
Subjt:  HGLHFAPSNSLSIVAFSDADWACC

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.8e-2446.09Show/hide
Query:  MQTRAKNGI--FQPKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKG
        M TR+K GI    PK  L+  ++       EPK V  AL+ P W   M +E  AL +N+TW L+PP    +++G KWVF+ K++ DG++ + KARLVAKG
Subjt:  MQTRAKNGI--FQPKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVMLDEYTALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKG

Query:  FHQRPEIDFVETFSPVIKPSTVRVILNI
        FHQ   I FVET+SPV++ +T+R ILN+
Subjt:  FHQRPEIDFVETFSPVIKPSTVRVILNI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTTGTAGAGCCAAAAAGTTTCGATGGAGTCATGAAGTTCGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAGGATTATTTAACTTGCAAGAAAGTGCA
TAAGGCATTAAAGGAGAGACCGAAAGATATGAAGGACAAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACCATTAGGATGTGTTTGTCGATGGATGTGGCAAGTC
TTGTAGCCCATGAGACAACTGCAGTCAAGTTGATGGAATCGCTTACAAACAGTTGGGAAACGATGAAGACAGCAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTT
TCAAAAGTTTGTGATTTAGCCATAGCTGAGGAAATTCGTAGGCAGAGTAGTAATAAAGAGTCTACTGTAGGGTCAGCTTTGGTTATGACTAAGGGTAAAGATAAAGTTGA
TGAAGATAATGAACCGAGTAGCAGTAAGAAAAAGTGGAAATATAGGAATGAGGTAGAATGTTATTACTGCCATAAGAAAGGTCACTTGAAGTATCAGTGTAGGAAATTTA
AAGAGGATCAGAAAAGAAAATCAGAGGCAAATATAGTGGAAGAGGTTGTCCTAGCTTGTGTTGAGAGTGACACAAAGTATAGTAACCACTTATCAGATTGGATATTAGAC
AGTGCAGCTTCTGTTCACATAGCTTCAGATAGGAATTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGG
GATTGGAGATGTTAGTCTGAAGACAGAATGTGGAGGTAAATTGGTACTACGAGATGTCAGGTACGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAATTGGCAG
ATGATGGTTACATGTGTGAGTTTGGCAGTCACCAGTGTAAACTCAAGTTCGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAAT
GTTGCCAAAGGTTCAGAGAGACAGTGGATGCCGGTTAAAGCTGCAGATGGTAGTTGTAGAGGTGCAGTTGAGCCAGCAGCAAGGATAGCCAAGTTCGATCAGTTCGATCA
AGATCCTTCAGTTCAGAAACAATTGGGAAGTCTAGGAGAGAAAGTTGATGGCTATCGTGAATCCCCAGTTGTCAGACGGTCGAATGAATTGAAGAAGTCGCTTAGGCGAG
TTGAGGCATCAAAGTGGAAGGCCAGAGCAGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAACAGGTTTGAATAGAGGATTCAAGCCATTCTCAGATTGTATCTTC
TTCAGGAACAGTTGTTCGGGTTGGAAGAAGATGACAGGTATTTTGAGCCTTGGAGTTGTCAAGAAGATCGAGATAACAGTTGTAATGGGAGTAGATCCTTGGAGTTTGCC
AAGATGTTTAGAGTTTTGTTGTAGTGGGAGACAAGATCGTTGTTTTGTCTCCAACAGAGTGAAAAAAAGAGAAAACGACAGTGGCCTTTTTCGATTGAAGGAATTTCATG
CGGCAGTGAGTGAAAGAGAGAAGGGATTTTTCTCTTCCATTGCTGCTGAAGTAATTAATATTCCAACCTCAAGAGGGGTTTGGAAAGTTCTTGAAGAGCTTTATGGGTCT
ACAAATAAATCAAGGATTAGCCAACTAAACAGAGCCTTACAAACCACCAGAAAGAATGGCTCCAAAATGTCTGAATATTTAAGCACCATGAAGCAAATAGCCGACAACCT
ATCTTTGGCCGGAGAAACAGTTCCGGTAAGCTCTCTTATAACTAGTGTTATCACTGGTTTAGATGCTGAATATTTACCCATAATATGTCAAATCAATGGCAAAAGTGAAA
TGAATTGGCATGAATTCCATCCTACCTTGATAAGCTTTGAAAACACGTTGAACTATATGAACTCTCTCCAAAGTTCTGGTGATATGACCAATCTTTCTGCTAATTTTGCT
GCCAAAAGACAAGGAAACAACTTCTCAAAACCTCAAAGTCAATCCCCTAATGCTGAACAAAATAGGGGACACAACTTTAATCAAAGAGGATTTAGAGGTAGAGGACGAGG
TCGTGGTGGTCGACCTGGAAATAACTCAAAGCCTACTTGTCAGGTATGTGGTCGATATGGCCATTCTGCTCTTGTTTGTTACTTTAGATTTGAGAGGTCAATTGGTGCAA
ATGGAAGCTCTTCAAATACCCTTAATGGAGCCAATGCTGCTTATATTGCTACACCTGAAATGATTAATGATCATGCTTGGTATGCGGATAGTGGAGCTACATCACATACT
ACACCAGATGCAAACAACATGAATCAGAAAATGGAATATGGAGGAAATGAATACTTGGTTGTAGGTAATGGTTCTAAACTTAAGATAGAGCATGTTGGTAAAGTAGATCT
TCATTGTTTTGACTCTAATGCTTCTTTGTTACACCTCAAAGATGTTCTACATGTACCAAAAATAACAAAAAATCTTATGAGTGTATCCAAACTTACTGCTGATAATCACG
TATTTGTTGAATTTCATTCAAATGTTTGCTTTGTGAAGGACAAGTCAACCGAGAGAATACTGCTGCAAGGGAGACTTAAAAATGGATTGTACCAAATGGAGCCTCTTGTG
CCCTCCCAATCAAATCTCAAAGGGTTCAAATTTGCTAAATCCAGCTCTCCAATTAGTCACACAAAGTCTATCTTTACTGCTTTCAATGAGTCCAAATTTACTACTATTTG
TCCTTCTACTTTTGTTACTTCCAAGAATGCGTCTGTCAAAGATAATGAAAGTGGTTTCTGTGATGCCTGTCAATTGGGAAAATCTCACAAATTACCTTTTAAGTTGTCTC
ACACAAAATCAACAAAACCCCTTGAACTTGTATTCTCTGATTTATGGGGACCGACACCAGTAACATCTTCCTCTGGATATAGATTTTACATTAGTTTTGTGGATGATTTC
ACCCGATTTACTCACATCTACCCTTTAAGATCCAAAGTTGATGCCTTAAAGTCTTTTATCTTATTCAAAAATCAAGTCGAAAACTACTTTGAATGCAAAATTAAAACCCT
TCAATCTGACTGGGGTGGTGAATTTCGACCATTCGTTCCAGTTCTTCAAAATTTTGGTATAATTTTTAGACACCCTTGTCCTCATACAAGTGAGCAAAATGGAACAGTTG
AACGAAAACATCGACATATTGTCGAAATGGGTCTTACTTTACTTGTCAAGCCTCCATGCCTCTCAAATATTGATTTGTCCATCTGCACATGGCTCGCTCCAATTGGTACA
ACTCAGTTTGCAAGTCCCTTGCCGCCAGTTTCTTCCACTCCAGCAAGTCCAAACCTTCATTCTAATGATCCACCACCATTAATACCTTTTACTCAATCGCCTACATCCAC
AAATTCCTCCCCCACATCCAACTCCCCACCTCAACCTCCACCCACGAAATCATTGGCTAATACTCACCCAATGCAAACTCGAGCGAAGAATGGAATTTTTCAGCCCAAAT
CTTTTCTCTCCACCAACTCTTCCTCCGCCTTAAGCCCTTACCATGAACCCAAATATGTTAAAGAAGCATTAGAGTCCCCACATTGGAAAGATGTTATGCTTGACGAATAT
ACTGCTCTACAAAAGAACAGAACTTGGACTCTCATTCCTCCCTCCAATAAAGTCAGTGTGGTTGGAAATAAGTGGGTTTTTCGTGTCAAGATGAATCCAGATGGGTCGGT
TGCCAAATATAAGGCTAGATTGGTAGCTAAAGGGTTTCACCAAAGACCAGAGATTGATTTTGTGGAAACTTTCAGCCCTGTGATCAAACCCTCTACAGTTCGTGTTATCT
TAAACATTGTTGTCTCCAATAATTGGGAGCTTCGACAAATTGATATAAATAATGCCTTTCTTAATGGCACTTTGCAGGAAGATGTTTACATGGTTCAACCAGAAGGTTTT
ATTGATCCATCTCGACCTGATTATATTTGCAAGCTACAAAAATCATTGTACGGTCTCAAGCAAGCACCTCGAGAGTGGGACACAAGTGGACTTTATTTAACCCAACAAAA
ATACATTCTCAGTATGCTACAAAAGTTGGGAATGGAGAATCTTAAACCAGCATCCACTCCTGTAGTTCATGGCAAGTATCTTACAGCATTTGATGGTGAGCCAATGGAAA
TCCTACCTTGTATCGCATCACCTACCTCCTGTCATTGGCAAGCCGTTAAACGTGTACTACGTTACCTCAAAGGTACAACAACTCATGGTTTACATTTTGCTCCATCTAAT
TCTCTTTCTATTGTAGCATTTTCGGATGCCGATTGGGCATGTTGCCCAGATGATGAAGAAAAGGGAAGAACCGTTACAAACCTCCGTTGTAGACTTGACCGTTCTTCGCC
AACGACGCTTCACGAACGAAAACGCAGCAGCCGCTTTAAGGTTTTCTTCGCTGGAAATTCTTCTTTGATTGACACAATCACCACAGATGTTACACCGGTATGTCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTTGTAGAGCCAAAAAGTTTCGATGGAGTCATGAAGTTCGATGGGAAAAATTTTGGATATTGGAAGATGCAAGTCAAGGATTATTTAACTTGCAAGAAAGTGCA
TAAGGCATTAAAGGAGAGACCGAAAGATATGAAGGACAAAGATTGGGAAGCTCTAGATGAAGAGGCAGTTGCAACCATTAGGATGTGTTTGTCGATGGATGTGGCAAGTC
TTGTAGCCCATGAGACAACTGCAGTCAAGTTGATGGAATCGCTTACAAACAGTTGGGAAACGATGAAGACAGCAGTGTCTAATTCGACTGGAAATAACACTTTAAAATTT
TCAAAAGTTTGTGATTTAGCCATAGCTGAGGAAATTCGTAGGCAGAGTAGTAATAAAGAGTCTACTGTAGGGTCAGCTTTGGTTATGACTAAGGGTAAAGATAAAGTTGA
TGAAGATAATGAACCGAGTAGCAGTAAGAAAAAGTGGAAATATAGGAATGAGGTAGAATGTTATTACTGCCATAAGAAAGGTCACTTGAAGTATCAGTGTAGGAAATTTA
AAGAGGATCAGAAAAGAAAATCAGAGGCAAATATAGTGGAAGAGGTTGTCCTAGCTTGTGTTGAGAGTGACACAAAGTATAGTAACCACTTATCAGATTGGATATTAGAC
AGTGCAGCTTCTGTTCACATAGCTTCAGATAGGAATTTGTTCACATCATTCACAGGAGGGCATCATGGCCTAGTGAGGATGGGGAATGGTAGAACCTCCAAGACTAGAGG
GATTGGAGATGTTAGTCTGAAGACAGAATGTGGAGGTAAATTGGTACTACGAGATGTCAGGTACGTGCCTAATATCAAGATGAATCTTATTTCTATTGGTAAATTGGCAG
ATGATGGTTACATGTGTGAGTTTGGCAGTCACCAGTGTAAACTCAAGTTCGGATCCCAGGTAGTGGCAGTTGGTCACAGGAAATCTACACTGTACAGATGTCAGTTGAAT
GTTGCCAAAGGTTCAGAGAGACAGTGGATGCCGGTTAAAGCTGCAGATGGTAGTTGTAGAGGTGCAGTTGAGCCAGCAGCAAGGATAGCCAAGTTCGATCAGTTCGATCA
AGATCCTTCAGTTCAGAAACAATTGGGAAGTCTAGGAGAGAAAGTTGATGGCTATCGTGAATCCCCAGTTGTCAGACGGTCGAATGAATTGAAGAAGTCGCTTAGGCGAG
TTGAGGCATCAAAGTGGAAGGCCAGAGCAGTTGCTAAGGTCAAAGGTCAGGTCTCTAGCTTGGTAACAGGTTTGAATAGAGGATTCAAGCCATTCTCAGATTGTATCTTC
TTCAGGAACAGTTGTTCGGGTTGGAAGAAGATGACAGGTATTTTGAGCCTTGGAGTTGTCAAGAAGATCGAGATAACAGTTGTAATGGGAGTAGATCCTTGGAGTTTGCC
AAGATGTTTAGAGTTTTGTTGTAGTGGGAGACAAGATCGTTGTTTTGTCTCCAACAGAGTGAAAAAAAGAGAAAACGACAGTGGCCTTTTTCGATTGAAGGAATTTCATG
CGGCAGTGAGTGAAAGAGAGAAGGGATTTTTCTCTTCCATTGCTGCTGAAGTAATTAATATTCCAACCTCAAGAGGGGTTTGGAAAGTTCTTGAAGAGCTTTATGGGTCT
ACAAATAAATCAAGGATTAGCCAACTAAACAGAGCCTTACAAACCACCAGAAAGAATGGCTCCAAAATGTCTGAATATTTAAGCACCATGAAGCAAATAGCCGACAACCT
ATCTTTGGCCGGAGAAACAGTTCCGGTAAGCTCTCTTATAACTAGTGTTATCACTGGTTTAGATGCTGAATATTTACCCATAATATGTCAAATCAATGGCAAAAGTGAAA
TGAATTGGCATGAATTCCATCCTACCTTGATAAGCTTTGAAAACACGTTGAACTATATGAACTCTCTCCAAAGTTCTGGTGATATGACCAATCTTTCTGCTAATTTTGCT
GCCAAAAGACAAGGAAACAACTTCTCAAAACCTCAAAGTCAATCCCCTAATGCTGAACAAAATAGGGGACACAACTTTAATCAAAGAGGATTTAGAGGTAGAGGACGAGG
TCGTGGTGGTCGACCTGGAAATAACTCAAAGCCTACTTGTCAGGTATGTGGTCGATATGGCCATTCTGCTCTTGTTTGTTACTTTAGATTTGAGAGGTCAATTGGTGCAA
ATGGAAGCTCTTCAAATACCCTTAATGGAGCCAATGCTGCTTATATTGCTACACCTGAAATGATTAATGATCATGCTTGGTATGCGGATAGTGGAGCTACATCACATACT
ACACCAGATGCAAACAACATGAATCAGAAAATGGAATATGGAGGAAATGAATACTTGGTTGTAGGTAATGGTTCTAAACTTAAGATAGAGCATGTTGGTAAAGTAGATCT
TCATTGTTTTGACTCTAATGCTTCTTTGTTACACCTCAAAGATGTTCTACATGTACCAAAAATAACAAAAAATCTTATGAGTGTATCCAAACTTACTGCTGATAATCACG
TATTTGTTGAATTTCATTCAAATGTTTGCTTTGTGAAGGACAAGTCAACCGAGAGAATACTGCTGCAAGGGAGACTTAAAAATGGATTGTACCAAATGGAGCCTCTTGTG
CCCTCCCAATCAAATCTCAAAGGGTTCAAATTTGCTAAATCCAGCTCTCCAATTAGTCACACAAAGTCTATCTTTACTGCTTTCAATGAGTCCAAATTTACTACTATTTG
TCCTTCTACTTTTGTTACTTCCAAGAATGCGTCTGTCAAAGATAATGAAAGTGGTTTCTGTGATGCCTGTCAATTGGGAAAATCTCACAAATTACCTTTTAAGTTGTCTC
ACACAAAATCAACAAAACCCCTTGAACTTGTATTCTCTGATTTATGGGGACCGACACCAGTAACATCTTCCTCTGGATATAGATTTTACATTAGTTTTGTGGATGATTTC
ACCCGATTTACTCACATCTACCCTTTAAGATCCAAAGTTGATGCCTTAAAGTCTTTTATCTTATTCAAAAATCAAGTCGAAAACTACTTTGAATGCAAAATTAAAACCCT
TCAATCTGACTGGGGTGGTGAATTTCGACCATTCGTTCCAGTTCTTCAAAATTTTGGTATAATTTTTAGACACCCTTGTCCTCATACAAGTGAGCAAAATGGAACAGTTG
AACGAAAACATCGACATATTGTCGAAATGGGTCTTACTTTACTTGTCAAGCCTCCATGCCTCTCAAATATTGATTTGTCCATCTGCACATGGCTCGCTCCAATTGGTACA
ACTCAGTTTGCAAGTCCCTTGCCGCCAGTTTCTTCCACTCCAGCAAGTCCAAACCTTCATTCTAATGATCCACCACCATTAATACCTTTTACTCAATCGCCTACATCCAC
AAATTCCTCCCCCACATCCAACTCCCCACCTCAACCTCCACCCACGAAATCATTGGCTAATACTCACCCAATGCAAACTCGAGCGAAGAATGGAATTTTTCAGCCCAAAT
CTTTTCTCTCCACCAACTCTTCCTCCGCCTTAAGCCCTTACCATGAACCCAAATATGTTAAAGAAGCATTAGAGTCCCCACATTGGAAAGATGTTATGCTTGACGAATAT
ACTGCTCTACAAAAGAACAGAACTTGGACTCTCATTCCTCCCTCCAATAAAGTCAGTGTGGTTGGAAATAAGTGGGTTTTTCGTGTCAAGATGAATCCAGATGGGTCGGT
TGCCAAATATAAGGCTAGATTGGTAGCTAAAGGGTTTCACCAAAGACCAGAGATTGATTTTGTGGAAACTTTCAGCCCTGTGATCAAACCCTCTACAGTTCGTGTTATCT
TAAACATTGTTGTCTCCAATAATTGGGAGCTTCGACAAATTGATATAAATAATGCCTTTCTTAATGGCACTTTGCAGGAAGATGTTTACATGGTTCAACCAGAAGGTTTT
ATTGATCCATCTCGACCTGATTATATTTGCAAGCTACAAAAATCATTGTACGGTCTCAAGCAAGCACCTCGAGAGTGGGACACAAGTGGACTTTATTTAACCCAACAAAA
ATACATTCTCAGTATGCTACAAAAGTTGGGAATGGAGAATCTTAAACCAGCATCCACTCCTGTAGTTCATGGCAAGTATCTTACAGCATTTGATGGTGAGCCAATGGAAA
TCCTACCTTGTATCGCATCACCTACCTCCTGTCATTGGCAAGCCGTTAAACGTGTACTACGTTACCTCAAAGGTACAACAACTCATGGTTTACATTTTGCTCCATCTAAT
TCTCTTTCTATTGTAGCATTTTCGGATGCCGATTGGGCATGTTGCCCAGATGATGAAGAAAAGGGAAGAACCGTTACAAACCTCCGTTGTAGACTTGACCGTTCTTCGCC
AACGACGCTTCACGAACGAAAACGCAGCAGCCGCTTTAAGGTTTTCTTCGCTGGAAATTCTTCTTTGATTGACACAATCACCACAGATGTTACACCGGTATGTCTCTGA
Protein sequenceShow/hide protein sequence
MGFVEPKSFDGVMKFDGKNFGYWKMQVKDYLTCKKVHKALKERPKDMKDKDWEALDEEAVATIRMCLSMDVASLVAHETTAVKLMESLTNSWETMKTAVSNSTGNNTLKF
SKVCDLAIAEEIRRQSSNKESTVGSALVMTKGKDKVDEDNEPSSSKKKWKYRNEVECYYCHKKGHLKYQCRKFKEDQKRKSEANIVEEVVLACVESDTKYSNHLSDWILD
SAASVHIASDRNLFTSFTGGHHGLVRMGNGRTSKTRGIGDVSLKTECGGKLVLRDVRYVPNIKMNLISIGKLADDGYMCEFGSHQCKLKFGSQVVAVGHRKSTLYRCQLN
VAKGSERQWMPVKAADGSCRGAVEPAARIAKFDQFDQDPSVQKQLGSLGEKVDGYRESPVVRRSNELKKSLRRVEASKWKARAVAKVKGQVSSLVTGLNRGFKPFSDCIF
FRNSCSGWKKMTGILSLGVVKKIEITVVMGVDPWSLPRCLEFCCSGRQDRCFVSNRVKKRENDSGLFRLKEFHAAVSEREKGFFSSIAAEVINIPTSRGVWKVLEELYGS
TNKSRISQLNRALQTTRKNGSKMSEYLSTMKQIADNLSLAGETVPVSSLITSVITGLDAEYLPIICQINGKSEMNWHEFHPTLISFENTLNYMNSLQSSGDMTNLSANFA
AKRQGNNFSKPQSQSPNAEQNRGHNFNQRGFRGRGRGRGGRPGNNSKPTCQVCGRYGHSALVCYFRFERSIGANGSSSNTLNGANAAYIATPEMINDHAWYADSGATSHT
TPDANNMNQKMEYGGNEYLVVGNGSKLKIEHVGKVDLHCFDSNASLLHLKDVLHVPKITKNLMSVSKLTADNHVFVEFHSNVCFVKDKSTERILLQGRLKNGLYQMEPLV
PSQSNLKGFKFAKSSSPISHTKSIFTAFNESKFTTICPSTFVTSKNASVKDNESGFCDACQLGKSHKLPFKLSHTKSTKPLELVFSDLWGPTPVTSSSGYRFYISFVDDF
TRFTHIYPLRSKVDALKSFILFKNQVENYFECKIKTLQSDWGGEFRPFVPVLQNFGIIFRHPCPHTSEQNGTVERKHRHIVEMGLTLLVKPPCLSNIDLSICTWLAPIGT
TQFASPLPPVSSTPASPNLHSNDPPPLIPFTQSPTSTNSSPTSNSPPQPPPTKSLANTHPMQTRAKNGIFQPKSFLSTNSSSALSPYHEPKYVKEALESPHWKDVMLDEY
TALQKNRTWTLIPPSNKVSVVGNKWVFRVKMNPDGSVAKYKARLVAKGFHQRPEIDFVETFSPVIKPSTVRVILNIVVSNNWELRQIDINNAFLNGTLQEDVYMVQPEGF
IDPSRPDYICKLQKSLYGLKQAPREWDTSGLYLTQQKYILSMLQKLGMENLKPASTPVVHGKYLTAFDGEPMEILPCIASPTSCHWQAVKRVLRYLKGTTTHGLHFAPSN
SLSIVAFSDADWACCPDDEEKGRTVTNLRCRLDRSSPTTLHERKRSSRFKVFFAGNSSLIDTITTDVTPVCL