; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035094 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035094
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:14774313..14778096
RNA-Seq ExpressionLag0035094
SyntenyLag0035094
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4268750.1 unnamed protein product [Prunus armeniaca]2.1e-10129.29Show/hide
Query:  RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAGVLIVKVDKLR----------------------------LSVRLRLDHFLAN
        RFTGFYG P  + R  SW+L+ RL   N   W+  GD NEIL  DEK G       +++                              +R+RLD  LA 
Subjt:  RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAGVLIVKVDKLR----------------------------LSVRLRLDHFLAN

Query:  ANFCALFE------------------------------------------DHKDCASA---------------------------LGGWGFRQNKRLMND
         ++C  F                                            H++C                              L GW       L + 
Subjt:  ANFCALFE------------------------------------------DHKDCASA---------------------------LGGWGFRQNKRLMND

Query:  IRALKDKIKQAYDSAM-PIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTRWFHQKASSRHKKNWVGGIEDSNGV-----------------
        I++++ K+ +  ++ + P       +L  +LDSL+   E+YW+QRSR  WLK GDRNT++FH KASSR ++N + G+ED NG+                 
Subjt:  IRALKDKIKQAYDSAM-PIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTRWFHQKASSRHKKNWVGGIEDSNGV-----------------

Query:  ------------------PTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPK
                            RV+ EMN+ LLA +T EE+  A+   HPSKAPG DGF  LFYQQYW  VG   VS+ L  L+ G  ++  N+T++ LIPK
Subjt:  ------------------PTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPK

Query:  VHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMS
        V  P+ ++  RPISL NV+YKI  KV+AN+LK++L  +I + QSAF+PGR ISDN I+  E LHF+HK+   + GY+ALKLDMSKAYDRVEW++L  +M 
Subjt:  VHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMS

Query:  KMEFHVNWIKLIMNCITIATFSICLN-------------------------------------------------------AAHEF--------------
         M F   W++LIM C+T  ++S  LN                                                        +H F              
Subjt:  KMEFHVNWIKLIMNCITIATFSICLN-------------------------------------------------------AAHEF--------------

Query:  --GVFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYLGLPSVFHRGKSRDFKFLLD------------------------
             K IL+ YE+ SGQ +N  KS V FSRNV  + ++     L +        YLGLP+   R + + F +L D                        
Subjt:  --GVFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYLGLPSVFHRGKSRDFKFLLD------------------------

Query:  --------------RIPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRDL------------------------------------
                       IPK +  +I  + A++WWG+   +RK+ W  W++LC+PK  GGL FR+L                                    
Subjt:  --------------RIPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRDL------------------------------------

Query:  ----------GFVW-----GMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSPPVGMESVMVSEFINS-SMQWDVVKLSQVLLREDIEVIKSLPI
                   +VW        +++ G R  +G+G+++ +++  WLP P +FKV+S+P     +  VS  I+  ++QW    L      E+  +I  +P+
Subjt:  ----------GFVW-----GMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSPPVGMESVMVSEFINS-SMQWDVVKLSQVLLREDIEVIKSLPI

Query:  S-NSAPDKWIWHYDR
        S    PD  IWH++R
Subjt:  S-NSAPDKWIWHYDR

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]1.7e-10332.12Show/hide
Query:  LSGGLCLLWKDDID-----------------------RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAG---VLIVKVDKLRL
        L GGL LLW  D+                        R T  YG+P+   + H+W+L+RRL   +   W+  GD NEI   +EK G       +V + R 
Subjt:  LSGGLCLLWKDDID-----------------------RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAG---VLIVKVDKLRL

Query:  SVR--LRLDHFLANANFC-------ALFEDHKDCASALGG---WGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFG-AIHQLERQLDSLLLDEEMYWKQR
        +VR    LD  L    F        A    +K    +LG    W  ++       +  L++K+K    S    D G  + + E Q+D++L DEE++WKQR
Subjt:  SVR--LRLDHFLANANFC-------ALFEDHKDCASALGG---WGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFG-AIHQLERQLDSLLLDEEMYWKQR

Query:  SRENWLKWGDRNTRWFHQKASSRHKKNWVGGIEDSNG------------------------VPT-------------RVTSEMNQMLLAPYTREEVVAAV
        SR +WLK GD+NT++FH KAS+R KKN +GGI D  G                         PT             +V  EMN  L AP+  EE+V A+
Subjt:  SRENWLKWGDRNTRWFHQKASSRHKKNWVGGIEDSNG------------------------VPT-------------RVTSEMNQMLLAPYTREEVVAAV

Query:  NGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQ
            P+KAPG DG PA F+Q++W +V +  +++ L ILND  ++   NHT + LIPK  +P+ VS+FRPISLCNV+Y+I+ K +AN LK +LD ++   Q
Subjt:  NGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQ

Query:  SAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLNA----------------
        SAFI  R I+DN+I+G+E L+ + + + +K G +ALKLD+SKAYDRVEW++L   M K+ F  NWI+L MNCIT  +FS+ +N                 
Subjt:  SAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLNA----------------

Query:  -------------------------------------------AHEFGVF-----------KSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNI
                                                   A +  VF           K I   Y  ASGQ  NY KSS+ FS NV     + +  I
Subjt:  -------------------------------------------AHEFGVF-----------KSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNI

Query:  LNMCESESLGSYLGLPSVFHRGKSRDFKFLLDRI--------------------------------------PKGILAKISTLCATFWWGSCGDKRKMQW
          +        YLGLPS+  R K   F  +  RI                                      P  I   I    A FWWGS  D+R + W
Subjt:  LNMCESESLGSYLGLPSVFHRGKSRDFKFLLDRI--------------------------------------PKGILAKISTLCATFWWGSCGDKRKMQW

Query:  KKWEDLCKPKEIGGLNFRDLGFVWGMDLMKMGLR------KNVGNGRSILMFQHPWLPRPSTFKVMSSPPVGMESVMVSEFINSSMQWDVVKLSQVLLRE
         KWE LC+ K  GG+ FRD        + K G R        +     + +++  WLPRP  FK  S P + ++ V V++ I+ +  W    + Q  ++E
Subjt:  KKWEDLCKPKEIGGLNFRDLGFVWGMDLMKMGLR------KNVGNGRSILMFQHPWLPRPSTFKVMSSPPVGMESVMVSEFINSSMQWDVVKLSQVLLRE

Query:  DIEVIKSLPISNS-APDKWIWHYDR
        D  +I  + +  +  PD+++WHYD+
Subjt:  DIEVIKSLPISNS-APDKWIWHYDR

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]2.0e-12335.7Show/hide
Query:  RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAGVLIVKVDKLRLSVRL------------------------------RLDHFL
        RFTGFYG+P    R  +W L+RR+ + + + W+I GD+N ILW  E +        ++     +                              RLD FL
Subjt:  RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAGVLIVKVDKLRLSVRL------------------------------RLDHFL

Query:  ANANFCALFEDH-----------------KDCASALGGWGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWL
         N  F  +F D                  +  +SAL  WG      L   I+A K  I  AY+  +P+DF  IH LE  L  LL  EE++WKQRSRE+WL
Subjt:  ANANFCALFEDH-----------------KDCASALGGWGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWL

Query:  KWGDRNTRWFHQKASSRHKKNWVGGIEDSNGVPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGAS
        KWG         +A               N +PTR+TSE+N+ LLAPYT+EE+  A+    P+KA G DGFPALFYQ YW  VG KT+ + L  LN+G  
Subjt:  KWGDRNTRWFHQKASSRHKKNWVGGIEDSNGVPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGAS

Query:  VQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKA
        ++ WN T + LIPK+ QPR +SDFRPISLCNV YKI++K I N+LK V+ +VI + QSAF+P R ISDN+I+GHE LH ++  +   IG  ALKLD+SKA
Subjt:  VQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKA

Query:  YDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLNAA----------------------------------HEFG--------------------
        +DRVEW+YL  IM KM F+  WI+ I+ CI+   FSI LN +                                  HE                      
Subjt:  YDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLNAA----------------------------------HEFG--------------------

Query:  -----------------VFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYLGLPSVFHRGKSRDFKFLLDRIPKGILAKI
                           + +L  Y RASGQC+N+SKS++ FS NV  + + YL  ILN+      G+YLGLPS F R +                   
Subjt:  -----------------VFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYLGLPSVFHRGKSRDFKFLLDRIPKGILAKI

Query:  STLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRDL---------------------------------------------------GFVWGMDLM
                    G+ RK+ W KW  +C PKE GGLNFRDL                                                   GF+WG DL+
Subjt:  STLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRDL---------------------------------------------------GFVWGMDLM

Query:  KMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSPPVGMESVMVSEFINSSMQWDVVKLSQVLLREDIEVIKSLPISN-SAPDKWIWHYDR
          GLR  VGNG +I  F  PWLPRP+TFK +      +++  V+ FI +   WDV  +S     ED ++I S+PIS+ +  D W+WHYD+
Subjt:  KMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSPPVGMESVMVSEFINSSMQWDVVKLSQVLLREDIEVIKSLPISN-SAPDKWIWHYDR

XP_024196188.1 uncharacterized protein LOC112199393 [Rosa chinensis]5.4e-10531.25Show/hide
Query:  FSPD--IVKRPKVDGFDCLSGGLCLLWKDDID------------------------RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWK
        FS D   V R K  G    +GGLCLLW +D+                         RFTGFYG P    R  SW L+R L  +    WVI GDLNEI+  
Subjt:  FSPD--IVKRPKVDGFDCLSGGLCLLWKDDID------------------------RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWK

Query:  DEKAGVLIVKVDKLRLSVRLRLDHFLANAN-----------FCA------LFEDH-KDCASALGGWGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFGAI
         +K G ++  + ++ L +   LD  L +AN           F +      +F  + K    AL  W F Q   L  +I  ++ ++   YD+   I    +
Subjt:  DEKAGVLIVKVDKLRLSVRLRLDHFLANAN-----------FCA------LFEDH-KDCASALGGWGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFGAI

Query:  H-QLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTRWFHQKASSRHKKNWVGGIEDSNG-----------------------------------VPTRVT
          +LE +L++LL  E ++W+QR++  WL+ GD NT++FHQ+AS+R KKN++ G+ D  G                                   +   ++
Subjt:  H-QLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTRWFHQKASSRHKKNWVGGIEDSNG-----------------------------------VPTRVT

Query:  SEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIV
         + N +L+   + +EV  A+   HPSKAPG DGF   FY+ +W  VG   V +    L     ++  N T + LIPKV + + V+  RPISLCNV+YK+ 
Subjt:  SEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIV

Query:  TKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCI-------
        +KV+AN++K ++D V+   QSAF+PG  ISDN ++ +E  HFL K+R+ K G+ ALKLDMSKAYD+VEW++L  +M KM F   W+K IM C+       
Subjt:  TKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCI-------

Query:  -----------------------------TIA-----------TFSICLNAAHEFGVFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNM
                                     T A           +F  C   + +    K +L+ YE  SGQ VN  KS++ FSRNV    +  L   L +
Subjt:  -----------------------------TIA-----------TFSICLNAAHEFGVFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNM

Query:  CESESLGSYLGLPSVFHRGKSRDFKFLLDRI--------------------------------------PKGILAKISTLCATFWWGSCGDKRKMQWKKW
           +    YLGLP      K   F +L++R+                                      PK +  ++  L A FWWG   +  K+ W  W
Subjt:  CESESLGSYLGLPSVFHRGKSRDFKFLLDRI--------------------------------------PKGILAKISTLCATFWWGSCGDKRKMQWKKW

Query:  EDLCKPKEIGGLNFR--------DLG--FVW-----GMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSPPVGMESVMVSEFINSSMQ-WDVVKL
        E +C  K++GGL F         + G  F W     G ++++ GLR  VGNG  I +++  W+P P  FK  + PP G++ ++V++ I+     W +  L
Subjt:  EDLCKPKEIGGLNFR--------DLG--FVW-----GMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSPPVGMESVMVSEFINSSMQ-WDVVKL

Query:  SQVLLREDIEVIKSLPIS-NSAPDKWIWHYDR
         ++   +++ +I S+P+S   A D+ IWHYDR
Subjt:  SQVLLREDIEVIKSLPIS-NSAPDKWIWHYDR

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]7.8e-10429.59Show/hide
Query:  GGLCLLWKDDI-------------------DRF----TGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAGVLIVKVDKL-------
        GGL  LWK+D+                   D F    TGFYG P+   + +SW L++ L       WV+ GD N  L   EK      +  ++       
Subjt:  GGLCLLWKDDI-------------------DRF----TGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAGVLIVKVDKL-------

Query:  -----------------------RLSVRLRLDHFLANANFCALFE------------DH------------------------------KDCASAL-GGW
                                 + ++RLD  +AN  +   F+            DH                               +CA+ +   W
Subjt:  -----------------------RLSVRLRLDHFLANANFCALFE------------DH------------------------------KDCASAL-GGW

Query:  GFRQNKRLMNDIRALKDKIK------QAYDSAM-PIDFGAIHQLERQL-----------------------DSLLLDEEMYWKQRSRENWLKWGDRNTRW
        G     R  + + A+++KIK       A+ S++   D GAI ++++QL                       D LL  +E+YW QRSR NWL+ GDRNT++
Subjt:  GFRQNKRLMNDIRALKDKIK------QAYDSAM-PIDFGAIHQLERQL-----------------------DSLLLDEEMYWKQRSRENWLKWGDRNTRW

Query:  FHQKASSRHKKNWVGGIEDSNG-----------------------------------VPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPAL
        FH KAS R +KN++ GI +S G                                   V T+VT +M + L   +T EEV AA+    P+KAPG DG  AL
Subjt:  FHQKASSRHKKNWVGGIEDSNG-----------------------------------VPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPAL

Query:  FYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGH
        FYQ++W  VGD  VS+ L  LN+G  + + NHTN+VLIPKV  P  +S+FRPISLCNV+YKI++KV+AN+LK VL  +I   QSAF+PGR I+DN+++ +
Subjt:  FYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGH

Query:  EALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLN----------------------------------
        E LH +H ++K K G VALKLD+SKAYDRVEW +L  IM KM F   WI+ +M+C+T  +FSI +N                                  
Subjt:  EALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLN----------------------------------

Query:  -------------------------------------AAHEFGVFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYLGLP
                                                E      IL+ YERASGQ +N  KSS +FS N +   +  +  IL + E +    YLGLP
Subjt:  -------------------------------------AAHEFGVFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYLGLP

Query:  SVFHRGKSRDFKFLLDR--------------------------------------IPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLN
        ++  R K   F  L DR                                      IP  + +++  LCA FWWG  G++RK+ WK W+ L  PK+ GG+ 
Subjt:  SVFHRGKSRDFKFLLDR--------------------------------------IPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLN

Query:  FRDL----------------------------------------------GFVW-----GMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSPPV
        FRDL                                               FVW        +++ G    VGNG SI   +  WLP   T KV++S   
Subjt:  FRDL----------------------------------------------GFVW-----GMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSPPV

Query:  GMESVMVSEFINSSMQ-WDVVKLSQVLLREDIEVIKSLPIS-NSAPDKWIWHY
            ++V+E IN     W+  ++  +  R++ E I  +P+S    PD   W Y
Subjt:  GMESVMVSEFINSSMQ-WDVVKLSQVLLREDIEVIKSLPIS-NSAPDKWIWHY

TrEMBL top hitse value%identityAlignment
A0A2N9H997 Uncharacterized protein1.6e-10731.7Show/hide
Query:  GGLCLLWKDDID-----------------------RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEI---------LWKDEKAGVLIVKV-
        GGL LLW DD+D                       R TGFYGNP+ +LR  SW L+RRL   +   W++ GD NEI          W + +    +V+V 
Subjt:  GGLCLLWKDDID-----------------------RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEI---------LWKDEKAGVLIVKV-

Query:  ---------------------------DKLRLSVRL---------------RLDHFLANANFCA------------------LFEDHKDCASALGGWGFR
                                   D + L V L               R DH     N C                   L +  K C   L  W   
Subjt:  ---------------------------DKLRLSVRL---------------RLDHFLANANFCA------------------LFEDHKDCASALGGWGFR

Query:  QNKRLMNDIRALKDKIKQAYDSAMPIDF--GAIHQLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTRWFHQKASSRHKKNWVGGIEDSNG---------
        Q +     I   K ++++  +   P D+  GA++ + R+L  L+  EE +W+QRSR  WL+ GD NTR+FH+ AS R K N V G+ DS           
Subjt:  QNKRLMNDIRALKDKIKQAYDSAMPIDF--GAIHQLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTRWFHQKASSRHKKNWVGGIEDSNG---------

Query:  --------------------------VPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWN
                                  V T VT EMN  LL P++ +EV  A+   HPSKAPG DG  ALF+Q+YW  VG     + L  +  G  +   N
Subjt:  --------------------------VPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWN

Query:  HTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVE
         TN+VLIPKV  P  +S FRPISLCNV+YKI++K++ N++K +L  VI +CQSAF+PGR I+DN+I+  E LHFL  KR+ K   +A+KLDMSKAYDRVE
Subjt:  HTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVE

Query:  WSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLNAAH-----------------------------------EFGVFKSILKDYERASGQCVNYSKSSV
        W YL  I+ K+ FH  W+ LI+ C++  T+S+ +N                                      E  V   +L  YE ASGQ VN  K++V
Subjt:  WSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLNAAH-----------------------------------EFGVFKSILKDYERASGQCVNYSKSSV

Query:  FFSRNVTADTRDYLCNILNMCESESLGSYLGLPSVFHRGKSRDFKFLLDRI--------------------------------------PKGILAKISTL
        FFS N     ++ +  +     +     YLGLP V  + K + F  + DR+                                      P G+ +++S+L
Subjt:  FFSRNVTADTRDYLCNILNMCESESLGSYLGLPSVFHRGKSRDFKFLLDRI--------------------------------------PKGILAKISTL

Query:  CATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRDL-----------------GFVW-----GMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMS
           FWWG     RK+ W   + L K K  GG+ FRDL                  + W        ++K+G+R  VG G +I ++Q PWL   S+ KV+S
Subjt:  CATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRDL-----------------GFVW-----GMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMS

Query:  SPPVGMESVMVSEFINSS-MQWDVVKLSQVLLREDIEVIKSLPISNSAP-DKWIW
           +   +  V   INSS M+W    + Q+ +  + E+IK +P+S   P D  IW
Subjt:  SPPVGMESVMVSEFINSS-MQWDVVKLSQVLLREDIEVIKSLPISNSAP-DKWIW

A0A6J1DX30 uncharacterized protein LOC1110248749.5e-12435.7Show/hide
Query:  RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAGVLIVKVDKLRLSVRL------------------------------RLDHFL
        RFTGFYG+P    R  +W L+RR+ + + + W+I GD+N ILW  E +        ++     +                              RLD FL
Subjt:  RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAGVLIVKVDKLRLSVRL------------------------------RLDHFL

Query:  ANANFCALFEDH-----------------KDCASALGGWGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWL
         N  F  +F D                  +  +SAL  WG      L   I+A K  I  AY+  +P+DF  IH LE  L  LL  EE++WKQRSRE+WL
Subjt:  ANANFCALFEDH-----------------KDCASALGGWGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWL

Query:  KWGDRNTRWFHQKASSRHKKNWVGGIEDSNGVPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGAS
        KWG         +A               N +PTR+TSE+N+ LLAPYT+EE+  A+    P+KA G DGFPALFYQ YW  VG KT+ + L  LN+G  
Subjt:  KWGDRNTRWFHQKASSRHKKNWVGGIEDSNGVPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGAS

Query:  VQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKA
        ++ WN T + LIPK+ QPR +SDFRPISLCNV YKI++K I N+LK V+ +VI + QSAF+P R ISDN+I+GHE LH ++  +   IG  ALKLD+SKA
Subjt:  VQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKA

Query:  YDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLNAA----------------------------------HEFG--------------------
        +DRVEW+YL  IM KM F+  WI+ I+ CI+   FSI LN +                                  HE                      
Subjt:  YDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLNAA----------------------------------HEFG--------------------

Query:  -----------------VFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYLGLPSVFHRGKSRDFKFLLDRIPKGILAKI
                           + +L  Y RASGQC+N+SKS++ FS NV  + + YL  ILN+      G+YLGLPS F R +                   
Subjt:  -----------------VFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYLGLPSVFHRGKSRDFKFLLDRIPKGILAKI

Query:  STLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRDL---------------------------------------------------GFVWGMDLM
                    G+ RK+ W KW  +C PKE GGLNFRDL                                                   GF+WG DL+
Subjt:  STLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRDL---------------------------------------------------GFVWGMDLM

Query:  KMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSPPVGMESVMVSEFINSSMQWDVVKLSQVLLREDIEVIKSLPISN-SAPDKWIWHYDR
          GLR  VGNG +I  F  PWLPRP+TFK +      +++  V+ FI +   WDV  +S     ED ++I S+PIS+ +  D W+WHYD+
Subjt:  KMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSPPVGMESVMVSEFINSSMQWDVVKLSQVLLREDIEVIKSLPISN-SAPDKWIWHYDR

A0A803PV25 Uncharacterized protein1.8e-10629.08Show/hide
Query:  LSGGLCLLWKDDID-----------------------RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAG--------------
        LSGGL L+WK DI                         FTGFYGNPD   R  SW L+R L    +  W+  GD NEI+   EK G              
Subjt:  LSGGLCLLWKDDID-----------------------RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAG--------------

Query:  ------VLIVKVDKLRLS---------VRLRLDHFLANANFCALFE------------DHK---------------------------------------
               +     K  L+         +  RLD  L    +   FE            DH+                                       
Subjt:  ------VLIVKVDKLRLS---------VRLRLDHFLANANFCALFE------------DHK---------------------------------------

Query:  ----------------------DCASALGGWGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTR
                               C  AL  W  ++  +L +++  LK  + +      P  +  I Q+E +L+ LL  +E YW+QRSR  WL+WGDRNT+
Subjt:  ----------------------DCASALGGWGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTR

Query:  WFHQKASSRHKKNWVGGIEDSNG-------------------------------------VPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGF
        +FH KAS+R KKN + G++D  G                                     V  +V+S MN  LLA +  EEV+ AV   +P+KAPG DG 
Subjt:  WFHQKASSRHKKNWVGGIEDSNG-------------------------------------VPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGF

Query:  PALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMI
        PALFYQ++W+ +    V+  L +LN+GA +Q  N T + LIPKV +P+ + +FRPISLCNV+YKIV+K +AN++++ L  V+ + QSAF+ GR I DN I
Subjt:  PALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMI

Query:  LGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLN-------------------------------
        +G+E+LH + K R R    VALKLDM+KAYDRVEW +L  +M K+ +   W+  IMNC+T   FS  +N                               
Subjt:  LGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLN-------------------------------

Query:  ----------------------------------------AAHEFGVFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYL
                                                   E   F+ +L+ Y  ASGQ VN+ KS + F R+V+A  R +L   + +   ++ G YL
Subjt:  ----------------------------------------AAHEFGVFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYL

Query:  GLPSVFHRGKSRDFKFL-------------------------------------LDRIPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGG
        GLPS   R K + F+F+                                       R+PK  +  I ++ A FWWGS     K+ W KW  LCK KE GG
Subjt:  GLPSVFHRGKSRDFKFL-------------------------------------LDRIPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGG

Query:  LNFRDLG---------------------------------------------------FVWGMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSP
        L FRDLG                                                    VWG  +++ G R  +GNG S+ +   PWLPRP TFK+   P
Subjt:  LNFRDLG---------------------------------------------------FVWGMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSP

Query:  PVGMESVMVSEFINSSMQWDVVKLSQVLLREDIEVIKSLPISN-SAPDKWIWHYDR
        P+  +++ V +    + +WD   +  V    D E+I  +  S     DK +WHY +
Subjt:  PVGMESVMVSEFINSSMQWDVVKLSQVLLREDIEVIKSLPISN-SAPDKWIWHYDR

A0A803PWX1 Uncharacterized protein6.2e-10730.01Show/hide
Query:  LSGGLCLLWKDDID-----------------------RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAG----------VLIV
        LSGGL L+WK ++                          TGFYGNP+ SLR  SW L+R L    +  W+  GD NEI+   EK G              
Subjt:  LSGGLCLLWKDDID-----------------------RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAG----------VLIV

Query:  KVDKLRL-------------------SVRLRLDHFLANANFCALFE------------DHK---------------------------------------
         +D  R                     +  RLD  L N  +   FE            DH+                                       
Subjt:  KVDKLRL-------------------SVRLRLDHFLANANFCALFE------------DHK---------------------------------------

Query:  ----------------------DCASALGGWGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTR
                               C  AL  W  ++  RL N+I   K  + +      P  + AI  +E +L+ LL  +E YW+QRSR  WL+WGDRNT+
Subjt:  ----------------------DCASALGGWGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTR

Query:  WFHQKASSRHKKNWVGGIEDSNG-------------------------------------VPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGF
        +FH KASSR KKN + G++D  G                                     V  +V+  MN+ L+  ++ EEVV AV G +P+KAPG DG 
Subjt:  WFHQKASSRHKKNWVGGIEDSNG-------------------------------------VPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGF

Query:  PALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMI
        PALFYQ++W+ + D+ ++  L +LN+GA +   N T + LIPKV +P+ + +FRPISLCNV+YKIV+K +AN+L+  LD V+ + QSAF+ GR I DN I
Subjt:  PALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMI

Query:  LGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLNA------------------------------
        +G+E LH + K R R    VALKLDM+KAYDRVEW +L  +M K+ + V W+  IM C+T   FS  +N                               
Subjt:  LGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLNA------------------------------

Query:  ------------AHEFGV-----------------------------FKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYL
                     H  G                              FK +L+ Y +ASGQ VN+ KS + F R VT   R +L NI+ +   ++ G YL
Subjt:  ------------AHEFGV-----------------------------FKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYL

Query:  GLPSVFHRGKSRDFKFLLD--------------------------------------RIPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIG
        GLPS   R K + F+F+ +                                      R+PK  +  I ++ A FWWGS     K+ W KW+ LCK KE G
Subjt:  GLPSVFHRGKSRDFKFLLD--------------------------------------RIPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIG

Query:  GLNFRDLG---------------------------------------------------FVWGMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSS
        GL FRDLG                                                    VWG  +++ G R  +GNG S+ +   PWLPRP TFK+   
Subjt:  GLNFRDLG---------------------------------------------------FVWGMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSS

Query:  PPV
        PP+
Subjt:  PPV

M5VU98 Reverse transcriptase domain-containing protein1.6e-11029.7Show/hide
Query:  SGGLCLLWKDDID------------------------RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAGVLIVK---------
        SGGL LLWK+++D                        R T FYG P    R  SW L+ +L  +N+  W+  GD NEIL  DEK G  +           
Subjt:  SGGLCLLWKDDID------------------------RFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAGVLIVK---------

Query:  -VDKLRLS-------------------VRLRLDHFLANA-----------------------------------------NFCALFEDHKDCAS------
         VDKL                      VR+RLD  LA                                           +F A++  H DC        
Subjt:  -VDKLRLS-------------------VRLRLDHFLANA-----------------------------------------NFCALFEDHKDCAS------

Query:  ---------------------ALGGWGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFGAIHQ-LERQLDSLLLDEEMYWKQRSRENWLKWGDRNTRWFHQ
                              L  W       +  + R L+ K+   + +          + +++ LD LL   E+YW QRSRENWLK GD+NT +FHQ
Subjt:  ---------------------ALGGWGFRQNKRLMNDIRALKDKIKQAYDSAMPIDFGAIHQ-LERQLDSLLLDEEMYWKQRSRENWLKWGDRNTRWFHQ

Query:  KASSRHKKNWVGGIEDSNG-----------------------------------VPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQ
        KA++R ++N + G+EDSNG                                   +  +VT++M Q+L+A ++ +E+  AV    PSKAPG DG P LFYQ
Subjt:  KASSRHKKNWVGGIEDSNG-----------------------------------VPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQ

Query:  QYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEAL
        +YW  VGD  V++  A L     ++  NHT + LIPKV +PR ++  RPISLCNV+Y+I  K +AN++K V+  VI E QSAF+PGR I+DN I+  E  
Subjt:  QYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEAL

Query:  HFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLN-------------------------------------
        HFL ++R+ + G +ALKLDMSKAYDRVEW +L ++M  M F + W++++M+C+T  ++S  +N                                     
Subjt:  HFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLN-------------------------------------

Query:  ------------------AAHEF----------------GVFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYLGLPSVF
                           +H F                GV K I + YE ASGQ +N  KS V FS N+  DT+  L ++L +   +S  +YLGLP + 
Subjt:  ------------------AAHEF----------------GVFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLCNILNMCESESLGSYLGLPSVF

Query:  HRGKSRDFKFLLDRI--------------------------------------PKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRD
         R K+  F++L +R+                                      P+G+  +I  + A FWWG  G+ RK+ W +WE LCK K  GG+ FR 
Subjt:  HRGKSRDFKFLLDRI--------------------------------------PKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRD

Query:  L----------------------------------------------GFVW-----GMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSPPVGME
        L                                                VW        +++MG R  +G+G+S+ ++   W+PRP+TF V++SP  GME
Subjt:  L----------------------------------------------GFVW-----GMDLMKMGLRKNVGNGRSILMFQHPWLPRPSTFKVMSSPPVGME

Query:  SVMVSEFI--NSSMQWDVVKLSQVLLREDIEVIKSLPIS-NSAPDKWIWHYDR
        +  VSE I    S QWD+ KL+ + L  D+  I  +P+S  + PD+ +W+YD+
Subjt:  SVMVSEFI--NSSMQWDVVKLSQVLLREDIEVIKSLPIS-NSAPDKWIWHYDR

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.8e-1828.99Show/hide
Query:  RVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSD-FRPISLCNVV
        R+  E  + L  P T  E+VA +N     K+PG DGF A FYQ+Y   +    +  F +I  +G     +   +++LIPK  +     + FRPISL N+ 
Subjt:  RVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSD-FRPISLCNVV

Query:  YKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIA
         KI+ K++AN+++  +  +I   Q  FIPG     N+      +   H  R +   +V + +D  KA+D+++  ++ + ++K+     ++K+I       
Subjt:  YKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIA

Query:  TFSICLN
        T +I LN
Subjt:  TFSICLN

P08548 LINE-1 reverse transcriptase homolog1.8e-1526.09Show/hide
Query:  RVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKV-HQPRLVSDFRPISLCNVV
        R++ +  +ML  P +  E+ + +      K+PG DGF + FYQ +   +    ++ F  I  +G     +   N+ LIPK    P    ++RPISL N+ 
Subjt:  RVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKV-HQPRLVSDFRPISLCNVV

Query:  YKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIA
         KI+ K++ N+++  +  +I   Q  FIPG     N+      +  ++K + +   ++ L +D  KA+D ++  ++ + + K+     ++KLI    +  
Subjt:  YKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIA

Query:  TFSICLN
        T +I LN
Subjt:  TFSICLN

P11369 LINE-1 retrotransposable element ORF2 protein1.1e-1528.34Show/hide
Query:  LLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQ-PRLVSDFRPISLCNVVYKIVTKVIA
        L +P + +E+ A +N     K+PG DGF A FYQ +   +       F  I  +G     +    + LIPK  + P  + +FRPISL N+  KI+ K++A
Subjt:  LLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQ-PRLVSDFRPISLCNVVYKIVTKVIA

Query:  NKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEF---HVNWIKLI
        N+++  +  +I   Q  FIPG     N+      +H+++K + +   ++ + LD  KA+D+++  ++ +++ +      ++N IK I
Subjt:  NKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEF---HVNWIKLI

P14381 Transposon TX1 uncharacterized 149 kDa protein1.9e-2032.23Show/hide
Query:  NGVPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISL
        +G+P  V+    + L  P T +E+  A+     +K+PGLDG    F+Q +W T+G             G          L L+PK    RL+ ++RP+SL
Subjt:  NGVPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISL

Query:  CNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNC
         +  YKIV K I+ +LK VL  VI   QS  +PGR I DN+ L  + LHF    R+  +    L LD  KA+DRV+  YL   +    F   ++  +   
Subjt:  CNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNC

Query:  ITIATFSICLN
           A   + +N
Subjt:  ITIATFSICLN

P93295 Uncharacterized mitochondrial protein AtMg003101.7e-0844.62Show/hide
Query:  RIPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKE-IGGLNFRDLGFVWGMDLMKMGLR
        R+ K +  K+++    FWW SC +KRK+ W  W+ LCK KE  GGL FRDLG+     L K   R
Subjt:  RIPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKE-IGGLNFRDLGFVWGMDLMKMGLR

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.8e-1825.4Show/hide
Query:  KDCASALGGWGF----RQNKRLMNDIRALKDKIKQAYDSAMPIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTRWFHQKASSRHKKNWVGG
        K C   L   GF     + K  ++ + +++ ++      ++   F   H   ++ +      E +++Q+SR  WL+ GD NTR+FH+   +   KN +  
Subjt:  KDCASALGGWGF----RQNKRLMNDIRALKDKIKQAYDSAMPIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTRWFHQKASSRHKKNWVGG

Query:  IEDSNGVPTRVTSEMNQMLLAPYTR---------------------------------------EEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDK
        +   + V     +++ +M++A YT                                        +E+ AAV     +KAPG D F A F+ + W  V D 
Subjt:  IEDSNGVPTRVTSEMNQMLLAPYTR---------------------------------------EEVVAAVNGFHPSKAPGLDGFPALFYQQYWATVGDK

Query:  TVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVT
        T+++       G  ++ +N T + LIPKV     +S FRP+S C VVYKI+T
Subjt:  TVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVT

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.8e-1037.35Show/hide
Query:  IANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWI
        +  +LK ++  +I   Q++FIPGR  +DN++   EA+H + +K+  K G++ LKLD+ KAYDR+ W YL   +    F   W+
Subjt:  IANKLKMVLDVVIDECQSAFIPGRCISDNMILGHEALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWI

AT4G29090.1 Ribonuclease H-like superfamily protein2.2e-0822.44Show/hide
Query:  IPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRDL----------------------------------------------GFVW-
        +PK +  +I ++ A FWW +  + + M WK W+ L   K  GG+ F+D+                                               FVW 
Subjt:  IPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRDL----------------------------------------------GFVW-

Query:  ----GMDLMKMGLRKNVGNGRSILMFQHPWL---PRPSTFKVMSSPPVGMESV----MVSEFIN-SSMQWDVVKLSQVLLREDIEVIKSL-PISNSAPDK
              ++++ G R  VGNG  I++++H WL   P  +  ++   PP    SV     VS+ I+ S  +W    +  +    + ++I  L P      D 
Subjt:  ----GMDLMKMGLRKNVGNGRSILMFQHPWL---PRPSTFKVMSSPPVGMESV----MVSEFIN-SSMQWDVVKLSQVLLREDIEVIKSL-PISNSAPDK

Query:  WIWHY
        + W Y
Subjt:  WIWHY

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-0944.62Show/hide
Query:  RIPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKE-IGGLNFRDLGFVWGMDLMKMGLR
        R+ K +  K+++    FWW SC +KRK+ W  W+ LCK KE  GGL FRDLG+     L K   R
Subjt:  RIPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKE-IGGLNFRDLGFVWGMDLMKMGLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCCGTTATCTTCAGAAGTAGGGGAAGGGTCGTCTGACTTCTGGCAAATTTTTAAGGTCAGTAACGTTCAAATATCAAATAACTTAATGAAAATTTTTAATCAGCC
TTGTTCGGTAATGGAAGACGCAGTGAAGGATCTTGAGGATGATGGCGTCGGTTACGGTAATGGTGGAGATTCTGAAATAGAAAGTTACCCTGAATCGGTGGACGAGATTG
GACCATTGGATCTGGATGGGTCTGCTGGACCAGGTGAAGCCAAGATTGAAGCAGCAACTGATGGGCCTCTGTTGAAGGTATTGGGCCCACGGAATGATATCGAGTCCTCT
TTTGAGGATGGGCCTTTGCAACCTATTGTGGGGAGCGGTTTGCAGTGGGATGAAGGGGTCCAACTCAGTTCTGATGGGCTTGGGGAGCAATCCATTTTTCAGCCGGTGTT
ACCTTTGACTACTCAGGTTGAGGACACTCTTCAAAGCCATATGTGGAAGAAGCGGGCACGTGCAGGCTATGTCCCACTTGGTTTGAGTTTGGAAGTGGTGGAGGAGTTTA
AGAAGCGTAAAACTGGTCCGATTTTATTTTCTCCTGATATTGTTAAGCGGCCTAAGGTCGATGGTTTTGATTGTCTCAGTGGTGGCCTATGCTTATTATGGAAAGATGAC
ATTGATCGGTTTACTGGTTTTTATGGTAATCCGGATCCTAGCTTACGCTCTCACTCTTGGAATTTGATTCGTCGGTTGTATGACAATAATGAGGCTGCATGGGTGATTGA
GGGTGATCTGAATGAGATTTTATGGAAAGATGAGAAAGCGGGGGTCCTGATCGTGAAAGTGGACAAATTGAGGCTTTCCGTGCGTTTAAGATTGGACCACTTTCTTGCTA
ATGCTAATTTCTGTGCTCTTTTCGAAGATCATAAGGATTGTGCGTCGGCATTGGGAGGTTGGGGTTTTCGTCAAAATAAGCGTTTGATGAATGACATCCGAGCACTTAAG
GATAAGATCAAACAGGCTTATGATAGTGCCATGCCTATTGATTTTGGGGCAATACATCAGTTGGAGCGCCAATTGGATTCTCTTCTTCTCGATGAGGAGATGTATTGGAA
ACAAAGATCGCGAGAGAACTGGCTTAAATGGGGTGATCGCAACACCCGATGGTTTCACCAGAAGGCATCTTCGAGGCATAAGAAGAACTGGGTTGGGGGAATTGAAGACT
CTAATGGGGTTCCCACTAGAGTTACTTCAGAAATGAACCAGATGCTTTTGGCTCCTTATACTCGAGAGGAAGTTGTTGCTGCCGTTAATGGCTTCCATCCTTCCAAGGCG
CCAGGTCTGGATGGTTTTCCTGCTCTTTTCTACCAACAATATTGGGCGACAGTGGGTGACAAAACTGTGTCTAGTTTTCTTGCCATTCTGAATGATGGAGCCTCGGTTCA
GGATTGGAATCATACAAATCTTGTCCTTATTCCAAAGGTGCACCAACCGAGGTTAGTATCTGATTTTCGTCCTATTAGCTTATGTAATGTGGTATATAAAATTGTTACAA
AGGTCATAGCGAATAAACTAAAGATGGTTCTGGACGTGGTTATCGATGAGTGTCAATCTGCGTTCATCCCTGGTAGATGTATATCAGATAATATGATTTTGGGGCATGAA
GCGCTTCATTTTTTGCACAAGAAGCGGAAAAGAAAAATTGGATATGTTGCCTTGAAGCTTGACATGAGCAAAGCGTACGATCGAGTTGAGTGGTCGTACTTATCTCAGAT
TATGTCTAAGATGGAATTCCATGTTAATTGGATTAAGTTGATTATGAATTGTATCACGATAGCTACATTTTCCATTTGTTTGAATGCGGCTCATGAGTTTGGGGTGTTTA
AATCCATTCTGAAGGATTATGAACGGGCTTCTGGTCAATGTGTTAATTATTCCAAATCATCTGTGTTTTTCTCCAGGAATGTTACTGCAGATACCAGAGATTATTTGTGC
AATATATTAAATATGTGTGAGTCTGAGTCTCTGGGCTCTTACCTTGGTTTACCTTCCGTTTTCCACCGAGGCAAATCTAGAGATTTTAAATTTTTGTTGGATAGAATTCC
TAAAGGCATTTTAGCAAAAATATCAACACTCTGTGCTACGTTTTGGTGGGGTTCGTGTGGAGACAAACGTAAGATGCAATGGAAAAAATGGGAGGACCTGTGTAAGCCAA
AAGAGATCGGAGGTTTAAATTTTCGAGATTTGGGATTTGTGTGGGGGATGGACCTTATGAAGATGGGTTTACGGAAAAATGTCGGGAACGGGAGATCGATTCTAATGTTT
CAACATCCATGGCTGCCAAGGCCATCTACTTTTAAGGTCATGTCTTCACCTCCTGTTGGCATGGAGAGTGTTATGGTCTCGGAGTTCATAAACTCTTCTATGCAATGGGA
CGTGGTTAAGTTAAGTCAAGTTTTGTTGAGGGAGGACATCGAGGTGATAAAATCTCTTCCTATTAGTAATTCAGCTCCTGACAAGTGGATTTGGCATTATGATCGAATAT
GA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCCGTTATCTTCAGAAGTAGGGGAAGGGTCGTCTGACTTCTGGCAAATTTTTAAGGTCAGTAACGTTCAAATATCAAATAACTTAATGAAAATTTTTAATCAGCC
TTGTTCGGTAATGGAAGACGCAGTGAAGGATCTTGAGGATGATGGCGTCGGTTACGGTAATGGTGGAGATTCTGAAATAGAAAGTTACCCTGAATCGGTGGACGAGATTG
GACCATTGGATCTGGATGGGTCTGCTGGACCAGGTGAAGCCAAGATTGAAGCAGCAACTGATGGGCCTCTGTTGAAGGTATTGGGCCCACGGAATGATATCGAGTCCTCT
TTTGAGGATGGGCCTTTGCAACCTATTGTGGGGAGCGGTTTGCAGTGGGATGAAGGGGTCCAACTCAGTTCTGATGGGCTTGGGGAGCAATCCATTTTTCAGCCGGTGTT
ACCTTTGACTACTCAGGTTGAGGACACTCTTCAAAGCCATATGTGGAAGAAGCGGGCACGTGCAGGCTATGTCCCACTTGGTTTGAGTTTGGAAGTGGTGGAGGAGTTTA
AGAAGCGTAAAACTGGTCCGATTTTATTTTCTCCTGATATTGTTAAGCGGCCTAAGGTCGATGGTTTTGATTGTCTCAGTGGTGGCCTATGCTTATTATGGAAAGATGAC
ATTGATCGGTTTACTGGTTTTTATGGTAATCCGGATCCTAGCTTACGCTCTCACTCTTGGAATTTGATTCGTCGGTTGTATGACAATAATGAGGCTGCATGGGTGATTGA
GGGTGATCTGAATGAGATTTTATGGAAAGATGAGAAAGCGGGGGTCCTGATCGTGAAAGTGGACAAATTGAGGCTTTCCGTGCGTTTAAGATTGGACCACTTTCTTGCTA
ATGCTAATTTCTGTGCTCTTTTCGAAGATCATAAGGATTGTGCGTCGGCATTGGGAGGTTGGGGTTTTCGTCAAAATAAGCGTTTGATGAATGACATCCGAGCACTTAAG
GATAAGATCAAACAGGCTTATGATAGTGCCATGCCTATTGATTTTGGGGCAATACATCAGTTGGAGCGCCAATTGGATTCTCTTCTTCTCGATGAGGAGATGTATTGGAA
ACAAAGATCGCGAGAGAACTGGCTTAAATGGGGTGATCGCAACACCCGATGGTTTCACCAGAAGGCATCTTCGAGGCATAAGAAGAACTGGGTTGGGGGAATTGAAGACT
CTAATGGGGTTCCCACTAGAGTTACTTCAGAAATGAACCAGATGCTTTTGGCTCCTTATACTCGAGAGGAAGTTGTTGCTGCCGTTAATGGCTTCCATCCTTCCAAGGCG
CCAGGTCTGGATGGTTTTCCTGCTCTTTTCTACCAACAATATTGGGCGACAGTGGGTGACAAAACTGTGTCTAGTTTTCTTGCCATTCTGAATGATGGAGCCTCGGTTCA
GGATTGGAATCATACAAATCTTGTCCTTATTCCAAAGGTGCACCAACCGAGGTTAGTATCTGATTTTCGTCCTATTAGCTTATGTAATGTGGTATATAAAATTGTTACAA
AGGTCATAGCGAATAAACTAAAGATGGTTCTGGACGTGGTTATCGATGAGTGTCAATCTGCGTTCATCCCTGGTAGATGTATATCAGATAATATGATTTTGGGGCATGAA
GCGCTTCATTTTTTGCACAAGAAGCGGAAAAGAAAAATTGGATATGTTGCCTTGAAGCTTGACATGAGCAAAGCGTACGATCGAGTTGAGTGGTCGTACTTATCTCAGAT
TATGTCTAAGATGGAATTCCATGTTAATTGGATTAAGTTGATTATGAATTGTATCACGATAGCTACATTTTCCATTTGTTTGAATGCGGCTCATGAGTTTGGGGTGTTTA
AATCCATTCTGAAGGATTATGAACGGGCTTCTGGTCAATGTGTTAATTATTCCAAATCATCTGTGTTTTTCTCCAGGAATGTTACTGCAGATACCAGAGATTATTTGTGC
AATATATTAAATATGTGTGAGTCTGAGTCTCTGGGCTCTTACCTTGGTTTACCTTCCGTTTTCCACCGAGGCAAATCTAGAGATTTTAAATTTTTGTTGGATAGAATTCC
TAAAGGCATTTTAGCAAAAATATCAACACTCTGTGCTACGTTTTGGTGGGGTTCGTGTGGAGACAAACGTAAGATGCAATGGAAAAAATGGGAGGACCTGTGTAAGCCAA
AAGAGATCGGAGGTTTAAATTTTCGAGATTTGGGATTTGTGTGGGGGATGGACCTTATGAAGATGGGTTTACGGAAAAATGTCGGGAACGGGAGATCGATTCTAATGTTT
CAACATCCATGGCTGCCAAGGCCATCTACTTTTAAGGTCATGTCTTCACCTCCTGTTGGCATGGAGAGTGTTATGGTCTCGGAGTTCATAAACTCTTCTATGCAATGGGA
CGTGGTTAAGTTAAGTCAAGTTTTGTTGAGGGAGGACATCGAGGTGATAAAATCTCTTCCTATTAGTAATTCAGCTCCTGACAAGTGGATTTGGCATTATGATCGAATAT
GA
Protein sequenceShow/hide protein sequence
MQPLSSEVGEGSSDFWQIFKVSNVQISNNLMKIFNQPCSVMEDAVKDLEDDGVGYGNGGDSEIESYPESVDEIGPLDLDGSAGPGEAKIEAATDGPLLKVLGPRNDIESS
FEDGPLQPIVGSGLQWDEGVQLSSDGLGEQSIFQPVLPLTTQVEDTLQSHMWKKRARAGYVPLGLSLEVVEEFKKRKTGPILFSPDIVKRPKVDGFDCLSGGLCLLWKDD
IDRFTGFYGNPDPSLRSHSWNLIRRLYDNNEAAWVIEGDLNEILWKDEKAGVLIVKVDKLRLSVRLRLDHFLANANFCALFEDHKDCASALGGWGFRQNKRLMNDIRALK
DKIKQAYDSAMPIDFGAIHQLERQLDSLLLDEEMYWKQRSRENWLKWGDRNTRWFHQKASSRHKKNWVGGIEDSNGVPTRVTSEMNQMLLAPYTREEVVAAVNGFHPSKA
PGLDGFPALFYQQYWATVGDKTVSSFLAILNDGASVQDWNHTNLVLIPKVHQPRLVSDFRPISLCNVVYKIVTKVIANKLKMVLDVVIDECQSAFIPGRCISDNMILGHE
ALHFLHKKRKRKIGYVALKLDMSKAYDRVEWSYLSQIMSKMEFHVNWIKLIMNCITIATFSICLNAAHEFGVFKSILKDYERASGQCVNYSKSSVFFSRNVTADTRDYLC
NILNMCESESLGSYLGLPSVFHRGKSRDFKFLLDRIPKGILAKISTLCATFWWGSCGDKRKMQWKKWEDLCKPKEIGGLNFRDLGFVWGMDLMKMGLRKNVGNGRSILMF
QHPWLPRPSTFKVMSSPPVGMESVMVSEFINSSMQWDVVKLSQVLLREDIEVIKSLPISNSAPDKWIWHYDRI