; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021735 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021735
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:11284880..11287483
RNA-Seq ExpressionLag0021735
SyntenyLag0021735
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]1.0e-17741.32Show/hide
Query:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV
        N I  I ++ G W   ++ I   A  +F  ++ SS P+   I++VT  +  K+++  N  L+R F+ EE+   +K ++P+KAPG DG+ A FFQKYW +V
Subjt:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV

Query:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR
        G +   + L +LN    +  LNKT I+LIPK  NPKRM DF PISLCNV+YK+++K LANR+K ++  IIS++Q+AF   R ITDNV + FE +H ++++
Subjt:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR

Query:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN
          GKEG+MA+KLDMSKA+DRVEW F+ KVME + FC  W   +M C+ SV YSILING+      P RGLRQGDPLSP LFL+CAEGLS LIN+      
Subjt:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN

Query:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV
          G+ IN+  P ++HLFFADDS++FC+A   +C  +++IL +YEEASGQ I  DKS    S NT  E   ++  ILG  Q +   +YLG+PS   +SK  
Subjt:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV

Query:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ
        +F  L+E+V   L  WK  L S+GGKE+LIKA+AQAIPTYTMSCF +P+GLCD++ +    FWWG   +  K+ W+SWK+MC SK SGG+GFR +  FN 
Subjt:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ

Query:  AMLAKQGWRILRNPDSLLAKVLRG-----------------------------------RWRVGDGKHIKIGRDPWIYRKGNRFLVSTP-DGLKDERVCN
        AMLAKQ WRIL NP+SL+ +VL+                                    RWRVG+GK I I  D W+       ++S      +   V +
Subjt:  AMLAKQGWRILRNPDSLLAKVLRG-----------------------------------RWRVGDGKHIKIGRDPWIYRKGNRFLVSTP-DGLKDERVCN

Query:  LLKENGS-WDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEG-CSDPSNTKAIWKQLWNLKIIPRAKICLWKM
        L+  +   W  E +R  F+  E E IL+IPL      D++IW  N KG F VKSAY +  +  D +  G CS+    + +WK+LW L +  + KI  W+ 
Subjt:  LLKENGS-WDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEG-CSDPSNTKAIWKQLWNLKIIPRAKICLWKM

Query:  LKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGF-EFWKEISQRLNLEELNIADLIIWNIWAYQNRV
          + +PT  N+  +GI  + TC +C    E   H +  C+     W  +    +++  + +S    F +    +      + L +  ++ W IW  +N++
Subjt:  LKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGF-EFWKEISQRLNLEELNIADLIIWNIWAYQNRV

Query:  M
        +
Subjt:  M

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.4e-19042.24Show/hide
Query:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV
        N I+ I++++G W+   + I  VA  +F  ++ SS PT   I +V   +   +++  N  L++ F+ EE+   +  M+P+KAPG DG+ A FFQKYW +V
Subjt:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV

Query:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR
        G D   + L +LN    +  +NKT ITL+PK++NP +M DF PISLCNV+YK+++K LANR+K I+  IIS++Q+AF+ GR ITDNV + FE +H + ++
Subjt:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR

Query:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN
          GKEG+ A+KLDMSKAYDRVEW F+++VME + F E+W   +M C+ SV YSIL+NG       P RGLRQGDP+SPY+FL+CA+G S L+N       
Subjt:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN

Query:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV
          G+ I +  P I+HLFFADDSL+FC+A   +C+ + +IL+ YE+ASGQ I +DKS    S NT  E+  ++ ++LG  Q+    +YLG+PS   KSK+ 
Subjt:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV

Query:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ
        +F  ++ERVE+ L  WKE L S+GG+E+LIKA+AQAIPTYTMSCF+IPK LC+EI     RFWWG   +  KI W+SWKK+CK+K +GGMGFR +  FN 
Subjt:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ

Query:  AMLAKQGWRILRNPDSLLAKVLRG-----------------------------------RWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKD-ERVCN
        AMLAKQGWR++ NP+SL+A++ +                                    RWRVG+G+ I I  D W+       ++S P    D  RV  
Subjt:  AMLAKQGWRILRNPDSLLAKVLRG-----------------------------------RWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKD-ERVCN

Query:  LL-KENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKA-IWKQLWNLKIIPRAKICLWKM
        L+ +E   W ++++R+ F+  EA  IL IPL      D+IIW  N KG F VKSAY + V   DN   G S   ++++ +W++LW+L I P+ +I  WKM
Subjt:  LL-KENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKA-IWKQLWNLKIIPRAKICLWKM

Query:  LKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGFEFWKEISQRLNLEELNIADLIIWNIWAYQNRVM
          N +PT +NL+ KG++    C  C  + E+N H+  +C+V K+ W+ ++ N  + +       D  +   +I       +L I  ++ W IW  +N+++
Subjt:  LKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGFEFWKEISQRLNLEELNIADLIIWNIWAYQNRVM

Query:  ATGIS
           +S
Subjt:  ATGIS

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]3.5e-18142.38Show/hide
Query:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV
        N I  + N+DG W    + I   A  +F  ++ SS PT   I +V + + ++++D  N +L + F+SEE+   +K ++P+KAPG DG+ A FF  YW++V
Subjt:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV

Query:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR
        G     + L +LN    +  +NKT I+LIPK   P RM +F PISLCN  YKI++K LANR K I+  IIS++Q+AF P R ITDNV + FE +H +N++
Subjt:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR

Query:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN
          GKE +M++KLDMSKA+DRVEW F++ VME L F E+W   IM CV SV YS+LING       P RG+RQGDPLSP LFL+CAEGLS LI+       
Subjt:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN

Query:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV
          G+ I +  P I+HLFFADDSL+FC+AKE +C  + +IL  YEEASGQ I  DKS    S NT  E    +  ILG  Q++   +YLG+PS   KSK  
Subjt:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV

Query:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ
        +F  +++RV K L  WK  L S+GG+E+LIKA+AQA+PTYTMSCF++PK LC ++      FWWG   +  KI W+SW+KMC+SKL GGMGFR I  FN 
Subjt:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ

Query:  AMLAKQGWRILRNPDSLLAKVLRG-----------------------------------RWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKDERVCNL
        AMLAKQGWRIL NP+SL+A+V +                                    RWRVG+G+ I I  D W+       +VS      D  + + 
Subjt:  AMLAKQGWRILRNPDSLLAKVLRG-----------------------------------RWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKDERVCNL

Query:  L--KENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNT-KAIWKQLWNLKIIPRAKICLWKM
        L   +   W  ++I   F+  EA  ILKIPL      D +IW  N +G F VKSAY +  +  D+  EG S   N+   +WK++W LK+ P+ KI  W+ 
Subjt:  L--KENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNT-KAIWKQLWNLKIIPRAKICLWKM

Query:  LKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGFEFWKEISQRLNLEELNIADLIIWNIWAYQNRVM
          N +PT  NL  +G+  +  C LC    ET  H +  C+  K  W  +  +  +   SC    D  E   +I  + +L +L +   + W+IW  +N+ +
Subjt:  LKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGFEFWKEISQRLNLEELNIADLIIWNIWAYQNRVM

XP_030939975.1 uncharacterized protein LOC115964883 [Quercus lobata]1.7e-18340.16Show/hide
Query:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV
        N I ++   DG   T +K+IG+    +F  +F S+ P+N    ++  G+  K++ + N DL R F+++E+   +K M P  APG DG+   F++  W  +
Subjt:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV

Query:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR
        G D     L ILN G     LN T I+LIPK+++P++  DF PISLCNV+YKIV+K++ANR+KK++  ++S+SQ+AF+  R I+DN+ + FE +H +  +
Subjt:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR

Query:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN
         +GK G+MA+KLDMSKAYDRVEW FL KVME L F   W T +  C+ SV +S+L+NG P   F P RGLRQGDPLSPYLFL+CAEGL  LI + E    
Subjt:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN

Query:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV
         KG+ +    P +SHLFFADDSL+FCRA   +   I  IL++YEEASGQ I  +K+    S NT      +++ +LGV    +  +YLG+PS   + K  
Subjt:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV

Query:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ
         F  +RER+   +Q WKE L S GG+EVLIKA+ QA+PT+TM CF+IPK LC +I     +FWWG   E RKIHW+ WKK+CKSK  GG+GF+ I LFN 
Subjt:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ

Query:  AMLAKQGWRILRNPDSLLAKVLR-----------------------------------GRWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGL-KDERVCN
        AML KQ WR++ N DSL  KV +                                    +WR+GDG  ++I  D W+    +  +VS       + RVC 
Subjt:  AMLAKQGWRILRNPDSLLAKVLR-----------------------------------GRWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGL-KDERVCN

Query:  LL-KENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLWKML
        L+ +EN  W E+ IRE F+  EAE IL +PL   G  D +IWAE   G +  KSAY+L++   + +  G S+P++ K  W++LW+L +  + +  LW+  
Subjt:  LL-KENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLWKML

Query:  KNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSW-RDGFEFWKEISQRLNLEEL-NIADL---IIWNIWAYQ
         + +PTK NL+ + I  + TC  C  + E   H IW C+++KQ W          L  CR +  + F  + ++ Q +  +++ N A+L   I W+IW  +
Subjt:  KNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSW-RDGFEFWKEISQRLNLEEL-NIADL---IIWNIWAYQ

Query:  NRVMATGISADQKKINQLIEASLDEHFRVQKQYQEDMQSKNQRSQSSWSPPSSS
        N       S   +KI ++    L E   VQ+  +  +   +    + W PPS S
Subjt:  NRVMATGISADQKKINQLIEASLDEHFRVQKQYQEDMQSKNQRSQSSWSPPSSS

XP_030946812.1 uncharacterized protein LOC115971195 [Quercus lobata]7.2e-17939.46Show/hide
Query:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV
        N I +++  DG+  TG+K+IG+    +F  +F S+ P+N    ++  G+  K++ + N DL R F+++E+   +K M P  APG DG+   F++  W  +
Subjt:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV

Query:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR
        G D     L ILN G     LN T I+LIPK+++P++  DF PISLCNV+YKIV+K++ANR+KK++  ++S+SQ+AF+  R I+DN+ + FE +H +  +
Subjt:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR

Query:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN
         +GK G+MA+KLDMSKAYDRVEW FL KVME L F   W T +  C+ SV +S+L+NG P   F P RGLRQGDPLSPYLFL+CAEGL  LI + E   +
Subjt:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN

Query:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV
         KG+ +    P +SHLFFADDSL+FCRA   +   I  IL++YEEASGQ I  +K+    S NT      +++ +LGV    +  +YLG+PS   + K  
Subjt:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV

Query:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ
         F  +RERV + +Q WKE L S GG+EVLIKA+ QA+PT+TM CF++PK LC +I     +FWWG   E RKIHW+ WKK+CKSK  GG+GF+ I LFN 
Subjt:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ

Query:  AMLAKQGWRILRNPDSLLAKVLR-----------------------------------GRWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGL-KDERVCN
        AML KQ WR++ N DSL  KV +                                    +WR+GDG  ++I  D W+    +  +VS       + RVC 
Subjt:  AMLAKQGWRILRNPDSLLAKVLR-----------------------------------GRWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGL-KDERVCN

Query:  LL-KENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLWKML
        L+ +EN  W E+ IRE F+  EAE IL +PL   G  D +IWAE   G +  KSAY+L++   + +    S+ +  K  W++LW+L +  + +  LW+  
Subjt:  LL-KENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLWKML

Query:  KNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSW-RDGFEFWKEISQR-LNLEELNIADL---IIWNIWAYQ
         + +P K NL  + I  +  C  C    E   H +W C+++KQ W          L  C+ +  + F  + ++ Q  L  +  N+A+L   I W+IW  +
Subjt:  KNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSW-RDGFEFWKEISQR-LNLEELNIADL---IIWNIWAYQ

Query:  NRVMATGISADQKKINQLIEASLDEHFRVQKQYQEDMQSKNQRSQSSWSPPSSS
        N       S   +KI +     L E   V+++ +  +        + W P S S
Subjt:  NRVMATGISADQKKINQLIEASLDEHFRVQKQYQEDMQSKNQRSQSSWSPPSSS

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein2.8e-18444.88Show/hide
Query:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV
        N I  I +    W+  + EI  V  H+F  ++ ++ P    I  V   + + +S   N++LL+PF+ EE+   +  M+PSKAPG DG+ A FFQK+W +V
Subjt:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV

Query:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR
        G D     L  LN+G  ++ LN T I LIPKV++P+ M  F PISLCNV+YKI++K L NRMK I+ +++S SQ+AFVPGR I+DN+ I FE IH + N+
Subjt:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR

Query:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN
          GK   MA KLDMSKAY+RVEW +L+K+M  L F E+W   IM CV SV YSIL+NG P+   +P RGLRQGDPLSPYLFLICAEGLS L+ + E    
Subjt:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN

Query:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV
         +G+ +++  P +SHLFFADDSLIFCRA E DC  ++ IL  YE ASGQ I  DK+    S+N        + ++ G        +YLG+P    +SK  
Subjt:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV

Query:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ
         F  +++R+ + LQ WKE   S  GKE+LIKA+ QAIPTY MSCF++P GLCDEI+    RFWWG     RKIHWLS KK+C++K+ GGMGFR +  FNQ
Subjt:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ

Query:  AMLAKQGWRILRNPDSLL-----AKVLRG---RWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKDE-RVCNLLKENG-SWDEEMIRECFMVSEAEEIL
        A+LA+QGWR+L+NP+SL+     AK + G   RWRVG+G++I+I +D WI       ++S    L++   V +L+ ++  +W+  ++ E F+  + E I+
Subjt:  AMLAKQGWRILRNPDSLL-----AKVLRG---RWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKDE-RVCNLLKENG-SWDEEMIRECFMVSEAEEIL

Query:  KIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAI---WKQLWNLKIIPRAKICLWKMLKNIVPTKVNLISKGIDTNPTCFLC
        KIPL  R   D ++W    KGVF V+SAY ++++    SV   ++ S++K +   W++LW+++  P+ K+ +W+  +NI+PT+  L  +GI  + TC  C
Subjt:  KIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAI---WKQLWNLKIIPRAKICLWKMLKNIVPTKVNLISKGIDTNPTCFLC

Query:  MSKRETNGHLIWECKVVKQAWK
        M + ET  H++W C+  ++ W+
Subjt:  MSKRETNGHLIWECKVVKQAWK

A0A2N9FN47 Uncharacterized protein1.5e-17740.14Show/hide
Query:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV
        N +  + + +  W+T    I  +   +F  +F SS P  T I +VT  +  +++   N  LL PFSSEE+ + +  M+PSKAPG DG+ A FFQKYW VV
Subjt:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV

Query:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR
        G       L  LN G  +  +N T I LIPKV+ P  M  F PISLCNVIYKI++K L NRMK ++  +IS SQ+AFVPGR ITDN+ + FE +H + N+
Subjt:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR

Query:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN
          GK G MA KLDMSKAYDRVEW +LR ++  L F E W + IMMCV SV YS+L+NG  +   KP RGLRQGDPLSPYLFLICAEGLS L+ + E    
Subjt:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN

Query:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV
          G+ I +  P +SHLFFADDS+IFC A  +DC+ +  +L  YE+ASGQ +   K+    S NT  +    + Q+ G        +YLG+P    ++K  
Subjt:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV

Query:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ
         F  L++RV + LQ WKE L S  G+E+LIKA+ QAIPTY MSCF++P GLC EI+    ++WWG     RK+HWLS +++  +K  GGMGFR +SLFN 
Subjt:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ

Query:  AMLAKQGWRILRNPDSLLAKVLRGR-----------------------------------WRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLK-DERVCN
        AMLA+QGWR+++ P+SLL +VL+ +                                   WRVG G++I+I +D W+       ++S P  L+ +  V  
Subjt:  AMLAKQGWRILRNPDSLLAKVLRGR-----------------------------------WRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLK-DERVCN

Query:  LLK-ENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLWKML
        L+  E G W+  +I + F+ S+AE I +IPL  R  +D++IWA N KGVF VK+AY+L++   +   E  S  S TK  W  +W+ K+ P+ +  +W+  
Subjt:  LLK-ENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLWKML

Query:  KNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGFEFWKEISQRLNLEELNIADLIIWNIWAYQNRVMA
        +NI+PT+  L  K I ++ +C  C  + ET  H++W C   ++ W        + +    S+ D   F     + L   E+ I   + W +W  +N +M 
Subjt:  KNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGFEFWKEISQRLNLEELNIADLIIWNIWAYQNRVMA

Query:  TGISADQKKINQLIEASLDEHFRVQKQYQEDMQSKNQRSQSSWSPPS-SSSEIELKC
         GI+++   I         E   ++   +E ++ +   S  SW PP   S ++ + C
Subjt:  TGISADQKKINQLIEASLDEHFRVQKQYQEDMQSKNQRSQSSWSPPS-SSSEIELKC

A0A2N9HYE3 Reverse transcriptase domain-containing protein5.1e-17838.54Show/hide
Query:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV
        N +  +  +DG W     ++  +   ++ ++FQ++ P    +++V   +   ++   N  L+  F++ E+   +K M P KAPG D L   F+QKYW ++
Subjt:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV

Query:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR
        G D     L  LN G+ ++ +N T ITLIPKVQNP+ + +F PISLCNVIYK+++K LANR+K ++  I+ +SQ+AF+PGR ITDN+ + FE +H + ++
Subjt:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR

Query:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN
          GK G MA+KLDMSKAYDRVEW +L+ VME + F  +W T +M C+ +V YSIL+NG P    KP RGLRQGDPLSPYLFL+CAEGL  LI +++    
Subjt:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN

Query:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV
         KG+ I++  P I+HLFFADDSL+FC+A  +D   I+ IL +YE+ASGQ +   K+    SK+T     + +Q +LGV       +YLG+PS   ++K  
Subjt:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV

Query:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ
         F +++ERV   L+ WKE L S  G+E+LIK++AQAIP Y MSCFR+P  L  EI     RFWWG   ++ K+HWL W+ +CKSK +GGMG R +  FN+
Subjt:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ

Query:  AMLAKQGWRILRNPDSLLAKVLRGR-----------------------------------WRVGDGKHIKIGRDPWIYRKGNRFLVS-TPDGLKDERVCN
        A+LAKQ WR+L NP SL +KV + +                                   WRVG G HI+I RD W+    +  +VS  P  +    V +
Subjt:  AMLAKQGWRILRNPDSLLAKVLRGR-----------------------------------WRVGDGKHIKIGRDPWIYRKGNRFLVS-TPDGLKDERVCN

Query:  LL-KENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLWKML
        L+  E  SW  E+++  F+  EA  IL IPL  R   D ++W     G + V+S Y L++N    +    SD +    +W  +W+L + P+ +  LW+  
Subjt:  LL-KENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLWKML

Query:  KNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCR--SWRDGFEFWKEISQRLNLEELNIADLIIWNIWAYQNRV
         N +PT+ NL  + I  +P+C  C ++ E+  H +W+CK +K  W++ IP    + R  R  S+    +   +  Q L+  EL +  +  W IW  +NR+
Subjt:  KNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCR--SWRDGFEFWKEISQRLNLEELNIADLIIWNIWAYQNRV

Query:  MATGISADQKKINQLIEASLDEHFRVQKQYQEDMQ---SKNQRSQSSWSPP
            +      ++QLI  +LD     Q     D Q     N    ++W PP
Subjt:  MATGISADQKKINQLIEASLDEHFRVQKQYQEDMQ---SKNQRSQSSWSPP

A0A2N9I335 Reverse transcriptase domain-containing protein3.2e-18039.58Show/hide
Query:  MNKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEV
        +N I  + + +G  +T   ++ ++A  +F ++F SS P +  I     GL + +++  N  LL  F+SEE++  +K M P+KAPG DG+ A F+Q YW++
Subjt:  MNKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEV

Query:  VGADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINN
        VG +  +  L IL+ G  V  +N T I LIPKV+NP+R+ DF PISLCNVIYKIV+K LANR+KK++  +IS+SQ+AFVPGR ITDNV + FE +H ++ 
Subjt:  VGADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINN

Query:  RVRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLS
        +  G+ G MA+KLDMSKAYDRVEW+F+  +M  L F E+W   IMMC++SV YS+LING     FK  RG+RQGD LSPYLFL+CAEGLS L+ +     
Subjt:  RVRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLS

Query:  NFKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKM
           G+  ++  P ++HLFFADDSL+FC+A   +C  + +IL++YE  SGQ +   K+    ++NT ++   Q+Q++  V +  S  +YLG+PS   +SK 
Subjt:  NFKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKM

Query:  VLFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFN
          F  L+ RV + +  WKE   S GG+EVLIKA+AQAIPTYTMSCF++P  LC ++N     FWWG   + +K HW+ WKK+C SK  GGMGFR +  FN
Subjt:  VLFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFN

Query:  QAMLAKQGWRILRNPDSLLAKVLRG-----------------------------------RWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGL-KDERVC
         A+LAKQGWR+L+ P SL+ +VL+                                    RW +GDGK ++I +DPW+      + +S  + +   ERV 
Subjt:  QAMLAKQGWRILRNPDSLLAKVLRG-----------------------------------RWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGL-KDERVC

Query:  NLLKENG-SWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEG-CSDPSNTKAIWKQLWNLKIIPRAKICLWK
         L+ E+  SW+ E I   F   EA  I  IPL  R   D + W +   G+F VKSAY L +  +  + EG CS     +  WK LW++ I P+ K  LW+
Subjt:  NLLKENG-SWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEG-CSDPSNTKAIWKQLWNLKIIPRAKICLWK

Query:  MLKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGFEFWKEISQRLNLEELNIADLIIWNIWAYQNRV
            I+PT   L  + +  +  C  C+   E+  H +W C      W         + R   S+   F+    +  RL+ EE+N+   + + IW  +N++
Subjt:  MLKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGFEFWKEISQRLNLEELNIADLIIWNIWAYQNRV

Query:  MATGISADQKKI---NQLIEASLDEHFRVQKQ-YQEDMQSKNQRSQSS-WSPPS
        +   +S +   +    QL+ +   E    Q+   Q   ++ NQRS    W PPS
Subjt:  MATGISADQKKI---NQLIEASLDEHFRVQKQ-YQEDMQSKNQRSQSS-WSPPS

A0A2N9I509 Uncharacterized protein3.0e-17842.07Show/hide
Query:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV
        N I  + + D AW     +I  ++  +F  +FQSS P  T +++V   +   ++ + N DLLRPFS EE+   +  M+PSKAPG DG+ A FFQK+W VV
Subjt:  NKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVV

Query:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR
        G D     L  L+ G  ++ +N T I LIPKV+ P+RM  F PISLCNV+YKIV+K L NRMK ++  IIS SQ+AFVPGR ITDNV + FE +H + N 
Subjt:  GADTKRICLQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR

Query:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN
          G    MA KLDMSKAYDRVEW +LR ++  L F + W   +M CV S  YS+++NG P+    P RGLRQGDPLSPYLFLICAEGLS L+ + E  S 
Subjt:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN

Query:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV
         KG+ I +  P ISHLFFADDS+IFCRA  +DC  I+NIL  YE+ASGQ +  DK+    S NT       +  + G   +    +YLG+P    + K  
Subjt:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMV

Query:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ
         F  +++R+ + LQ WKE L S  GKEVLIKA+ QA+PTY MSCF+ P GLC EI+     FWWG     RKIHWLS  K+ K K  GG+GFR + LFN+
Subjt:  LFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQ

Query:  AMLAKQGWRILRNPDSLLAKVLRG-----------------------------------RWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKDERVCNL
        A+LA+QGWR+L++P SL+ + L+                                    RWRVG+G+ IK+  D W+       ++S P  L D    + 
Subjt:  AMLAKQGWRILRNPDSLLAKVLRG-----------------------------------RWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKDERVCNL

Query:  LKENGS--WDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVV---NHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLW
        L E G+  W  +++   F   +AE I KIPL  R   D +IW  +  GVF V+SAY +++   N ++ SV   S  S     W  LW++++ P+ K+ +W
Subjt:  LKENGS--WDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVV---NHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLW

Query:  KMLKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWK-NFIPNTTNFLRSCRSWRDGF-EFWKEISQRLNLEELNIADLIIWNIWAYQ
        K  KNIVPT+  L  KG+ ++ +C  C+ + ET  H++W C+  +  WK + +P +T         R  F E  +     L    L I+    W +W  +
Subjt:  KMLKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWK-NFIPNTTNFLRSCRSWRDGF-EFWKEISQRLNLEELNIADLIIWNIWAYQ

Query:  N
        N
Subjt:  N

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.1e-3625.52Show/hide
Query:  NKIDEILNKDGAWRTGDKE----IGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKY
        N+ID I N  G   T   E    I +   H +    ++ +  +T +   T     +L+  +   L RP +  E+ A+I  +   K+PG DG  A F+Q+Y
Subjt:  NKIDEILNKDGAWRTGDKE----IGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKY

Query:  WEVVGADTKRICLQILNEGEDVRPLNKTLITLIPKV-QNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIH
         E +     ++   I  EG       +  I LIPK  ++  + ++F PISL N+  KI+ K LANR+++ +  +I   Q  F+PG Q   N+      I 
Subjt:  WEVVGADTKRICLQILNEGEDVRPLNKTLITLIPKV-QNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIH

Query:  GINNRVRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRK
         I NR + K   + + +D  KA+D+++  F+ K +  L     +   I    +    +I++NG   E F  + G RQG PLSP LF I  E L+  I ++
Subjt:  GINNRVRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRK

Query:  ETLSNFKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKS-VFMVSKNTKAEEAI--QLQQILGVKQENSIGQYLGMPS
        + +   KG+++ K    +S   FADD +++        + +  ++  + + SG  I + KS  F+ + N + E  I  +L   +  K+   +G  L    
Subjt:  ETLSNFKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKS-VFMVSKNTKAEEAI--QLQQILGVKQENSIGQYLGMPS

Query:  QNIKSKMVLFKRLRERVEKVLQSWKENLFSLGGKEVLIK--AIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGM
        +++  +   +K L + +++    WK    S  G+  ++K   + + I  +     ++P     E+ +   +F W   + R     LS K       +GG+
Subjt:  QNIKSKMVLFKRLRERVEKVLQSWKENLFSLGGKEVLIK--AIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGM

Query:  GFRGISLFNQAMLAKQGWRILRNPD
              L+ +A + K  W   +N D
Subjt:  GFRGISLFNQAMLAKQGWRILRNPD

P08548 LINE-1 reverse transcriptase homolog1.4e-3625.58Show/hide
Query:  IDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLS----KKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWE
        I  I N +    T   EI  + + ++  ++ S K  N  +K++   L      +LS  +   L RP SS E+ + I+ +   K+PG DG  + F+Q + E
Subjt:  IDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLS----KKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWE

Query:  VVGADTKRICLQILNEGEDVRPLNKTLITLIPKV-QNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGI
         +      +   I  EG       +  ITLIPK  ++P R +++ PISL N+  KI+ K L NR+++ +  II   Q  F+PG Q   N+      I  I
Subjt:  VVGADTKRICLQILNEGEDVRPLNKTLITLIPKV-QNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGI

Query:  NNRVRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKET
         N+++ K+  M + +D  KA+D ++  F+ + ++ +     +   I         +I++NG+  + F    G RQG PLSP LF I  E L+  I  ++ 
Subjt:  NNRVRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKET

Query:  LSNFKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSV-FMVSKNTKAEEAIQ--LQQILGVKQENSIGQYLGMPSQN
        +   KG+ I      I    FADD +++     +    +  +++EY   SG  I   KSV F+ + N +AE+ ++  +   +  K+   +G YL    ++
Subjt:  LSNFKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSV-FMVSKNTKAEEAIQ--LQQILGVKQENSIGQYLGMPSQN

Query:  IKSKMVLFKRLRERVEKVLQSWKENLFSLGGKEVLIK--AIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGF
        +  +   ++ LR+ + + +  WK    S  G+  ++K   + +AI  +     + P     ++ +    F W   K +     LS K       +GG+  
Subjt:  IKSKMVLFKRLRERVEKVLQSWKENLFSLGGKEVLIK--AIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGF

Query:  RGISLFNQAMLAKQGW
          + L+ ++++ K  W
Subjt:  RGISLFNQAMLAKQGW

P0C2F6 Putative ribonuclease H protein At1g657501.4e-3123.34Show/hide
Query:  FKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQA
        F  + ERV   +  W+E   S  G+  L KA+  ++P ++MS   +P+ + + ++Q    F WG+T E++K H + W K+C  K  GG+G R     N+A
Subjt:  FKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQA

Query:  MLAKQGWRILRNPDSLLAKVLRGR--------------------------------------WRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKDERVC
        +++K GWR+L+  +SL   VL+ +                                      W  GDG+ I+   D W+  K    L+   +G +     
Subjt:  MLAKQGWRILRNPDSLLAKVLRGR--------------------------------------WRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKDERVC

Query:  NLLKEN-----GSWDEEMIRECFMVSEAEEILKIPLGR-RGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKI
         ++ ++       WD   I      +   E+  + L    G+RD + W  +  G F V+SAY+++      +V+    P N  + +  LW +++  R K 
Subjt:  NLLKEN-----GSWDEEMIRECFMVSEAEEILKIPLGR-RGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKI

Query:  CLWKMLKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTN---FLRSCRSWRDGFEFWKEISQRLNLEEL---NIADLII
         LW +    V T+     + +  +  C +C    E+  H++ +C      W   +P       F +S   W      +  +  R   E++    I  +II
Subjt:  CLWKMLKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTN---FLRSCRSWRDGFEFWKEISQRLNLEEL---NIADLII

Query:  WNIWAYQ
        W  W ++
Subjt:  WNIWAYQ

P11369 LINE-1 retrotransposable element ORF2 protein9.9e-4629.57Show/hide
Query:  IDEILNKDGAWRTGDKEIGDVASHHFIAMFQSS-KPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVVG
        I++I N+ G   T  +EI +     +  ++ +  +  + + K +      KL+  Q   L  P S +E+ AVI  +   K+PG DG  A F+Q + E + 
Subjt:  IDEILNKDGAWRTGDKEIGDVASHHFIAMFQSS-KPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVVG

Query:  ADTKRICLQILNEGEDVRPLNKTLITLIPKVQ-NPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR
            ++  +I  EG       +  ITLIPK Q +P ++++F PISL N+  KI+ K LANR+++ +  II   Q  F+PG Q   N+      IH I N+
Subjt:  ADTKRICLQILNEGEDVRPLNKTLITLIPKVQ-NPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNR

Query:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN
        ++ K   M + LD  KA+D+++  F+ KV+E       +   I         +I +NG   E    + G RQG PLSPYLF I  E L+  I +++ +  
Subjt:  VRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSN

Query:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSV-FMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMP-SQNIKSK
         KG++I K    IS L  ADD +++    +N  R + N++  + E  G  I  +KS+ F+ +KN +AE+ I+      +   N   +YLG+  ++ +K  
Subjt:  FKGLKINKIFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSV-FMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMP-SQNIKSK

Query:  M-VLFKRLRERVEKVLQSWKENLFSLGGKEVLIK--AIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSK-LSGGMGFRG
            FK L++ +++ L+ WK+   S  G+  ++K   + +AI  +     +IP    +E+  A  +F W   K R        K + K K  SGG+    
Subjt:  M-VLFKRLRERVEKVLQSWKENLFSLGGKEVLIK--AIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSK-LSGGMGFRG

Query:  ISLFNQAMLAKQGW
        + L+ +A++ K  W
Subjt:  ISLFNQAMLAKQGW

P14381 Transposon TX1 uncharacterized 149 kDa protein3.1e-3125.15Show/hide
Query:  KDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVVGADTKRIC
        +DG      + I D A   +  +F     +    +++  GL   +S+ +   L  P + +EL+  ++ M  +K+PG DGL   FFQ +W+ +G D  R+ 
Subjt:  KDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVVGADTKRIC

Query:  LQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNRVRGKEGWM
         +   +GE      + +++L+PK  + + ++++ P+SL +  YKIVAK+++ R+K ++  +I   Q+  VPGR I DNV +  + +H      R      
Subjt:  LQILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNRVRGKEGWM

Query:  AMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSNFKGLKINK
         + LD  KA+DRV+  +L   +++ SF  ++   +     S E  + IN     P    RG+RQG PLS  L+ +  E    L+ ++ T     GL + +
Subjt:  AMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSNFKGLKINK

Query:  IFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGM--------PSQNIKSKMV
            +    +ADD ++  +    D    +     Y  AS   I   KS  ++  + K +          +  E+ I +YLG+         SQN      
Subjt:  IFPSISHLFFADDSLIFCRAKENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGM--------PSQNIKSKMV

Query:  LFKRLRERVEKVLQSWK--ENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMG
         F  L E V   L  WK    + S+ G+ ++I  +  +   Y + C    +    +I +    F W         HW+S          GG G
Subjt:  LFKRLRERVEKVLQSWK--ENLFSLGGKEVLIKAIAQAIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMG

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein5.8e-2527.11Show/hide
Query:  FRGISLFNQAMLAKQ--GWRILRNPDSLLAKVLRGRWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKDERVCNLLKENGS---WDEEMIRECFMVSEA
        F+ +S+ +  +  +Q  GW  L +  +LL K    R  +GDG++I+IG D  +     R L +T +  K+  + NL +  GS   WD+  I +    S+ 
Subjt:  FRGISLFNQAMLAKQ--GWRILRNPDSLLAKVLRGRWRVGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKDERVCNLLKENGS---WDEEMIRECFMVSEA

Query:  EEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLWKMLKNIVPTKVNLISKGIDTNPTCFL
          I +I L +    D+IIW  N  G + V+S Y L+ +    ++   + P  +  +  ++WNL I+P+ K  LW+ L   + T   L ++G+  +P+C  
Subjt:  EEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLWKMLKNIVPTKVNLISKGIDTNPTCFL

Query:  CMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGFEFWKEISQRLN-LEELNIADL-------IIWNIWAYQNRVM
        C  + E+  H ++ C     AW+    + ++ +R+     D   F + IS  LN +++  ++D        +IW IW  +N V+
Subjt:  CMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGFEFWKEISQRLN-LEELNIADL-------IIWNIWAYQNRVM

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.6e-1440.7Show/hide
Query:  LANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNRVRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKI
        +  R+K +M  +I  +QA+F+PGR  TDN+    E +H +  R +G +GWM +KLD+ KAYDR+ W +L   + S  F E W  +I
Subjt:  LANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNRVRGKEGWMAMKLDMSKAYDRVEWIFLRKVMESLSFCEEWTTKI

AT4G29090.1 Ribonuclease H-like superfamily protein9.2e-3926.15Show/hide
Query:  AIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQAMLAKQGWRILRNPDSLLAKVLRGR----------
        A+PTYTM+CF +PK +C +I    A FWW   +E + +HW +W  +   K  GG+GF+ I  FN A+L KQ WR+L  P+SL+AKV + R          
Subjt:  AIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQAMLAKQGWRILRNPDSLLAKVLRGR----------

Query:  ---------WR----------------VGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKDE--------RVCNLLKENG-SWDEEMIRECFMVSEAEEILK
                 W+                VG+G+ I I R  W+  K     +        E        +V +L+ E+G  W +++I   F   E + I +
Subjt:  ---------WR----------------VGDGKHIKIGRDPWIYRKGNRFLVSTPDGLKDE--------RVCNLLKENG-SWDEEMIRECFMVSEAEEILK

Query:  IPLGRRGSRDEIIWAENPKGVFLVKSAYQLV--VNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLWKMLKNIVPTKVNLISKGIDTNPTCFLCMS
        +  G R   D   W     G + VKS Y ++  + ++ +S +  S+PS    I++++W  +  P+ +  LWK L N +P    L  + +     C  C S
Subjt:  IPLGRRGSRDEIIWAENPKGVFLVKSAYQLV--VNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICLWKMLKNIVPTKVNLISKGIDTNPTCFLCMS

Query:  KRETNGHLIWECKVVKQAWK-NFIPNTTNFLRSCRSWRD--------------GFEFWKEISQRLNLEELNIADLIIWNIWAYQNRVMATGISADQKKIN
         +ET  HL+++C   +  W  + IP     +     W D              G   W++ SQ        +   ++W +W  +N ++  G   + +++ 
Subjt:  KRETNGHLIWECKVVKQAWK-NFIPNTTNFLRSCRSWRD--------------GFEFWKEISQRLNLEELNIADLIIWNIWAYQNRVMATGISADQKKIN

Query:  QLIEASLDE-HFRVQKQYQEDMQSKNQRSQSSWSPP
        +  E  L+E   R + +        N+ S   W PP
Subjt:  QLIEASLDE-HFRVQKQYQEDMQSKNQRSQSSWSPP

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-2439.16Show/hide
Query:  AIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKL-SGGMGFRGISLFNQAMLAKQGWRILRNPDSLLAKVLRGR---------
        A+P Y MSCFR+ K LC ++  A   FWW + + +RKI W++W+K+CKSK   GG+GFR +  FNQA+LAKQ +RI+  P +LL+++LR R         
Subjt:  AIPTYTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKL-SGGMGFRGISLFNQAMLAKQGWRILRNPDSLLAKVLRGR---------

Query:  ----------WR----------------VGDGKHIKIGRDPWI
                  WR                +GDG H K+  D WI
Subjt:  ----------WR----------------VGDGKHIKIGRDPWI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.4e-1552.94Show/hide
Query:  LINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSNFKGLKINKIFPSISHLFFADDS
        +ING PQ    P RGLRQGDPLSPYLF++C E LSGL  R +      G++++   P I+HL FADD+
Subjt:  LINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSNFKGLKINKIFPSISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAAGATAGACGAGATTCTTAATAAGGATGGGGCCTGGAGGACAGGGGATAAAGAGATTGGGGATGTGGCTTCTCATCACTTTATAGCCATGTTCCAGTCCTCTAA
GCCCACAAACACTCTTATAAAAAAAGTGACTAGTGGGCTAAGCAAGAAACTCTCGGACAGCCAAAATAGGGATCTTTTAAGACCATTCTCATCCGAAGAGCTAAATGCAG
TGATCAAAGGGATGAACCCCTCAAAAGCCCCAGGAAGAGACGGGCTACAAGCAGCCTTCTTCCAAAAATACTGGGAGGTGGTAGGAGCGGACACCAAGCGGATTTGCCTC
CAAATTCTGAATGAAGGAGAAGATGTGAGACCTTTAAATAAGACTCTTATAACCCTTATCCCTAAAGTCCAAAATCCAAAAAGAATGCAAGACTTCATACCAATAAGTCT
ATGCAATGTCATCTATAAAATAGTAGCGAAATCCCTAGCTAACAGAATGAAAAAGATTATGGATGTGATCATCTCGCAATCCCAAGCAGCATTTGTGCCCGGAAGGCAAA
TCACTGATAATGTGGCGATTGGGTTTGAGTGCATCCACGGAATAAACAACAGAGTTAGAGGGAAGGAAGGTTGGATGGCCATGAAGCTTGATATGAGTAAAGCCTACGAT
CGAGTGGAATGGATCTTCCTAAGGAAGGTTATGGAGAGTCTTAGCTTCTGCGAAGAATGGACAACCAAGATAATGATGTGCGTGGAATCTGTGGAATACTCCATTTTGAT
CAATGGCATACCGCAGGAACCTTTCAAGCCAGAGAGAGGCCTTAGGCAAGGGGACCCTTTATCCCCTTACCTTTTCCTAATATGCGCTGAAGGATTATCAGGTCTTATAA
ACAGGAAAGAAACTCTCTCTAATTTCAAAGGTCTTAAAATTAATAAAATTTTCCCCTCTATATCTCACTTGTTTTTCGCTGATGATAGTTTAATATTTTGTAGGGCGAAG
GAGAATGACTGTAGGTGTATAAAAAATATCCTAAGGGAATATGAAGAAGCCTCGGGTCAAACCATTAAACTGGACAAGTCTGTTTTCATGGTTAGCAAGAATACAAAGGC
AGAGGAAGCAATCCAGCTACAACAAATCCTTGGTGTCAAGCAAGAGAATTCCATAGGACAGTACCTCGGAATGCCATCACAAAACATCAAAAGCAAAATGGTGTTGTTTA
AAAGGCTGAGGGAAAGGGTGGAAAAGGTGTTGCAATCTTGGAAGGAGAACCTATTCTCCTTGGGTGGAAAAGAAGTTCTTATTAAGGCTATAGCGCAAGCGATCCCCACA
TATACCATGTCTTGTTTTCGAATCCCCAAAGGGCTGTGTGATGAGATCAACCAGGCATGTGCGCGTTTTTGGTGGGGAGCCACGAAGGAAAGAAGGAAAATCCATTGGTT
AAGCTGGAAGAAGATGTGCAAAAGCAAGCTCTCAGGGGGTATGGGTTTCCGGGGCATTAGCTTGTTCAATCAAGCTATGTTAGCTAAACAAGGCTGGCGAATCTTAAGGA
ACCCAGATAGCCTCCTAGCAAAAGTACTAAGGGGGAGATGGAGAGTCGGCGATGGCAAACACATAAAGATTGGCCGAGATCCGTGGATATACAGGAAAGGAAACAGGTTT
TTAGTCTCTACACCGGATGGGCTAAAAGACGAAAGAGTTTGCAACCTTCTCAAAGAGAATGGGTCCTGGGATGAGGAGATGATTCGAGAGTGCTTCATGGTTTCCGAGGC
CGAAGAAATCCTCAAAATCCCTCTCGGTAGAAGAGGTTCCAGAGATGAAATTATATGGGCTGAAAATCCTAAAGGCGTTTTCTTGGTGAAATCGGCTTATCAACTCGTTG
TTAATCATGAAGACAACTCAGTGGAAGGATGTTCTGATCCCTCCAATACAAAGGCTATATGGAAACAACTTTGGAACTTAAAAATCATTCCAAGAGCAAAAATCTGTCTG
TGGAAAATGCTGAAAAACATAGTACCTACAAAAGTGAATCTAATTTCAAAAGGAATTGATACTAACCCAACATGTTTTCTATGCATGAGCAAAAGGGAAACGAATGGCCA
CCTTATCTGGGAATGTAAGGTTGTTAAACAAGCATGGAAAAATTTTATTCCTAACACAACCAATTTCCTCCGTTCTTGTAGGTCCTGGAGGGATGGATTTGAATTTTGGA
AGGAGATTTCGCAGAGGCTCAACTTGGAAGAACTAAACATAGCAGACCTAATTATTTGGAATATCTGGGCGTATCAGAACAGAGTTATGGCAACAGGCATCTCAGCAGAC
CAAAAGAAAATCAACCAACTAATTGAAGCTAGCTTGGATGAACACTTCAGGGTTCAAAAGCAATACCAGGAAGATATGCAGTCGAAGAACCAGCGGAGTCAAAGCTCGTG
GTCCCCCCCCTCCTCAAGCTCTGAAATTGAACTCAAATGCCTCGTGAAGCAATTCGTCAAGTACAGGCGGACTGGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATAAGATAGACGAGATTCTTAATAAGGATGGGGCCTGGAGGACAGGGGATAAAGAGATTGGGGATGTGGCTTCTCATCACTTTATAGCCATGTTCCAGTCCTCTAA
GCCCACAAACACTCTTATAAAAAAAGTGACTAGTGGGCTAAGCAAGAAACTCTCGGACAGCCAAAATAGGGATCTTTTAAGACCATTCTCATCCGAAGAGCTAAATGCAG
TGATCAAAGGGATGAACCCCTCAAAAGCCCCAGGAAGAGACGGGCTACAAGCAGCCTTCTTCCAAAAATACTGGGAGGTGGTAGGAGCGGACACCAAGCGGATTTGCCTC
CAAATTCTGAATGAAGGAGAAGATGTGAGACCTTTAAATAAGACTCTTATAACCCTTATCCCTAAAGTCCAAAATCCAAAAAGAATGCAAGACTTCATACCAATAAGTCT
ATGCAATGTCATCTATAAAATAGTAGCGAAATCCCTAGCTAACAGAATGAAAAAGATTATGGATGTGATCATCTCGCAATCCCAAGCAGCATTTGTGCCCGGAAGGCAAA
TCACTGATAATGTGGCGATTGGGTTTGAGTGCATCCACGGAATAAACAACAGAGTTAGAGGGAAGGAAGGTTGGATGGCCATGAAGCTTGATATGAGTAAAGCCTACGAT
CGAGTGGAATGGATCTTCCTAAGGAAGGTTATGGAGAGTCTTAGCTTCTGCGAAGAATGGACAACCAAGATAATGATGTGCGTGGAATCTGTGGAATACTCCATTTTGAT
CAATGGCATACCGCAGGAACCTTTCAAGCCAGAGAGAGGCCTTAGGCAAGGGGACCCTTTATCCCCTTACCTTTTCCTAATATGCGCTGAAGGATTATCAGGTCTTATAA
ACAGGAAAGAAACTCTCTCTAATTTCAAAGGTCTTAAAATTAATAAAATTTTCCCCTCTATATCTCACTTGTTTTTCGCTGATGATAGTTTAATATTTTGTAGGGCGAAG
GAGAATGACTGTAGGTGTATAAAAAATATCCTAAGGGAATATGAAGAAGCCTCGGGTCAAACCATTAAACTGGACAAGTCTGTTTTCATGGTTAGCAAGAATACAAAGGC
AGAGGAAGCAATCCAGCTACAACAAATCCTTGGTGTCAAGCAAGAGAATTCCATAGGACAGTACCTCGGAATGCCATCACAAAACATCAAAAGCAAAATGGTGTTGTTTA
AAAGGCTGAGGGAAAGGGTGGAAAAGGTGTTGCAATCTTGGAAGGAGAACCTATTCTCCTTGGGTGGAAAAGAAGTTCTTATTAAGGCTATAGCGCAAGCGATCCCCACA
TATACCATGTCTTGTTTTCGAATCCCCAAAGGGCTGTGTGATGAGATCAACCAGGCATGTGCGCGTTTTTGGTGGGGAGCCACGAAGGAAAGAAGGAAAATCCATTGGTT
AAGCTGGAAGAAGATGTGCAAAAGCAAGCTCTCAGGGGGTATGGGTTTCCGGGGCATTAGCTTGTTCAATCAAGCTATGTTAGCTAAACAAGGCTGGCGAATCTTAAGGA
ACCCAGATAGCCTCCTAGCAAAAGTACTAAGGGGGAGATGGAGAGTCGGCGATGGCAAACACATAAAGATTGGCCGAGATCCGTGGATATACAGGAAAGGAAACAGGTTT
TTAGTCTCTACACCGGATGGGCTAAAAGACGAAAGAGTTTGCAACCTTCTCAAAGAGAATGGGTCCTGGGATGAGGAGATGATTCGAGAGTGCTTCATGGTTTCCGAGGC
CGAAGAAATCCTCAAAATCCCTCTCGGTAGAAGAGGTTCCAGAGATGAAATTATATGGGCTGAAAATCCTAAAGGCGTTTTCTTGGTGAAATCGGCTTATCAACTCGTTG
TTAATCATGAAGACAACTCAGTGGAAGGATGTTCTGATCCCTCCAATACAAAGGCTATATGGAAACAACTTTGGAACTTAAAAATCATTCCAAGAGCAAAAATCTGTCTG
TGGAAAATGCTGAAAAACATAGTACCTACAAAAGTGAATCTAATTTCAAAAGGAATTGATACTAACCCAACATGTTTTCTATGCATGAGCAAAAGGGAAACGAATGGCCA
CCTTATCTGGGAATGTAAGGTTGTTAAACAAGCATGGAAAAATTTTATTCCTAACACAACCAATTTCCTCCGTTCTTGTAGGTCCTGGAGGGATGGATTTGAATTTTGGA
AGGAGATTTCGCAGAGGCTCAACTTGGAAGAACTAAACATAGCAGACCTAATTATTTGGAATATCTGGGCGTATCAGAACAGAGTTATGGCAACAGGCATCTCAGCAGAC
CAAAAGAAAATCAACCAACTAATTGAAGCTAGCTTGGATGAACACTTCAGGGTTCAAAAGCAATACCAGGAAGATATGCAGTCGAAGAACCAGCGGAGTCAAAGCTCGTG
GTCCCCCCCCTCCTCAAGCTCTGAAATTGAACTCAAATGCCTCGTGAAGCAATTCGTCAAGTACAGGCGGACTGGGTAG
Protein sequenceShow/hide protein sequence
MNKIDEILNKDGAWRTGDKEIGDVASHHFIAMFQSSKPTNTLIKKVTSGLSKKLSDSQNRDLLRPFSSEELNAVIKGMNPSKAPGRDGLQAAFFQKYWEVVGADTKRICL
QILNEGEDVRPLNKTLITLIPKVQNPKRMQDFIPISLCNVIYKIVAKSLANRMKKIMDVIISQSQAAFVPGRQITDNVAIGFECIHGINNRVRGKEGWMAMKLDMSKAYD
RVEWIFLRKVMESLSFCEEWTTKIMMCVESVEYSILINGIPQEPFKPERGLRQGDPLSPYLFLICAEGLSGLINRKETLSNFKGLKINKIFPSISHLFFADDSLIFCRAK
ENDCRCIKNILREYEEASGQTIKLDKSVFMVSKNTKAEEAIQLQQILGVKQENSIGQYLGMPSQNIKSKMVLFKRLRERVEKVLQSWKENLFSLGGKEVLIKAIAQAIPT
YTMSCFRIPKGLCDEINQACARFWWGATKERRKIHWLSWKKMCKSKLSGGMGFRGISLFNQAMLAKQGWRILRNPDSLLAKVLRGRWRVGDGKHIKIGRDPWIYRKGNRF
LVSTPDGLKDERVCNLLKENGSWDEEMIRECFMVSEAEEILKIPLGRRGSRDEIIWAENPKGVFLVKSAYQLVVNHEDNSVEGCSDPSNTKAIWKQLWNLKIIPRAKICL
WKMLKNIVPTKVNLISKGIDTNPTCFLCMSKRETNGHLIWECKVVKQAWKNFIPNTTNFLRSCRSWRDGFEFWKEISQRLNLEELNIADLIIWNIWAYQNRVMATGISAD
QKKINQLIEASLDEHFRVQKQYQEDMQSKNQRSQSSWSPPSSSSEIELKCLVKQFVKYRRTG