; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010376 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010376
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:46784810..46788937
RNA-Seq ExpressionLag0010376
SyntenyLag0010376
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]6.6e-27039.48Show/hide
Query:  SYSSFHIDASVNWNGKLWRFTGIYGQPNDSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTW
        S S   +D S  ++G   RFTG YG P    R  TWELLRR+ N D S W++GGD N ILW+ E     +   + +  F + ++ C L D+GF G +FTW
Subjt:  SYSSFHIDASVNWNGKLWRFTGIYGQPNDSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTW

Query:  CNRRPISDRIFERLDRYVGNEEFHRMFPHFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQ
        CN R   D++++RLDR++ N+ F+ +FP                                                      A+G WS N +   + S  
Subjt:  CNRRPISDRIFERLDRYVGNEEFHRMFPHFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQ

Query:  LSRCSARLKQWGRKTNVFLKSEICRQRQAIQDAYRRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMN
        +   S+ L+ WGR     L  +I  Q+ AI DAY +  P+DF+ IH LE  L  LLE EEI+W QRSRE+WL+WG                      I+N
Subjt:  LSRCSARLKQWGRKTNVFLKSEICRQRQAIQDAYRRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMN

Query:  ANGVWVDSDSEVEMVFLDYFREIFCTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASC
        A                              +    I++ I   +++++N +LLAP+ + EIE  ++QMFPTKA GPDGFPA+FYQ+YW VVGP+T+ +C
Subjt:  ANGVWVDSDSEVEMVFLDYFREIFCTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASC

Query:  LEILNGRKSVKGWNQTNIALIPKVQSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFA
        L  LN    +K WN T IALIPK++ PRS+ DFRPISLCNVSYKI++K + NRLK V+  VIS+AQSAFVP RAI+DNVIIGHECLH I     G  G A
Subjt:  LEILNGRKSVKGWNQTNIALIPKVQSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFA

Query:  ALKLDMCKAYDRVELSFLEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGS
        ALKLD+ KA+DRVE ++LE +MR+MGF + W+  I+ CISTV+F++ LNG   G   PSRG+RQGDPLSPYLF+LCAEGLS  +N  + +  ++G+    
Subjt:  ALKLDMCKAYDRVELSFLEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGS

Query:  LCPSVSHLFFADDSLVFFKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTER
           S++HL FADDSL+F ++  +E   ++R+L+ Y +ASGQC+N+ KSAL  SPN+H + Q  L  +L V +V   G YLGLPSHF R +G         
Subjt:  LCPSVSHLFFADDSLVFFKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTER

Query:  IKKVVQGWKGSFFSVGGKETLIKSVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVW
                                                                       K+HW  W +MC PKE GGLNFRD+EGFNQAL+AK VW
Subjt:  IKKVVQGWKGSFFSVGGKETLIKSVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVW

Query:  RVLTSPDLLLSRILKGRYFPNSSVLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQR
        R L  P+LL+S++LK +YF ++S+L+ S  +  SYFW+ FLWGR+LL  GLR RVG+G++++ + DPW+PR  TFKP+    F   +L + V+ FI    
Subjt:  RVLTSPDLLLSRILKGRYFPNSSVLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQR

Query:  EWDVEKLDAVMCPEDLEVVLSIPIGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTN
         WDV  +    C ED +++LS+PI      D W+WHYDK GNYSV+SGYKL M  +  A S+ +      W  +W   +P+KIKIF+WR+    +PT  N
Subjt:  EWDVEKLDAVMCPEDLEVVLSIPIGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTN

Query:  LILRGIDAAPTCRGCSSQPETIDHALVTCDRSLAFWNLLLPQVD-WQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKC
        L+LRGI   P C  C  + E+I HA   C R+   W  L P +       N SF +    L E L  +D  L  +  W +W DRN++  GK V  V+ KC
Subjt:  LILRGIDAAPTCRGCSSQPETIDHALVTCDRSLAFWNLLLPQVD-WQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKC

Query:  QWVVEYLDAFLNAQECLGGGSSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLA
        +W+  +LD+   AQ  +   S    ++  P V  W+      +KLN DAAC  +      G + R S+  L AA+   +P  L PL AE R ++EGLK A
Subjt:  QWVVEYLDAFLNAQECLGGGSSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLA

Query:  QLLGLARIVVESDCSNLIAMIHGSLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACWEGDA--LWFSQYPEWLLHFVSKDVNSS
               + VESD    I +I   + +  ++  WV  + AL   F  + F H  R CN  AH LAK       A   W   +P WLL  V +D  S+
Subjt:  QLLGLARIVVESDCSNLIAMIHGSLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACWEGDA--LWFSQYPEWLLHFVSKDVNSS

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]7.3e-26938.14Show/hide
Query:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQ---------GASGGLCLLWNDGIDVLIRSYSSFHIDASVNWNG--KLW
        +G P T   +   ++   P V+FL ETK   + M++++  L Y   F+++ Q           +GGLCLLW +GIDV + ++S  HID  +   G    W
Subjt:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQ---------GASGGLCLLWNDGIDVLIRSYSSFHIDASVNWNG--KLW

Query:  RFTGIYGQPNDSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYV
        RFTG+YG     LR  TW L+ ++   +   W++GGDFNEIL   EK GG  +    M  F   +  C L DL F G  FTW  +R   + I  RLDR++
Subjt:  RFTGIYGQPNDSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYV

Query:  GNEEFHRMFPHFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRKTNVF
            +  +FP  RV++L  + SDH PI L   RS +   R RR   F+F   W+  AEC +++    +    N     +  ++ +    L  W  +    
Subjt:  GNEEFHRMFPHFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRKTNVF

Query:  LKSEICRQRQAIQDAY-RRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFL
        LK+EI R R  +   Y + ++         LE KL+ LL  E  YW QRSR  WL  GD NT++FHHRAS RK+RN I G+ N +GVW   DS++E + L
Subjt:  LKSEICRQRQAIQDAY-RRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFL

Query:  DYFREIFCTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTN
        DYF  +F T  P  ++  +  +    +V+  MNS+L+  F + EI + + QM P KAPGPDGF  +FYQ YW VVG   +A+    +N    ++  N T 
Subjt:  DYFREIFCTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTN

Query:  IALIPKVQSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSF
        + LIPKV+   +++  RPISLCNV YK+ +KVLANRLK +L+ +I+  QSAFVPGR I+DN ++  E  H+++RR+ G  G+ ALKLDM KAYDRVE  F
Subjt:  IALIPKVQSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSF

Query:  LEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVF
        +E +MR MGF   W+  I+ C++TV ++ LLNGE RG ++P+RGLRQGD +SPYLF+LCAEGLS  L+       + G+ I    PS++HLFFADDS VF
Subjt:  LEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVF

Query:  FKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGG
         KA   E   VK IL  YE ASGQ VN+ KS +  S N+    Q  L  + GV  V     YLGLP+    +K   F+ + E+ +  ++ WK    SV G
Subjt:  FKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGG

Query:  KETLIKSVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGR
        KE +IKSV Q++PTY MSCF LPK +C+ + + +A FWWG S    K+HW +WDKMC+PKE GGL FR++E FNQALLAKQ WR+L  PD LL + LK +
Subjt:  KETLIKSVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGR

Query:  YFPNSSVLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFIN-DQREWDVEKLDAVMCPEDL
        YFPN+  +  S     SY WR  + G+ LL  GLR +VG GT + ++ DPWIPR  +F+P    M  L  L+  V+D I+ D ++W V+ L+ +   +++
Subjt:  YFPNSSVLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFIN-DQREWDVEKLDAVMCPEDL

Query:  EVVLSIPIGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTS-----ARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPT
        +++  IP+      D  +WH+DK G YSVKSGY +  C  + + SSH  TS        W  +W  R+  K++ F+WR  + I+PT  NL  R       
Subjt:  EVVLSIPIGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTS-----ARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPT

Query:  CRGCSSQPETIDHALVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAV--RGGKVVPVVDVKCQWVVEYLDAF
        C  C  + ET  H  + C+     W      +  + +  +S ++  + + + L+     ++ +  WA+W++RN +   GG   P+  V   W +  L  +
Subjt:  CRGCSSQPETIDHALVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAV--RGGKVVPVVDVKCQWVVEYLDAF

Query:  LNAQECLGGGSSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVV
                   S  GA+     + W  PP G +K+NVD A    +G  G+GVV R   G    A  + +P        EA A   GL +A   G  ++ +
Subjt:  LNAQECLGGGSSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVV

Query:  ESDCSNLIAMIHGSLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHA
        E+DC+ L   ++  +    E    + +       F  +  +H+ R  N+ A+ LA  A
Subjt:  ESDCSNLIAMIHGSLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHA

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]2.5e-26937.48Show/hide
Query:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNW-NGKLWRFTGIYGQPN
        +G+   FR +   +    P VLFL ETK  D +++R KNLL++S    +   G  GGL LLW D +DV + S ++ + D  + + +G  W F+ +YG P 
Subjt:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNW-NGKLWRFTGIYGQPN

Query:  DSLRFRTWELLRRLHNFDE-SAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMF
           +  TW+L+RRL +      W++ GD NEI  +  K GG  +    M  F + L+ C L ++   GD FTW   R  +  + ERLD    N  +   F
Subjt:  DSLRFRTWELLRRLHNFDE-SAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMF

Query:  PHFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRKTNVFLKSEICRQR
           ++S+L++  SDHR +   +D +     +  R + F+F   W++  EC +II  +   S      + L   L  C+  L  W  +    +K +I   +
Subjt:  PHFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRKTNVFLKSEICRQR

Query:  QAIQ--DAYRRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIFC
        + +   +     +P   + +H+ E  LD LL  EE YW QRSR +WL+ GD+NTK+FH +AS R   N I+ + + +G  V +   +  V  DYF+E+F 
Subjt:  QAIQ--DAYRRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIFC

Query:  TQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKVQ
                 + +L  I   +S + N  L   F   E+   ++ +   K+PG DG  A+FY   W++VG       L++LN   + + +N+T I LIPK++
Subjt:  TQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKVQ

Query:  SPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRRM
         P++++DFRPISLCNV+YKI++K+LA R KEVL SVISE QSAF+  R ITDN+++  E +H ++ R++G +GFAALKLDM KA+DRVE SFL  +M +M
Subjt:  SPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRRM

Query:  GFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTEG
        GF    + LI+ C+ T  F+ L+NGE  G V+P RGLRQGDPLSPYLF++C+EGLS  L        + G+ +    PS++HL FADDSL+F +AN    
Subjt:  GFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTEG

Query:  WHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIKSV
          +KR L+ Y +ASGQ +N  KS +  SPN     +     +LG+PI      YLGLP++  R K   F  + ERI K++  W    FS+GGKE L+K+V
Subjt:  WHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIKSV

Query:  AQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSVL
         QAIPTYAMSCFRL    C+ I   +ARFWWGSS    K+HW++W  +C  K  GGL FR    FNQA LAKQ WR+  +P+ LLSR+LKGRY+  +  +
Subjt:  AQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSVL

Query:  EVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQREWDVEKLDAVMCPEDLEVVLSIPIG
            +   S  W+  +WGRELL  GL  ++GDGT V    D WIP    FKP+    F   S S+LV+D+I D REWD+E L     P D++ +L+IP+ 
Subjt:  EVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQREWDVEKLDAVMCPEDLEVVLSIPIG

Query:  RSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPETIDHA
         + + D W WHYD  G+Y+VKSGY L      +  SS S +   WW   W   LPSK++IF WR     LP   NL  R +  + TC  CS   E+I HA
Subjt:  RSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPETIDHA

Query:  LVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKCQWVVEYLDAFLNAQECLGGGSSSLGA
        L +C  + + W     Q+D+         D  + L   L+  +        W +W+DRN     K +            YL  F + +       S + A
Subjt:  LVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKCQWVVEYLDAFLNAQECLGGGSSSLGA

Query:  SLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNLIAMIHGSLV
            +   W  P    +K+NVDAA    + +IG+GV+ R STGR+ AA  K +  +    E EA+A+  GL+ A+ L L    VE+DC  L+  + G   
Subjt:  SLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNLIAMIHGSLV

Query:  SFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHAC-WEGDALWFSQYPEWLLHFVSKD
                V ++    S F      HV R  N  AH LAK A   + D +W  + P  +   V  D
Subjt:  SFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHAC-WEGDALWFSQYPEWLLHFVSKD

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]2.7e-27137.5Show/hide
Query:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNWN-GKLWRFTGIYGQPN
        +G P  FR +   ++   P +L L E+K    S++++++ L +     +  QG  GGL LLW   + V I ++S+ HID  +  N G  + FTG YG P+
Subjt:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNWN-GKLWRFTGIYGQPN

Query:  DSLRFRTWELLRRLHNFDE-SAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMF
         S R  TW +L+R ++    + W+V GDFNEIL   +K+GG  +  + +  F + +  C L  L F+GD  TW ++   S  + ERLD    N+ +   F
Subjt:  DSLRFRTWELLRRLHNFDE-SAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMF

Query:  PHFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSS-VAALSQQLSRCSARLKQWGRKTNVFLKSEICRQ
            V +L++  SDHR I++ +     +       S F+F   W++  +C  +I  N  WS  +S+ +A ++  +S  S++L+ W   T   LK +I   
Subjt:  PHFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSS-VAALSQQLSRCSARLKQWGRKTNVFLKSEICRQ

Query:  RQAIQDAY--RRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIF
        ++ +   +  RR +      + T E+ LD LL +EE YWHQRSR +WL+ GD NTK+FH  A+ R++ N+IR +++ NG    +   V  +  DY+ ++F
Subjt:  RQAIQDAY--RRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIF

Query:  CTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKV
         ++       ++ILD I   +     + + APF   ++   ++ M   K+PG DG   +FY +YW +VGP   A+ L +LN       +N T + LIPKV
Subjt:  CTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKV

Query:  QSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRR
        + P  +  +RPISLCNV YK+V+K +  RLK  L  VISE QSAF+  R ITDN+++  E LH ++ R +G +GFAA+KLDM KA+DRVE  F+  +M +
Subjt:  QSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRR

Query:  MGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTE
        MGF    V+LIL C+ +V ++ LLNG  +G V+PSRG+RQGDPLSPYLF++CAEGLS  L    +A  + G++I    PSVSHLFFADDS++F +AN   
Subjt:  MGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTE

Query:  GWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIKS
           + R L  Y +ASGQ +N  K  L  S N     Q     LLG+PI      YLGLPS   + K   F  +T++I K++  WK   FS GGKE L+K+
Subjt:  GWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIKS

Query:  VAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSV
        V QAIPTYAMSCFRLP ++C  I   +ARFWWGS+     +HW++W+ +C  K  GGL FR+   FNQALLAKQ WR+L  P+ LLS +L+ RYF N + 
Subjt:  VAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSV

Query:  LEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQREWDVEKLDAVMCPEDLEVVLSIPI
        L     ++ S  WR  +WG+ELL  GLR RVG G  +    D W+P   TFKP F   F     + LV+D I + R WD+  L+      D+  VLSIP+
Subjt:  LEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQREWDVEKLDAVMCPEDLEVVLSIPI

Query:  GRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPETIDH
              D  +W+    G Y+VKSGY   +    +  S+ S +   WW+  W  +LP K++IF+W+ F   LP    L  R I  +P C  C+S  ET+ H
Subjt:  GRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPETIDH

Query:  ALVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKCQWVVEYLDAFLNAQECLGGGSSSLG
        AL +C R+ A W L    +D+Q     S  D  ++L  SLSS +  L+ V CW++W +RNA+  G  V        +   YL  F  A+       ++ G
Subjt:  ALVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKCQWVVEYLDAFLNAQECLGGGSSSLG

Query:  ASLCPQVSS-------WQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNLI
        A+   + SS       W  PP G +KLN DAA  +++  IG+G V R S G + AA  K    +    E EA  +   L       L+   +E+D   ++
Subjt:  ASLCPQVSS-------WQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNLI

Query:  AMIHGSLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACW-EGDALWFSQYPEWLL
          +  S          + N+  L S F +    HV R  N  AH LAK A   + D +W   +P  L+
Subjt:  AMIHGSLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACW-EGDALWFSQYPEWLL

XP_030923017.1 uncharacterized protein LOC115949892 [Quercus lobata]1.6e-27137.76Show/hide
Query:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNWN-GKLWRFTGIYGQPN
        +G  R  R +SD +    P V+FL ET+     + RL+  L +   F +  +   GGL LLWN+ +++ IR++S  HIDA VN      WRFTG YG P 
Subjt:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNWN-GKLWRFTGIYGQPN

Query:  DSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFP
         + R  +W +LR L +     WV  GDFNEI    EK GG  +P   M +F   L+ CGL+DLGF G  FTWCNRR     ++ RLDR V   ++   FP
Subjt:  DSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFP

Query:  HFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRKTNVFLKSEICRQRQ
          R+ +L  + SDH+P+ L  D    R  R ++   F+F A W+    C  ++ +  D +     V+ +  ++  C  +LK W +     ++  + R R+
Subjt:  HFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRKTNVFLKSEICRQRQ

Query:  AIQDAY-RRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIFCTQ
         +  A    M   + + +  L E++ RL++ EE  W QRS+  WLR+GD+NTK+FH RA+ER ++N I G+ NA+G W++ + ++  +   Y+  +F T 
Subjt:  AIQDAY-RRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIFCTQ

Query:  CPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKVQSP
         P+ ++   +L  +QP VS  MNS+LL PF   E+   ++QM P  APGPDGFP +FY+++W+ VG +   + L +LN        N T I+LIPK +SP
Subjt:  CPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKVQSP

Query:  RSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRRMGF
        R V +FRPISLCNV YK++AKVLANRLK +L  +ISE+QSAF+  R ITDN++I HE LH+++ +  G+ GF ++KLDM KAYDRVE ++LE +M RMGF
Subjt:  RSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRRMGF

Query:  ADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTEGWH
           W++LI  CI +V ++++LNG+  GL+ P+RGLRQGDPLSPYLF+L  EGL      A  +  I GV +  + P +SHL FADDSLVF +A   E   
Subjt:  ADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTEGWH

Query:  VKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIKSVAQ
        ++ IL  YE ASGQC+N  K+ LF S N H + QA + + LG+ +      YLGLPS   R+K   F  + ERI K ++GWK    S  G+E L+K+V Q
Subjt:  VKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIKSVAQ

Query:  AIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSVLEV
        AIP Y MSCFRLPK +   I   I +FWWG    + K+HW SW+K+C PK+ GG+ F+++  FN ++LAKQ+WR+  + + L  R+ K ++FPN SV + 
Subjt:  AIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSVLEV

Query:  SAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFIN-DQREWDVEKLDAVMCPEDLEVVLSIPIGR
          +   SY W+  +  R ++ LG   R+GDG SV++  D W+P+  T   I  P+  L    S+V D IN D+ +W  + +     P + +++  IP+  
Subjt:  SAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFIN-DQREWDVEKLDAVMCPEDLEVVLSIPIGR

Query:  SGSGDGWVWHYDKFGNYSVKSGYKLLMC--RRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPETIDH
            D  VWH    G Y+ +S YKLL    R      S +  S+ +W G+W+ ++P K++ F+WRA    LPT+ NL  R +     C+ C +  E   H
Subjt:  SGSGDGWVWHYDKFGNYSVKSGYKLLMC--RRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPETIDH

Query:  ALVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQ-DFGLWCVGCWALWADRNAVR-GGKVVPVVDVKCQWVVEYLDAFLNAQECLGGGSSS
        AL  C R +  W      +     F  S     + L  SL  + D  L C+  W +W  RN+VR    VV +  ++ +  +  LD F +AQ     G   
Subjt:  ALVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQ-DFGLWCVGCWALWADRNAVR-GGKVVPVVDVKCQWVVEYLDAFLNAQECLGGGSSS

Query:  LGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNLIAMIHG
           + C     W+ P     K+N D A        G+GVV R S G +  A  + +P+ +     EA A    L  A+  G++ +V E D   ++  +  
Subjt:  LGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNLIAMIHG

Query:  SLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACWEGD-ALWFSQYPEWLLHFVSKD
          V   E G  + + + LA  F+   F H+ RV N+ AH LA+HA +  +  +W    PE +   V++D
Subjt:  SLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACWEGD-ALWFSQYPEWLLHFVSKD

TrEMBL top hitse value%identityAlignment
A0A2N9FNH6 Reverse transcriptase domain-containing protein8.4e-27137.83Show/hide
Query:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNWNGKL-WRFTGIYGQPN
        +G P+T R +   + H  P V+FL ET+   +++++++  +   G   +   G  GGL LLW +GI V   S+SS HID ++   G   W FTG YG P+
Subjt:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNWNGKL-WRFTGIYGQPN

Query:  DSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFP
         S R  +W LLRRL  +D+  W+V GDFNE+L  SEK G +A+P   M  F  AL+DC L+D+G+ G+ FTW N R  +D ++ERLDR V + ++  +FP
Subjt:  DSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFP

Query:  HFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRKTNVFLKSEICRQRQ
          ++ +++++ SDH    L +D S  +         F+F   W++   C +++        +   +  +SQ++  C   L++W R  N   K+ + + R 
Subjt:  HFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRKTNVFLKSEICRQRQ

Query:  AIQDAYR-----RMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREI
        A  D+ +     R N   ++  +     L+ +L +EE YW QRS  +WLR GD+NT++FH  AS+RK++N I G+ +ANG      + +  +   YF  I
Subjt:  AIQDAYR-----RMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREI

Query:  FCTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPK
        F T  PS I Q  ++  +   V+  MN  LLAPF   EI   + QM PTKAPGPDG  A+FYQ +W +VG     + LE L+  K +K  N T+I LIPK
Subjt:  FCTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPK

Query:  VQSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMR
        + SP  +  FRPISLCNV YKI++KVLANRLK VL  +IS+ QSAFVPGR ITDN+++  E LHY++ + KGR    A+KLDM KAYDRVE +FLE++M 
Subjt:  VQSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMR

Query:  RMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGT
        ++GF   WV+LI+ C+++V ++V+LNGE  G + P+RG+RQGDPLSPYLF++CAEGL+  L  A  +  + G+ I    P +SHLFFADDSL+F +AN  
Subjt:  RMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGT

Query:  EGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIK
        E  ++  IL+ YEQASGQ +N+ K++LF S N H D +  + +LL      DLG YLGLP    R K   F ++ ++I K + GWKG   S  G+E LIK
Subjt:  EGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIK

Query:  SVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSS
        SVAQAIP Y MSCFR+P ++C  I   +++FWWG      K+HW+ W  MC  K  GG+ FRD+  FNQALLAKQ WR+L  P+ LL R+LK +YFPN S
Subjt:  SVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSS

Query:  VLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFI-NDQREWDVEKLDAVMCPEDLEVVLSI
         +E    +  S+ WR     R ++  G R R+G+GT V ++ D WI      K +      +   ++ VSD I ++ R+W+   +D++  P +   + +I
Subjt:  VLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFI-NDQREWDVEKLDAVMCPEDLEVVLSI

Query:  PIGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTSA--RWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPE
        P+ R    D  VW     G ++ +S Y L +  + + + S S  S    +W  LW   +PSKIK FMWRA   ILPT TNL  RG+  +  C  C+   E
Subjt:  PIGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTSA--RWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPE

Query:  TIDHALVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAV----RGGKVVPVVDVKCQWVVEYLDAFLNAQECL
        T+ H+L  C+ +   W          +    S+ D    +   L + +  L+    W +WA RNA          + +      +V E+LD+   A    
Subjt:  TIDHALVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAV----RGGKVVPVVDVKCQWVVEYLDAFLNAQECL

Query:  GGGSSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNL
                  +     +W  PP    KLNV     + +  +G+  V R S G L AA  + +P +   L   AR++ + L   Q  G   I+VE     L
Subjt:  GGGSSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNL

Query:  IAMIHGSLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACWEGD-ALWFSQYPEWLLHFV
         ++I    V   E    +A +   ++ F  + F  + R CN  A  LA +A    + ++W  + P  LL  V
Subjt:  IAMIHGSLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACWEGD-ALWFSQYPEWLLHFV

A0A2N9HYE3 Reverse transcriptase domain-containing protein1.3e-27138.13Show/hide
Query:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNWN-GKLWRFTGIYGQPN
        +G PRT + ++       P V+FL ET   +  + +L+  L +   F +N +   GGLCL W   + + ++S+S  HIDA VN N    WRFTG YG P 
Subjt:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNWN-GKLWRFTGIYGQPN

Query:  DSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFP
           R  +W+LLRRL+   +  W   GDFNE++   EK G   +  + M  F   L++CG  DLGF G  FTW N RP  D  +ERLDR V   ++   FP
Subjt:  DSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFP

Query:  HFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALS--QQLSRCSARLKQWGRKTNVFLKSEICRQ
          RVS+L    SDH+PI +  + + +      +   F+F   W     C  +I     W  + + V   +  Q++  C   L+ W R T   + S I   
Subjt:  HFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALS--QQLSRCSARLKQWGRKTNVFLKSEICRQ

Query:  RQAIQDAYRR-MNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIFC
         + ++ A    M   D   ++ L+ +L  LL +EE  W QRSR  WL  GD+NT++FH RA++RKR+N +  +   +G W  + ++V  +F++Y++ +F 
Subjt:  RQAIQDAYRR-MNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIFC

Query:  TQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKVQ
        T  P  ++Q  +++ IQ +V+A+MNS+L++ F  VE+E  ++QM P KAPGPD  P +FYQ YW ++G    A+ L  LN  + +K  N T+I LIPKVQ
Subjt:  TQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKVQ

Query:  SPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRRM
        +P  V +FRPISLCNV YK+++KVLANRLK +L S++ E+QSAF+PGR ITDN+++  E LH++Q +  G+ G  ALKLDM KAYDRVE  +L+ +M +M
Subjt:  SPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRRM

Query:  GFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTEG
        GF + WV L+++CISTV +++L+NGE  G + PSRGLRQGDPLSPYLF+LCAEGL   +    ++  + GV I    P ++HLFFADDSL+F KA   + 
Subjt:  GFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTEG

Query:  WHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIKSV
          ++ IL++YEQASGQ VN  K+ LF S +    +Q  + ++LGVP +     YLGLPS   RAK   F ++ ER+   ++GWK    S  G+E LIKSV
Subjt:  WHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIKSV

Query:  AQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSVL
        AQAIP YAMSCFRLP  + + I   I RFWWG    + KMHW  W  +C  K  GG+  RD+  FN+ALLAKQVWR+L +P  L S++ K +YFP+ S+L
Subjt:  AQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSVL

Query:  EVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQREWDVEKLDAVMCPEDLEVVLSIPIG
        E    +  SY W+  +  R+L+  G   RVG G+ +R++ D W+P +     I  P  P  S+S +     ++ R W  E +  +  P +  V+L IP+ 
Subjt:  EVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQREWDVEKLDAVMCPEDLEVVLSIPIG

Query:  RSGSGDGWVWHYDKFGNYSVKSGYKLLM--CRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPETID
             D  VW   K G Y+V+SGY LL+  C + E  SS +    + W  +W+  +P KI+ F+WRA    LPT +NL  R I A P+C  C++Q E+  
Subjt:  RSGSGDGWVWHYDKFGNYSVKSGYKLLM--CRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPETID

Query:  HALVTCDRSLAFWNLLLPQVDWQVNFNH----SFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKCQWVVEYLDAFLNAQECLGGG
        HAL  C      W      + W           F D      ++LS+ +  L+ +  W +W  RN +R   +   VD   Q +   LD  L  Q      
Subjt:  HALVTCDRSLAFWNLLLPQVDWQVNFNH----SFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKCQWVVEYLDAFLNAQECLGGG

Query:  SSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNLIAM
                  + ++W+ P  G  K+N D A   ++   GVGV+ R   G +  +    +P        EA A    ++ A+ LG   I +E D   ++  
Subjt:  SSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNLIAM

Query:  IHGSLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHA
        +          G  + ++   A   Q V F H+ R  NA AH LAK A
Subjt:  IHGSLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHA

A0A2N9I509 Uncharacterized protein2.8e-27437.93Show/hide
Query:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNW-NGKLWRFTGIYGQPN
        +G P T   + + +    P ++FL ET+   + +  L+  L   GCF ++  G  GGL LLW+D + + I+SYS+ HIDA V   NG  WR T  YG P 
Subjt:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNW-NGKLWRFTGIYGQPN

Query:  DSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFP
         +LR  TW LLR+L +  +  W+V GDFNEI+   E+ G  A+ +  M  F   L  C L DLGF G  FTW NRR  S  +  RLDR V N+ +  +FP
Subjt:  DSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFP

Query:  HFRVSNLNWACSDHRPIELCLD--RSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRKTNVFLKSEICRQ
           V N   + SDH  I + L    S    +  RR   F+F   W+R A C D+I          +++  LS+++  C   L +W   ++V   S I   
Subjt:  HFRVSNLNWACSDHRPIELCLD--RSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRKTNVFLKSEICRQ

Query:  RQA-IQDAYRR-MNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIF
        ++A ++D   +     D   I+ L ++L+ LLE+EEI W Q+SR +WLR GD+NTK+FH  AS+R++ N I G+ +++  W +  +++  + +DYF  +F
Subjt:  RQA-IQDAYRR-MNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIF

Query:  CTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKV
         +  P+ +++  ++ H+  +V+  MNS LL PF   EI+  + QM P+KAPGPDG  A+F+Q +W VVG     + L+ L+    +K  N T+I LIPKV
Subjt:  CTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKV

Query:  QSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRR
        ++P  +  FRPISLCNV YKIV+KVL NR+K +L  +IS++QSAFVPGR ITDNVI+  E LHY++    G     A KLDM KAYDRVE S+L  ++ +
Subjt:  QSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRR

Query:  MGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTE
        +GFAD WV L+++C+++  ++V++NGE +G V+PSRGLRQGDPLSPYLF++CAEGLS  +  A     I G+ I    P +SHLFFADDS++F +AN ++
Subjt:  MGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTE

Query:  GWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIKS
           ++ IL  YE+ASGQ VN  K+A+F S N  ++ +A + +L G  +      YLGLP    R K   F ++ +RI + +QGWK    S  GKE LIK+
Subjt:  GWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIKS

Query:  VAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSV
        V QA+PTYAMSCF+ P  +C  I+     FWWG      K+HW S  K+  PK  GG+ FRD++ FN+ALLA+Q WR+L  P  L+ R LK +YFPNSS 
Subjt:  VAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSV

Query:  LEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQREWDVEKLDAVMCPEDLEVVLSIPI
        LE +   + S+ WR     R++L +GLR RVG+G  ++++ D W+P   TFK +  P   LS  +++          W ++ L    CP D E++  IP+
Subjt:  LEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQREWDVEKLDAVMCPEDLEVVLSIPI

Query:  GRSGSGDGWVWHYDKFGNYSVKSGYKLLMC---RRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPET
              D  +W   K G +SV+S Y +L+    R   + SS     +++W+ LW+ ++P K+K+F+W+A + I+PT T L  +G+ ++ +C  C  +PET
Subjt:  GRSGSGDGWVWHYDKFGNYSVKSGYKLLMC---RRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPET

Query:  IDHALVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKCQWVVEYLDAFLNAQECLGGGSS
          H L  C+ +   W      +  +V+   +F++       SL S    +     WALW  RN       VP VD  C         FL ++  L G S 
Subjt:  IDHALVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKCQWVVEYLDAFLNAQECLGGGSS

Query:  SLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGP-LEAEARAVIEGLKLAQLLGLARIVVESDCSNLIAMI
           AS+      W+ P  G  KLNV   C     R+GVG++ R S G + AASM+   +S G  L+  A   +  +K A  +GL ++ VE  C+ L+ +I
Subjt:  SLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGP-LEAEARAVIEGLKLAQLLGLARIVVESDCSNLIAMI

Query:  HGSLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACWEGD-ALWFSQYPEWLLHFVSKD
                  G+ + ++      FQ + F  + + CN  A  LA  A       +W   +P  +   V  D
Subjt:  HGSLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACWEGD-ALWFSQYPEWLLHFVSKD

A0A7N2R0C3 Reverse transcriptase domain-containing protein6.6e-28439.52Show/hide
Query:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNW-NGKL-WRFTGIYGQP
        MG+    RA++D + +  P ++FL ETKAS R +  L+  L  +   ++ S G SGGL +LW +G+DV ++S S+ HID  V   NG + WR TG YG P
Subjt:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNW-NGKL-WRFTGIYGQP

Query:  NDSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMF
        +  +R  +W+LL  L       WVV GDFNEIL   EKLG + +    M  F   L++CGL DLGF G  FTWCN R    R   RLDR V NEE+  +F
Subjt:  NDSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMF

Query:  PHFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRK----TNVFLKSEI
        P  +V + + A SDH  + L + R   R    RR   F F   W R   CR++I    D  G N  +  +  +L  C  +L+ W R+     N  LK + 
Subjt:  PHFRVSNLNWACSDHRPIELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRK----TNVFLKSEI

Query:  CRQRQAIQDAYRRMNPIDFSF--IHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFR
        CR +Q        +N +  S   +  L+++++ ++  EEI W+QRSR  W+++GD+NT++FH  A+ R+R+N+I GI+++ G W +++ EVE + L+YF+
Subjt:  CRQRQAIQDAYRRMNPIDFSF--IHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFR

Query:  EIFCTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALI
        EI+ +  P+  +    L  +   V+ DMN  LL  F + E+ + + QM PTK+PGPDG   +F+Q YWDVVGPQ V S +  L       G N+T I LI
Subjt:  EIFCTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALI

Query:  PKVQSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHL
        PKV+ P+ + ++RPISLCNV YK+V+KVLANRLK VL  V+ EAQSAFVPGR ITDNV++  E +H I +R KG+EG  A+KLDM KAYDRVE  +LE +
Subjt:  PKVQSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHL

Query:  MRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKAN
        MRRMGF + W+ L++ C++TV F+VL+NGE RG +VP+RGLRQGDP+SPYLF+LCAEGLS  L    +   +SGV+I    P +SHL FADD +VF KA+
Subjt:  MRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKAN

Query:  GTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETL
          EG  V +IL +YE+ SGQ +N  K++LF S N   + +  +  L G  I+H    YLGLP    R K   F ++ +++ + +  WKG   S  G+E L
Subjt:  GTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETL

Query:  IKSVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPN
        IK+VAQA PTY M+CF LP S+C  +   +  FWWG      K+ W +W K+C PK  GG+ F+D++ FN ALLAKQ WR+  +P  L  R+LK RYFP+
Subjt:  IKSVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPN

Query:  SSVLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQR-EWDVEKLDAVMCPEDLEVVL
        S+ +E    N  SY WR  L  RE++  G R  +G+G  VR++ D W+P   +FK +  P  P      +V   IN Q   WD   +  V  P + E +L
Subjt:  SSVLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQR-EWDVEKLDAVMCPEDLEVVL

Query:  SIPIGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARS-------SHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCR
        SIPI  S   D   W +   G ++V S YK+      E RS       S      ++W  LW    PSK+K F+WRA + ILPT   L LR +     C 
Subjt:  SIPIGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARS-------SHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCR

Query:  GCSSQPETIDHALVTCDRSLAFW---NLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVR---GGKVVPVVDVKCQWVVEYLD
        G   + E+  H L  C+ + A W    L+LP++    N +  F +    L E     D+  +    W +W +RN+++    GK   V+  + + +VE   
Subjt:  GCSSQPETIDHALVTCDRSLAFW---NLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVR---GGKVVPVVDVKCQWVVEYLD

Query:  AFLNAQECLGGGSSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARI
        +        G     L   +  ++ +W+ P  GW K NVD A  ++    G+GVV R   G++  A  K L + LG +E EA+A  EGL+LA  LGL ++
Subjt:  AFLNAQECLGGGSSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARI

Query:  VVESDCSNLIAMIHGSLVSFKEDGIWVANVVALASRFQQVV----FQHVGRVCNAPAHCLAKHA-CWEGDALWFSQYPEWLLHFVSKDV
        ++E D   +   + G  +        +  ++A ASR++Q V      HV R  N  AH +A++A C     +W    P  +   V  DV
Subjt:  VVESDCSNLIAMIHGSLVSFKEDGIWVANVVALASRFQQVV----FQHVGRVCNAPAHCLAKHA-CWEGDALWFSQYPEWLLHFVSKDV

A0A803PV25 Uncharacterized protein3.3e-27539Show/hide
Query:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNWNG-KLWRFTGIYGQPN
        +G P T RA+   +  L+P ++FL E+K +     R+ ++L Y   ++++  G SGGL L+W   I V + S S  HI A+V   G   W FTG YG P+
Subjt:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNWNG-KLWRFTGIYGQPN

Query:  DSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFP
           R  +W+LLR L    +  W+  GDFNEI+  SEK+GG  +    M  F   L+DC   D        TWCN    + RI ERLDR +  EE+   F 
Subjt:  DSLRFRTWELLRRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFP

Query:  HFRVSNLNWACSDHRP--IELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSS--VAALSQQLSRCSARLKQWGRKTNVFLKSEIC
           +S L+W  SDHR   +++ +   G +  + +R S F F   W +  EC +I+  +  WS  N S  V +   ++++C   L+ W RK    L SE+ 
Subjt:  HFRVSNLNWACSDHRP--IELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSS--VAALSQQLSRCSARLKQWGRKTNVFLKSEIC

Query:  RQRQAIQDAYRRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIF
        + ++A+ +      P  +  I  +E KL+ LLE++E YW QRSR  WL+WGD+NTK+FHH+AS R+++NEI+G+ +  GVW D    V  +  DY+ ++F
Subjt:  RQRQAIQDAYRRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIF

Query:  CTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKV
         +   +    N +L  +QP VS+ MN+ LLA F + E+ R V++M PTKAPG DG PA+FYQ +W  +    VA  L +LN    ++  N T +ALIPKV
Subjt:  CTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKV

Query:  QSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGF-AALKLDMCKAYDRVELSFLEHLMR
          P+ + +FRPISLCNV YKIV+K LANR++  L SV+S++QSAF+ GR I DN I+G+E LH + R+ + R G   ALKLDM KAYDRVE  FLE +M 
Subjt:  QSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSKGREGF-AALKLDMCKAYDRVELSFLEHLMR

Query:  RMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGT
        ++G++  WV  I++C+++VQF+ ++NGE +G V+P RGLRQGDPLSP+LF+LCAE  S  +  A     + GV  G     VSHLFFADDSLVF  A   
Subjt:  RMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGT

Query:  EGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIK
        E    + +L +Y  ASGQ VN+HKS +    ++ +  +  L + +GV +V + G YLGLPS   R K   F  +  ++   ++GWKGSFFS  GKE LIK
Subjt:  EGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIK

Query:  SVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSS
        ++ QAIPTY MSCFRLPK     I    ARFWWGSS    K+HW  W  +C  KE GGL FRD+  FNQALLAKQ+WR +  P+ L S++LK  Y+PN  
Subjt:  SVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSS

Query:  VLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQREWDVEKLDAVMCPEDLEVVLSIP
        VLE    N  S+ WR  +WG++++  G R R+G+G SVR+  DPW+PR VTFK    P  P  +L   V D      EWD E + AV  P D E++L + 
Subjt:  VLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQREWDVEKLDAVMCPEDLEVVLSIP

Query:  IGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQP-ETI
               D  +WHY K G YSV+SGY++     +    S++  + RWW  LW  ++P K+K F+W+     +PT + L  R +   P C  CSS   E +
Subjt:  IGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQP-ETI

Query:  DHALVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKCQWVVEYLDAFLNAQECLGGGSSS
         HAL +C  +   W +       +           + +  SL+ +DF  + V  W LW  RN+V  G   PV     +W  ++L  F ++      G++ 
Subjt:  DHALVTCDRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKCQWVVEYLDAFLNAQECLGGGSSS

Query:  LGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNLIAMIHG
          A        W  P  G   +NVDA     +G   V  V R   GR+  A+++V+   L PL AE  A+ +G+K      L    VE+DC   + ++  
Subjt:  LGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNLIAMIHG

Query:  SLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACW-EGDALWFSQYP
             ++    V  +  L    +      V R  N  AH LA  A   +  A+W    P
Subjt:  SLVSFKEDGIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACW-EGDALWFSQYP

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.6e-4022.91Show/hide
Query:  ICRQRQAIQDAYRRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSR---------------ENWLRWGDKNTKWFHHRAS-----------ERKRRNEIR
        +CR +    +AY+R    + S I TL  +L  L ++E+ +     R               +  L+  +++  WF  R +           +++ +N+I 
Subjt:  ICRQRQAIQDAYRRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSR---------------ENWLRWGDKNTKWFHHRAS-----------ERKRRNEIR

Query:  GIMNANGVWVDSDSEVEMVFLDYFREIFCTQCPSTIQQNVILD-HIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQ
         I N  G      +E++    +Y++ ++  +  +  + +  LD +  P ++ +    L  P    EI  ++  +   K+PGPDGF A FYQ Y + + P 
Subjt:  GIMNANGVWVDSDSEVEMVFLDYFREIFCTQCPSTIQQNVILD-HIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQ

Query:  TVASCLEILNGRKSVKGWNQTNIALIPKV-QSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSK
         +     I         + + +I LIPK  +      +FRPISL N+  KI+ K+LANR+++ ++ +I   Q  F+PG     N+      + +I R   
Subjt:  TVASCLEILNGRKSVKGWNQTNIALIPKV-QSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSK

Query:  GREGFAALKLDMCKAYDRVELSFLEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYIS
          +    + +D  KA+D+++  F+   + ++G    ++ +I          ++LNG+         G RQG PLSP LF +  E L+ A+      + I 
Subjt:  GREGFAALKLDMCKAYDRVELSFLEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYIS

Query:  GVRIGSLCPSVSHLFFADDSLVFFKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLC-
        G+++G     V    FADD +V+ +       ++ ++++ + + SG  +N  KS  F+  N       ++G  L   I      YLG+       K L  
Subjt:  GVRIGSLCPSVSHLFFADDSLVFFKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLC-

Query:  --FRKVTERIKKVVQGWKGSFFSVGGKETLIKSVAQAIPTYAMSC--FRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEG
          ++ + + IK+    WK    S  G+  ++K        Y  +    +LP +    + K   +F W    AR      +   +    + GG+   D + 
Subjt:  --FRKVTERIKKVVQGWKGSFFSVGGKETLIKSVAQAIPTYAMSC--FRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEG

Query:  FNQALLAKQVW
        + +A + K  W
Subjt:  FNQALLAKQVW

P08548 LINE-1 reverse transcriptase homolog1.2e-4322.21Show/hide
Query:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGI---DVLIRSYSSFH-IDASVNWNGKLWRFTGIYG
        +  P     ++D +  LKP +  + E+  + +   RLK +  +S  F  N +    G+ +L+ D I      IR     H I    N          IY 
Subjt:  MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGI---DVLIRSYSSFH-IDASVNWNGKLWRFTGIYG

Query:  QPNDSLRFRTWELLRRLHNFDESAWVVGGDFNEIL--WDSEKLGGVAKPVAHMTEFFSALNDCGL-RDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEE
         PN +      E L  + N   S  +V GDFN  L   D      ++K +  +      L+   + R    +   +T+ +    +   + ++D  +G++ 
Subjt:  QPNDSLRFRTWELLRRLHNFDESAWVVGGDFNEIL--WDSEKLGGVAKPVAHMTEFFSALNDCGL-RDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEE

Query:  FHRMFPHFRVSNLNWACSDHR--PIELCLDRSGVRVDRFRRYSGFKFNAQWV---RHAECRDIIVANGDWSGN-----NSSVAALSQQLSRCSARLKQWG
            F   ++  +    SDH    +EL  +R+     +  + +       WV      E    +  N +   N     +++ A L  +     A LK+  
Subjt:  FHRMFPHFRVSNLNWACSDHR--PIELCLDRSGVRVDRFRRYSGFKFNAQWV---RHAECRDIIVANGDWSGN-----NSSVAALSQQLSRCSARLKQWG

Query:  RKTNVFLKSEICRQRQAIQDAYRRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEV
        R+    L   +   +Q  ++ +    P     I  +  +L+ +  +  I    +S+  +    +K  K   +   +++ ++ I  I N N       SE+
Subjt:  RKTNVFLKSEICRQRQAIQDAYRRMNPIDFSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEV

Query:  EMVFLDYFREIFCTQCPS--TIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSV
        + +  +Y+++++  +  +   I Q +   H+ P +S      L  P    EI   +Q +   K+PGPDGF + FYQ++ + + P  +     I       
Subjt:  EMVFLDYFREIFCTQCPS--TIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSV

Query:  KGWNQTNIALIPKV-QSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSK-GREGFAALKLDMCK
          + + NI LIPK  + P    ++RPISL N+  KI+ K+L NR+++ ++ +I   Q  F+PG     N+    + ++ IQ  +K   +    L +D  K
Subjt:  KGWNQTNIALIPKV-QSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLHYIQRRSK-GREGFAALKLDMCK

Query:  AYDRVELSFLEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHL
        A+D ++  F+   ++++G    ++ LI    S     ++LNG          G RQG PLSP LF +  E L+ A+      + I G+ IGS    +   
Subjt:  AYDRVELSFLEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHL

Query:  FFADDSLVFFKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLG--LPSHFPRAKGLCFRKVTERIKKVVQ
         FADD +V+ +        +  ++ EY   SG  +N HKS  FI  N ++ ++  +   +   +V     YLG  L           +  + + I + V 
Subjt:  FFADDSLVFFKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLG--LPSHFPRAKGLCFRKVTERIKKVVQ

Query:  GWKGSFFSVGGKETLIKS--VAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVW
         WK    S  G+  ++K   + +AI  +     + P S  + + K I  F W     +P++   +   +    + GG+   D+  + ++++ K  W
Subjt:  GWKGSFFSVGGKETLIKS--VAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVW

P11369 LINE-1 retrotransposable element ORF2 protein1.2e-4326.31Show/hide
Query:  IRGIMNANGVWVDSDSEVEMVFLDYFREIFCTQCPSTIQQNVILDHIQ-PIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVG
        I  I N  G       E++     +++ ++ T+  +  + +  LD  Q P ++ D    L +P    EIE V+  +   K+PGPDGF A FYQ++ + + 
Subjt:  IRGIMNANGVWVDSDSEVEMVFLDYFREIFCTQCPSTIQQNVILDHIQ-PIVSADMNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVG

Query:  PQTVASCLEILNGRKSVKG-----WNQTNIALIPKVQ-SPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLH
        P      L  L  +  V+G     + +  I LIPK Q  P  + +FRPISL N+  KI+ K+LANR++E ++++I   Q  F+PG     N+      +H
Subjt:  PQTVASCLEILNGRKSVKG-----WNQTNIALIPKVQ-SPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNVIIGHECLH

Query:  YIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSA
        YI +     +    + LD  KA+D+++  F+  ++ R G    ++++I    S     + +NGE    +    G RQG PLSPYLF +  E L+ A+   
Subjt:  YIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSA

Query:  HVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLG--LPSH
           + I G++IG     +S L  ADD +V+          +  ++N + +  G  +N +KS  F+    +  ++  +       IV +   YLG  L   
Subjt:  HVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLG--LPSH

Query:  FPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIKS--VAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLN
                F+ + + IK+ ++ WK    S  G+  ++K   + +AI  +     ++P      +   I +F W +   R        DK    +  GG+ 
Subjt:  FPRAKGLCFRKVTERIKKVVQGWKGSFFSVGGKETLIKS--VAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLN

Query:  FRDIEGFNQALLAKQVW
          D++ + +A++ K  W
Subjt:  FRDIEGFNQALLAKQVW

P14381 Transposon TX1 uncharacterized 149 kDa protein1.8e-3624.72Show/hide
Query:  RSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIFCTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERV
        RSR   L   D+ +++F+    ++  R +I  +   +G  ++    +      +++ +F     S      + D + P+VS     +L  P    E+ + 
Subjt:  RSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIFCTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERV

Query:  VQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKVQSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEA
        ++ M   K+PG DG    F+Q +WD +GP       E     +      +  ++L+PK    R ++++RP+SL +  YKIVAK ++ RLK VL  VI   
Subjt:  VQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKVQSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEA

Query:  QSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQG
        QS  VPGR I DNV +  + LH+ +R        A L LD  KA+DRV+  +L   ++   F   +V  +    ++ +  V +N      +   RG+RQG
Subjt:  QSAFVPGRAITDNVIIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQG

Query:  DPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSD--SQAL
         PLS  L+ L  E     L      + ++G+ +      V    +ADD ++    +  +    +     Y  AS   +N+ KS+  +  ++  D    A 
Subjt:  DPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSD--SQAL

Query:  LGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKG--SFFSVGGKETLIKSVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLA
                I+  LG YL     +P ++   F ++ E +   +  WKG     S+ G+  +I  +  +   Y + C    +     I + +  F W     
Subjt:  LGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTERIKKVVQGWKG--SFFSVGGKETLIKSVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLA

Query:  RPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVL
            HW S     LP + GG     I         +Q+ R L
Subjt:  RPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVL

P93295 Uncharacterized mitochondrial protein AtMg003104.3e-3848.03Show/hide
Query:  AIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKE-VGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSVLE
        A+P YAMSCFRL K +C+ +T  +  FWW S   + K+ W +W K+C  KE  GGL FRD+  FNQALLAKQ +R++  P  LLSR+L+ RYFP+SS++E
Subjt:  AIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKE-VGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSVLE

Query:  VSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPI
         S     SY WR  + GRELL  GL + +GDG   +++ D WI  E    P+
Subjt:  VSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.9e-2927.67Show/hide
Query:  VVGGDFNEILWDSEKLG--GVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFPH----FRVSNLNWACSDHRP
        ++ GDF++I   S+       + P+  + EF + L D  L D+   G  +TW N +   + I  +LDR + N ++   FP     F +S +    SDH P
Subjt:  VVGGDFNEILWDSEKLG--GVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFPH----FRVSNLNWACSDHRP

Query:  IELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQL---SRCSARLKQWGRKTNVFLKSEICRQRQAIQDAYRRMNPID
          + L+    R  +  RY    F+           + VA  +     S + +L + L    +C   L + G         E     ++IQ +    NP D
Subjt:  IELCLDRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQL---SRCSARLKQWGRKTNVFLKSEICRQRQAIQDAYRRMNPID

Query:  --FSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIFCTQ----CPSTIQQN
          F   H   +K +      E ++ Q+SR  WL+ GD NT++FH      + +N I+ +   + V V++ ++V+ + + Y+  +  +      P ++Q+ 
Subjt:  --FSFIHTLEEKLDRLLEEEEIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIFCTQ----CPSTIQQN

Query:  VILDHIQPIVSAD-MNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKVQSPRSVRDFR
          +  I P    D + S+L A     EI   V  M   KAPGPD F A F+   W VV   T+A+  E       +K +N T I LIPKV     +  FR
Subjt:  VILDHIQPIVSAD-MNSKLLAPFCQVEIERVVQQMFPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKVQSPRSVRDFR

Query:  PISLCNVSYKIV
        P+S C V YKI+
Subjt:  PISLCNVSYKIV

AT3G09510.1 Ribonuclease H-like superfamily protein1.2e-4027.9Show/hide
Query:  LKGRYFPNSSVLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQRE-------WDVEK
        +K RYF + S+L+       SY W   L G  LL  G R  +GDG ++R+  D  +           P  PL++  +     IN+  E       WD  K
Subjt:  LKGRYFPNSSVLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQRE-------WDVEK

Query:  LDAVMCPEDLEVVLSIPIGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPT--SARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILR
        +   +   D   +  I + +S   D  +W+Y+  G Y+V+SGY LL         + +P   S    T +W   +  K+K F+WRA    L T   L  R
Subjt:  LDAVMCPEDLEVVLSIPIGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPT--SARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILR

Query:  GIDAAPTCRGCSSQPETIDHALVTCDRSLAFWNL----LLPQVDWQVNFNHSFQDRCIVLQESLSSQDF----GLWCVGCWALWADRNAV-----RGGKV
        G+   P+C  C  + E+I+HAL TC  +   W L    L+       +F  +  +    +Q++  S DF     +W +  W +W  RN V     R    
Subjt:  GIDAAPTCRGCSSQPETIDHALVTCDRSLAFWNL----LLPQVDWQVNFNHSFQDRCIVLQESLSSQDF----GLWCVGCWALWADRNAV-----RGGKV

Query:  VPVVDVKCQWVVEYLDAFLNAQECLGGGSSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGR-LHAASMKVLPVSLGPLEAEAR
          V+  K +   ++L+A  + ++     + S    +      W+ PP  +VK N DA    QK     G + R   G  +   SMK+   S  PLEAE +
Subjt:  VPVVDVKCQWVVEYLDAFLNAQECLGGGSSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGR-LHAASMKVLPVSLGPLEAEAR

Query:  AVIEGLKLAQLLGLARIVVESDCSNLIAMI-----HGSLVSFKED-GIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACWEGDALWFS-QYPEWL
        A++  L+   + G  ++ +E DC  LI +I     H SL +  ED   W       A++F  + F  + R  N  AH LAK+ C        S   P WL
Subjt:  AVIEGLKLAQLLGLARIVVESDCSNLIAMI-----HGSLVSFKED-GIWVANVVALASRFQQVVFQHVGRVCNAPAHCLAKHACWEGDALWFS-QYPEWL

Query:  LHFVSKDVN
          +   D N
Subjt:  LHFVSKDVN

AT4G29090.1 Ribonuclease H-like superfamily protein1.7e-6128.64Show/hide
Query:  AIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSVLEV
        A+PTY M+CF LPK+VC+ I   +A FWW +      MHW++WD +   K  GG+ F+DIE FN ALL KQ+WR+L+ P+ L++++ K RYF  S  L  
Subjt:  AIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSVLEV

Query:  SAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWI---PREVTFKPIFGPMFPLSSLSSL--VSDFINDQ-REWDVEKLDAVMCPEDLEVVLS
           +  S+ W+     +E+L  G R  VG+G  + ++   W+   P     +    P    +S+SS+  VSD I++  REW  + ++ +    + +++  
Subjt:  SAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWI---PREVTFKPIFGPMFPLSSLSSL--VSDFINDQ-REWDVEKLDAVMCPEDLEVVLS

Query:  IPIGRSGSGDGWVWHYDKFGNYSVKSGYKLL---MCRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQ
        +  G     D + W Y   G+Y+VKSGY +L   + +R   +    P+    +  +W  +   KI+ F+W+     LP    L  R +     C  C S 
Subjt:  IPIGRSGSGDGWVWHYDKFGNYSVKSGYKLL---MCRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQ

Query:  PETIDHALVTCDRSLAFWNLLLPQV----DW----QVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRN-AVRGGKVVPVVDV--KCQWVVEYLD
         ET++H L  C  +   W +    +    +W     VN    F       Q   +SQ    W +  W LW +RN  V  G+     +V  + +  +E   
Subjt:  PETIDHALVTCDRSLAFWNLLLPQV----DW----QVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRN-AVRGGKVVPVVDV--KCQWVVEYLD

Query:  AFLNAQECLGGGSSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARI
            A+ C  G    +  S C     W+ PP  WVK N DA   R   R G+G V R   G +     + LP     LEAE  A+   +          +
Subjt:  AFLNAQECLGGGSSSLGASLCPQVSSWQRPPCGWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARI

Query:  VVESDCSNLIAMIHGSLVSFKEDGIW------VANVVALASRFQQVVFQHVGRVCNAPAHCLAKHAC--WEGDALWFSQYPEW
        + ESD   LI +++        D IW      + ++  L S+F +V F  + R  N  A  +A+ +      D   +S  P W
Subjt:  VVESDCSNLIAMIHGSLVSFKEDGIW------VANVVALASRFQQVVFQHVGRVCNAPAHCLAKHAC--WEGDALWFSQYPEW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.1e-3948.03Show/hide
Query:  AIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKE-VGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSVLE
        A+P YAMSCFRL K +C+ +T  +  FWW S   + K+ W +W K+C  KE  GGL FRD+  FNQALLAKQ +R++  P  LLSR+L+ RYFP+SS++E
Subjt:  AIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKE-VGGLNFRDIEGFNQALLAKQVWRVLTSPDLLLSRILKGRYFPNSSVLE

Query:  VSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPI
         S     SY WR  + GRELL  GL + +GDG   +++ D WI  E    P+
Subjt:  VSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)6.9e-1552.94Show/hide
Query:  LLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDS
        ++NG  +GLV PSRGLRQGDPLSPYLF+LC E LS     A     + G+R+ +  P ++HL FADD+
Subjt:  LLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHVARYISGVRIGSLCPSVSHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGACCCCTAGAACATTTCGTGCTGTAAGTGACGCGTTAAGTCACCTTAAACCCCATGTTCTGTTTTTATGTGAAACTAAAGCTAGTGATAGATCAATGAATCGTTT
AAAGAATCTTTTACATTATTCTGGCTGTTTTTCAATAAATAGTCAGGGTGCTAGTGGTGGTTTATGTTTGTTGTGGAATGATGGGATTGATGTTCTAATTCGCTCGTATT
CGTCTTTTCATATTGATGCTTCGGTAAATTGGAATGGTAAATTATGGAGATTCACGGGTATTTATGGTCAGCCGAATGATTCTCTTCGTTTTCGAACATGGGAACTTCTG
CGCCGTTTGCATAATTTTGATGAGTCTGCCTGGGTCGTTGGTGGCGATTTCAATGAAATTTTATGGGATTCAGAAAAATTGGGAGGCGTGGCTAAACCGGTGGCTCACAT
GACGGAGTTCTTCTCAGCATTAAATGATTGTGGGCTTCGGGATTTGGGTTTCGATGGGGATGTGTTTACTTGGTGTAATCGACGACCTATATCGGATCGTATTTTTGAAC
GTTTGGATAGATATGTTGGGAATGAAGAGTTTCACCGTATGTTTCCTCATTTTCGAGTATCTAATTTGAATTGGGCTTGTTCTGATCATAGGCCCATCGAGTTATGTTTG
GACAGGTCTGGGGTTCGGGTTGACCGATTTCGAAGGTATTCTGGCTTCAAATTTAACGCTCAGTGGGTTAGGCATGCTGAGTGTCGTGACATCATTGTTGCAAACGGGGA
TTGGTCTGGTAACAATTCTTCGGTGGCGGCATTGAGTCAACAATTGTCTCGTTGTTCGGCGCGGTTGAAGCAGTGGGGGCGGAAAACTAATGTGTTTTTAAAGTCTGAAA
TCTGTCGGCAGCGACAGGCTATTCAGGATGCCTATCGTCGTATGAATCCTATTGATTTTTCTTTTATTCATACTTTGGAGGAGAAGCTTGACAGGCTACTTGAGGAGGAA
GAAATTTATTGGCATCAGAGATCTAGAGAGAATTGGCTTAGATGGGGGGACAAGAATACGAAATGGTTCCATCATAGGGCATCGGAAAGGAAAAGACGCAATGAGATTCG
GGGAATTATGAATGCTAATGGTGTTTGGGTTGATTCGGATTCTGAGGTGGAGATGGTGTTTCTGGATTACTTTCGAGAGATTTTTTGCACTCAGTGTCCATCCACAATTC
AGCAAAATGTTATCCTTGATCATATCCAACCTATTGTTTCTGCTGATATGAATTCTAAATTGTTAGCACCTTTTTGTCAAGTAGAGATTGAGAGGGTTGTACAGCAGATG
TTCCCTACCAAAGCGCCAGGGCCGGATGGGTTTCCGGCTGTGTTTTATCAGTCCTATTGGGATGTTGTTGGGCCTCAGACTGTTGCGAGTTGCCTTGAGATTTTGAATGG
TCGCAAGTCGGTGAAGGGGTGGAATCAGACTAATATTGCCCTTATCCCTAAGGTTCAGTCTCCACGGTCAGTGCGGGATTTTCGCCCTATTAGTCTATGTAATGTGTCAT
ATAAAATTGTAGCGAAGGTGTTGGCTAATCGTTTGAAGGAGGTTCTCCGGTCGGTGATCTCGGAGGCCCAATCTGCATTTGTGCCGGGGAGGGCAATTACAGATAATGTT
ATTATTGGGCATGAGTGTTTGCATTATATCCAACGTCGCTCAAAGGGCAGGGAGGGTTTTGCTGCGCTGAAATTGGACATGTGCAAGGCTTACGACAGGGTCGAGTTGTC
TTTCTTAGAGCATTTAATGCGACGAATGGGGTTTGCTGATGCATGGGTTGATTTAATTCTTGATTGTATCTCCACAGTTCAGTTTGCTGTTCTGCTTAATGGTGAGGCTC
GTGGCCTTGTTGTTCCTAGCAGAGGTCTTCGCCAAGGTGACCCTCTTTCCCCTTATTTATTTGTGTTATGTGCTGAAGGTTTATCACACGCTTTGAACTCGGCTCATGTG
GCTCGATATATTTCGGGGGTGCGAATTGGATCGTTGTGTCCTTCTGTTTCGCATTTATTTTTTGCCGATGATAGTTTGGTGTTTTTTAAGGCTAATGGGACCGAAGGGTG
GCATGTCAAGAGAATTCTAAATGAATATGAGCAAGCTTCGGGTCAGTGTGTTAATTACCATAAGTCTGCCTTATTTATATCGCCTAATATTCATTCTGATAGTCAGGCTC
TGTTGGGTTCACTGTTGGGTGTTCCGATCGTACATGATTTGGGCTATTATTTAGGTCTCCCATCTCATTTTCCTCGGGCCAAAGGGTTGTGTTTTCGGAAAGTCACCGAA
CGGATTAAGAAAGTGGTTCAAGGATGGAAGGGGTCTTTTTTTTCGGTAGGTGGTAAGGAAACTCTTATCAAGAGTGTTGCCCAAGCAATTCCCACATATGCCATGAGCTG
TTTTCGTCTTCCTAAATCGGTATGTCGTGGCATTACAAAGGAAATTGCTAGATTTTGGTGGGGTTCTTCGTTAGCCCGGCCGAAAATGCATTGGAGGAGCTGGGACAAGA
TGTGTTTACCAAAGGAAGTCGGTGGTCTTAATTTTCGTGATATTGAGGGTTTTAACCAAGCTCTTTTGGCGAAGCAGGTGTGGAGGGTCTTAACATCCCCTGATTTATTA
TTATCTCGCATTTTGAAGGGTCGTTACTTTCCAAACTCCTCTGTTCTTGAGGTGTCTGCTGCGAATTCTGTGTCCTATTTTTGGCGAGTATTTCTTTGGGGACGAGAGCT
CCTTGGGTTGGGTTTGCGGAAGCGAGTAGGTGATGGCACTAGTGTCCGTATGTATTGTGATCCGTGGATCCCTCGAGAGGTGACGTTTAAGCCAATTTTTGGGCCAATGT
TTCCGTTGTCTTCTCTTTCTTCACTGGTTAGCGACTTTATTAATGATCAAAGGGAATGGGATGTTGAAAAGCTCGACGCGGTGATGTGTCCGGAAGATCTGGAGGTTGTT
TTGAGTATTCCGATTGGGCGGTCGGGGTCTGGGGATGGATGGGTTTGGCACTATGACAAATTTGGGAACTATTCCGTCAAAAGTGGGTACAAATTGTTAATGTGTCGGAG
GATGGAGGCACGCTCTTCCCATTCTCCTACCAGTGCTCGTTGGTGGACTGGCTTGTGGGCTCAGCGGTTGCCATCGAAAATCAAAATATTTATGTGGCGAGCTTTTCAGG
TGATTCTTCCGACTATGACAAATCTTATTCTTCGTGGGATTGATGCCGCGCCTACTTGTCGAGGGTGTTCCTCTCAGCCTGAAACAATTGATCATGCTTTGGTTACTTGT
GATCGTTCTTTGGCATTTTGGAATTTGTTGTTGCCGCAGGTTGATTGGCAGGTGAATTTTAATCACAGTTTCCAAGATCGTTGTATTGTGCTTCAGGAGTCTCTTTCTTC
GCAGGATTTTGGTTTATGGTGTGTGGGGTGTTGGGCGTTATGGGCTGACCGTAATGCTGTCAGGGGTGGGAAGGTGGTTCCTGTGGTTGATGTCAAATGTCAGTGGGTCG
TGGAGTATTTGGATGCATTTTTGAATGCGCAGGAGTGTTTGGGTGGTGGATCCTCTTCCTTGGGAGCTTCTTTGTGTCCTCAAGTTTCCTCTTGGCAGCGGCCTCCTTGT
GGTTGGGTTAAATTAAATGTGGATGCGGCTTGTCCCAGGCAGAAGGGTAGAATCGGAGTCGGAGTTGTTTTCCGCACTTCAACGGGGCGTCTTCATGCAGCGTCAATGAA
AGTTCTTCCAGTTTCACTTGGTCCGTTGGAAGCTGAGGCTCGGGCGGTGATTGAGGGGTTGAAGTTGGCTCAGTTGTTGGGTTTGGCGCGGATTGTGGTGGAATCGGATT
GTAGTAATCTCATTGCCATGATTCATGGGTCGCTGGTTTCGTTTAAAGAAGATGGTATCTGGGTGGCCAACGTTGTGGCTCTGGCCTCTCGGTTTCAACAGGTTGTTTTT
CAGCATGTTGGTCGCGTTTGCAATGCCCCAGCTCATTGTTTGGCGAAACACGCCTGTTGGGAGGGGGATGCATTATGGTTTTCTCAATATCCTGAGTGGTTATTGCATTT
TGTCTCTAAGGATGTTAATTCTAGTTGCTGTAGCCCGTGTGGCAATCTTTTGCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGACCCCTAGAACATTTCGTGCTGTAAGTGACGCGTTAAGTCACCTTAAACCCCATGTTCTGTTTTTATGTGAAACTAAAGCTAGTGATAGATCAATGAATCGTTT
AAAGAATCTTTTACATTATTCTGGCTGTTTTTCAATAAATAGTCAGGGTGCTAGTGGTGGTTTATGTTTGTTGTGGAATGATGGGATTGATGTTCTAATTCGCTCGTATT
CGTCTTTTCATATTGATGCTTCGGTAAATTGGAATGGTAAATTATGGAGATTCACGGGTATTTATGGTCAGCCGAATGATTCTCTTCGTTTTCGAACATGGGAACTTCTG
CGCCGTTTGCATAATTTTGATGAGTCTGCCTGGGTCGTTGGTGGCGATTTCAATGAAATTTTATGGGATTCAGAAAAATTGGGAGGCGTGGCTAAACCGGTGGCTCACAT
GACGGAGTTCTTCTCAGCATTAAATGATTGTGGGCTTCGGGATTTGGGTTTCGATGGGGATGTGTTTACTTGGTGTAATCGACGACCTATATCGGATCGTATTTTTGAAC
GTTTGGATAGATATGTTGGGAATGAAGAGTTTCACCGTATGTTTCCTCATTTTCGAGTATCTAATTTGAATTGGGCTTGTTCTGATCATAGGCCCATCGAGTTATGTTTG
GACAGGTCTGGGGTTCGGGTTGACCGATTTCGAAGGTATTCTGGCTTCAAATTTAACGCTCAGTGGGTTAGGCATGCTGAGTGTCGTGACATCATTGTTGCAAACGGGGA
TTGGTCTGGTAACAATTCTTCGGTGGCGGCATTGAGTCAACAATTGTCTCGTTGTTCGGCGCGGTTGAAGCAGTGGGGGCGGAAAACTAATGTGTTTTTAAAGTCTGAAA
TCTGTCGGCAGCGACAGGCTATTCAGGATGCCTATCGTCGTATGAATCCTATTGATTTTTCTTTTATTCATACTTTGGAGGAGAAGCTTGACAGGCTACTTGAGGAGGAA
GAAATTTATTGGCATCAGAGATCTAGAGAGAATTGGCTTAGATGGGGGGACAAGAATACGAAATGGTTCCATCATAGGGCATCGGAAAGGAAAAGACGCAATGAGATTCG
GGGAATTATGAATGCTAATGGTGTTTGGGTTGATTCGGATTCTGAGGTGGAGATGGTGTTTCTGGATTACTTTCGAGAGATTTTTTGCACTCAGTGTCCATCCACAATTC
AGCAAAATGTTATCCTTGATCATATCCAACCTATTGTTTCTGCTGATATGAATTCTAAATTGTTAGCACCTTTTTGTCAAGTAGAGATTGAGAGGGTTGTACAGCAGATG
TTCCCTACCAAAGCGCCAGGGCCGGATGGGTTTCCGGCTGTGTTTTATCAGTCCTATTGGGATGTTGTTGGGCCTCAGACTGTTGCGAGTTGCCTTGAGATTTTGAATGG
TCGCAAGTCGGTGAAGGGGTGGAATCAGACTAATATTGCCCTTATCCCTAAGGTTCAGTCTCCACGGTCAGTGCGGGATTTTCGCCCTATTAGTCTATGTAATGTGTCAT
ATAAAATTGTAGCGAAGGTGTTGGCTAATCGTTTGAAGGAGGTTCTCCGGTCGGTGATCTCGGAGGCCCAATCTGCATTTGTGCCGGGGAGGGCAATTACAGATAATGTT
ATTATTGGGCATGAGTGTTTGCATTATATCCAACGTCGCTCAAAGGGCAGGGAGGGTTTTGCTGCGCTGAAATTGGACATGTGCAAGGCTTACGACAGGGTCGAGTTGTC
TTTCTTAGAGCATTTAATGCGACGAATGGGGTTTGCTGATGCATGGGTTGATTTAATTCTTGATTGTATCTCCACAGTTCAGTTTGCTGTTCTGCTTAATGGTGAGGCTC
GTGGCCTTGTTGTTCCTAGCAGAGGTCTTCGCCAAGGTGACCCTCTTTCCCCTTATTTATTTGTGTTATGTGCTGAAGGTTTATCACACGCTTTGAACTCGGCTCATGTG
GCTCGATATATTTCGGGGGTGCGAATTGGATCGTTGTGTCCTTCTGTTTCGCATTTATTTTTTGCCGATGATAGTTTGGTGTTTTTTAAGGCTAATGGGACCGAAGGGTG
GCATGTCAAGAGAATTCTAAATGAATATGAGCAAGCTTCGGGTCAGTGTGTTAATTACCATAAGTCTGCCTTATTTATATCGCCTAATATTCATTCTGATAGTCAGGCTC
TGTTGGGTTCACTGTTGGGTGTTCCGATCGTACATGATTTGGGCTATTATTTAGGTCTCCCATCTCATTTTCCTCGGGCCAAAGGGTTGTGTTTTCGGAAAGTCACCGAA
CGGATTAAGAAAGTGGTTCAAGGATGGAAGGGGTCTTTTTTTTCGGTAGGTGGTAAGGAAACTCTTATCAAGAGTGTTGCCCAAGCAATTCCCACATATGCCATGAGCTG
TTTTCGTCTTCCTAAATCGGTATGTCGTGGCATTACAAAGGAAATTGCTAGATTTTGGTGGGGTTCTTCGTTAGCCCGGCCGAAAATGCATTGGAGGAGCTGGGACAAGA
TGTGTTTACCAAAGGAAGTCGGTGGTCTTAATTTTCGTGATATTGAGGGTTTTAACCAAGCTCTTTTGGCGAAGCAGGTGTGGAGGGTCTTAACATCCCCTGATTTATTA
TTATCTCGCATTTTGAAGGGTCGTTACTTTCCAAACTCCTCTGTTCTTGAGGTGTCTGCTGCGAATTCTGTGTCCTATTTTTGGCGAGTATTTCTTTGGGGACGAGAGCT
CCTTGGGTTGGGTTTGCGGAAGCGAGTAGGTGATGGCACTAGTGTCCGTATGTATTGTGATCCGTGGATCCCTCGAGAGGTGACGTTTAAGCCAATTTTTGGGCCAATGT
TTCCGTTGTCTTCTCTTTCTTCACTGGTTAGCGACTTTATTAATGATCAAAGGGAATGGGATGTTGAAAAGCTCGACGCGGTGATGTGTCCGGAAGATCTGGAGGTTGTT
TTGAGTATTCCGATTGGGCGGTCGGGGTCTGGGGATGGATGGGTTTGGCACTATGACAAATTTGGGAACTATTCCGTCAAAAGTGGGTACAAATTGTTAATGTGTCGGAG
GATGGAGGCACGCTCTTCCCATTCTCCTACCAGTGCTCGTTGGTGGACTGGCTTGTGGGCTCAGCGGTTGCCATCGAAAATCAAAATATTTATGTGGCGAGCTTTTCAGG
TGATTCTTCCGACTATGACAAATCTTATTCTTCGTGGGATTGATGCCGCGCCTACTTGTCGAGGGTGTTCCTCTCAGCCTGAAACAATTGATCATGCTTTGGTTACTTGT
GATCGTTCTTTGGCATTTTGGAATTTGTTGTTGCCGCAGGTTGATTGGCAGGTGAATTTTAATCACAGTTTCCAAGATCGTTGTATTGTGCTTCAGGAGTCTCTTTCTTC
GCAGGATTTTGGTTTATGGTGTGTGGGGTGTTGGGCGTTATGGGCTGACCGTAATGCTGTCAGGGGTGGGAAGGTGGTTCCTGTGGTTGATGTCAAATGTCAGTGGGTCG
TGGAGTATTTGGATGCATTTTTGAATGCGCAGGAGTGTTTGGGTGGTGGATCCTCTTCCTTGGGAGCTTCTTTGTGTCCTCAAGTTTCCTCTTGGCAGCGGCCTCCTTGT
GGTTGGGTTAAATTAAATGTGGATGCGGCTTGTCCCAGGCAGAAGGGTAGAATCGGAGTCGGAGTTGTTTTCCGCACTTCAACGGGGCGTCTTCATGCAGCGTCAATGAA
AGTTCTTCCAGTTTCACTTGGTCCGTTGGAAGCTGAGGCTCGGGCGGTGATTGAGGGGTTGAAGTTGGCTCAGTTGTTGGGTTTGGCGCGGATTGTGGTGGAATCGGATT
GTAGTAATCTCATTGCCATGATTCATGGGTCGCTGGTTTCGTTTAAAGAAGATGGTATCTGGGTGGCCAACGTTGTGGCTCTGGCCTCTCGGTTTCAACAGGTTGTTTTT
CAGCATGTTGGTCGCGTTTGCAATGCCCCAGCTCATTGTTTGGCGAAACACGCCTGTTGGGAGGGGGATGCATTATGGTTTTCTCAATATCCTGAGTGGTTATTGCATTT
TGTCTCTAAGGATGTTAATTCTAGTTGCTGTAGCCCGTGTGGCAATCTTTTGCTTTAA
Protein sequenceShow/hide protein sequence
MGTPRTFRAVSDALSHLKPHVLFLCETKASDRSMNRLKNLLHYSGCFSINSQGASGGLCLLWNDGIDVLIRSYSSFHIDASVNWNGKLWRFTGIYGQPNDSLRFRTWELL
RRLHNFDESAWVVGGDFNEILWDSEKLGGVAKPVAHMTEFFSALNDCGLRDLGFDGDVFTWCNRRPISDRIFERLDRYVGNEEFHRMFPHFRVSNLNWACSDHRPIELCL
DRSGVRVDRFRRYSGFKFNAQWVRHAECRDIIVANGDWSGNNSSVAALSQQLSRCSARLKQWGRKTNVFLKSEICRQRQAIQDAYRRMNPIDFSFIHTLEEKLDRLLEEE
EIYWHQRSRENWLRWGDKNTKWFHHRASERKRRNEIRGIMNANGVWVDSDSEVEMVFLDYFREIFCTQCPSTIQQNVILDHIQPIVSADMNSKLLAPFCQVEIERVVQQM
FPTKAPGPDGFPAVFYQSYWDVVGPQTVASCLEILNGRKSVKGWNQTNIALIPKVQSPRSVRDFRPISLCNVSYKIVAKVLANRLKEVLRSVISEAQSAFVPGRAITDNV
IIGHECLHYIQRRSKGREGFAALKLDMCKAYDRVELSFLEHLMRRMGFADAWVDLILDCISTVQFAVLLNGEARGLVVPSRGLRQGDPLSPYLFVLCAEGLSHALNSAHV
ARYISGVRIGSLCPSVSHLFFADDSLVFFKANGTEGWHVKRILNEYEQASGQCVNYHKSALFISPNIHSDSQALLGSLLGVPIVHDLGYYLGLPSHFPRAKGLCFRKVTE
RIKKVVQGWKGSFFSVGGKETLIKSVAQAIPTYAMSCFRLPKSVCRGITKEIARFWWGSSLARPKMHWRSWDKMCLPKEVGGLNFRDIEGFNQALLAKQVWRVLTSPDLL
LSRILKGRYFPNSSVLEVSAANSVSYFWRVFLWGRELLGLGLRKRVGDGTSVRMYCDPWIPREVTFKPIFGPMFPLSSLSSLVSDFINDQREWDVEKLDAVMCPEDLEVV
LSIPIGRSGSGDGWVWHYDKFGNYSVKSGYKLLMCRRMEARSSHSPTSARWWTGLWAQRLPSKIKIFMWRAFQVILPTMTNLILRGIDAAPTCRGCSSQPETIDHALVTC
DRSLAFWNLLLPQVDWQVNFNHSFQDRCIVLQESLSSQDFGLWCVGCWALWADRNAVRGGKVVPVVDVKCQWVVEYLDAFLNAQECLGGGSSSLGASLCPQVSSWQRPPC
GWVKLNVDAACPRQKGRIGVGVVFRTSTGRLHAASMKVLPVSLGPLEAEARAVIEGLKLAQLLGLARIVVESDCSNLIAMIHGSLVSFKEDGIWVANVVALASRFQQVVF
QHVGRVCNAPAHCLAKHACWEGDALWFSQYPEWLLHFVSKDVNSSCCSPCGNLLL