; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0011892 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0011892
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:34567764..34579384
RNA-Seq ExpressionLag0011892
SyntenyLag0011892
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001810 - F-box domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036047 - F-box-like domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]9.5e-27339.23Show/hide
Query:  VDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATIK--GSEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILGGDFNEIL
        ++++++ + F  GL VPC+G+ GG+ALLW  E++L + S++  HIDA I     +  WR TGFYG P T KR DSW LL  L+    LPW+  GDFNEIL
Subjt:  VDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATIK--GSEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILGGDFNEIL

Query:  TSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIK-----------------------RFEEVKVHHLNRHGSDHHPISVTLGNREVRRRKGP
        + +EK GGA R+Q+QM  FR +++ C   DLG+    +TW                         +F  +KVHHL     DH  + VT  +  +R R   
Subjt:  TSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIK-----------------------RFEEVKVHHLNRHGSDHHPISVTLGNREVRRRKGP

Query:  RPIKFEGNWLAFNECKEIVKLHW-INSPRSSTHNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVI-MERNSATRDADLGFAERELDRLLEEEE
        +   FE  W    +CK I++  W      S+       +  C  +LS W+ T + G I   I+ K   +  + M         ++     E++ LL++EE
Subjt:  RPIKFEGNWLAFNECKEIVKLHW-INSPRSSTHNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVI-MERNSATRDADLGFAERELDRLLEEEE

Query:  IYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARS
         YW  R++  WLK GDRNTK+FH++A+ RRK+N I G ++ +G W DNEE + + A  YF N++SSS   Q  I    E +  K+ E   E L + F + 
Subjt:  IYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARS

Query:  EIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGS
        E+  +LK + PNKAPG DG  A FFQ YW ++G+ V+++ L VLN+   +  LNKT I+LIPKT  P++M +FRPISLCNV+YK+I+K +ANRLK +L  
Subjt:  EIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGS

Query:  IISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLR
        IIS  QSAF   R I+DNV++ FE +H L  +  G++G  A+KLDMSKA+DRVEW F+ ++ME MGF   W + VM C+ SV YS+L+NG+      P R
Subjt:  IISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLR

Query:  GIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISKNVDDTK
        G+RQGDPLSP LFLLCAEGLSAL+N+      I+GI IN  CP VTHLFFADDS++FC+A+ EE   ++ IL  YE+ASGQK+N DKS+   S N     
Subjt:  GIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISKNVDDTK

Query:  AGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSE
          EI  ILG  Q++    YLGLP+  GR+KS++F  +KE+V   L GWKG+L SMGGKEILIK+VAQAIPTYTMSCF LP+ +CD++ R+   FWWG   
Subjt:  AGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSE

Query:  QKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGER
        Q+ K  WI WK+MC SK  GGLGFR+++ FN AMLAK +WR+L NP SL+ +VL+ +YF  G+ L AK G++PS +WRSI    ++   G RWRVG+G++
Subjt:  QKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGER

Query:  IYISQDPWLGREGCSKPLWVNA-NWRMKRVRDLLEPNGSWNK-NLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIA
        I+I +D WL      K +     N+    V  L++P+  W K   L  +FLP + + I + P   +  +D++IW    KG FSVKSAYH+A  +   +  
Subjt:  IYISQDPWLGREGCSKPLWVNA-NWRMKRVRDLLEPNGSWNK-NLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIA

Query:  SGSSDASS-KAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEP--
           S+    + +WK +W  N   + KI  W+   D LPT  NI+K+GI  +  C +C    E   H    C+ +  +W          W    D+ E   
Subjt:  SGSSDASS-KAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEP--

Query:  ------IDFWSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASW
              +D    +  + + +   +  +L W IW  RN    N++    +++      ++E+     +   +   +    S + W+ P  G +K+N D + 
Subjt:  ------IDFWSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASW

Query:  LEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDAT--ELCLFVDEIDGFR
         ++     IG  IRDSNG ++    K     +    +EA A+ +G+          G+ +   +++E D++ V+  +N   D++T  EL   +  I    
Subjt:  LEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDAT--ELCLFVDEIDGFR

Query:  RSNRVRSFSKCSRQSNTLAHELARAAAK
         S    +F   +R  N +AHELA+ A +
Subjt:  RSNRVRSFSKCSRQSNTLAHELARAAAK

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]9.5e-26539.69Show/hide
Query:  GGGLALLWKDELDLSIISFSPGHIDATIKGSEVW-WRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILGGDFNEILTSDEKQGGAERNQNQMSAFRSV
        GGGLA LWK+++ L +I+F+  H+ A +   + + W  TGFYG P  Q++++SW LL+ L      PW++ GDFN  L + EK    +   +Q+ AFR  
Subjt:  GGGLALLWKDELDLSIISFSPGHIDATIKGSEVW-WRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILGGDFNEILTSDEKQGGAERNQNQMSAFRSV

Query:  IDACSLIDLGFPVGKFT-----------------------WIKRFEEVKVHHLNRHGSDHHPISVTLGNREVRRRKGPRPIKFEGNWLAFNECKEIVKLH
        + +C L DLGF    +T                       W  RF+  +V HL+ H SDH P+ + + +    R+   R  KFE +WL  +EC  +++  
Subjt:  IDACSLIDLGFPVGKFT-----------------------WIKRFEEVKVHHLNRHGSDHHPISVTLGNREVRRRKGPRPIKFEGNWLAFNECKEIVKLH

Query:  WINSP--RSSTHNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERN-SATRDADLGFAERELDRLLEEEEIYWKFRSREEWLKWGDRNTKW
        W N    R        K+ +C  +L +W  + +      AIK  +  +  + E   +    A+     +++D LL+++EIYW  RSR  WL+ GDRNTK+
Subjt:  WINSP--RSSTHNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERN-SATRDADLGFAERELDRLLEEEEIYWKFRSREEWLKWGDRNTKW

Query:  FHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARSEIEHSLKNMSPNKAPGEDGAH
        FH+KA+ RR++N I+G  NS+G WV+N E +G+ A+ YF NLF +   DQ  +   L+ +  K+ E  +E L   F   E++ +L  M P KAPG DG +
Subjt:  FHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARSEIEHSLKNMSPNKAPGEDGAH

Query:  ATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSIISPTQSAFVPGRSISDNVVL
        A F+Q +W ++GD V +  L+ LNNG  +  +N T I LIPK   PE+M EFRPISLCNVIYKII+K +ANRLK+VL  IIS TQSAFVPGR I+DNV++
Subjt:  ATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSIISPTQSAFVPGRSISDNVVL

Query:  GFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLS
         +E +HT+  RK+G+KG  ALKLD+SKAYDRVEW FL+ IME MGF   WI RVMSCV +  +S+L+NG P E  +P RGIRQGDP+SPYLFLLCAEGL+
Subjt:  GFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLS

Query:  ALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISKNVDDTKAGEISEILGVSQSNSFGYYLG
        ALLN+ E    I+G+ I    P +T+L FADDSL+FC+A++ E   I EIL +YE+ASGQ +NL+KS+   S N  + + G+I EILGV + + F  YLG
Subjt:  ALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISKNVDDTKAGEISEILGVSQSNSFGYYLG

Query:  LPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGG
        LP   GR K   F+ +K+RVWK LQGWKG L S  GKEILIK+VAQAIPTYTMS F++P  +C E+  LC+RFWWG    +RK HW  W K+   K  GG
Subjt:  LPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGG

Query:  LGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPL-WV
        +GFRD++ FN AMLAK  WRL++   SLL +  + +YF   +FL+AKE  N S  WRS++  + +   GY WRVG+G  I   +D WL     +K L  V
Subjt:  LGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPL-WV

Query:  NANWRMKRVRDLLEPN-GSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIASGSS-DASSKAIWKSIWKANC
          +     V +L+ P    WN   +  +F  D+A+ I + P  R    D I W Y  +G FSVKSAYH+A+R+ +     G+S    +K IW +IWK   
Subjt:  NANWRMKRVRDLLEPN-GSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIASGSS-DASSKAIWKSIWKANC

Query:  VPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEPIDFWSFIQRNLSKEETRIAILLL
          + K+  W+  ++ LPT  N+  + I  +  C +C    ES  H  W C   + +W     ++  L        + +     +   L+++E  +     
Subjt:  VPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEPIDFWSFIQRNLSKEETRIAILLL

Query:  WHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGCKKNF
        W +W  RN+      L   + +  R    I E  N Q  N+   +    S    W PP PG +KLN DA+        G G  IR+  G ++        
Subjt:  WHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGCKKNF

Query:  NLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHELARAA
         + N    E  A  + L+ +    GF        L+VE D++ V + ++    +A+     +D+I    RS +  S     R  N +AH LA+ A
Subjt:  NLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHELARAA

XP_030924668.1 uncharacterized protein LOC115951644 [Quercus lobata]1.1e-25537.32Show/hide
Query:  ETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATI-KGSEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILG
        ETK     +D++   + +     VP    GGGLAL W  + ++ + SFS  HIDA I  G +  WRFTGFYG P T  R++SW LL  LS    LPW+  
Subjt:  ETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATI-KGSEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILG

Query:  GDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIKRFEEVKVHHLNRHGSDHHPISVTLGNREVRRRKGPRPIKFEGNWLAFNECK
        GDFNEIL +DEKQG   R + QM  FR  +D   L DLG                +HHL+   SDH PI ++  +   R  K  RP +FE  WL    C+
Subjt:  GDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIKRFEEVKVHHLNRHGSDHHPISVTLGNREVRRRKGPRPIKFEGNWLAFNECK

Query:  EIVKLHWINSPRSST-HNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRD-ADLGFAERELDRLLEEEEIYWKFRSREEWLKWGD
        E+V   W      +T   F++K+ +C   L  WNK ++ G ++ ++K+K + +K   E     ++   +     E+ +L  +EE  WK RSR  WLK GD
Subjt:  EIVKLHWINSPRSST-HNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRD-ADLGFAERELDRLLEEEEIYWKFRSREEWLKWGD

Query:  RNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARSEIEHSLKNMSPNKAPG
        RNTK+FH +AN R +RNLI G  +  G WV++E+ MGK    YF+ +F+SS  +    S  L  +     E  ++ ++  F   E++ +L +M+P  APG
Subjt:  RNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARSEIEHSLKNMSPNKAPG

Query:  EDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSIISPTQSAFVPGRSIS
         DG    F++S+W ++G++V+ + L  LN G     LN T I LIPK   P+K+ +FRPISLCNV+YK+IAK +ANRLK+ L + +  +QSAF+ GR IS
Subjt:  EDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSIISPTQSAFVPGRSIS

Query:  DNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRGIRQGDPLSPYLFLLC
        DN+++ FE +H L  + +G+ G  ALKLDMSKAYDRVEW FL  IM+ +G        ++SC++SV YS+LLNG P    KP RG+RQGDPLSPYLFLLC
Subjt:  DNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRGIRQGDPLSPYLFLLC

Query:  AEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISKNVDDTKAGEISEILGVSQSNSF
        A GL  LL + E   +I G+ I+ + P V+HLFFADDS++FCRA++ E  R+ +IL+ YE+ +GQK+N +K+    S N        I ++LGV     +
Subjt:  AEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISKNVDDTKAGEISEILGVSQSNSF

Query:  GYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTS
          YLGLPA  GR K R F  +KERVWK +QGWK +L S+ G+E+LIK+V QAIPTYTMSCFKLPK +  E+  L  +FWWG ++  +K HW+ W+++C +
Subjt:  GYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTS

Query:  KDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCS-
        K+ GG+GF++I+ FN+A+LAK  WR+++NP+SL  +V + ++F + + L AKE N+ S AW+SIL  RD+  +G  WR+GDG+ + I +D WL  +    
Subjt:  KDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCS-

Query:  -----KPLWVNANWRMKRVRDLLEPNGS-WNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIASGSSDASSKAI
              P+   A     RV  L+ P+ S W    ++ +FLP +A  +   P       D I W     G FS  SAY L    SS D AS S+    ++ 
Subjt:  -----KPLWVNANWRMKRVRDLLEPNGS-WNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIASGSSDASSKAI

Query:  WKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEPIDFWSFIQRNLSKE
        WK IWK     + K  +W++ N+ALPTKSN+ ++ I  +  C LC++  E + H    C                                 I     + 
Subjt:  WKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEPIDFWSFIQRNLSKE

Query:  ETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASWLEKDEIGGIGWAIRDSNGSL
         T +   + W +W  RN  +       ++ I   + T +++   +      T  L+  + H  W PP+ G  K+N DA+        G+G  +RD  G  
Subjt:  ETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASWLEKDEIGGIGWAIRDSNGSL

Query:  IGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHEL
        +G          ++  +EA A +  ++       F   R    +++E DSV ++  + +     +     VD++           F+  +R  N +A  L
Subjt:  IGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHEL

Query:  ARAAA
        A+ AA
Subjt:  ARAAA

XP_030939975.1 uncharacterized protein LOC115964883 [Quercus lobata]2.9e-26138.95Show/hide
Query:  GGLALLWKDELDLSIISFSPGHIDATIK-GSEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILGGDFNEILTSDEKQGGAERNQNQMSAFRSVI
        GGLAL WK+ +DL ++  +P +IDA +  G +  WRFTGFYG+PIT  R+ SW LL+ L    +LPWI  GDFNEI  ++EK+GGA R + QM AFR  +
Subjt:  GGLALLWKDELDLSIISFSPGHIDATIK-GSEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILGGDFNEILTSDEKQGGAERNQNQMSAFRSVI

Query:  DACSLIDLGFPVGKFTWIKR-----------------------FEEVKVHHLNRHGSDHHPISVTLGNREVRRRKGPRPIKFEGNWLAFNECKEIVKLHW
        D C   DLGF    FTW                          F  V+VHH+    SDH P+ +   +  VR  K  RP +FE  WL    C+ I+K  W
Subjt:  DACSLIDLGFPVGKFTWIKR-----------------------FEEVKVHHLNRHGSDHHPISVTLGNREVRRRKGPRPIKFEGNWLAFNECKEIVKLHW

Query:  INSPRSSTHNFD---AKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRDAD-LGFAERELDRLLEEEEIYWKFRSREEWLKWGDRNTKW
          S R+     D    K+ +C   L +W++T   G+I+  + +K+  +      + A    D +     E+  L+ +EE  W  RSR +WLK GD NT +
Subjt:  INSPRSSTHNFD---AKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRDAD-LGFAERELDRLLEEEEIYWKFRSREEWLKWGDRNTKW

Query:  FHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARSEIEHSLKNMSPNKAPGEDGAH
        FHS+A  R KRN I     ++G+ V +E+ +G+    YFK +F+S+M       + L+G+  K+  A   DL + F   E+E +LK M P  APG DG  
Subjt:  FHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARSEIEHSLKNMSPNKAPGEDGAH

Query:  ATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSIISPTQSAFVPGRSISDNVVL
          F++S W+ +G +V +  L +LN+G     LN T I+LIPK   PEK  +FRPISLCNV+YKI++K+IANRLK++L  ++S +QSAF+  R ISDN+++
Subjt:  ATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSIISPTQSAFVPGRSISDNVVL

Query:  GFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLS
         FE +H L  + +G+ G  A+KLDMSKAYDRVEW FL ++ME +GF   WI  V SC+ SV +SVL+NG P   F P RG+RQGDPLSPYLFLLCAEGL 
Subjt:  GFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLS

Query:  ALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISKNVDDTKAGEISEILGVSQSNSFGYYLG
        +L+ + E    I G+ + +  P V+HLFFADDSL+FCRA+ +EA  I EIL  YE+ASGQ++N +K+    S N D     EI  +LGV+ + ++  YLG
Subjt:  ALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISKNVDDTKAGEISEILGVSQSNSFGYYLG

Query:  LPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGG
        LP+  GR K + F  ++ER+W  +QGWK RL S GG+E+LIK+V QA+PT+TM CFK+PK++C +I  L  +FWWG   + RK HW+GWKK+C SK HGG
Subjt:  LPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGG

Query:  LGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPLWVN
        LGF+DI+LFN AML K  WRL+ N  SL  KV + K+F + + L      N S AW+SIL  R +   G +WR+GDG  + I  D WL     S+ +   
Subjt:  LGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPLWVN

Query:  ANW-RMKRVRDLL-EPNGSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIASGSSDASSKAIWKSIWKANCV
         N+    RV  L+ E N  W ++ + E FLP +A+ I   P   +G +D +IW     G ++ KSAY L  + +       S+ A  K  W+ +W  N  
Subjt:  ANW-RMKRVRDLL-EPNGSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIASGSSDASSKAIWKSIWKANCV

Query:  PRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDW--EEPIDFWSFIQRNLSKEETRIAILL
         + +  +W+  ND+LPTK N+ K+ I  +  C  C    E   H  W C+  K++W +          +CR++  E+   F   +Q  L+++    A L 
Subjt:  PRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDW--EEPIDFWSFIQRNLSKEETRIAILL

Query:  L---WHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGC
            W IW  RN   + +      +I R     + E  ++QED +   +L  H    HW PP P  +K+N D +        G+G  IRDS G +I    
Subjt:  L---WHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGC

Query:  KKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHELARAA
        ++      +  LEA A    +        F        +V E DS  V  ++             +DE        R  +F+   RQ N +A +LA+ A
Subjt:  KKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHELARAA

XP_030946812.1 uncharacterized protein LOC115971195 [Quercus lobata]1.1e-26538.24Show/hide
Query:  ETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATIK-GSEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILG
        ET+S    +  + S L  E          GGGLAL WK+ +DL ++  +P +IDA +  G +  WRFTGFYG+PIT  R+ SW LL+ L    +LPWI  
Subjt:  ETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATIK-GSEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILG

Query:  GDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIKR-----------------------FEEVKVHHLNRHGSDHHPISVTLGNRE
        GDFNEI  ++EK+GGA R ++QM AFR  +D C   DLGF    FTW                          F  V+VHH+    SDH P+ +   +  
Subjt:  GDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIKR-----------------------FEEVKVHHLNRHGSDHHPISVTLGNRE

Query:  VRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSS-THNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRDADLGFAERELDR
        VR  K  RP +FE  WL    C+ ++K  W N            K+ +C   L +W++T      +  I++K+          +      +     E+  
Subjt:  VRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSS-THNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRDADLGFAERELDR

Query:  LLEEEEIYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLD
        L+ +EE  W  RSR +WLK GD NT +FHS+A  R KRN I     ++G  V  E+ +G+    YFK +F+S+M       + L+G+  K+  A   DL 
Subjt:  LLEEEEIYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLD

Query:  KPFARSEIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRL
        + F   E+E +LK M P  APG DG    F++S W+ +G +V +  L +LN+G     LN T I+LIPK   PEK  +FRPISLCNV+YKI++K+IANRL
Subjt:  KPFARSEIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRL

Query:  KRVLGSIISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQE
        K++L  ++S +QSAF+  R ISDN+++ FE +H L  + +G+ G  A+KLDMSKAYDRVEW FL ++ME +GF   WI  V SC+ SV +SVL+NG P  
Subjt:  KRVLGSIISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQE

Query:  EFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISK
         F P RG+RQGDPLSPYLFLLCAEGL +L+ + E   +I G+ + +  P V+HLFFADDSL+FCRA+ +E   I EIL  YE+ASGQ++N +K+    S 
Subjt:  EFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISK

Query:  NVDDTKAGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRF
        N D     EI  +LGV+ + ++  YLGLP+  GR K + F  ++ERVW+ +QGWK RL S GG+E+LIK+V QA+PT+TM CFKLPK++C +I  L  +F
Subjt:  NVDDTKAGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRF

Query:  WWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWR
        WWG   + RK HW+GWKK+C SK  GGLGF+DI+LFN AML K  WRL+ N  SL  KV + KYF + + L      N S AW+SIL  R +   G +WR
Subjt:  WWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWR

Query:  VGDGERIYISQDPWLGREGCSKPLWVNANW-RMKRVRDLL-EPNGSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRL
        +GDG  + I  D WL     S+ +    N+    RV  L+ E N  W ++ + E FLP +A+ I   P   +G +D +IW     G ++ KSAY L  + 
Subjt:  VGDGERIYISQDPWLGREGCSKPLWVNANW-RMKRVRDLL-EPNGSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRL

Query:  SSKDIASGSSDASSKAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDW
        +     S S+ A+ K  W+ +W  N   + +  +W+  ND+LP K N+ K+ I  +  C  C +G E   H  W C+  K++W +          +C+D+
Subjt:  SSKDIASGSSDASSKAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDW

Query:  --EEPIDFWSFIQRNLSKEETRIAIL---LLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADA
          E    F   +Q  L+++   +A L   + W IW  RN   + ++   I +I R     + E  +++E+ Q T    +H +  HW P  P  +K+N D 
Subjt:  --EEPIDFWSFIQRNLSKEETRIAIL---LLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADA

Query:  SWLEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFR
        +        G+G  +RDS G +I    ++      +  LEA A    +        F G      +V E DS  +  ++       +     ++E     
Subjt:  SWLEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFR

Query:  RSNRVRSFSKCSRQSNTLAHELARAA
         S R  +F+   RQ N +A +LA+ A
Subjt:  RSNRVRSFSKCSRQSNTLAHELARAA

TrEMBL top hitse value%identityAlignment
A0A2N9FNH6 Reverse transcriptase domain-containing protein3.3e-25538.45Show/hide
Query:  IGETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATIKG-SEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWI
        + ET+   + +DK+R  +   G   V  +G GGGLALLWK+ + ++ +S S  HID TI+      W FTGFYG+P T KR DSW LL +L    ++PW+
Subjt:  IGETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATIKG-SEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWI

Query:  LGGDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFT-----------------------WIKRFEEVKVHHLNRHGSDHHPISVTLGN
        + GDFNE+L + EK G   R   QM  FR  +  C L D+G+   KFT                       W   F + ++ H++   SDH  + V L +
Subjt:  LGGDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFT-----------------------WIKRFEEVKVHHLNRHGSDHHPISVTLGN

Query:  REVRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSS--THNFDAKMTSCLYKLSSWNKT----RLNGSIQAAIKRKEDHIKVIMERNSATRDADLGF
         + R +  P+  +FE  W+    C+E+V+  W   P+S    +    ++ +C   L  W++T      N   QA     +        R++         
Subjt:  REVRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSS--THNFDAKMTSCLYKLSSWNKT----RLNGSIQAAIKRKEDHIKVIMERNSATRDADLGF

Query:  AERELDRLLEEEEIYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFE
        A R+L+ +L +EE YW+ RS   WL+ GDRNT++FH+ A+ R+K+N I G  ++ G        M      YF N+F +S  +  AI++ +  ++  + +
Subjt:  AERELDRLLEEEEIYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFE

Query:  AQKEDLDKPFARSEIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIA
           + L  PF   EI  +L  M P KAPG DG +A F+Q +W ++GD+V+N  LE L++GK +  +N T I LIPK   PE M +FRPISLCNV+YKII+
Subjt:  AQKEDLDKPFARSEIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIA

Query:  KSIANRLKRVLGSIISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVL
        K +ANRLK VL  IIS  QSAFVPGR I+DN+++ FE +H +  +++GR    A+KLDMSKAYDRVEW FL  +M  +GF   W+N +M C+ SV YSV+
Subjt:  KSIANRLKRVLGSIISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVL

Query:  LNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDK
        LNG P    KP RGIRQGDPLSPYLFL+CAEGL+ALL + E    + G+ I    P ++HLFFADDSL+FCRA+  E   +  IL+ YE+ASGQK+N +K
Subjt:  LNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDK

Query:  SACMISKNVDDTKAGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEI
        ++   S N        I  +L  S +   G YLGLP   GR K + F  +K+++ K L GWKG+L S  G+EILIKSVAQAIP YTMSCF++P  +C EI
Subjt:  SACMISKNVDDTKAGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEI

Query:  NRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLF
        N + S+FWWG   +++K HW  W  MC  K  GG+GFRD+ LFNQA+LAK  WRLL++P +LL ++L+ KYF + +F++A   ++ S AWRSI   R + 
Subjt:  NRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLF

Query:  MEGYRWRVGDGERIYISQDPWLGREGCSKP------LWVNANWRMKRVRDLLE-PNGSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNF
         +G RWR+G+G ++ I +D W+     SK       L  NA      V DL++     WN +L+  +F P +A +I   P R     D ++W     G F
Subjt:  MEGYRWRVGDGERIYISQDPWLGREGCSKP------LWVNANWRMKRVRDLLE-PNGSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNF

Query:  SVKSAYHLAKRLSSKDIASGSSDASSK--AIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADF
        + +SAY L  +L  K    GSS   S+  A WK++W+     + K  +W+     LPTK+N+ ++G+  +  C +C +  E+  H  W C++++  W + 
Subjt:  SVKSAYHLAKRLSSKDIASGSSDASSK--AIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADF

Query:  IPNANPLWLRCRDWEEPIDFWSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDP
          +     +R   W++ +D   ++ R+L   E  +   L W IW  RN + +NN   D   +  +  + +EE   L  +N+    + N+     W PP  
Subjt:  IPNANPLWLRCRDWEEPIDFWSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDP

Query:  GCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGCKK
          +KLN      +     GI   IRDS G+L+   C++
Subjt:  GCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGCKK

A0A2N9HYE3 Reverse transcriptase domain-containing protein2.3e-25637.52Show/hide
Query:  VDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATIKGSEV-WWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILGGDFNEILT
        ++K+R  L F+    V    KGGGL L WK ++ LS+ SFS  HIDA +  ++   WRFTGFYG+P T KR++SW+LL +L+    LPW   GDFNE++ 
Subjt:  VDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATIKGSEV-WWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILGGDFNEILT

Query:  SDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFT----------------------WIKRFEEVKVHHLNRHGSDHHPISVTLGNREVRRRKGPRP
         +EKQG   R+++QM  FR V+D C  +DLGF   KFT                      W+ RF   +V HL    SDH PI V+     + +RK   P
Subjt:  SDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFT----------------------WIKRFEEVKVHHLNRHGSDHHPISVTLGNREVRRRKGPRP

Query:  IKFEGNWLAFNECKEIVKLHWINSPRS-STHNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRDAD-LGFAERELDRLLEEEEIY
         +FE  W +   C+ +++  W         +    K+ +C   L  W++T   G+I + IK  E  +K+  E +   RD   +   +REL  LL +EE  
Subjt:  IKFEGNWLAFNECKEIVKLHWINSPRS-STHNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRDAD-LGFAERELDRLLEEEEIY

Query:  WKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARSEI
        W+ RSR EWL  GDRNT++FH +A  R+++N +      +G W   +  +      Y+K+LF ++  DQ  + + +E +   +       L   F   E+
Subjt:  WKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARSEI

Query:  EHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSII
        E +LK M+P KAPG D     F+Q YW ++G +V+   L  LN+G+ +  +N T I LIPK   PE++ EFRPISLCNVIYK+I+K +ANRLK +L SI+
Subjt:  EHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSII

Query:  SPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRGI
          +QSAF+PGR I+DN+++ FE +H +  +K G+ G  ALKLDMSKAYDRVEW +L+ +ME MGF   W+  +M C+ +V YS+L+NG P    KP RG+
Subjt:  SPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRGI

Query:  RQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISKNVDDTKAG
        RQGDPLSPYLFLLCAEGL +L+ +E+    + G+ I+   P +THLFFADDSL+FC+A+ ++  RI+ IL+ YE+ASGQ+VN  K+    SK+       
Subjt:  RQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISKNVDDTKAG

Query:  EISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQK
        +I  +LGV     +  YLGLP+  GR K   F ++KERVW  L+GWK +L S  G+EILIKSVAQAIP Y MSCF+LP  +  EI  L  RFWWG    K
Subjt:  EISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQK

Query:  RKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIY
         K HW+ W+ +C SK +GG+G RD+  FN+A+LAK  WRLL NP SL +KV + KYF   + L+A++ +  S AW+SI+  RDL ++G  WRVG G  I 
Subjt:  RKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIY

Query:  ISQDPWL--GREGC--SKPLWVNANWRMKRVRDLLEPN-GSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDI
        I +D WL      C  S P     +  +  V+ L++    SW   L+  +FLP +A  I   P       D ++WK    G ++V+S YHL      +  
Subjt:  ISQDPWL--GREGC--SKPLWVNANWRMKRVRDLLEPN-GSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDI

Query:  ASGSSDASSKAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEPID
         S S       +W +IW  +  P+ +  +W+  +++LPT+SN+  + I  +  C  C N  ES  H  W+CK  K +W   IP      LR   +   ID
Subjt:  ASGSSDASSKAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEPID

Query:  FWSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASWLEKDEIGG
              + LS  E ++  +  W IW  RN   +   + +++++  R   ++ E    Q  +   S   NH+    W PP+ G +K+N D +   +    G
Subjt:  FWSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASWLEKDEIGG

Query:  IGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSK
        +G  IR+  G ++G    +     +++ +EA A    ++  +  G          + +E DS  +V  +   A   T     +++I    ++ +   F  
Subjt:  IGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSK

Query:  CSRQSNTLAHELARAAAKNGDF
         +R+ N +AH LA+ A  N  F
Subjt:  CSRQSNTLAHELARAAAKNGDF

A0A2N9I946 Uncharacterized protein1.2e-25737.82Show/hide
Query:  ETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATIKGSE-VWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILG
        ET+     ++ +R  L   G   V  +G GGGLAL+WK  + + I SFS  HIDA +  ++ + WR TGFYG P    R  SW LL +L  + NLPW++ 
Subjt:  ETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATIKGSE-VWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILG

Query:  GDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIKR-----------------------FEEVKVHHLNRHGSDHHPISVTL----
        GDFNE+L+ +E+ G  +RN +QM+AFR  +  CSL DLG+    F+W  R                       F   +VHH+    SDH  + V L    
Subjt:  GDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIKR-----------------------FEEVKVHHLNRHGSDHHPISVTL----

Query:  ----GNREVRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSSTHNF--DAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRDADL
            GNR+       +P +FE  W+  + C++ +K  W + P S T  F    K+ +C  +L  WN++++  + +    +K    ++          +++
Subjt:  ----GNREVRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSSTHNF--DAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRDADL

Query:  GFAERELDRLLEEEEIYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKI
            RE++ L+E+EEI+W+ RSR  WLK GDRNTK++H+ A+ R+K N+I G  + +G+W +    +   A  YF  LF SS  + + I   ++ +   +
Subjt:  GFAERELDRLLEEEEIYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKI

Query:  FEAQKEDLDKPFARSEIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKI
          A  + L + F+  EI+ +L  M P+KAPG DG  A FFQ YW ++G++VS   L+  ++G+ +G +N T I LIPK   PE M +FRPISLCNV+YKI
Subjt:  FEAQKEDLDKPFARSEIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKI

Query:  IAKSIANRLKRVLGSIISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYS
         +K + NR+K +L +IIS +QSAFVPGR ISDN+++ FE +H L   + G     A KLDMSKAYDRVEW FL+ I+  +GF   W++ +M+CV S  YS
Subjt:  IAKSIANRLKRVLGSIISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYS

Query:  VLLNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNL
        V++NG+P    KP RG+RQGDPLSPYLFLLCAEGLSAL+ + E    I GI I    P ++HLFFADDS+IFCRAS+ + G +  IL +YE+ASGQK+N 
Subjt:  VLLNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNL

Query:  DKSACMISKNVDDTKAGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICD
        +K+A   SKN  ++    I  + G S S+ F  YLGLP   GR+K R FN +K+R+WK LQGWK +L S  G+EILIK+V QAIP Y MSCFKLP  +CD
Subjt:  DKSACMISKNVDDTKAGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICD

Query:  EINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRD
        EI  L ++FWWG    +R+ HW    K+   K  GG+GFRD+QLFN+A+LA+  WRLL+ P SL+ ++L+ KYF   +FL+A+  +N S  WRSI   R 
Subjt:  EINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRD

Query:  LFMEGYRWRVGDGERIYISQDPWLGREGCSK---PLWV-NANWRMKRVRDLLEPNGSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFS
        +  +G RWRVG+G  I I +D WL      +   PL V N+   +  +  ++E +  W+++ L ++FLP D   I + P      +D++IW     GNF+
Subjt:  LFMEGYRWRVGDGERIYISQDPWLGREGCSK---PLWV-NANWRMKRVRDLLEPNGSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFS

Query:  VKSAYHLAKRLSSKDIASGSSDA-SSKAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADF--
        V+SAY L    S  D  S S+   S++ +W +IW A   P+ ++ +W+   D LPTK+ +  KG+  ++ C  C    E+++H+ W+C FS+++W     
Subjt:  VKSAYHLAKRLSSKDIASGSSDA-SSKAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADF--

Query:  -IPNANPLWLRCRDWEEPIDFWSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATS---IEERPNLQEDNQNTSELKNHSSHLHWD
         IP++  + +  +      DF       L K +      + W IW ARN ++  N +  ++ I R+  +S    +E   L       S +   S    W 
Subjt:  -IPNANPLWLRCRDWEEPIDFWSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATS---IEERPNLQEDNQNTSELKNHSSHLHWD

Query:  PPDPGCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDAT
        PPD G +KLN          + G+G  IRD++GS+     +K  +  +   L+A  ++  +K +  + GF        L V+    E+ +++        
Subjt:  PPDPGCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDAT

Query:  ELCLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHELARAAAKNGDFLAFVNF
         +   VD+I  FRRS     FS      N  A  LA  A  +    A+ ++
Subjt:  ELCLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHELARAAAKNGDFLAFVNF

A0A2N9IMU5 Uncharacterized protein7.4e-25538.82Show/hide
Query:  ETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDA-TIKGSEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILG
        ET+     ++  R  L   G   V   G GGGLALLW+  + + I S+S  HIDA  +   E+ WR TGFYG P    R  SW LL  L    NLPW++ 
Subjt:  ETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDA-TIKGSEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWILG

Query:  GDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIKR-----------------------FEEVKVHHLNRHGSDHHPISVTL----
        GDFNEI+  +E+ G  +R+  QM+AFR  +  CSL DLG+    F+W  R                       F   +VHH+    SDH  + V L    
Subjt:  GDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIKR-----------------------FEEVKVHHLNRHGSDHHPISVTL----

Query:  GNREVRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSSTHNFDA--KMTSCLYKLSSWN--KTRLNGSIQAAIKRKEDHIK--VIMERNSATRDADL
         +  + R+K     +F+  W+    C+E +K+ W + P S T  F    K+ +C   L  WN  +TR+N  +  + K +   ++   + E NS    +++
Subjt:  GNREVRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSSTHNFDA--KMTSCLYKLSSWN--KTRLNGSIQAAIKRKEDHIK--VIMERNSATRDADL

Query:  GFAERELDRLLEEEEIYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKI
             E++ L  +EEI+W+ RSR  WLK GDRNTK+FH  A  R+K NLI G  +  GVW +    +   A  YF +LF SS  +   I   ++ +   +
Subjt:  GFAERELDRLLEEEEIYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKI

Query:  FEAQKEDLDKPFARSEIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKI
          A  E L K  +  EI  +L  M P+KAPG DG  A F+Q YW ++G++VS    +   +G+ +G +N T I LIPK   PE M +FRPISLCNV+YKI
Subjt:  FEAQKEDLDKPFARSEIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKI

Query:  IAKSIANRLKRVLGSIISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYS
         +K + NR+K +L  IIS +QSAFVPGR ISDNV++ FE +H L     G     A KLDMSKAYDRVEW FL+ I+   GF   W++ +M+CV +  Y+
Subjt:  IAKSIANRLKRVLGSIISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYS

Query:  VLLNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNL
        V++NG P    KP RG+RQGDPLSPYLFLLCAEGLSAL+ + E    I GI I    P ++HLFFADDS+IFCRAS+ +   I  ILN+YE+ASGQK+N 
Subjt:  VLLNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNL

Query:  DKSACMISKNVDDTKAGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICD
        +K+A   SKN  ++   EI  +   S S  F  YLGLP   GR+K R FN +K+R+WK LQGWK  L S  G+E+LIK+V QAIP Y MSCFKLP  +CD
Subjt:  DKSACMISKNVDDTKAGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICD

Query:  EINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRD
        +I  + +RFWWG    +RK HW+   K+   K  GG+GFRD+ LFN+A+LA+  WRLL +PQSL+ ++L+ KYF   +FL+A+   N S  WRSI   R 
Subjt:  EINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRD

Query:  LFMEGYRWRVGDGERIYISQDPWLGREGCSKPLW-VNANWRMKRVRDLLEPNG-SWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVK
        +  +G RWRVG+G  I I +D WL      + +  +N +     V  L++ N   W  + L ++FLP D + I + P      +D++IW     GNFSVK
Subjt:  LFMEGYRWRVGDGERIYISQDPWLGREGCSKPLW-VNANWRMKRVRDLLEPNG-SWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVK

Query:  SAYHLAKRLSSKDIASGSSDASS-KAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWAD---FI
        SAY+L    S  D  S S+  S  + +W SIW A   P+ ++ +W+   D LPTK+ +  KG+  ++ C  C    E+A+H+ W+C+F++++W      I
Subjt:  SAYHLAKRLSSKDIASGSSDASS-KAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWAD---FI

Query:  PNANPLWLRCRDWEEPIDFWSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRI-KRRIATSIE-ERPNLQEDNQNTSELKNHSSHLHWDPPD
        P++  + +  R      DF      NLS+ +T +   + W IW ARN  +  N L  +N I +R +  +++ +   LQ        +   +S   W PPD
Subjt:  PNANPLWLRCRDWEEPIDFWSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRI-KRRIATSIE-ERPNLQEDNQNTSELKNHSSHLHWDPPD

Query:  PGCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKC-LEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATEL
           +KLN   S   + ++ G+G+ IRD+NGS++    K+     + K  L+A  ++E +K Y    GF        L V+    E+  ++         +
Subjt:  PGCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKC-LEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATEL

Query:  CLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHELARAA
           VD+I   R S     FS      N  A  LA  A
Subjt:  CLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHELARAA

A0A7N2LIH6 Uncharacterized protein4.2e-26638.43Show/hide
Query:  ETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATI--KGSEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWIL
        ETK+  E++   ++ L F  G+ VP  G+ GGLALLWK+  D+   S S  HID  +   GS   WR TGFYG P T KR  SW+LLE L+    +PW++
Subjt:  ETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATI--KGSEVWWRFTGFYGSPITQKRKDSWELLEKLSGVSNLPWIL

Query:  GGDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIK-----------------------RFEEVKVHHLNRHGSDHHPISVTLGNR
         GDFNEI+  DEK G  +R+  QM AFR V+  C LIDLGF   +FTW                          F E KVHH++   SDH  +++ L N+
Subjt:  GGDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIK-----------------------RFEEVKVHHLNRHGSDHHPISVTLGNR

Query:  EVRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSSTHNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRDA-DLGFAERELD
           +R+G +   FE  W    ECKEIV+L W      S      ++  C   L  WN+    G++   IK+K++ ++ +   N     A ++   ++E++
Subjt:  EVRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSSTHNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRDA-DLGFAERELD

Query:  RLLEEEEIYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDL
         L   EE+ WK RSR  WL++GD+N+K+FH+ A+ RR++N I G  +  GVW +++E   K    YFK+++SS+      +S  LE +  ++     ++L
Subjt:  RLLEEEEIYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDL

Query:  DKPFARSEIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANR
         K F   E+  +L+ M P KAPG DG    F+Q YWD++G  V+N  L+ LN+G     +NKT I LIPKT  P+K+ EFRPISLCNVIYKII+K +ANR
Subjt:  DKPFARSEIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANR

Query:  LKRVLGSIISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQ
        LK+VL  +I   QSAFVPGR I+DNV++ FE +H++  R++G++G+ A+KLDMSKAYDRVEW +L  +M+ MGF   WI+ +M CV SV +SVL+NG P+
Subjt:  LKRVLGSIISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQ

Query:  EEFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMIS
          F P RG+RQGDP+SPYLFLLC EGLSA++ ++E    I G+      P ++HLFFADDS+IFCRA+ +E  ++ ++L VYE+ SGQK+N DK++   S
Subjt:  EEFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMIS

Query:  KNVDDTKAGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSR
        +N  D        I G         YLGLP   GR K + FNR+K++V + + GWKG+L S  G+E+LIK+VAQA PTYTM+ FKLP ++C E+N +   
Subjt:  KNVDDTKAGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSR

Query:  FWWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRW
        FWWG   +++K  W+ WK +C  K  GG+GF+D++ FN A+LAK  WRL +NP SL  +VL+ KYF + +F++A+ G  PS  WRSI+  +++  EG RW
Subjt:  FWWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRW

Query:  RVGDGERIYISQDPWLGREGCSKPLWV-NANWRMKRVRDLL-EPNGSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLA-K
         VGDG  I I    WL      K +   + + + +RV  L+ +  G W   L+ + F+P +A+EI   P       D ++W     G F+VKSAY  A K
Subjt:  RVGDGERIYISQDPWLGREGCSKPLWV-NANWRMKRVRDLL-EPNGSWNKNLLSEVFLPDDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLA-K

Query:  RLSSKDIASGSSDASSKA----IWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNA-NPL
         +        + + S K+    IWK+IW   C  + K  +W+     LPTK  +  + I  + CC  C    E++ H  W C  +K+ W     N  NP 
Subjt:  RLSSKDIASGSSDASSKA----IWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNA-NPL

Query:  WLRCRDWEEPIDF----WSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNH-----SSHLHWDPP
               E  ++F    W  ++    K+    AI + W +W  RN  N+ +       + ++  +  EE    +E+ +     K         H  W PP
Subjt:  WLRCRDWEEPIDF----WSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNH-----SSHLHWDPP

Query:  DPGCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATEL
            +K+N DA+   +    GIG  IR++ G ++G   KK   L+ ++ LEAEA     KA E       +     +VVE D+  V+  +  V D  T +
Subjt:  DPGCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATEL

Query:  CLFVDEIDGFRRSNRVRSFSKC---SRQSNTLAHELARAAAKNGDFLAFVNFS
           V  I+G RR  ++    K    +R++NT AH LAR +    D++ +V  S
Subjt:  CLFVDEIDGFRRSNRVRSFSKC---SRQSNTLAHELARAAAKNGDFLAFVNFS

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.8e-4625.08Show/hide
Query:  LPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGG
        +P    R     F  + ERV   + GW+ +  S  G+  L K+V  ++P ++MS   LP++I + +++L   F WGS+ +K+K H + W K+C+ K  GG
Subjt:  LPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGG

Query:  LGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKY----FHDGNFLKAKEGNNPSLAWRSILWG-RDLFMEGYRWRVGDGERIYISQDPWLGREGCSK
        LG R  +  N+A+++K  WRLL+   SL   VL+ KY      D  +L  K   + S  WRSI  G RD+   G  W  GDG++I    D W+      K
Subjt:  LGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKY----FHDGNFLKAKEGNNPSLAWRSILWG-RDLFMEGYRWRVGDGERIYISQDPWLGREGCSK

Query:  PLWVNANWRMKR------VRDLLEPNGSWNKNLLSEVFLPDDAKEI-AKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIASGSSDASSKAI
        PL    N            +DL  P   W+   +      +   E+ A      +G++D + WK+   G FSV+SAY +   L+  ++       +  + 
Subjt:  PLWVNANWRMKR------VRDLLEPNGSWNKNLLSEVFLPDDAKEI-AKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIASGSSDASSKAI

Query:  WKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEPIDFWSFIQRNLSKE
        +  +WK     R K  +W + N A+ T+    ++ +  +  C +C+ G ES  H+   C     +W   +P         +        + ++  NL   
Subjt:  WKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEPIDFWSFIQRNLSKE

Query:  E-------TRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASWLEKDEIGGIGWAI
                + I  +++W  W+ R  +    N    +R+K     ++E       +             + W  P  G  K+N D +      +   G  +
Subjt:  E-------TRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASWLEKDEIGGIGWAI

Query:  RDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSKCSRQS
        RD  G+  G      F+L   +C   +A + G+  Y G   F   ++ P + +E DS  +V  +     D+  L   V    GF + + +       R++
Subjt:  RDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSKCSRQS

Query:  NTLAHELARAAAKNGDFLAFVNFSLL
        N LA  LA  A      L F +F L+
Subjt:  NTLAHELARAAAKNGDFLAFVNFSLL

P11369 LINE-1 retrotransposable element ORF2 protein6.0e-4426.32Show/hide
Query:  DRNTKWFHSKAN-----------ARRKRNLIKGFYNSEG-VWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTL-KIFEAQKEDLDKPFARSEI
        ++   WF  K N             R + LI    N +G +  D EEI     S ++K L+S+ + + + + + L+   + K+ + Q + L+ P +  EI
Subjt:  DRNTKWFHSKAN-----------ARRKRNLIKGFYNSEG-VWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTL-KIFEAQKEDLDKPFARSEI

Query:  EHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAK-PEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSI
        E  + ++   K+PG DG  A F+Q++ + L   +  +  ++   G       +  I LIPK  K P K+E FRPISL N+  KI+ K +ANR++  + +I
Subjt:  EHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAK-PEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSI

Query:  ISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRG
        I P Q  F+PG     N+      IH +   K   K    + LD  KA+D+++  F+ +++E  G    ++N + +       ++ +NG   E      G
Subjt:  ISPTQSAFVPGRSISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRG

Query:  IRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKS-ACMISKNVDDTK
         RQG PLSPYLF +  E L+  + +++    I GI I      ++ L  ADD +++    K     +  ++N + +  G K+N +KS A + +KN    K
Subjt:  IRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKS-ACMISKNVDDTK

Query:  AGEISEILGVSQSNSFGYYLG--LPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKS--VAQAIPTYTMSCFKLPKAICDEINRLCSRFWW
          EI E    S   +   YLG  L  +      + F  +K+ + + L+ WK    S  G+  ++K   + +AI  +     K+P    +E+     +F W
Subjt:  AGEISEILGVSQSNSFGYYLG--LPAQNGRNKSRLFNRVKERVWKALQGWKGRLFSMGGKEILIKS--VAQAIPTYTMSCFKLPKAICDEINRLCSRFWW

Query:  GSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQ
         + + +     I    +   +  GG+   D++L+ +A++ K +W   ++ Q
Subjt:  GSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQ

P14381 Transposon TX1 uncharacterized 149 kDa protein5.6e-5025.4Show/hide
Query:  GLALLWKDELD---LSIISFSPGH-IDATIKGSEVWWRFTGFYGSPITQKRKDSWELLEKLSGV--SNLPWILGGDFNEILTSDEKQGGAERNQNQMSAF
        G+  L+ D      LS  S  PG  +   ++ S   +     Y      +R   +E L        S+   I+GGDFN  L + ++    +R+ ++ S  
Subjt:  GLALLWKDELD---LSIISFSPGH-IDATIKGSEVWWRFTGFYGSPITQKRKDSWELLEKLSGV--SNLPWILGGDFNEILTSDEKQGGAERNQNQMSAF

Query:  RSVIDACSLIDLGFPVGKFTWIKRFEEVKVHHLNRHGSDHHPISVTLGNR-----------------EVRRRKGPRPIK-----FEGNWLAFNECKEIVK
        R +I   SL+D+       T    +  V+  H+++   D   IS  L +R                  +R    P   K     F  + L      + V+
Subjt:  RSVIDACSLIDLGFPVGKFTWIKRFEEVKVHHLNRHGSDHHPISVTLGNR-----------------EVRRRKGPRPIK-----FEGNWLAFNECKEIVK

Query:  LHWIN----SPRSSTHNFDAKMTSCLYKLSSWNKTR-LNGSIQAAIKRKEDHIKVIMERNSATRDADLG---FAERELDRLLEEEEIYWKF-RSREEWLK
          W          +T N    +     KL     T+ ++G   A I+     +  + +R S + D  L       +E  R +E+ +    F RSR + L 
Subjt:  LHWIN----SPRSSTHNFDAKMTSCLYKLSSWNKTR-LNGSIQAAIKRKEDHIKVIMERNSATRDADLG---FAERELDRLLEEEEIYWKF-RSREEWLK

Query:  WGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARSEIEHSLKNMSPNK
          DR +++F++    +  R  I   +  +G  +++ E +   A  +++NLFS   +  +A     +GL + + E +KE L+ P    E+  +L+ M  NK
Subjt:  WGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARSEIEHSLKNMSPNK

Query:  APGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSIISPTQSAFVPGR
        +PG DG    FFQ +WD LG +   +  E    G+      + +++L+PK      ++ +RP+SL +  YKI+AK+I+ RLK VL  +I P QS  VPGR
Subjt:  APGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSIISPTQSAFVPGR

Query:  SISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRGIRQGDPLSPYLF
        +I DNV L  + +H    R+ G   +A L LD  KA+DRV+  +L   ++   F   ++  + +   S    V +N          RG+RQG PLS  L+
Subjt:  SISDNVVLGFECIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRGIRQGDPLSPYLF

Query:  LLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMI--SKNVDDTKAGEISEILGVS
         L  E    LL +      ++G+ +      V    +ADD +I       +  R +E   VY  AS  ++N  KS+ ++  S  VD            +S
Subjt:  LLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMI--SKNVDDTKAGEISEILGVS

Query:  QSNSFGYYLGL-PAQNGRNKSRLFNRVKERVWKALQGWKG--RLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWI
          +    YLG+  +      S+ F  ++E V   L  WKG  ++ SM G+ ++I  +  +   Y + C    +    +I R    F W         HW+
Subjt:  QSNSFGYYLGL-PAQNGRNKSRLFNRVKERVWKALQGWKG--RLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWI

Query:  GWKKMCTSKDHGGLG
                   GG G
Subjt:  GWKKMCTSKDHGGLG

P93295 Uncharacterized mitochondrial protein AtMg003106.0e-3644.74Show/hide
Query:  AIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSK-DHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLK
        A+P Y MSCF+L K +C ++    + FWW S E KRK  W+ W+K+C SK D GGLGFRD+  FNQA+LAK S+R++  P +LL+++LR +YF   + ++
Subjt:  AIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSK-DHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLK

Query:  AKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPL
           G  PS AWRSI+ GR+L   G    +GDG    +  D W+  E    PL
Subjt:  AKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-1932.35Show/hide
Query:  FSPMQ--AMVAVSNFNQDCNWYPDSGATNHLTNNLGNMSVSSEYLGNNQIHFGNGTSLTINRLGYSSLISPTNHVFHLHNLLHVPSITKNLIGVSQFSND
        F+P Q  A +A+ +     NW  DSGAT+H+T++  N+S+   Y G + +   +G+++ I+  G +SL S  +   +LHN+L+VP+I KNLI V +  N 
Subjt:  FSPMQ--AMVAVSNFNQDCNWYPDSGATNHLTNNLGNMSVSSEYLGNNQIHFGNGTSLTINRLGYSSLISPTNHVFHLHNLLHVPSITKNLIGVSQFSND

Query:  NSVFFEFHPNFCLVNDQATGQILLHGTLHEGLYKFNLTKSPPSTNSNLNSVGFQSSSPQNSALSCITSTALLSFVQSSNNKSYDVWHQRLGHPAMSVVKI
        N V  EF P    V D  TG  LL G   + LY++ +  S P                            +  F   S+  ++  WH RLGHPA S++  
Subjt:  NSVFFEFHPNFCLVNDQATGQILLHGTLHEGLYKFNLTKSPPSTNSNLNSVGFQSSSPQNSALSCITSTALLSFVQSSNNKSYDVWHQRLGHPAMSVVKI

Query:  GIPN
         I N
Subjt:  GIPN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.8e-1657.33Show/hide
Query:  QVDINNAFLHGVLSETVYMEQPAGFQIKSSSPLVCRLHKALYGLKQAPHAWLERLSMFLNSLGFVNSKADTSLLL
        Q+D+NNAFL G L++ VYM QP GF  K     VC+L KALYGLKQAP AW   L  +L ++GFVNS +DTSL +
Subjt:  QVDINNAFLHGVLSETVYMEQPAGFQIKSSSPLVCRLHKALYGLKQAPHAWLERLSMFLNSLGFVNSKADTSLLL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.3e-1726.52Show/hide
Query:  SDHHPISVTLGNREVRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSSTHNFD-AKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSAT
        SDH P  + L N   R +K  R   F      F     +    W       +H F   +      K       +  G+IQ   K   D ++ I +    T
Subjt:  SDHHPISVTLGNREVRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSSTHNFD-AKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSAT

Query:  RDADLGF-----AERELDRLLEEEEIYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLF--SSSMVDQEA
          +D  F     A ++ +      E +++ +SR +WL+ GD NT++FH    A + +NLIK     + V V+N   + +    Y+ +L    S ++  ++
Subjt:  RDADLGF-----AERELDRLLEEEEIYWKFRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLF--SSSMVDQEA

Query:  ISRTLEGLTLKIFEAQKEDLDKPFARSEIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEF
        + R  +    +  +     L    +  EI  ++  M  NKAPG D   A FF   W V+ D       E    G  +   N T I LIPK    +++  F
Subjt:  ISRTLEGLTLKIFEAQKEDLDKPFARSEIEHSLKNMSPNKAPGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEF

Query:  RPISLCNVIYKII
        RP+S C V+YKII
Subjt:  RPISLCNVIYKII

AT3G09510.1 Ribonuclease H-like superfamily protein1.2e-3124.89Show/hide
Query:  LRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPLWVNANWRMKRVRDLLEPNGS---WNKNLLSEVFLP
        ++ +YF D + L AK     S  W S+L G  L  +G R  +GDG+ I I  D  +      +PL     ++   + +L E  GS   W+ + +S+    
Subjt:  LRGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPLWVNANWRMKRVRDLLEPNGS---WNKNLLSEVFLP

Query:  DDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIASGSSDASSKAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLC
         D   I +    +S   D+IIW Y   G ++V+S Y L     S +I + +    S  +   IW    +P+ K  +W+ L+ AL T   +  +G+  +  
Subjt:  DDAKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIASGSSDASSKAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLC

Query:  CCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEPI-DFWSFIQ-RNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSI
        C  C    ES  H  + C F+   W     +     L   D+EE I +  +F+Q   +S     + + L+W IW+ARN    N      ++         
Subjt:  CCLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEPI-DFWSFIQ-RNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSI

Query:  EERPNLQEDNQNTSELKNH--SSHLHWDPPDPGCWKLNADASW-LEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFE
         +  N  + ++ T         + + W  P     K N DA + ++K E  G GW IR+  G+ I  G  K  +  N    E +A++  L+         
Subjt:  EERPNLQEDNQNTSELKNH--SSHLHWDPPDPGCWKLNADASW-LEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFE

Query:  GNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHELAR
          R    + +E D   ++N++N ++  ++ L   +++I  +        F    R+ N LAH LA+
Subjt:  GNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHELAR

AT4G29090.1 Ribonuclease H-like superfamily protein2.1e-6829.68Show/hide
Query:  AIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKA
        A+PTYTM+CF LPK +C +I  + + FWW + ++ +  HW  W  +   K  GG+GF+DI+ FN A+L K  WR+L  P+SL+AKV + +YFH  + L A
Subjt:  AIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLKA

Query:  KEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPLWVNANWRMKR--------------VRDLLEPNG-SWNKNLLSEVFLPDD
          G+ PS  W+SI   +++  +G R  VG+GE I I +  WL     SKP   +A  RM+R              V DL++ +G  W K+++  +F   +
Subjt:  KEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPLWVNANWRMKR--------------VRDLLEPNG-SWNKNLLSEVFLPDD

Query:  AKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYH-LAKRLSSKDIASGSSDASSKAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCC
         K I +         D   W Y   G+++VKS Y  L + ++ +      S+ S   I++ IWK+   P+ +  +WK L+++LP    +A + +     C
Subjt:  AKEIAKNPRRRSGSKDEIIWKYEDKGNFSVKSAYH-LAKRLSSKDIASGSSDASSKAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCC

Query:  CLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEPID---FWSFIQRNLS---KEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIA
          C + KE+  HL +KC F++  WA    ++ P+ L   +W + I    +W F   N +   ++ +++   LLW +W+ RN         +   + RR  
Subjt:  CLCRNGKESAAHLFWKCKFSKKLWADFIPNANPLWLRCRDWEEPID---FWSFIQRNLS---KEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIA

Query:  TSIEE-RPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLK-AYEGSGG
          +EE R   + ++  T    N SS   W PP     K N DA+W   +E  GIGW +R+  G +  +G +    L ++  LEAE  +E ++ A      
Subjt:  TSIEE-RPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLNADASWLEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLK-AYEGSGG

Query:  FEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHELARAAAKNGDFLAFVNFSLLYGEDDRFWREVPFPE
        F+ N     ++ ESDS  ++ ++N   +    L   + ++           F    R+ NTLA  +AR      + L+F+N+      D + +  V  P 
Subjt:  FEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRSFSKCSRQSNTLAHELARAAAKNGDFLAFVNFSLLYGEDDRFWREVPFPE

Query:  WCR
        W R
Subjt:  WCR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.3e-3744.74Show/hide
Query:  AIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSK-DHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLK
        A+P Y MSCF+L K +C ++    + FWW S E KRK  W+ W+K+C SK D GGLGFRD+  FNQA+LAK S+R++  P +LL+++LR +YF   + ++
Subjt:  AIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSK-DHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVLRGKYFHDGNFLK

Query:  AKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPL
           G  PS AWRSI+ GR+L   G    +GDG    +  D W+  E    PL
Subjt:  AKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.3e-1451.47Show/hide
Query:  LLNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDS
        ++NG PQ    P RG+RQGDPLSPYLF+LC E LS L  R +    + GI ++N+ P + HL FADD+
Subjt:  LLNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNISGIHINNHCPTVTHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAGCCAGGGAACACAAAGAAGAACACTGGCATAGAGAACAGTAACGTCAAAAAATGGAAACGCATGGCCAGAGCTGAAATAAGCAAGAATCATCACAATTTAGA
AAATGTTACAGAGAAAGCCATTGGGGAAACGAAAAGTGGTTTTGAGAGAGTGGATAAGGTTAGGAGCCTCCTTGATTTTGAAGGGGGCTTGTGTGTTCCTTGCTCTGGCA
AAGGTGGGGGCCTGGCCCTTTTGTGGAAAGATGAGTTAGATCTTTCGATCATTTCTTTTTCTCCTGGGCATATCGATGCGACTATCAAAGGATCCGAAGTTTGGTGGAGA
TTTACGGGTTTCTACGGAAGTCCGATTACCCAGAAAAGGAAAGACTCCTGGGAACTCTTGGAGAAGTTAAGCGGGGTCTCTAATCTCCCTTGGATCCTGGGAGGGGATTT
TAATGAGATTCTTACTTCAGACGAGAAGCAGGGAGGAGCTGAGAGGAACCAAAACCAGATGAGCGCCTTTAGATCAGTTATTGATGCTTGCAGTCTGATTGACTTGGGGT
TCCCCGTGGGGAAATTCACTTGGATTAAAAGGTTTGAAGAGGTCAAAGTCCATCACCTAAATAGGCATGGCTCAGACCACCACCCAATCTCGGTGACCCTAGGCAACCGC
GAGGTCAGGAGGAGGAAAGGGCCGAGGCCCATCAAATTTGAAGGTAACTGGTTGGCTTTCAATGAGTGTAAAGAGATCGTTAAGCTCCACTGGATCAATTCCCCCCGATC
CTCCACGCATAACTTTGATGCCAAGATGACTTCTTGCCTCTATAAGCTCAGTAGCTGGAACAAAACCAGACTAAATGGCTCAATTCAAGCGGCCATAAAAAGAAAAGAGG
ACCATATTAAGGTTATCATGGAGAGAAACTCGGCAACGAGAGATGCTGACCTGGGGTTTGCCGAGAGAGAGTTGGATAGGCTTTTGGAGGAGGAGGAGATCTATTGGAAG
TTTCGTTCCCGGGAGGAGTGGCTCAAGTGGGGTGACCGTAATACCAAGTGGTTCCACTCAAAGGCCAATGCTAGGAGAAAGCGTAACCTGATTAAAGGCTTCTATAATAG
TGAAGGAGTGTGGGTAGACAACGAAGAAATCATGGGCAAGGAAGCCTCGAGATACTTCAAAAATCTGTTTTCCTCCTCAATGGTCGACCAGGAGGCTATCTCTAGGACTT
TAGAAGGTCTGACCTTGAAAATCTTTGAGGCTCAAAAGGAGGATCTGGATAAACCTTTTGCTAGGAGCGAAATTGAGCACTCCCTTAAGAACATGAGCCCCAATAAAGCC
CCTGGCGAGGATGGAGCGCATGCCACCTTTTTTCAGAGTTATTGGGATGTTCTAGGGGATGAAGTGTCTAATATATGCCTCGAGGTCCTTAATAACGGTAAAGATGTTGG
CCCGCTAAACAAAACTCTCATAGCCCTTATACCTAAGACTGCCAAGCCGGAGAAAATGGAAGAGTTTAGACCCATCAGTCTGTGTAATGTCATCTATAAAATCATTGCGA
AATCCATTGCCAACAGACTCAAACGGGTCCTGGGGTCAATCATTTCCCCCACCCAATCAGCTTTTGTACCTGGTAGATCGATTTCCGATAATGTGGTGCTTGGCTTTGAG
TGCATCCACACTTTGATTGGAAGGAAAAGAGGTAGAAAGGGAATAGCCGCCCTTAAGCTTGACATGAGCAAAGCCTACGATAGGGTGGAGTGGACTTTTCTCAGGCAAAT
CATGGAAATCATGGGATTTTCCTTTAGTTGGATCAATCGTGTTATGAGCTGTGTTGAATCGGTGAGATACTCGGTACTCCTTAATGGTATCCCTCAAGAGGAATTTAAAC
CTCTCAGGGGCATCAGACAAGGGGACCCACTATCTCCTTATCTTTTTCTCCTCTGTGCTGAGGGCCTGTCAGCGCTCCTGAACAGGGAAGAGACTCTTTACAACATTTCT
GGAATTCATATTAATAATCACTGTCCTACCGTCACGCATCTTTTTTTCGCAGATGACAGCCTTATCTTCTGCAGGGCTTCTAAGGAGGAAGCAGGGCGAATAAAGGAGAT
CCTCAACGTATACGAGAAGGCTTCGGGACAGAAAGTCAACCTGGACAAGTCGGCCTGCATGATTAGCAAGAATGTGGATGACACCAAAGCCGGAGAGATTAGTGAGATCT
TAGGAGTTAGCCAATCCAACTCCTTTGGATACTATCTTGGCCTCCCAGCGCAAAATGGAAGAAACAAATCCCGACTTTTCAATAGGGTCAAGGAGAGAGTTTGGAAAGCC
CTGCAAGGGTGGAAAGGGAGACTTTTTTCCATGGGCGGGAAGGAAATTCTCATAAAATCTGTAGCGCAGGCCATCCCAACTTATACGATGAGTTGTTTCAAACTTCCAAA
GGCGATCTGCGATGAGATTAACAGGTTATGTAGCCGATTCTGGTGGGGATCTTCAGAGCAAAAAAGGAAAGCCCACTGGATAGGGTGGAAAAAGATGTGCACGAGTAAGG
ACCATGGAGGCTTGGGTTTTAGAGATATACAACTTTTTAACCAAGCCATGCTAGCTAAGCACAGTTGGAGACTCTTAAAGAACCCTCAAAGCCTTCTAGCCAAAGTCTTA
AGAGGCAAATACTTCCATGATGGAAATTTCCTCAAGGCAAAGGAGGGAAACAACCCATCCTTAGCTTGGAGGAGCATTCTTTGGGGCAGAGATTTGTTCATGGAAGGTTA
CAGATGGCGTGTGGGTGATGGGGAAAGAATTTATATTAGCCAAGATCCTTGGTTAGGGAGAGAAGGGTGCAGCAAACCTCTTTGGGTGAATGCTAACTGGCGTATGAAGC
GGGTTAGGGATCTCTTAGAGCCTAATGGTTCTTGGAATAAGAATCTCTTGAGCGAGGTTTTTCTCCCTGATGATGCTAAAGAGATTGCGAAAAATCCTAGAAGAAGAAGC
GGGTCCAAGGACGAGATAATATGGAAGTATGAGGATAAAGGCAATTTCTCTGTCAAGAGTGCTTACCACCTAGCGAAAAGATTGAGCTCCAAAGATATAGCTTCTGGGTC
CAGCGATGCTAGCTCAAAAGCCATTTGGAAGTCTATATGGAAGGCCAATTGTGTTCCTAGAGCTAAGATCACCGTGTGGAAAATCCTCAATGACGCCCTCCCTACTAAGT
CAAATATTGCTAAAAAAGGGATCCATACTAATCTGTGTTGTTGTCTGTGCAGGAATGGCAAGGAGTCAGCAGCTCACTTATTCTGGAAATGTAAGTTCTCCAAAAAGCTT
TGGGCTGATTTCATCCCTAATGCTAACCCTCTTTGGCTTCGGTGCAGGGACTGGGAGGAGCCTATTGATTTTTGGAGTTTCATTCAAAGGAACCTCTCCAAGGAGGAGAC
CAGAATTGCCATCCTTCTGCTGTGGCATATTTGGGAGGCAAGGAATACAAGCAACATCAATAACAATTTGCCAGACATAAACAGAATCAAAAGGCGAATTGCGACCAGCA
TTGAGGAAAGACCTAATCTTCAAGAGGACAACCAGAACACTTCAGAATTAAAGAACCATTCGAGTCACTTACATTGGGATCCTCCCGACCCTGGCTGCTGGAAGCTTAAT
GCTGATGCTTCCTGGTTAGAGAAAGATGAAATTGGCGGAATTGGATGGGCCATTCGTGACTCTAATGGATCTTTAATTGGCTTGGGCTGCAAAAAAAATTTCAATTTGTG
GAACATCAAATGTCTTGAAGCGGAAGCCATTATTGAAGGTTTGAAAGCTTACGAAGGCAGTGGCGGTTTCGAAGGGAACAGACGAAAGCCACCGCTGGTTGTTGAGTCAG
ATTCCGTCGAAGTCGTGAACGTTGTTAATCGAGTCGCCGATGACGCCACAGAGCTCTGTTTGTTCGTGGATGAAATTGACGGCTTCAGGCGCTCGAACCGGGTGAGATCA
TTCTCCAAATGCTCGAGGCAGAGCAACACTTTGGCGCACGAGCTTGCGCGAGCTGCGGCCAAAAATGGCGATTTCCTGGCTTTTGTAAATTTCTCTCTCCTCTATGGAGA
AGACGATAGGTTTTGGAGGGAAGTTCCTTTCCCCGAGTGGTGTAGGAGGAAGTCTCATCGTTATATGGAGCAATCGCCTTCTTCGGTGCTAACACCGTCATCGCCACCAT
GGGAGGTGCTACTTCTTGTCGCGGAACGCCTCGACCCTAAAACCCTTTCCATGGCCTCCTGTGTATGCAAGTCTTGGTCCATTTCCATGGCCTCTGACCACCTTTGGGAG
CCCATTTTCATCGCCAATTTCCCTTCTCTCTCTAACCTTATCATCTCCGACGCCACATCGCCACCTGTCTCCTTTCGCCGCCTCTTCGGCCTCAGACGCCGTCGTTGTCC
GCCATCACCACTGTTAAGGAAGCTCATTAGTTTTTCAGCCAAGAGTATTCAAGTCCAAGTGGTCCAAGTGAAATTTCAAGTTCTTACTGCTCTTGAAGGTCACGAACTTG
AAGATCACATCAGTGAAGATTGCCAACCTCCTCCAAAATCACTCCAAGTAAGTGAAGGCTCCTCTACGGTTAGTAAACCAAACCCCAACTATAAGAATTTTAATCACCCA
TATGGAAATCAGTTCTCTCCAATGCAAGCTATGGTAGCTGTTTCAAACTTTAACCAGGACTGTAATTGGTACCCTGATTCAGGAGCAACCAACCATTTGACCAACAATCT
TGGTAATATGTCTGTCAGCTCTGAGTACCTTGGGAATAATCAAATTCATTTTGGCAACGGTACAAGTTTGACTATTAATCGACTTGGATATTCTTCTCTTATTTCTCCTA
CTAATCATGTTTTCCATCTTCATAACCTATTACATGTTCCATCCATTACTAAAAACTTGATTGGTGTTAGCCAATTTTCCAATGACAACTCTGTTTTCTTTGAATTTCAC
CCTAATTTTTGTCTTGTGAATGACCAAGCAACTGGACAAATACTTCTCCACGGGACTCTTCATGAAGGACTATACAAGTTTAATCTGACCAAGTCTCCGCCATCCACCAA
TTCTAATCTTAATTCTGTTGGTTTTCAATCTTCTTCTCCACAAAACTCTGCTTTGTCTTGCATTACTTCTACTGCTTTACTTTCTTTTGTTCAGTCTTCGAACAATAAGT
CTTATGATGTTTGGCATCAACGACTGGGCCATCCAGCTATGTCTGTAGTTAAGATTGGAATTCCCAATTCCGTTACAGCGGAAGCAATTGGACCCCTCCCACTCGGTCTT
GTCCCCAAAAAGGAAGGCATATTGAGTCGGCGTATTGGCCACTCTTACCCATGCAATCAAAGGACAATCCCTCATGAGAAGGAGTTCATGACACACTCAGGATTAAGACT
GAGTTACCTAAGTCATCGTATTGAAACAGATACCCCCGCTCGCATGTCTCCTACATGGACGCCTTGGATTAATACGTCTGTATCGAATACAAAGCGGACCGTATCACATA
GTGTTACCAGGATAAGGTTACCTTCTATAGTCCTTAGTGGGGCAAGTCCTTTGGAGAAGCTTTTCCAGCGGCAACCGGACTACACATTCCTCAAGGTTTTTGGCTATAAT
ATCTCATCCTCTACTTCACCTAATGCTTCACCCAACTCACATTCTTCTCCTTCGTCTTTGGCTAATAATAATAATGTTGACTCTATCCCTTCTACCTCCCCTGAAATTTT
TCCATCAAGTGTCTCCCAACCAGCACAAATTACGTCTGTGTCAAATCATCATCCTATGGTTACTAGGAGTAAGAGAGGAATATTCAAACTGAAAGTTTTCCTTACTACTT
ATCTTGATGTTGAACCACCTAATGTCAAGGAAGCTCTTAAATGTCCTCATTGGAAAAAGACAATGAAAGCTGAATATGATGCTCTTATACAAGTTGATATTAACAATGCA
TTCTTGCATGGTGTGTTGTCTGAGACAGTTTATATGGAACAACCTGCTGGTTTTCAAATTAAAAGTTCTTCTCCTCTTGTTTGTCGTCTCCACAAGGCTTTATATGGGCT
CAAACAAGCCCCTCACGCTTGGCTAGAAAGATTGAGTATGTTTCTCAATTCTCTTGGCTTCGTGAACTCTAAGGCTGATACTTCCTTACTTCTTCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAGCCAGGGAACACAAAGAAGAACACTGGCATAGAGAACAGTAACGTCAAAAAATGGAAACGCATGGCCAGAGCTGAAATAAGCAAGAATCATCACAATTTAGA
AAATGTTACAGAGAAAGCCATTGGGGAAACGAAAAGTGGTTTTGAGAGAGTGGATAAGGTTAGGAGCCTCCTTGATTTTGAAGGGGGCTTGTGTGTTCCTTGCTCTGGCA
AAGGTGGGGGCCTGGCCCTTTTGTGGAAAGATGAGTTAGATCTTTCGATCATTTCTTTTTCTCCTGGGCATATCGATGCGACTATCAAAGGATCCGAAGTTTGGTGGAGA
TTTACGGGTTTCTACGGAAGTCCGATTACCCAGAAAAGGAAAGACTCCTGGGAACTCTTGGAGAAGTTAAGCGGGGTCTCTAATCTCCCTTGGATCCTGGGAGGGGATTT
TAATGAGATTCTTACTTCAGACGAGAAGCAGGGAGGAGCTGAGAGGAACCAAAACCAGATGAGCGCCTTTAGATCAGTTATTGATGCTTGCAGTCTGATTGACTTGGGGT
TCCCCGTGGGGAAATTCACTTGGATTAAAAGGTTTGAAGAGGTCAAAGTCCATCACCTAAATAGGCATGGCTCAGACCACCACCCAATCTCGGTGACCCTAGGCAACCGC
GAGGTCAGGAGGAGGAAAGGGCCGAGGCCCATCAAATTTGAAGGTAACTGGTTGGCTTTCAATGAGTGTAAAGAGATCGTTAAGCTCCACTGGATCAATTCCCCCCGATC
CTCCACGCATAACTTTGATGCCAAGATGACTTCTTGCCTCTATAAGCTCAGTAGCTGGAACAAAACCAGACTAAATGGCTCAATTCAAGCGGCCATAAAAAGAAAAGAGG
ACCATATTAAGGTTATCATGGAGAGAAACTCGGCAACGAGAGATGCTGACCTGGGGTTTGCCGAGAGAGAGTTGGATAGGCTTTTGGAGGAGGAGGAGATCTATTGGAAG
TTTCGTTCCCGGGAGGAGTGGCTCAAGTGGGGTGACCGTAATACCAAGTGGTTCCACTCAAAGGCCAATGCTAGGAGAAAGCGTAACCTGATTAAAGGCTTCTATAATAG
TGAAGGAGTGTGGGTAGACAACGAAGAAATCATGGGCAAGGAAGCCTCGAGATACTTCAAAAATCTGTTTTCCTCCTCAATGGTCGACCAGGAGGCTATCTCTAGGACTT
TAGAAGGTCTGACCTTGAAAATCTTTGAGGCTCAAAAGGAGGATCTGGATAAACCTTTTGCTAGGAGCGAAATTGAGCACTCCCTTAAGAACATGAGCCCCAATAAAGCC
CCTGGCGAGGATGGAGCGCATGCCACCTTTTTTCAGAGTTATTGGGATGTTCTAGGGGATGAAGTGTCTAATATATGCCTCGAGGTCCTTAATAACGGTAAAGATGTTGG
CCCGCTAAACAAAACTCTCATAGCCCTTATACCTAAGACTGCCAAGCCGGAGAAAATGGAAGAGTTTAGACCCATCAGTCTGTGTAATGTCATCTATAAAATCATTGCGA
AATCCATTGCCAACAGACTCAAACGGGTCCTGGGGTCAATCATTTCCCCCACCCAATCAGCTTTTGTACCTGGTAGATCGATTTCCGATAATGTGGTGCTTGGCTTTGAG
TGCATCCACACTTTGATTGGAAGGAAAAGAGGTAGAAAGGGAATAGCCGCCCTTAAGCTTGACATGAGCAAAGCCTACGATAGGGTGGAGTGGACTTTTCTCAGGCAAAT
CATGGAAATCATGGGATTTTCCTTTAGTTGGATCAATCGTGTTATGAGCTGTGTTGAATCGGTGAGATACTCGGTACTCCTTAATGGTATCCCTCAAGAGGAATTTAAAC
CTCTCAGGGGCATCAGACAAGGGGACCCACTATCTCCTTATCTTTTTCTCCTCTGTGCTGAGGGCCTGTCAGCGCTCCTGAACAGGGAAGAGACTCTTTACAACATTTCT
GGAATTCATATTAATAATCACTGTCCTACCGTCACGCATCTTTTTTTCGCAGATGACAGCCTTATCTTCTGCAGGGCTTCTAAGGAGGAAGCAGGGCGAATAAAGGAGAT
CCTCAACGTATACGAGAAGGCTTCGGGACAGAAAGTCAACCTGGACAAGTCGGCCTGCATGATTAGCAAGAATGTGGATGACACCAAAGCCGGAGAGATTAGTGAGATCT
TAGGAGTTAGCCAATCCAACTCCTTTGGATACTATCTTGGCCTCCCAGCGCAAAATGGAAGAAACAAATCCCGACTTTTCAATAGGGTCAAGGAGAGAGTTTGGAAAGCC
CTGCAAGGGTGGAAAGGGAGACTTTTTTCCATGGGCGGGAAGGAAATTCTCATAAAATCTGTAGCGCAGGCCATCCCAACTTATACGATGAGTTGTTTCAAACTTCCAAA
GGCGATCTGCGATGAGATTAACAGGTTATGTAGCCGATTCTGGTGGGGATCTTCAGAGCAAAAAAGGAAAGCCCACTGGATAGGGTGGAAAAAGATGTGCACGAGTAAGG
ACCATGGAGGCTTGGGTTTTAGAGATATACAACTTTTTAACCAAGCCATGCTAGCTAAGCACAGTTGGAGACTCTTAAAGAACCCTCAAAGCCTTCTAGCCAAAGTCTTA
AGAGGCAAATACTTCCATGATGGAAATTTCCTCAAGGCAAAGGAGGGAAACAACCCATCCTTAGCTTGGAGGAGCATTCTTTGGGGCAGAGATTTGTTCATGGAAGGTTA
CAGATGGCGTGTGGGTGATGGGGAAAGAATTTATATTAGCCAAGATCCTTGGTTAGGGAGAGAAGGGTGCAGCAAACCTCTTTGGGTGAATGCTAACTGGCGTATGAAGC
GGGTTAGGGATCTCTTAGAGCCTAATGGTTCTTGGAATAAGAATCTCTTGAGCGAGGTTTTTCTCCCTGATGATGCTAAAGAGATTGCGAAAAATCCTAGAAGAAGAAGC
GGGTCCAAGGACGAGATAATATGGAAGTATGAGGATAAAGGCAATTTCTCTGTCAAGAGTGCTTACCACCTAGCGAAAAGATTGAGCTCCAAAGATATAGCTTCTGGGTC
CAGCGATGCTAGCTCAAAAGCCATTTGGAAGTCTATATGGAAGGCCAATTGTGTTCCTAGAGCTAAGATCACCGTGTGGAAAATCCTCAATGACGCCCTCCCTACTAAGT
CAAATATTGCTAAAAAAGGGATCCATACTAATCTGTGTTGTTGTCTGTGCAGGAATGGCAAGGAGTCAGCAGCTCACTTATTCTGGAAATGTAAGTTCTCCAAAAAGCTT
TGGGCTGATTTCATCCCTAATGCTAACCCTCTTTGGCTTCGGTGCAGGGACTGGGAGGAGCCTATTGATTTTTGGAGTTTCATTCAAAGGAACCTCTCCAAGGAGGAGAC
CAGAATTGCCATCCTTCTGCTGTGGCATATTTGGGAGGCAAGGAATACAAGCAACATCAATAACAATTTGCCAGACATAAACAGAATCAAAAGGCGAATTGCGACCAGCA
TTGAGGAAAGACCTAATCTTCAAGAGGACAACCAGAACACTTCAGAATTAAAGAACCATTCGAGTCACTTACATTGGGATCCTCCCGACCCTGGCTGCTGGAAGCTTAAT
GCTGATGCTTCCTGGTTAGAGAAAGATGAAATTGGCGGAATTGGATGGGCCATTCGTGACTCTAATGGATCTTTAATTGGCTTGGGCTGCAAAAAAAATTTCAATTTGTG
GAACATCAAATGTCTTGAAGCGGAAGCCATTATTGAAGGTTTGAAAGCTTACGAAGGCAGTGGCGGTTTCGAAGGGAACAGACGAAAGCCACCGCTGGTTGTTGAGTCAG
ATTCCGTCGAAGTCGTGAACGTTGTTAATCGAGTCGCCGATGACGCCACAGAGCTCTGTTTGTTCGTGGATGAAATTGACGGCTTCAGGCGCTCGAACCGGGTGAGATCA
TTCTCCAAATGCTCGAGGCAGAGCAACACTTTGGCGCACGAGCTTGCGCGAGCTGCGGCCAAAAATGGCGATTTCCTGGCTTTTGTAAATTTCTCTCTCCTCTATGGAGA
AGACGATAGGTTTTGGAGGGAAGTTCCTTTCCCCGAGTGGTGTAGGAGGAAGTCTCATCGTTATATGGAGCAATCGCCTTCTTCGGTGCTAACACCGTCATCGCCACCAT
GGGAGGTGCTACTTCTTGTCGCGGAACGCCTCGACCCTAAAACCCTTTCCATGGCCTCCTGTGTATGCAAGTCTTGGTCCATTTCCATGGCCTCTGACCACCTTTGGGAG
CCCATTTTCATCGCCAATTTCCCTTCTCTCTCTAACCTTATCATCTCCGACGCCACATCGCCACCTGTCTCCTTTCGCCGCCTCTTCGGCCTCAGACGCCGTCGTTGTCC
GCCATCACCACTGTTAAGGAAGCTCATTAGTTTTTCAGCCAAGAGTATTCAAGTCCAAGTGGTCCAAGTGAAATTTCAAGTTCTTACTGCTCTTGAAGGTCACGAACTTG
AAGATCACATCAGTGAAGATTGCCAACCTCCTCCAAAATCACTCCAAGTAAGTGAAGGCTCCTCTACGGTTAGTAAACCAAACCCCAACTATAAGAATTTTAATCACCCA
TATGGAAATCAGTTCTCTCCAATGCAAGCTATGGTAGCTGTTTCAAACTTTAACCAGGACTGTAATTGGTACCCTGATTCAGGAGCAACCAACCATTTGACCAACAATCT
TGGTAATATGTCTGTCAGCTCTGAGTACCTTGGGAATAATCAAATTCATTTTGGCAACGGTACAAGTTTGACTATTAATCGACTTGGATATTCTTCTCTTATTTCTCCTA
CTAATCATGTTTTCCATCTTCATAACCTATTACATGTTCCATCCATTACTAAAAACTTGATTGGTGTTAGCCAATTTTCCAATGACAACTCTGTTTTCTTTGAATTTCAC
CCTAATTTTTGTCTTGTGAATGACCAAGCAACTGGACAAATACTTCTCCACGGGACTCTTCATGAAGGACTATACAAGTTTAATCTGACCAAGTCTCCGCCATCCACCAA
TTCTAATCTTAATTCTGTTGGTTTTCAATCTTCTTCTCCACAAAACTCTGCTTTGTCTTGCATTACTTCTACTGCTTTACTTTCTTTTGTTCAGTCTTCGAACAATAAGT
CTTATGATGTTTGGCATCAACGACTGGGCCATCCAGCTATGTCTGTAGTTAAGATTGGAATTCCCAATTCCGTTACAGCGGAAGCAATTGGACCCCTCCCACTCGGTCTT
GTCCCCAAAAAGGAAGGCATATTGAGTCGGCGTATTGGCCACTCTTACCCATGCAATCAAAGGACAATCCCTCATGAGAAGGAGTTCATGACACACTCAGGATTAAGACT
GAGTTACCTAAGTCATCGTATTGAAACAGATACCCCCGCTCGCATGTCTCCTACATGGACGCCTTGGATTAATACGTCTGTATCGAATACAAAGCGGACCGTATCACATA
GTGTTACCAGGATAAGGTTACCTTCTATAGTCCTTAGTGGGGCAAGTCCTTTGGAGAAGCTTTTCCAGCGGCAACCGGACTACACATTCCTCAAGGTTTTTGGCTATAAT
ATCTCATCCTCTACTTCACCTAATGCTTCACCCAACTCACATTCTTCTCCTTCGTCTTTGGCTAATAATAATAATGTTGACTCTATCCCTTCTACCTCCCCTGAAATTTT
TCCATCAAGTGTCTCCCAACCAGCACAAATTACGTCTGTGTCAAATCATCATCCTATGGTTACTAGGAGTAAGAGAGGAATATTCAAACTGAAAGTTTTCCTTACTACTT
ATCTTGATGTTGAACCACCTAATGTCAAGGAAGCTCTTAAATGTCCTCATTGGAAAAAGACAATGAAAGCTGAATATGATGCTCTTATACAAGTTGATATTAACAATGCA
TTCTTGCATGGTGTGTTGTCTGAGACAGTTTATATGGAACAACCTGCTGGTTTTCAAATTAAAAGTTCTTCTCCTCTTGTTTGTCGTCTCCACAAGGCTTTATATGGGCT
CAAACAAGCCCCTCACGCTTGGCTAGAAAGATTGAGTATGTTTCTCAATTCTCTTGGCTTCGTGAACTCTAAGGCTGATACTTCCTTACTTCTTCGTTGA
Protein sequenceShow/hide protein sequence
MEEPGNTKKNTGIENSNVKKWKRMARAEISKNHHNLENVTEKAIGETKSGFERVDKVRSLLDFEGGLCVPCSGKGGGLALLWKDELDLSIISFSPGHIDATIKGSEVWWR
FTGFYGSPITQKRKDSWELLEKLSGVSNLPWILGGDFNEILTSDEKQGGAERNQNQMSAFRSVIDACSLIDLGFPVGKFTWIKRFEEVKVHHLNRHGSDHHPISVTLGNR
EVRRRKGPRPIKFEGNWLAFNECKEIVKLHWINSPRSSTHNFDAKMTSCLYKLSSWNKTRLNGSIQAAIKRKEDHIKVIMERNSATRDADLGFAERELDRLLEEEEIYWK
FRSREEWLKWGDRNTKWFHSKANARRKRNLIKGFYNSEGVWVDNEEIMGKEASRYFKNLFSSSMVDQEAISRTLEGLTLKIFEAQKEDLDKPFARSEIEHSLKNMSPNKA
PGEDGAHATFFQSYWDVLGDEVSNICLEVLNNGKDVGPLNKTLIALIPKTAKPEKMEEFRPISLCNVIYKIIAKSIANRLKRVLGSIISPTQSAFVPGRSISDNVVLGFE
CIHTLIGRKRGRKGIAALKLDMSKAYDRVEWTFLRQIMEIMGFSFSWINRVMSCVESVRYSVLLNGIPQEEFKPLRGIRQGDPLSPYLFLLCAEGLSALLNREETLYNIS
GIHINNHCPTVTHLFFADDSLIFCRASKEEAGRIKEILNVYEKASGQKVNLDKSACMISKNVDDTKAGEISEILGVSQSNSFGYYLGLPAQNGRNKSRLFNRVKERVWKA
LQGWKGRLFSMGGKEILIKSVAQAIPTYTMSCFKLPKAICDEINRLCSRFWWGSSEQKRKAHWIGWKKMCTSKDHGGLGFRDIQLFNQAMLAKHSWRLLKNPQSLLAKVL
RGKYFHDGNFLKAKEGNNPSLAWRSILWGRDLFMEGYRWRVGDGERIYISQDPWLGREGCSKPLWVNANWRMKRVRDLLEPNGSWNKNLLSEVFLPDDAKEIAKNPRRRS
GSKDEIIWKYEDKGNFSVKSAYHLAKRLSSKDIASGSSDASSKAIWKSIWKANCVPRAKITVWKILNDALPTKSNIAKKGIHTNLCCCLCRNGKESAAHLFWKCKFSKKL
WADFIPNANPLWLRCRDWEEPIDFWSFIQRNLSKEETRIAILLLWHIWEARNTSNINNNLPDINRIKRRIATSIEERPNLQEDNQNTSELKNHSSHLHWDPPDPGCWKLN
ADASWLEKDEIGGIGWAIRDSNGSLIGLGCKKNFNLWNIKCLEAEAIIEGLKAYEGSGGFEGNRRKPPLVVESDSVEVVNVVNRVADDATELCLFVDEIDGFRRSNRVRS
FSKCSRQSNTLAHELARAAAKNGDFLAFVNFSLLYGEDDRFWREVPFPEWCRRKSHRYMEQSPSSVLTPSSPPWEVLLLVAERLDPKTLSMASCVCKSWSISMASDHLWE
PIFIANFPSLSNLIISDATSPPVSFRRLFGLRRRRCPPSPLLRKLISFSAKSIQVQVVQVKFQVLTALEGHELEDHISEDCQPPPKSLQVSEGSSTVSKPNPNYKNFNHP
YGNQFSPMQAMVAVSNFNQDCNWYPDSGATNHLTNNLGNMSVSSEYLGNNQIHFGNGTSLTINRLGYSSLISPTNHVFHLHNLLHVPSITKNLIGVSQFSNDNSVFFEFH
PNFCLVNDQATGQILLHGTLHEGLYKFNLTKSPPSTNSNLNSVGFQSSSPQNSALSCITSTALLSFVQSSNNKSYDVWHQRLGHPAMSVVKIGIPNSVTAEAIGPLPLGL
VPKKEGILSRRIGHSYPCNQRTIPHEKEFMTHSGLRLSYLSHRIETDTPARMSPTWTPWINTSVSNTKRTVSHSVTRIRLPSIVLSGASPLEKLFQRQPDYTFLKVFGYN
ISSSTSPNASPNSHSSPSSLANNNNVDSIPSTSPEIFPSSVSQPAQITSVSNHHPMVTRSKRGIFKLKVFLTTYLDVEPPNVKEALKCPHWKKTMKAEYDALIQVDINNA
FLHGVLSETVYMEQPAGFQIKSSSPLVCRLHKALYGLKQAPHAWLERLSMFLNSLGFVNSKADTSLLLR