; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037795 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037795
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:9255817..9259162
RNA-Seq ExpressionLag0037795
SyntenyLag0037795
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]3.2e-21938.18Show/hide
Query:  LAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDS----NIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDAN
        L  W+ +  +G  K    ++ + I+ L  T Q  L +     I K E ++  +L +EE YW+ R+R DWL  GD+NTK+FHSKAS R +KN +  ++D  
Subjt:  LAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDS----NIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDAN

Query:  GAWVDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLD
        G WVDD EGI      +F++LF+S+NP    I+ AL G+   ++++ N  LE PF+  DI RAL ++ P+KAPGPDG  A FFQ+ W IVG  ++K CL 
Subjt:  GAWVDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLD

Query:  ILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAV
        ILN  G +  LN T+I+LIPKV  P K+  FRPISLC VVY+I++KA+ANRLK +L+ +ISP+Q+AFIP RLITDNV+IG+E +H I   +  ++G +A+
Subjt:  ILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAV

Query:  KLDMSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFC
        KLD+SKA+DRVEW ++ + M  LGF  +WI+++M C+ +  ++V ING P     PERG+RQG PLSPYLF++CAE  S LL + E    + G +  +  
Subjt:  KLDMSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFC

Query:  PTLTHLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQ
         T+THL FADDS++F KA+  + + +K I   Y   SGQ  N +KS+   S   S    + + S   +K+V     YLG+P   GR+K   F ++K  + 
Subjt:  PTLTHLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQ

Query:  KTLQGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRI
          +  W  + FS GGKE+LIKAVAQA+P Y MS FKLP  +C+DI K  A+FWWG+  DK+  HW  W  +  +K  GGLGFR++  FNQA++AKQ WR+
Subjt:  KTLQGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRI

Query:  LKNPNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTESRVCELIADSGDWDEQ
        ++ PNSL+A++++ +Y+++++F +A +G NPS  WRSILWG ++ K+G+RWR+G+G  + +  D WIPR  +++P+  +    E+ V +LI     W   
Subjt:  LKNPNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTESRVCELIADSGDWDEQ

Query:  KIRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIIN
        ++    +++D EAIL I +     +D ++W  DKKG +SVKS Y LA N        P   + + R WK  W +    K KI  W+++ NILP+  N+  
Subjt:  KIRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIIN

Query:  KRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVS-----DFFREGWNAMDRWDKLLRLFKDHEASTAVNIVGSLWNFRNQVKIKAEFPD-
        +R     +C  C+ + E+V HVL  C  A+K+W    P+++  S     DFF       + W +        EA   +     +W+ RN+   + +  D 
Subjt:  KRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVS-----DFFREGWNAMDRWDKLLRLFKDHEASTAVNIVGSLWNFRNQVKIKAEFPD-

Query:  ------PTKIIREINSKSREGAYLQSRSRVIPGE----PGQ---------------KKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKC
                 +++     S+ G    ++ R I  +    P Q               +K G+G I RD++G ++  G K  +    V   E  A+   L+ 
Subjt:  ------PTKIIREINSKSREGAYLQSRSRVIPGE----PGQ---------------KKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKC

Query:  LLENISRPFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARFPLFGSQSE
        +   IS     +IVESD  ++V +LN T    +E+  +  + +  +     V FSF  R  N  AH +A+F L  S ++
Subjt:  LLENISRPFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARFPLFGSQSE

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]7.4e-21639.8Show/hide
Query:  KLAAWNSKRLKGTLKGAIERKLQE----INVLASTS-QRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKD
        +L+ W+S     T+ G I +K+Q+    +N LA      +L   I++  +E++ LL++EE YW  RA+  WL  GDRNTK+FH++AS+R K+NT+  + D
Subjt:  KLAAWNSKRLKGTLKGAIERKLQE----INVLASTS-QRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKD

Query:  ANGAWVDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVC
          G W D+EE I   A+ YF  ++SS++  P+ I      I   +TE+ N  L R F++ ++  ALK I+P+KAPGPDG  A FFQ++W IVG +++ + 
Subjt:  ANGAWVDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVC

Query:  LDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYI
        L++LN++  I +LN T ISLIPK  +P +M  FRPISLC VVYK+ISK LANRLK +L  +IS +Q+AF   RLITDNVL+ FE +H +++K  GK+G++
Subjt:  LDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYI

Query:  AVKLDMSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINK
        A+KLDMSKAFDRVEW +I K+ME++GF  RW ++VM+C+ SV Y++ ING      +P RG+RQGDPLSP LFL+CAEGLSAL+ +      + G  IN+
Subjt:  AVKLDMSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINK

Query:  FCPTLTHLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKES
         CP +THLFFADDSILF KA       ++ IL +YE  SGQ +N DKS+   S N ++    E+ + LG         YLG+PS  GR K +VF  +KE 
Subjt:  FCPTLTHLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKES

Query:  IQKTLQGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSW
        +   L GWK +  S GGKE+LIKAVAQAIPTYTMSCF LP  +CDD+ ++   FWWG    + +  W +W+++C SK  GGLGFR +  FN AMLAKQ+W
Subjt:  IQKTLQGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSW

Query:  RILKNPNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTESRVCELIADSGD--
        RIL NPNSL+ ++L+ +YF     L+A LG +PS +WRSI    E+ ++G RWRVGNG  I I +D W+P   +YK +  +    E  +   + D     
Subjt:  RILKNPNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTESRVCELIADSGD--

Query:  WDEQKIRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLA-SNIKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQ
        W  + +RS  L  + E IL IP+     +D LIW  +KKG FSVKSAYH+A S I        S+G   +  WK+ W +    K KI AW++ ++ LP+ 
Subjt:  WDEQKIRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLA-SNIKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQ

Query:  INIINKRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKDHEASTAVNIV----GSLWNFRNQVKIKAE
         NI  + I  +  C +C    E V H L  C  A  VW  +       SD+     +    +  +       +AS  + +      ++W  RN++     
Subjt:  INIINKRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKDHEASTAVNIV----GSLWNFRNQVKIKAE

Query:  FPDPTKIIREINSKSREGAYLQSRSRVIPGEPGQ-----------------------KKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLK
           P+++    N+   +     S   +IP    Q                       +   IG I RDS G ++ A  K L   +    +E LA+ + + 
Subjt:  FPDPTKIIREINSKSREGAYLQSRSRVIPGEPGQ-----------------------KKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLK

Query:  CLLENISRPFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCF-------REKNIVAHEIAR
         LL        ++I+E D++ ++  LN  S           E  H    + SV+ SF F       R  N+VAHE+A+
Subjt:  CLLENISRPFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCF-------REKNIVAHEIAR

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]2.7e-22639.44Show/hide
Query:  LAAWNSKRLKGTLKGAIERKLQEINVLASTSQR-NLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAW
        L+ WN K + G +   I+ K + +N L  + +  +L   I+   KE+++LL+ EE  W+ R+R  WL  GDRNTK+FH+KAS R ++NT++ + D NG W
Subjt:  LAAWNSKRLKGTLKGAIERKLQEINVLASTSQR-NLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAW

Query:  VDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILN
         D  EGI  VAV YF+ ++SS+   P  I+  L  I   +TE+ N  L + F+R +IE AL  ++P+KAPGPDG  A FFQ++W+IVG DI  + LD+LN
Subjt:  VDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILN

Query:  NDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLD
        ++  + ++N T I+L+PK+ +P KM  FRPISLC VVYK+ISK LANRLK +L ++IS +Q+AF+ GRLITDNVL+ FE +H + +K+ GK+G+ A+KLD
Subjt:  NDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLD

Query:  MSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTL
        MSKA+DRVEW +I+++MEK+GF ++WI +VM C+ SV Y++ +NG       P RG+RQGDP+SPY+FL+CA+G S+LL        ++G  I + CP +
Subjt:  MSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTL

Query:  THLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTL
        THLFFADDS+LF KAN +  +T+  IL+ YE  SGQ +N+DKS+   S N       E+   LG         YLG+PS  G+ K ++F ++KE +++ L
Subjt:  THLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTL

Query:  QGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKN
         GWKE+  S GG+E+LIKAVAQAIPTYTMSCF++P ++C++I  +  +FWWG  G +++  W +W+KLC +K+ GG+GFR +  FN AMLAKQ WR++ N
Subjt:  QGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKN

Query:  PNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTE-SRVCELI-ADSGDWDEQK
        PNSL+A+I + +Y+ H     A LG +PS TWRSI  G E+ ++G RWRVGNG  I I +D W+P   +YK +       +  RV  LI  +   W +  
Subjt:  PNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTE-SRVCELI-ADSGDWDEQK

Query:  IRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASN-IKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIIN
        +R   L  +A  IL+IP+     +D +IW  ++KG FSVKSAY++A   I +  +   S G      W++ W +   PK +I AWK  +N LP+ +N++ 
Subjt:  IRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASN-IKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIIN

Query:  KRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKDHEASTAVNIVGSLWNFRNQVKIK--AEFPD----
        K ++   +C  C  + ES  H+   C  AK+VW  +   L + +D      + +D   K+L      +      +  ++W  RN++  +  ++ P+    
Subjt:  KRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKDHEASTAVNIVGSLWNFRNQVKIK--AEFPD----

Query:  -PTKIIREINSKSREGAYL--QSRSRVIPGEPG-------------QKKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCLLENISRPF
           K I E  + S     +  QS  + +   PG              +   +G I RD+ G +  A    L+  ++V+ +E LA+   L    E   +  
Subjt:  -PTKIIREINSKSREGAYL--QSRSRVIPGEPG-------------QKKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCLLENISRPF

Query:  PKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARFPLFGSQSE
        P+II+ESD++ +VS +N ++E    +  + +      +  G    +   RE N  AHE+A++      S+
Subjt:  PKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARFPLFGSQSE

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]5.0e-22039.94Show/hide
Query:  LAAWNSKRLKGTLKGAIERKLQEINVLASTSQR-NLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAW
        L +WN + + G +   I  K + +N + +  Q+ N  + I++  KEL+ LL+ EE  WR R++  W   GDRNTK+FH++AS+R KKN++ RL + +G W
Subjt:  LAAWNSKRLKGTLKGAIERKLQEINVLASTSQR-NLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAW

Query:  VDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILN
         D +E I   A+ YF+ +++S++P    IN  +  I   +T++ N +L + F+  ++ +ALK ++P+KAPGPDG  A FF  +WDIVG  I+ + L++LN
Subjt:  VDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILN

Query:  NDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLD
        ++  + ++N T ISLIPK   P +M  FRPISLC   YKIISK LANR K +L  +IS +Q+AF P RLITDNVL+ FE +H +N+K  GK+ Y+++KLD
Subjt:  NDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLD

Query:  MSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTL
        MSKAFDRVEW +I+ +MEKLGF ++WI+++M CV SV Y+V ING       P RGIRQGDPLSP LFL+CAEGLSAL+        +NG  I + CP +
Subjt:  MSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTL

Query:  THLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTL
        THLFFADDS+LF KA ++    +  IL  YE  SGQ +N DKS+   S N S+ L   + + LG         YLG+PS  G+ K +VF ++K+ + K L
Subjt:  THLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTL

Query:  QGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKN
         GWK +  S GG+E+LIKAVAQA+PTYTMSCF+LP ++C D+  L   FWWG    +N+  W +WRK+C SK  GG+GFR I  FN AMLAKQ WRIL N
Subjt:  QGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKN

Query:  PNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTE-SRVCELI-ADSGDWDEQK
        PNSL+A++ + KYF  +  L++  G NPS  WRSI    ++ ++G RWRVGNG  I I DD W+P   ++K +  R +  +   V  LI  D+  W    
Subjt:  PNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTE-SRVCELI-ADSGDWDEQK

Query:  IRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASN-IKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIIN
        I +  L  +A  IL IP+     +D LIW  +K+G F+VKSAY++AS+ + S      + G+     WK  W++K  PK KI AW++ +N LP+  N+ +
Subjt:  IRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASN-IKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIIN

Query:  KRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHF--FPILLSVSDFFREGWNAMDRWDKLLRLFKDHEASTAVNIVGSLWNFRNQV------------
        + +  +  C LC K  E++ H L  C  AK  W+ +   P+ LS S    E   A+D    ++     H+      +  S+W  RNQ             
Subjt:  KRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHF--FPILLSVSDFFREGWNAMDRWDKLLRLFKDHEASTAVNIVGSLWNFRNQV------------

Query:  -----KIKAEF------PDPTKIIREINSKSREGAYLQSRSRVIPGEPGQKKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCLLENIS
              + AEF      P  ++ +     K+    Y +  +     +  ++   IG + RD +G ++ A  K L  ++  +  E LA+ E +    E   
Subjt:  -----KIKAEF------PDPTKIIREINSKSREGAYLQSRSRVIPGEPGQKKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCLLENIS

Query:  RPFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIAR
              I ESDS+ I+  ++       E   +       A      +F    RE N VAHE+AR
Subjt:  RPFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIAR

XP_023913142.1 uncharacterized protein LOC112024740 [Quercus suber]6.3e-21538.29Show/hide
Query:  LAAWNSKRLKGTLK--GAIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGA
        L AW S +    +     + +K++++N    T +      +  A K+LD LL ++E +W   +R  WL  GDRNTK+FHSKASQR K+N +  +++ +G 
Subjt:  LAAWNSKRLKGTLK--GAIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGA

Query:  WVDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDIL
        WV+D   +  VA+ YF+ +F S   +   +   L  +   +T +   +L +P++  +++ AL  + P+KAPGPDG +A F+QRFW IVG D+S   LD L
Subjt:  WVDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDIL

Query:  NNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKL
        N+  ++ ++N T I LIPKV  P+KM  FRPISLC V+YKIISK LANRLK +L ++ISP+Q+AF+PGRLITDNVL+ +E++HA++ ++ GK   +A+KL
Subjt:  NNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKL

Query:  DMSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPT
        D+SKA+DRVEW +++ +M +LGF + WIN VM CV +  ++V ING       P RG+RQGDPLSPYLFL+CAEG ++LL++ E+   L+G +I +  P 
Subjt:  DMSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPT

Query:  LTHLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKT
        +++L FADDS++F +AN+  V+ +   LK Y   SGQ +N +KS+   S N S     ++   LG++ VD    YLG+P+  GR K++ F  +KE + K 
Subjt:  LTHLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKT

Query:  LQGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILK
        LQGW+ R  S  GKEVLIKAVAQ+IPTYTM  F LP+ +C++++ LCA+FWWG  GD+ + HW +W+ L   K+ GG+GFR+I  FN AMLAKQ WR+++
Subjt:  LQGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILK

Query:  NPNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLW-VRNELTESRVCELIA-DSGDWDEQ
        + NSLL+   + KYF   SFL+A+  PN S  W+S+L  +++ K G  WRVG G+ IR+ +D WIP     K L+   N+  E RV +LI      WD +
Subjt:  NPNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLW-VRNELTESRVCELIA-DSGDWDEQ

Query:  KIRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLA------SNIKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPS
        +I     Q DAEAIL IP+ R+  +D L+W   +KG + VKS+YH+A      +N +       SD     R W + W+++   K K+  W++ L+ILPS
Subjt:  KIRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLA------SNIKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPS

Query:  QINIINKRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKDHEASTAVN-----IVGS--LWNFRNQVK
        ++N+  +++     C LC++  E+  H +W C+ A+ VW+     +      F          D  ++LF+   A  +V      +V S  +W+ RN + 
Subjt:  QINIINKRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKDHEASTAVN-----IVGS--LWNFRNQVK

Query:  IKAEFPDPTKIIREINSKSREGAYLQSRSRVIPGEPGQKKGGIGWIARDSKGSLILAGFKALKRNW--NVKCLEMLAVAESLKCLLENISRPFPKIIVES
               P ++ +      +E  Y  +++ ++ G       G     +   G L    F A   ++   V  +   A  E +  L         K+IVE 
Subjt:  IKAEFPDPTKIIREINSKSREGAYLQSRSRVIPGEPGQKKGGIGWIARDSKGSLILAGFKALKRNW--NVKCLEMLAVAESLKCLLENISRPFPKIIVES

Query:  DSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARF
        D+  ++S +   +  WS +  V ++ +H A  +  V+F+   R  N VAH +AR+
Subjt:  DSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARF

TrEMBL top hitse value%identityAlignment
A0A2N9GPZ7 Reverse transcriptase domain-containing protein6.8e-21542.89Show/hide
Query:  LAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWV
        L  W+ +R  G+L  +I+RK +++  L + +     + I + + +L+ LLE+EE +WR R+R  W++ GD+NTK+FH++ ++R + N +  L+D +G W 
Subjt:  LAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWV

Query:  DDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNN
         ++  I  +AV+YF+ +F+S+NP    I T L G+++ +T   N  L+  F++ ++  ALK + P+KAPGPDG  A F+Q +WDIVG ++++  L IL++
Subjt:  DDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNN

Query:  DGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDM
          ++ ++N T I+LIPKV +P+ +  FRPISLC V+YKI+SK LANRLKKVL  VIS +Q+AF+PGRLITDNVL+ FE +H+++ KR GK G +A+KLDM
Subjt:  DGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDM

Query:  SKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLT
        SKA+DRVEW+++  IM  +GF K WI ++M C+ SV Y+V ING     F   RGIRQGD LSPYLFLICAEGLS LL +      L G   ++  P LT
Subjt:  SKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLT

Query:  HLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTLQ
        HLFFADDS+LF +A   N E V  IL++YE  SGQ +N  K++   +K+ S  +   +     +  + S   YLG+PS  GR K   F +IK  + + + 
Subjt:  HLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTLQ

Query:  GWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKNP
        GWKE+F S  G+EVLIKAVAQ+IPTY+MSCFKLP S+C+D+N + + FWWG      +AHW  W KLC SK  GGLGFR++  FN A+LAKQ WR L++ 
Subjt:  GWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKNP

Query:  NSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTESRVCELI--ADSGDWDEQKI
        NSL+ ++ + KYF    F+ A+LG  PS  WRSI   R++ + G++W +G+G  ++I +D W+P   S+K +  ++ L       ++   DS  W+   I
Subjt:  NSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTESRVCELI--ADSGDWDEQKI

Query:  RSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAA-MASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIINK
         S   + +A+ I AIP+  +   D L W   K GLF+VKSAYHL    K+A  M   S G   ++FWK  W +   PK K+  W++   ILP+   + N+
Subjt:  RSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAA-MASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIINK

Query:  RIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKD
         +  N LC  C    ES  H LWTC  A  VW+              E    + +WD+ +  F D
Subjt:  RIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKD

A0A2N9I335 Reverse transcriptase domain-containing protein2.1e-21641.05Show/hide
Query:  LAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWV
        L  W+  R+ G+L   I+ K   +  L   S       I + + EL+ LLE EE YWR R+R  W+  GD+NTK+FH+  + R + N +  L+D  G   
Subjt:  LAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWV

Query:  DDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNN
         D+  +  +AV+YF+ +FSS+NP    IN+ L G++  +TE+ N  L   F+  ++ +ALK + P+KAPGPDG  A F+Q +WDIVG ++++  L IL++
Subjt:  DDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNN

Query:  DGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDM
          ++ ++N T I+LIPKV +P+++  FRPISLC V+YKI+SK LANRLKKVL  VIS SQ+AF+PGRLITDNVL+ FE +H+++ KR+G+ G +A+KLDM
Subjt:  DGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDM

Query:  SKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLT
        SKA+DRVEW+++  IM +LGF + WIN++M C++SV Y+V ING     F   RGIRQGD LSPYLFL+CAEGLS LL +      ++G   ++  P LT
Subjt:  SKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLT

Query:  HLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTLQ
        HLFFADDS+LF +A   N   V  IL++YE VSGQ +N  K++   ++N S  + +++     +  + S   YLG+PS  GR K+  F ++K  + + + 
Subjt:  HLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTLQ

Query:  GWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKNP
        GWKE+F S GG+EVLIKAVAQAIPTYTMSCFKLP  +C D+N +   FWWG      +AHW  W+KLC SKE GG+GFR++  FN A+LAKQ WR+L+ P
Subjt:  GWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKNP

Query:  NSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNEL-TESRVCELI-ADSGDWDEQKI
         SL+ ++L+ KYF    FL+AS+G  PS  WRSI   R +   G+RW +G+G  +RI  D W+P       L  +N + +  RV  LI  DS  W+ + I
Subjt:  NSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNEL-TESRVCELI-ADSGDWDEQKI

Query:  RSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAAMASP-SDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIINK
             + +A  I +IP+  +   D L W   K G+F+VKSAYHL    +SAA     S     ++FWK  W +   PK K   W++ L ILP+   +  +
Subjt:  RSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAAMASP-SDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIINK

Query:  RIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKD-----------HEASTAVNIVGSLWNFRNQVKIKA
         +  + LC  C    ESV H LW+C  A  VW               E    + +WD+ +  F D            E +    +   +W  RN++  + 
Subjt:  RIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKD-----------HEASTAVNIVGSLWNFRNQVKIKA

Query:  EFPDPTKIIRE---INSKSREGAYLQSRSRVIPGEP-GQKKGGIGWIARDSKGSLILAGFKALKRNWNVKCL
          P+PT ++R    + S   E    Q  +R   GE   Q+  G  W    S G          K NW + C+
Subjt:  EFPDPTKIIRE---INSKSREGAYLQSRSRVIPGEP-GQKKGGIGWIARDSKGSLILAGFKALKRNWNVKCL

A0A2N9IPS8 Reverse transcriptase domain-containing protein6.8e-21542.89Show/hide
Query:  LAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWV
        L  W+ +R  G+L  +I+RK +++  L + +     + I + + +L+ LLE+EE +WR R+R  W++ GD+NTK+FH++ ++R + N +  L+D +G W 
Subjt:  LAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWV

Query:  DDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNN
         ++  I  +AV+YF+ +F+S+NP    I T L G+++ +T   N  L+  F++ ++  ALK + P+KAPGPDG  A F+Q +WDIVG ++++  L IL++
Subjt:  DDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNN

Query:  DGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDM
          ++ ++N T I+LIPKV +P+ +  FRPISLC V+YKI+SK LANRLKKVL  VIS +Q+AF+PGRLITDNVL+ FE +H+++ KR GK G +A+KLDM
Subjt:  DGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDM

Query:  SKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLT
        SKA+DRVEW+++  IM  +GF K WI ++M C+ SV Y+V ING     F   RGIRQGD LSPYLFLICAEGLS LL +      L G   ++  P LT
Subjt:  SKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLT

Query:  HLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTLQ
        HLFFADDS+LF +A   N E V  IL++YE  SGQ +N  K++   +K+ S  +   +     +  + S   YLG+PS  GR K   F +IK  + + + 
Subjt:  HLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTLQ

Query:  GWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKNP
        GWKE+F S  G+EVLIKAVAQ+IPTY+MSCFKLP S+C+D+N + + FWWG      +AHW  W KLC SK  GGLGFR++  FN A+LAKQ WR L++ 
Subjt:  GWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKNP

Query:  NSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTESRVCELI--ADSGDWDEQKI
        NSL+ ++ + KYF    F+ A+LG  PS  WRSI   R++ + G++W +G+G  ++I +D W+P   S+K +  ++ L       ++   DS  W+   I
Subjt:  NSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTESRVCELI--ADSGDWDEQKI

Query:  RSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAA-MASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIINK
         S   + +A+ I AIP+  +   D L W   K GLF+VKSAYHL    K+A  M   S G   ++FWK  W +   PK K+  W++   ILP+   + N+
Subjt:  RSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAA-MASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIINK

Query:  RIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKD
         +  N LC  C    ES  H LWTC  A  VW+              E    + +WD+ +  F D
Subjt:  RIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKD

A0A7N2L6Z9 Reverse transcriptase domain-containing protein3.5e-22741.68Show/hide
Query:  KLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAW
        +LA+WNS  L+   K   E++     +        L   I+   +EL+ LL++EE +W  R++  WL  GDRNTK+FH++AS+R K+NT+  + D  G W
Subjt:  KLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAW

Query:  VDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILN
         +D + I   AV YF+ ++S++N  P++++   A I   ITE+ N +L R F+R +I  ALK I+P+K+PGPDG  A FFQ++WDIVG ++S + L++LN
Subjt:  VDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILN

Query:  NDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLD
            +  +N T I LIPK  +P +M  FRPISLC V+YK+ISK LANRLK  L  +I+ +Q+AF   RLITDNVLI +E +H + +K+ GKD ++A KLD
Subjt:  NDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLD

Query:  MSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTL
        MSKAFDRVEW +I ++M K+GF + WI+++M+C+ SV Y+V ING       P RG+RQGDPLSPYLFL+CAEGLSALL        LNG  + + CP +
Subjt:  MSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTL

Query:  THLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTL
        THLFFADDS+LF KAN+   E +K+IL++YEA SGQ VN DKS+   S N +  L   + + LG         YLG+PS  GR K  VF +IKE +   L
Subjt:  THLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTL

Query:  QGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKN
         GWK +  S+GGKE+LIKAVAQAIPTYTMSCF LP S+CD++ K+   FWWG    +++  W +WRK+C  K  GGLGFR +  FN A+LAKQ+WRIL N
Subjt:  QGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKN

Query:  PNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTES--RVCELI-ADSGDWDEQ
        P SL A+IL+ KYF +   L+ASLG NPS TWRSI    E+ K+G RWRVGNG  I I DD W+P   +YK +     +TE    V  LI  D+  W   
Subjt:  PNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTES--RVCELI-ADSGDWDEQ

Query:  KIRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASN-IKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINII
         IR+  L  DAEAIL IP+      D +IW  +KKG FSVKSAY +A N ++SA +   S G      WK  WK+    K KI AW++ LN LP+  N+ 
Subjt:  KIRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASN-IKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINII

Query:  NKRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHF----FPILLSVSD-------------------FFREGWNAMDRWDKLLRLFKDHEASTAVNIV
         + ++ N  C +C ++ E + H L TC FA  VW+ +      +LL   D                   FF   W     W        D +  + +   
Subjt:  NKRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHF----FPILLSVSD-------------------FFREGWNAMDRWDKLLRLFKDHEASTAVNIV

Query:  GSLWNFRNQV------KIKAEFPDPTKIIREINSKSREGAYLQSRSRVIPGEPGQKKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCL
           W    ++       IK +FP     +R   S    G +  +       +P     G+G + RD  G +I A  K L   +  +  E+ A+ + L   
Subjt:  GSLWNFRNQV------KIKAEFPDPTKIIREINSKSREGAYLQSRSRVIPGEPGQKKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCL

Query:  LENISRPFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARF
         E      P+I++ESD++  +  +N ++ +  E   + E       L    +FS+  R+ N+VAH++A+F
Subjt:  LENISRPFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARF

A0A803QC75 Uncharacterized protein8.0e-21638.47Show/hide
Query:  AIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWVDDEEGIGVVAVEYFK
        +++ K+ ++N   S+S  +    +  AE  L+ LLE+EE YW+ R+R DWL+ GDRNTK+FH+KAS R   N +  L ++ G  V  +  I  V   ++ 
Subjt:  AIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWVDDEEGIGVVAVEYFK

Query:  KLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLI
         LFSSA+ D   +N+ L+ I   ++ + N  L +PF+  ++  AL  + P K+PG DG  A F+Q+ W IVG  ++KV L ILN++     LN T I+LI
Subjt:  KLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLI

Query:  PKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYIRKI
        PKV  P  ++ +RPISLC V+ K+++K L +R K  L  VIS  Q+AF+P RLITDNVL+ FE +HAI NK  G++G  + KLDMSKAFDRVEW +I ++
Subjt:  PKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYIRKI

Query:  MEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLTHLFFADDSILFIKAN
        M K+GF +RWI+++M C+ +  ++  ING       P RG+RQG PLSPYLFLIC+EG S LL  E++ ++L GF++ +  P +THLFFADDS+LF +AN
Subjt:  MEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLTHLFFADDSILFIKAN

Query:  KRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTLQGWKERFFSTGGKEVL
        +R+   +K++L  Y   SGQ +NLDKS    S N +          L + + +    YLG+PS +GRDK ++F  IKE I K +  W E+ FS GGKE+L
Subjt:  KRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTLQGWKERFFSTGGKEVL

Query:  IKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKNPNSLLAKILRGKYFRH
        +KAV Q+IPTY MSCF+LP   C  +  + A FWWG   + +R HW +W  LC SK  GG+GFR    FNQA+LAKQ+WRI + P+SLL +IL+ +YF +
Subjt:  IKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKNPNSLLAKILRGKYFRH

Query:  NSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTESRVCELIADSGDWDEQKIRSFLLQDDAEAILAIPI
        N+FL+ASLG +PSLTW+ I W REL  +G+RW+VG+G HI+   D WIP   ++ P       +   V  LI D   WD   ++ F    D E IL++P+
Subjt:  NSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTESRVCELIADSGDWDEQKIRSFLLQDDAEAILAIPI

Query:  GRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIINKRIDTNHLCFLCRKKAESV
            ++D LIW     GL++VKS YHLA+++     +S S  +    +WK FW ++  PK KI AW++I + LP   +++ +++ T+  C +C++  ESV
Subjt:  GRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIINKRIDTNHLCFLCRKKAESV

Query:  EHVLWTCAFAKKVWSHFFPI--------------LLSVSDFFREG------WNAMDRWDKLLRLFKDHEASTAVNI----VGSLWNFRN-QVKIK-----
         H L++C +AK VW H   +              +L +S  + +        +    W +  R+   H+A  A ++    V  L N+RN Q +++     
Subjt:  EHVLWTCAFAKKVWSHFFPI--------------LLSVSDFFREG------WNAMDRWDKLLRLFKDHEASTAVNI----VGSLWNFRN-QVKIK-----

Query:  -AEFPDPTKII----REINSKSREGAYLQSRSRVIPGEPG-------------QKKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCLL
              P +I+     +I S S +   + +  +  P E               QK  GIG I RDS G+++ A  K    +++   +E +A+ +SL  + 
Subjt:  -AEFPDPTKII----REINSKSREGAYLQSRSRVIPGEPG-------------QKKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCLL

Query:  ENISRPFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARFPL
          I +  P  +VESD++ +V+ L       S    +  +     + +  V  +   RE N+ AH +A+F L
Subjt:  ENISRPFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARFPL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.5e-4126.09Show/hide
Query:  KLQEINVLAST-SQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDW----LNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWVDDEEGIGVVAVEYF
        +L+E+     T S+ +    I K   EL + +E ++   +I     W    +N  DR       K   + +KN +D +K+  G    D   I     EY+
Subjt:  KLQEINVLAST-SQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDW----LNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWVDDEEGIGVVAVEYF

Query:  KKLFSSANPDPNLINTAL-AGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWIS
        K L+++   +   ++T L       + +++   L RP +  +I   +  +   K+PGPDG  A+F+QR+ + +   + K+   I     +        I 
Subjt:  KKLFSSANPDPNLINTAL-AGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWIS

Query:  LIPKVPHPD--KMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIY
        LIPK P  D  K E+FRPISL  +  KI++K LANR+++ + K+I   Q  FIPG     N+      I  IN  R     ++ + +D  KAFD+++  +
Subjt:  LIPKVPHPD--KMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIY

Query:  IRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLTHLFFADDSILF
        + K + KLG    ++ I+    +     + +NG     F  + G RQG PLSP LF I  E L+  + +E+    + G ++ K    L+   FADD I++
Subjt:  IRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLTHLFFADDSILF

Query:  IKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWK----IKESIQKTLQGWKERFF
        ++    + + + K++  +  VSG  +N+ KS   +  N +R   +++   L   +      YLG+  Q  RD   +F +    + + I++    WK    
Subjt:  IKANKRNVETVKKILKEYEAVSGQTVNLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWK----IKESIQKTLQGWKERFF

Query:  STGGKEVLIK--AVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKN
        S  G+  ++K   + + I  +     KLPM+   ++ K   KF W    ++ RA       L    + GG+   +  L+ +A + K +W   +N
Subjt:  STGGKEVLIK--AVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKN

P0C2F6 Putative ribonuclease H protein At1g657501.4e-3928.84Show/hide
Query:  MPSQTGRDKHKVFWKIKESIQKTLQGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGG
        MP    R     F +I E +   + GW+E+  S  G+  L KAV  ++P ++MS   LP SI + +++L   F WGS  +K + H   W K+C+ K+ GG
Subjt:  MPSQTGRDKHKVFWKIKESIQKTLQGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGG

Query:  LGFREISLFNQAMLAKQSWRILKNPNSLLAKILRGKYFRHNSFLDASLGPNP--SLTWRSILWG-RELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPL
        LG R     N+A+++K  WR+L+  NSL   +L+ KY          L P    S TWRSI  G R++   G+ W  G+G  IR   D W+    S KPL
Subjt:  LGFREISLFNQAMLAKQSWRILKNPNSLLAKILRGKYFRHNSFLDASLGPNP--SLTWRSILWG-RELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPL

Query:  WVRNELTESRVCELIA------DSGDWDEQKIRSFLLQDDAEAILAIPIGR-KDAKDTLIWGPDKKGLFSVKSAYHL--ASNIKSAAMASPSDGSHTKRF
           +       C+ +           WD  KI  +   +    + A+ +     A+D L W   + G FSV+SAY +     +    MAS         F
Subjt:  WVRNELTESRVCELIA------DSGDWDEQKIRSFLLQDDAEAILAIPIGR-KDAKDTLIWGPDKKGLFSVKSAYHL--ASNIKSAAMASPSDGSHTKRF

Query:  WKEFWKIKGAPKEKICAWKSILNILPSQINIINKRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFP
        +   WK++   + K   W      + ++     + +  +++C +C+   ES+ HVL  C     +W    P
Subjt:  WKEFWKIKGAPKEKICAWKSILNILPSQINIINKRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFP

P11369 LINE-1 retrotransposable element ORF2 protein7.8e-4325.81Show/hide
Query:  ASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNK-----------KNTLDRLKDANGAWVDDEEGIGVVAVEYFKK
        A++ +R+    I K   E++Q +E      RI          ++   WF  K ++ +K           K  ++++++  G    D E I      ++K+
Subjt:  ASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNK-----------KNTLDRLKDANGAWVDDEEGIGVVAVEYFKK

Query:  LFSSANPDPNLINTALAGIKA-CITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTW----
        L+S+   + + ++  L   +   + +DQ   L  P S  +IE  +  +   K+PGPDG  A+F+Q F +    D+  +   + +   V G L  ++    
Subjt:  LFSSANPDPNLINTALAGIKA-CITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTW----

Query:  ISLIPK-VPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWI
        I+LIPK    P K+E+FRPISL  +  KI++K LANR+++ +  +I P Q  FIPG     N+      IH IN  ++    ++ + LD  KAFD+++  
Subjt:  ISLIPK-VPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWI

Query:  YIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLTHLFFADDSIL
        ++ K++E+ G    ++N++          +K+NG        + G RQG PLSPYLF I  E L+  + +++    + G +I K    ++ L  ADD I+
Subjt:  YIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLTHLFFADDSIL

Query:  FIKANKRNVETVKKILKEYEAVSGQTVNLDKS-ACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMP-SQTGRDKH-KVFWKIKESIQKTLQGWKERFF
        +I   K +   +  ++  +  V G  +N +KS A + +KN  +    E+       +V +   YLG+  ++  +D + K F  +K+ I++ L+ WK+   
Subjt:  FIKANKRNVETVKKILKEYEAVSGQTVNLDKS-ACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMP-SQTGRDKH-KVFWKIKESIQKTLQGWKERFF

Query:  STGGKEVLIK--AVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSW
        S  G+  ++K   + +AI  +     K+P    +++     KF W      N+        L   +  GG+   ++ L+ +A++ K +W
Subjt:  STGGKEVLIK--AVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSW

P14381 Transposon TX1 uncharacterized 149 kDa protein1.3e-4026.92Show/hide
Query:  IRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWVDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIER
        +R+R   L   DR +++F++   ++  +  +  L   +G  ++D E I   A  +++ LFS     P+       G+   ++E +   LE P +  ++ +
Subjt:  IRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWVDDEEGIGVVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIER

Query:  ALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISP
        AL+ +  +K+PG DG   +FFQ FWD +G D  +V  +      +        +SL+PK      ++++RP+SL    YKI++KA++ RLK VL +VI P
Subjt:  ALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISP

Query:  SQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQ
         Q+  +PGR I DNV +  + +H    +R G      + LD  KAFDRV+  Y+   ++   FG +++  +     S +  VKIN   T      RG+RQ
Subjt:  SQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYIRKIMEKLGFGKRWINIVMKCVESVKYAVKINGFPTPEFFPERGIRQ

Query:  GDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLTHLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMM--SKNVSRVLAT
        G PLS  L+ +  E    LL +      L G  + +    +    +ADD IL +  +  ++E  ++  + Y A S   +N  KS+ ++  S  V  +   
Subjt:  GDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLTHLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTVNLDKSACMM--SKNVSRVLAT

Query:  ELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTLQGWK--ERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIG
                K++  LG YL   S       + F +++E +   L  WK   +  S  G+ ++I  +  +   Y + C          I +    F W  IG
Subjt:  ELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTLQGWK--ERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAKFWWGSIG

Query:  DKNRAHWTN
             HW +
Subjt:  DKNRAHWTN

P93295 Uncharacterized mitochondrial protein AtMg003104.4e-3846.05Show/hide
Query:  AIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKE-FGGLGFREISLFNQAMLAKQSWRILKNPNSLLAKILRGKYFRHNSFLD
        A+P Y MSCF+L   +C  +     +FWW S  +K +  W  W+KLC SKE  GGLGFR++  FNQA+LAKQS+RI+  P++LL+++LR +YF H+S ++
Subjt:  AIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKE-FGGLGFREISLFNQAMLAKQSWRILKNPNSLLAKILRGKYFRHNSFLD

Query:  ASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPL
         S+G  PS  WRSI+ GREL  +G+   +G+G H ++  D WI  E    PL
Subjt:  ASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.8e-1627.82Show/hide
Query:  GAIERKLQE-INVLASTSQRNLDSNIDK-------AEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWVDDEEGI
        G I+ K +E ++ L S   + L +  D        A K+ +      E ++R ++R  WL  GD NT++FH        KN +  L+  +   V++   +
Subjt:  GAIERKLQE-INVLASTSQRNLDSNIDK-------AEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWVDDEEGI

Query:  GVVAVEYFKKLFSS----ANPD---------PNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKV
          + V Y+  L  S      PD         P   N  LA   + +  D+           +I  A+  +  +KAPGPD   A+FF   W +V       
Subjt:  GVVAVEYFKKLFSS----ANPD---------PNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKV

Query:  CLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIIS
          +      ++ + N T I+LIPKV   D++  FRP+S C VVYKII+
Subjt:  CLDILNNDGVIGQLNCTWISLIPKVPHPDKMESFRPISLCCVVYKIIS

AT3G09510.1 Ribonuclease H-like superfamily protein1.3e-2923.49Show/hide
Query:  LRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTESRVCELIADSGD---WDEQKIRSFLLQ
        ++ +YF+  S LDA +    S  W S+L G  L K+G R  +G+G +IRI  D+ +      +PL       E  +  L    G    WD+ KI  F+ Q
Subjt:  LRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTESRVCELIADSGD---WDEQKIRSFLLQ

Query:  DDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIINKRIDTNHL
         D   I  I + +    D +IW  +  G ++V+S Y L ++  S  + + +    +       W +   PK K   W+++   L +   +  + +  +  
Subjt:  DDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAAMASPSDGSHTKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIINKRIDTNHL

Query:  CFLCRKKAESVEHVLWTCAFAKKVW--------------SHFFPILLSVSDFFREGWNAMDRWDKLLRL-----------------FKDHEASTAVNIVG
        C  C ++ ES+ H L+TC FA   W              + F   + ++ +F ++    M  + KLL +                 F++  + T ++   
Subjt:  CFLCRKKAESVEHVLWTCAFAKKVW--------------SHFFPILLSVSDFFREGWNAMDRWDKLLRL-----------------FKDHEASTAVNIVG

Query:  SLWNFRNQVKIKAEFPDPTKIIREINSKSREGAYLQSRSRVIPGEPGQK-KGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCLLENISR
           ++ N  +   + P PT+ I E   + R       +     G   QK +   GWI R+  G+ I  G   L    N    E  A+   L  L +   R
Subjt:  SLWNFRNQVKIKAEFPDPTKIIREINSKSREGAYLQSRSRVIPGEPGQK-KGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCLLENISR

Query:  PFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARF
         + ++ +E D   +++++N  S   S +A   E+   +A    S+ F F  R+ N +AH +A++
Subjt:  PFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARF

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.7e-1338.55Show/hide
Query:  LANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYIRKIMEKLGFGKRWI
        +  RLK ++  +I P+QA+FIPGR+ TDN++   E++H++  K+ G  G++ +KLD+ KA+DR+ W Y+   +   GF + W+
Subjt:  LANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYIRKIMEKLGFGKRWI

AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-5627.89Show/hide
Query:  AIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKNPNSLLAKILRGKYFRHNSFLDA
        A+PTYTM+CF LP ++C  I  + A FWW +  +    HW  W  L   K  GG+GF++I  FN A+L KQ WR+L  P SL+AK+ + +YF  +  L+A
Subjt:  AIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKNPNSLLAKILRGKYFRHNSFLDA

Query:  SLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTES--------RVCELIADSG-DWDEQKIRSFLLQDDAEAILA
         LG  PS  W+SI   +E+ +QG R  VGNG  I I    W+  + +   L ++    +         +V +LI +SG +W +  I     + + + I  
Subjt:  SLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPLWVRNELTES--------RVCELIADSG-DWDEQKIRSFLLQDDAEAILA

Query:  IPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAAMASPSDGSH--TKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIINKRIDTNHLCFLCRK
        +  G +   D+  W     G ++VKS Y + + I +   +SP + S       +++ WK + +PK +   WK + N LP    +  + +     C  C  
Subjt:  IPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAAMASPSDGSH--TKRFWKEFWKIKGAPKEKICAWKSILNILPSQINIINKRIDTNHLCFLCRK

Query:  KAESVEHVLWTCAFAKKVWS-HFFPILL----SVSDFFREGW-----NAMDRWDKLLRLFKDHEASTAVNIVGSLWNFRNQVKIKAEFPDPTKIIREINS
          E+V H+L+ C FA+  W+    PI L    + S +    W     N   +W+K  +L           ++  LW  RN++  +    +  +++R    
Subjt:  KAESVEHVLWTCAFAKKVWS-HFFPILL----SVSDFFREGW-----NAMDRWDKLLRLFKDHEASTAVNIVGSLWNFRNQVKIKAEFPDPTKIIREINS

Query:  KSRE----------GAYLQ------SRSRVIPGE-----------PGQKKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCLLENISR-
           E          G   Q       R R  P +              ++ GIGW+ R+ KG +   G +AL +  +V    + A  E+++  + ++SR 
Subjt:  KSRE----------GAYLQ------SRSRVIPGE-----------PGQKKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCLLENISR-

Query:  PFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIAR
         +  +I ESDS  ++ ILN   E W  +    ++ Q   +    V F F  RE N +A  +AR
Subjt:  PFPKIIVESDSVDIVSILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIAR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.2e-3946.05Show/hide
Query:  AIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKE-FGGLGFREISLFNQAMLAKQSWRILKNPNSLLAKILRGKYFRHNSFLD
        A+P Y MSCF+L   +C  +     +FWW S  +K +  W  W+KLC SKE  GGLGFR++  FNQA+LAKQS+RI+  P++LL+++LR +YF H+S ++
Subjt:  AIPTYTMSCFKLPMSICDDINKLCAKFWWGSIGDKNRAHWTNWRKLCASKE-FGGLGFREISLFNQAMLAKQSWRILKNPNSLLAKILRGKYFRHNSFLD

Query:  ASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPL
         S+G  PS  WRSI+ GREL  +G+   +G+G H ++  D WI  E    PL
Subjt:  ASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRIKDDHWIPREGSYKPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAAAGTTAGCGGCCTGGAATAGTAAAAGACTTAAAGGCACTCTCAAAGGTGCGATTGAAAGGAAGCTTCAAGAGATAAATGTGTTGGCTTCCACAAGCCAAAGGAA
TCTAGACAGCAATATTGACAAGGCAGAAAAGGAATTAGACCAGTTGCTTGAAGAAGAAGAGCAGTATTGGAGGATTAGAGCTAGAGAAGATTGGCTCAATTGGGGGGATA
GAAACACTAAGTGGTTCCATTCCAAAGCAAGTCAAAGAAACAAGAAAAACACCTTGGATAGGTTGAAAGATGCTAATGGAGCTTGGGTGGATGATGAAGAGGGAATTGGG
GTGGTGGCGGTGGAGTACTTCAAAAAGCTTTTCTCCTCAGCCAATCCCGACCCTAATTTGATTAACACAGCCTTGGCAGGTATCAAGGCCTGCATCACTGAGGACCAAAA
CCGGGACCTTGAGAGGCCGTTTTCGAGATGTGACATAGAAAGGGCGTTGAAAGACATTAACCCGTCCAAGGCTCCAGGTCCTGATGGTGCCCATGCCAAGTTCTTCCAGC
GGTTTTGGGATATTGTGGGTGGCGATATCTCTAAAGTGTGCTTAGACATTCTTAACAATGATGGTGTTATTGGGCAGCTAAATTGCACGTGGATTTCCTTAATCCCCAAG
GTTCCCCACCCCGATAAAATGGAAAGTTTTCGACCTATCAGCCTATGCTGCGTGGTTTACAAAATTATATCCAAAGCCCTGGCAAATAGATTGAAAAAAGTTCTCGACAA
AGTCATTTCTCCTTCTCAGGCGGCTTTCATTCCTGGAAGGCTAATTACTGATAATGTCCTTATTGGCTTTGAGAGCATCCACGCGATAAACAACAAAAGAGTGGGGAAGG
ATGGCTACATTGCTGTGAAGCTGGATATGAGTAAAGCCTTCGACCGGGTGGAATGGATCTATATCCGTAAAATCATGGAGAAGCTTGGTTTTGGTAAAAGATGGATCAAT
ATCGTGATGAAGTGTGTGGAATCTGTCAAATATGCGGTCAAGATCAACGGCTTCCCTACTCCCGAGTTCTTCCCGGAAAGAGGAATTCGTCAAGGGGACCCGTTATCGCC
TTATCTGTTCCTGATATGTGCTGAAGGTTTATCTGCGCTTCTCACCAGGGAGGAAAATCTCTCTTCTCTTAATGGTTTTAGAATCAATAAATTTTGTCCTACTCTAACCC
ATCTTTTTTTCGCAGATGACAGCATCCTCTTCATCAAAGCCAACAAAAGGAATGTTGAAACGGTCAAGAAAATTCTGAAAGAATATGAAGCCGTTTCGGGCCAAACTGTC
AACTTGGATAAGTCGGCGTGCATGATGAGCAAAAACGTATCTAGGGTGTTGGCAACCGAGCTTAGTAGCTGCTTGGGGATCAAGCTGGTGGATTCTCTGGGATACTACCT
CGGAATGCCCTCCCAAACGGGTCGAGATAAACACAAGGTGTTTTGGAAGATCAAAGAGAGCATCCAAAAGACCCTCCAAGGTTGGAAGGAGCGATTTTTTTCCACCGGAG
GAAAGGAAGTGTTGATCAAGGCAGTGGCCCAAGCTATTCCAACTTACACAATGAGCTGCTTCAAGCTTCCTATGAGTATTTGCGATGATATCAATAAACTGTGTGCCAAG
TTCTGGTGGGGATCTATAGGTGATAAAAATAGGGCTCATTGGACCAATTGGAGGAAGTTGTGTGCTAGCAAGGAGTTTGGAGGTCTTGGCTTCAGAGAGATTAGCTTGTT
CAATCAAGCCATGTTAGCGAAGCAAAGCTGGCGAATCTTAAAAAATCCCAATAGTCTCCTTGCAAAAATCTTGAGAGGAAAATATTTTAGACACAACAGCTTCCTCGATG
CCTCTCTTGGGCCGAACCCTTCACTCACTTGGAGGAGCATCTTGTGGGGTAGGGAGCTCTTCAAGCAAGGCATGAGATGGAGAGTGGGGAATGGAGCTCACATTAGAATC
AAGGATGACCATTGGATCCCCAGGGAAGGAAGTTATAAACCTCTTTGGGTTAGGAATGAGCTCACCGAATCTAGAGTTTGTGAGCTCATTGCGGACTCAGGGGATTGGGA
TGAGCAAAAAATTAGATCTTTCCTCCTTCAGGACGATGCGGAAGCCATTCTAGCCATTCCCATCGGAAGAAAGGATGCTAAGGACACGCTCATTTGGGGTCCGGACAAGA
AAGGTCTTTTTTCAGTCAAAAGTGCTTATCACCTAGCTTCCAACATTAAATCGGCAGCAATGGCCTCTCCCTCTGATGGCTCTCACACGAAACGATTTTGGAAAGAATTT
TGGAAGATTAAGGGAGCCCCTAAGGAGAAAATTTGTGCTTGGAAATCTATTCTTAACATTCTTCCCTCTCAAATCAATATTATCAACAAGAGGATTGATACTAACCACCT
ATGTTTTCTATGTAGGAAAAAGGCCGAATCGGTGGAGCATGTGCTTTGGACCTGCGCTTTTGCTAAAAAGGTTTGGAGTCATTTCTTCCCAATCTTACTTTCGGTCTCGG
ATTTTTTCAGGGAAGGCTGGAATGCTATGGATAGGTGGGATAAGCTTTTGAGGCTGTTTAAAGACCATGAAGCTTCTACTGCAGTCAATATTGTGGGGAGCTTATGGAAT
TTCAGAAATCAAGTCAAAATCAAAGCAGAATTTCCAGACCCAACCAAAATTATCAGGGAAATCAATAGCAAATCAAGAGAGGGGGCTTACCTGCAATCCAGAAGCAGAGT
CATCCCTGGCGAGCCCGGACAAAAAAAGGGCGGGATTGGCTGGATTGCTCGTGACTCCAAAGGTTCTTTAATCCTCGCTGGTTTCAAAGCTCTGAAGCGAAATTGGAATG
TGAAATGCCTGGAGATGCTTGCTGTCGCGGAAAGTCTAAAATGCCTTCTCGAGAATATCAGTAGGCCGTTCCCGAAAATCATCGTCGAATCTGATTCCGTCGATATCGTT
TCCATCCTTAATCGGACCTCTGAAGACTGGTCAGAGGTGGCGACGGTAGCGGAAGAAGCCCAGCATTATGCTACGCTTGTGGGATCGGTTGCTTTCTCCTTCTGTTTCAG
GGAGAAGAACATTGTAGCGCATGAAATCGCGCGTTTTCCTCTCTTCGGTAGTCAATCGGAGACTGAGGGGGATTGTTTTGTGGATTTTTCCATTTCGGAAGAATCATCTT
TTTGTTTGGCGTCTGTGCCTGGGTGGTTGAGCTCCTCTTTAATTAAGGAGCAGAATTGTTATTCTGTTAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAAAGTTAGCGGCCTGGAATAGTAAAAGACTTAAAGGCACTCTCAAAGGTGCGATTGAAAGGAAGCTTCAAGAGATAAATGTGTTGGCTTCCACAAGCCAAAGGAA
TCTAGACAGCAATATTGACAAGGCAGAAAAGGAATTAGACCAGTTGCTTGAAGAAGAAGAGCAGTATTGGAGGATTAGAGCTAGAGAAGATTGGCTCAATTGGGGGGATA
GAAACACTAAGTGGTTCCATTCCAAAGCAAGTCAAAGAAACAAGAAAAACACCTTGGATAGGTTGAAAGATGCTAATGGAGCTTGGGTGGATGATGAAGAGGGAATTGGG
GTGGTGGCGGTGGAGTACTTCAAAAAGCTTTTCTCCTCAGCCAATCCCGACCCTAATTTGATTAACACAGCCTTGGCAGGTATCAAGGCCTGCATCACTGAGGACCAAAA
CCGGGACCTTGAGAGGCCGTTTTCGAGATGTGACATAGAAAGGGCGTTGAAAGACATTAACCCGTCCAAGGCTCCAGGTCCTGATGGTGCCCATGCCAAGTTCTTCCAGC
GGTTTTGGGATATTGTGGGTGGCGATATCTCTAAAGTGTGCTTAGACATTCTTAACAATGATGGTGTTATTGGGCAGCTAAATTGCACGTGGATTTCCTTAATCCCCAAG
GTTCCCCACCCCGATAAAATGGAAAGTTTTCGACCTATCAGCCTATGCTGCGTGGTTTACAAAATTATATCCAAAGCCCTGGCAAATAGATTGAAAAAAGTTCTCGACAA
AGTCATTTCTCCTTCTCAGGCGGCTTTCATTCCTGGAAGGCTAATTACTGATAATGTCCTTATTGGCTTTGAGAGCATCCACGCGATAAACAACAAAAGAGTGGGGAAGG
ATGGCTACATTGCTGTGAAGCTGGATATGAGTAAAGCCTTCGACCGGGTGGAATGGATCTATATCCGTAAAATCATGGAGAAGCTTGGTTTTGGTAAAAGATGGATCAAT
ATCGTGATGAAGTGTGTGGAATCTGTCAAATATGCGGTCAAGATCAACGGCTTCCCTACTCCCGAGTTCTTCCCGGAAAGAGGAATTCGTCAAGGGGACCCGTTATCGCC
TTATCTGTTCCTGATATGTGCTGAAGGTTTATCTGCGCTTCTCACCAGGGAGGAAAATCTCTCTTCTCTTAATGGTTTTAGAATCAATAAATTTTGTCCTACTCTAACCC
ATCTTTTTTTCGCAGATGACAGCATCCTCTTCATCAAAGCCAACAAAAGGAATGTTGAAACGGTCAAGAAAATTCTGAAAGAATATGAAGCCGTTTCGGGCCAAACTGTC
AACTTGGATAAGTCGGCGTGCATGATGAGCAAAAACGTATCTAGGGTGTTGGCAACCGAGCTTAGTAGCTGCTTGGGGATCAAGCTGGTGGATTCTCTGGGATACTACCT
CGGAATGCCCTCCCAAACGGGTCGAGATAAACACAAGGTGTTTTGGAAGATCAAAGAGAGCATCCAAAAGACCCTCCAAGGTTGGAAGGAGCGATTTTTTTCCACCGGAG
GAAAGGAAGTGTTGATCAAGGCAGTGGCCCAAGCTATTCCAACTTACACAATGAGCTGCTTCAAGCTTCCTATGAGTATTTGCGATGATATCAATAAACTGTGTGCCAAG
TTCTGGTGGGGATCTATAGGTGATAAAAATAGGGCTCATTGGACCAATTGGAGGAAGTTGTGTGCTAGCAAGGAGTTTGGAGGTCTTGGCTTCAGAGAGATTAGCTTGTT
CAATCAAGCCATGTTAGCGAAGCAAAGCTGGCGAATCTTAAAAAATCCCAATAGTCTCCTTGCAAAAATCTTGAGAGGAAAATATTTTAGACACAACAGCTTCCTCGATG
CCTCTCTTGGGCCGAACCCTTCACTCACTTGGAGGAGCATCTTGTGGGGTAGGGAGCTCTTCAAGCAAGGCATGAGATGGAGAGTGGGGAATGGAGCTCACATTAGAATC
AAGGATGACCATTGGATCCCCAGGGAAGGAAGTTATAAACCTCTTTGGGTTAGGAATGAGCTCACCGAATCTAGAGTTTGTGAGCTCATTGCGGACTCAGGGGATTGGGA
TGAGCAAAAAATTAGATCTTTCCTCCTTCAGGACGATGCGGAAGCCATTCTAGCCATTCCCATCGGAAGAAAGGATGCTAAGGACACGCTCATTTGGGGTCCGGACAAGA
AAGGTCTTTTTTCAGTCAAAAGTGCTTATCACCTAGCTTCCAACATTAAATCGGCAGCAATGGCCTCTCCCTCTGATGGCTCTCACACGAAACGATTTTGGAAAGAATTT
TGGAAGATTAAGGGAGCCCCTAAGGAGAAAATTTGTGCTTGGAAATCTATTCTTAACATTCTTCCCTCTCAAATCAATATTATCAACAAGAGGATTGATACTAACCACCT
ATGTTTTCTATGTAGGAAAAAGGCCGAATCGGTGGAGCATGTGCTTTGGACCTGCGCTTTTGCTAAAAAGGTTTGGAGTCATTTCTTCCCAATCTTACTTTCGGTCTCGG
ATTTTTTCAGGGAAGGCTGGAATGCTATGGATAGGTGGGATAAGCTTTTGAGGCTGTTTAAAGACCATGAAGCTTCTACTGCAGTCAATATTGTGGGGAGCTTATGGAAT
TTCAGAAATCAAGTCAAAATCAAAGCAGAATTTCCAGACCCAACCAAAATTATCAGGGAAATCAATAGCAAATCAAGAGAGGGGGCTTACCTGCAATCCAGAAGCAGAGT
CATCCCTGGCGAGCCCGGACAAAAAAAGGGCGGGATTGGCTGGATTGCTCGTGACTCCAAAGGTTCTTTAATCCTCGCTGGTTTCAAAGCTCTGAAGCGAAATTGGAATG
TGAAATGCCTGGAGATGCTTGCTGTCGCGGAAAGTCTAAAATGCCTTCTCGAGAATATCAGTAGGCCGTTCCCGAAAATCATCGTCGAATCTGATTCCGTCGATATCGTT
TCCATCCTTAATCGGACCTCTGAAGACTGGTCAGAGGTGGCGACGGTAGCGGAAGAAGCCCAGCATTATGCTACGCTTGTGGGATCGGTTGCTTTCTCCTTCTGTTTCAG
GGAGAAGAACATTGTAGCGCATGAAATCGCGCGTTTTCCTCTCTTCGGTAGTCAATCGGAGACTGAGGGGGATTGTTTTGTGGATTTTTCCATTTCGGAAGAATCATCTT
TTTGTTTGGCGTCTGTGCCTGGGTGGTTGAGCTCCTCTTTAATTAAGGAGCAGAATTGTTATTCTGTTAGTTGA
Protein sequenceShow/hide protein sequence
MRKLAAWNSKRLKGTLKGAIERKLQEINVLASTSQRNLDSNIDKAEKELDQLLEEEEQYWRIRAREDWLNWGDRNTKWFHSKASQRNKKNTLDRLKDANGAWVDDEEGIG
VVAVEYFKKLFSSANPDPNLINTALAGIKACITEDQNRDLERPFSRCDIERALKDINPSKAPGPDGAHAKFFQRFWDIVGGDISKVCLDILNNDGVIGQLNCTWISLIPK
VPHPDKMESFRPISLCCVVYKIISKALANRLKKVLDKVISPSQAAFIPGRLITDNVLIGFESIHAINNKRVGKDGYIAVKLDMSKAFDRVEWIYIRKIMEKLGFGKRWIN
IVMKCVESVKYAVKINGFPTPEFFPERGIRQGDPLSPYLFLICAEGLSALLTREENLSSLNGFRINKFCPTLTHLFFADDSILFIKANKRNVETVKKILKEYEAVSGQTV
NLDKSACMMSKNVSRVLATELSSCLGIKLVDSLGYYLGMPSQTGRDKHKVFWKIKESIQKTLQGWKERFFSTGGKEVLIKAVAQAIPTYTMSCFKLPMSICDDINKLCAK
FWWGSIGDKNRAHWTNWRKLCASKEFGGLGFREISLFNQAMLAKQSWRILKNPNSLLAKILRGKYFRHNSFLDASLGPNPSLTWRSILWGRELFKQGMRWRVGNGAHIRI
KDDHWIPREGSYKPLWVRNELTESRVCELIADSGDWDEQKIRSFLLQDDAEAILAIPIGRKDAKDTLIWGPDKKGLFSVKSAYHLASNIKSAAMASPSDGSHTKRFWKEF
WKIKGAPKEKICAWKSILNILPSQINIINKRIDTNHLCFLCRKKAESVEHVLWTCAFAKKVWSHFFPILLSVSDFFREGWNAMDRWDKLLRLFKDHEASTAVNIVGSLWN
FRNQVKIKAEFPDPTKIIREINSKSREGAYLQSRSRVIPGEPGQKKGGIGWIARDSKGSLILAGFKALKRNWNVKCLEMLAVAESLKCLLENISRPFPKIIVESDSVDIV
SILNRTSEDWSEVATVAEEAQHYATLVGSVAFSFCFREKNIVAHEIARFPLFGSQSETEGDCFVDFSISEESSFCLASVPGWLSSSLIKEQNCYSVS