; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006274 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006274
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr6:40378297..40381609
RNA-Seq ExpressionLag0006274
SyntenyLag0006274
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023874626.1 uncharacterized protein LOC111987155 [Quercus suber]1.0e-17934.49Show/hide
Query:  RWKDRISKCASELALWGKSKKGNYGRRINEARNRLQ------AKVENGGNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNR
        R  DR+  C   L  W +S  GN  + + + +++LQ        ++    +   +  +   + +EEI W+QR+R  W++ GDRN+++FH  A+QRRR NR
Subjt:  RWKDRISKCASELALWGKSKKGNYGRRINEARNRLQ------AKVENGGNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNR

Query:  VEGLFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVR
        + GL + +G WVE++EG++ +I  YF  +F S N  S    ++    I+  VS + N  L   +  +++ V+L+Q+ P KAPG DG   +FYQ+YW IV 
Subjt:  VEGLFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVR

Query:  NEVTSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKR
         +V    L V+N G     +N T I LIPK  + ++++EFR ISLCNV+YKIISK +ANRLK VL +VI  +QSAFVPGR+I DN ++ FE++HY+ Q++
Subjt:  NEVTSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKR

Query:  RGKIGWAALKLDMSKTYDRVECVVL--------LYAEW-------------------KEKRENLSIEGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKL
        +GK    A+KLDMSK YDRVE   L         + +W                   + K + +   GLRQGDPISPYLFL CAEGL  ML   E    L
Subjt:  RGKIGWAALKLDMSKTYDRVECVVL--------LYAEW-------------------KEKRENLSIEGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKL

Query:  TGVRIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKA
         GV ++RG+P +SHL FADD + F +A + +   V+ +L  Y   + QK+N  K+ +  S N  E++   +       ++  HE+ LGLP   G GK KA
Subjt:  TGVRIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKA

Query:  LKQTKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNK
          + KD++  +I  WK    S   G E+LIK++ QA PTYTMS FKLP+ L KE NSM++ FWWG +E  RK+ W SW K+CV K  GGLGFRDL+ FN 
Subjt:  LKQTKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNK

Query:  ALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGFLPGARVVS-
        ALLAKQ WRL + P+SL+ RV+K KYF + SF  A+     S+ W+S++  R +++ G RW +G+G+ + I  D W+P+  S ++     +     +VS 
Subjt:  ALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGFLPGARVVS-

Query:  LLDN-DGWWNKELIRAVFEEDDAMAILGI---------------------------------LRPIVRCPD--------------KIMWNYEKDGRLYH-
        L+D     WN EL+   F   +A ++  I                                 L+   R P+              K++W      ++ H 
Subjt:  LLDN-DGWWNKELIRAVFEEDDAMAILGI---------------------------------LRPIVRCPD--------------KIMWNYEKDGRLYH-

Query:  ------DFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSWWELW-----
              D LPT+  L  RG+  + GC+ C   E S  H +W C  +  +WRD+     + P++     D++W   +      +  F V + W LW     
Subjt:  ------DFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSWWELW-----

Query:  ---------------ICSEGVEAV-----------------C-WIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAM
                          E VE +                 C W+PP    YK+NTD AV  ++    IG ++RNE+G +M  + + ++  L    +EA 
Subjt:  ---------------ICSEGVEAV-----------------C-WIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAM

Query:  AVREGLAAVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFC-----EVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPPVV
        AV EG+    D G  ++V+E+DS SVV   +    + S +  +VE  R     L  C     ++    R+ N   H +A  A  +    +W+E+ PP++
Subjt:  AVREGLAAVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFC-----EVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPPVV

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.5e-18636.89Show/hide
Query:  KDRISKCASELALWGKSKKGNYGRRINEARNRL----QAKVENGGNVNEARLC--LENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVE
        +++I  C  EL  WG S        I E + +L    + ++          L   +++L+ ++EIYW QR+R+ W++ GDRN+++FH +A+QRRR N + 
Subjt:  KDRISKCASELALWGKSKKGNYGRRINEARNRL----QAKVENGGNVNEARLC--LENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVE

Query:  GLFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNE
        G+ + +G+WVE  E +  V   YF  LF +    +  ++EE    +   V+    E L   +T +++  +L Q+GP KAPG DG   LFYQ++W IV + 
Subjt:  GLFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNE

Query:  VTSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRG
        V S  L+ +N G  +  +NHT I LIPK +N  R+SEFR ISLCNVIYKIISK +ANRLK+VL Q+IS TQSAFVPGR I DN ++ +E+LH M  +++G
Subjt:  VTSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRG

Query:  KIGWAALKLDMSKTYDRVECVVL--------LYAEWKEK------RENLSI-------------EGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTG
        K G  ALKLD+SK YDRVE   L          A W E+        + SI              G+RQGDPISPYLFL CAEGL  +L+  E++  +TG
Subjt:  KIGWAALKLDMSKTYDRVECVVL--------LYAEWKEK------RENLSI-------------EGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTG

Query:  VRIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALK
        V I RG+P I++L FADD L F +A   E   +  +L+ Y   + Q IN  KS    S N SE    +I  +L V  V    + LGLP   G  K     
Subjt:  VRIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALK

Query:  QTKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKAL
        + KDR+W ++Q WK M  S   G+E+LIK++ QAIPTYTMS F++P +L  E  ++ ARFWWG     RK+HW SW K+   K  GG+GFRDL  FN A+
Subjt:  QTKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKAL

Query:  LAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGFLPGARVVSLLD
        LAKQGWRL++  DSLL R FK +YF + SFL+A+   N S++W+SL+  + +L+ G  WRVG+G SI    D W+P  P+ +++          +V+ L 
Subjt:  LAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGFLPGARVVSLLD

Query:  NDGW--WNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG--------------------------------------------------RLYH
        N  +  WN E IRA+F  D+A AI  I       PD I W Y   G                                                  R  H
Subjt:  NDGW--WNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG--------------------------------------------------RLYH

Query:  DFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSP---FFPRLGPSTIV------------NPADLLW------WCWKEMSVD----
        + LPT  NL  R +     C  C +  EST+HA+W+C   + +W  S       R G + +V            +  DL W      WC +   +     
Subjt:  DFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSP---FFPRLGPSTIV------------NPADLLW------WCWKEMSVD----

Query:  --------KFEEFV-----VMSWWELWICSEGVEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGL
                + EE++       +  ++    + +  + W PP    +KLN D AV  DL     GAI+RNEKGEVM  +  S   + + D  E +A R+ L
Subjt:  --------KFEEFV-----VMSWWELWICSEGVEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGL

Query:  AAVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALR-LQMEGVWLEEVPP
           VDAGFSR++VE D+ +V     S   N S  G+++++I  + R L+   VG   R  N + H +A  A   L+ +  W+E+ PP
Subjt:  AAVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALR-LQMEGVWLEEVPP

XP_030940187.1 uncharacterized protein LOC115965136 [Quercus lobata]2.5e-18636.95Show/hide
Query:  EARNRLQAKVENGGNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGLFDREGRWVEEEEGMEGVITWYFQELFSSNNS
        EA N L  + +   N+ +    +   +  EE+ W QR+R  WM+WGDRN+++FH  A QRR  N + GL D  G W E+   MEG+   YF+ +F SNN 
Subjt:  EARNRLQAKVENGGNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGLFDREGRWVEEEEGMEGVITWYFQELFSSNNS

Query:  QSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVTSLCLEVMNEGGSVEALNHTVISLIPKCKNAR
         S    +   R I P V+ + N  L   +   ++  +L Q+ P KAPG DG   +F+Q+YW +V   V    LE++N G     +N T I LIPK K  R
Subjt:  QSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVTSLCLEVMNEGGSVEALNHTVISLIPKCKNAR

Query:  RVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGKIGWAALKLDMSKTYDRVECVVL--------LY
        ++SE+R ISLCNVIYKI+SK +ANRLK +L +VI  +QSAFVPGR I DN ++ FE++H +  +++G+    ALKLDMSK YDRVE   L         +
Subjt:  RVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGKIGWAALKLDMSKTYDRVECVVL--------LY

Query:  AEW-------------------KEKRENLSIEGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVRIARGSPAISHLFFADDCLFFFKADLREARMV
         +W                   + K + +   GLRQGDPISPYLFL C EGLF  L   E++  + GV + RG+P ISHLFFADD + F +A + E + V
Subjt:  AEW-------------------KEKRENLSIEGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVRIARGSPAISHLFFADDCLFFFKADLREARMV

Query:  LNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQTKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQ
        LNVL  Y   + QK+N  K+ +  S N S D+  ++  L    ++  HE+ LGLP   G GK KA  + KD++  +I  WK    S   G E+LIK + Q
Subjt:  LNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQTKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQ

Query:  AIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDA
        A PTYTMSCFKLP  L KE N+M+++FWWG ++  RK+ W +W K+C SK  GG+GFRDL+ FN ALLAKQGWR+ + P++L+ +V K KYF   +F +A
Subjt:  AIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDA

Query:  RPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGFLPGARVVSLLDNDG-WWNKELIRAVFEEDDAMAILGILRPIVRC
        +     SY+W+SL+  + ++  G RW +G+G +++I  D WIP P S  ++     L    V  L+D +   W+ + +R VF   +A AILG+       
Subjt:  RPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGFLPGARVVSLLDNDG-WWNKELIRAVFEEDDAMAILGILRPIVRC

Query:  PDKIMWNYEKDGRL-----YHDFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEE
         D ++W +  +G+      Y + LPT+  L+ RG+D++ GC  C +  ES  H +W CK +  +W +S     L P       +++W          +E 
Subjt:  PDKIMWNYEKDGRL-----YHDFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEE

Query:  FVVMSWWELW-----------------ICSEGVEAVCWI----PPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAV
        FV  + W LW                 +C +  E V       PP    YK N D AV ++     +G ++RN +G++M  + + +   L     EAMA+
Subjt:  FVVMSWWELW-----------------ICSEGVEAVCWI----PPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAV

Query:  REGLAAVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCS----RARNILEHEVAMSALRLQMEGVWLEEVPPVV
        + G+    D G   VV E+DSS+V+    SD    +   + ++++   +++   C   W +    R+ N+  H +A +A+ +    VW+E++PPV+
Subjt:  REGLAAVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCS----RARNILEHEVAMSALRLQMEGVWLEEVPPVV

XP_030941688.1 uncharacterized protein LOC115966628 [Quercus lobata]6.8e-19237.23Show/hide
Query:  KDRISKCASELALWGKSKKGNYGRRINEARNRLQ------AKVENGGNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVE
        K +I  C +EL  WG SK       I   + R++        VEN     E   CL+NL+ ++EIYW QR+R+ W++ GD+NS++FH +A+QRRR N ++
Subjt:  KDRISKCASELALWGKSKKGNYGRRINEARNRLQ------AKVENGGNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVE

Query:  GLFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNE
        G+ D E  WVEE+E +  V T YF +LFS+      G   E    +   V+    + L   +T  ++  +L Q+GP KAPG DG   LFYQ++W IV + 
Subjt:  GLFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNE

Query:  VTSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRG
        V S  L+  N G     +NHT I LIPK     ++S+FR ISLCNVIYKIISK +ANRLK+VL  +IS TQSAFVPG  I DN ++  ++LH M+ +R+G
Subjt:  VTSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRG

Query:  KIGWAALKLDMSKTYDRVECVVLL-----------YAEW----------------KEKRENLSIEGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTG
        K G  ALKLD+SK YDRVE   L            +  W                K         GLRQGDP+SPYLFL CAEG   +L   EV+ +L G
Subjt:  KIGWAALKLDMSKTYDRVECVVLL-----------YAEW----------------KEKRENLSIEGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTG

Query:  VRIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALK
        V I + +P ISHL FADD L F +A  +E   V  +L+TY+  + Q IN  KS I  S N  E         L V  V   E  LGLP   G  K +   
Subjt:  VRIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALK

Query:  QTKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKAL
          KDR+W ++Q WK    S   G+EVLIK++ Q+IPTYTM  F+LP +L  E ++M ARFWWG  E  RK+HW SWG +  +K  GG+GFRDL  F  AL
Subjt:  QTKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKAL

Query:  LAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQII-PKNGFLPGARVVSLL
        LAKQGWRLM+  DSLL + FK +YF +C+FL+A    NSS++WKSL+    +LK G  WRVGDG+SIR+  D WI   P+ +++ P        RV  L+
Subjt:  LAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQII-PKNGFLPGARVVSLL

Query:  DNDG-WWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG--------------------------------------------------RLYH
        D     W++EL+ + F  DDA AI  I        D ++W + KDG                                                  R   
Subjt:  DNDG-WWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG--------------------------------------------------RLYH

Query:  DFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSW--W------------
        D LPT  NL +R +   + C  CK   E+ +HA+WEC  ++ +W  S    +       +   L       +S ++FE F+ +SW  W            
Subjt:  DFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSW--W------------

Query:  ----------------------ELWICSEGVEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGLAA
                              +L I S       W PP    YKLN D A+  +L  +  GAIVRNE+GEVM  L      + + +  EA+A R  +  
Subjt:  ----------------------ELWICSEGVEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGLAA

Query:  VVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPPVVDDVYVAELRS
         + AGF+ +++E D+ +V+K   S+  + S +G + ++I+ +   LR+  V    R  N + H +A  A  +  E +WLEE PP+  D    +L S
Subjt:  VVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPPVVDDVYVAELRS

XP_030964568.1 uncharacterized protein LOC115985808 [Quercus lobata]1.0e-17934.68Show/hide
Query:  MRRWKDRISKCASELALWGKSKKGNYGRRINEARNRLQAKVE-------NGGNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRR
        M R+  ++  C S L  W +   GN  RR+ + + +L A+VE       N   V   R  + +LM +EE  W QR+RV W++ GD+N+ +FH RA QR R
Subjt:  MRRWKDRISKCASELALWGKSKKGNYGRRINEARNRLQAKVE-------NGGNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRR

Query:  TNRVEGLFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWG
         N +  L    G  VEE+  +   +  YF+ LF+S+N  +    E   + I   V+   N  L + YT  ++  +LKQ+    +PG DG P LFY+ YW 
Subjt:  TNRVEGLFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWG

Query:  IVRNEVTSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMK
         V  +VTS  L V+N G     +NHT I+LIPK K+  +  +FR ISLCNVIYK+ISK+IANRLK++L +++S +QSAF+  R I DN ++ FE+LH++K
Subjt:  IVRNEVTSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMK

Query:  QKRRGKIGWAALKLDMSKTYDRVECVVL--------LYAEWKE------KRENLSI-------------EGLRQGDPISPYLFLFCAEGLFRMLSWLEVD
         KR+GK G+ ALKLDMSK YD+VE   L            W        +  + S+              GLRQGDP+SPYLFL CAEGL  ++   +  
Subjt:  QKRRGKIGWAALKLDMSKTYDRVECVVL--------LYAEWKE------KRENLSI-------------EGLRQGDPISPYLFLFCAEGLFRMLSWLEVD

Query:  RKLTGVRIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGK
         ++ GV I++  P +SHLFFADDCL F KA   + + +L  L  Y   T Q+IN  K+ +  S N S  +   I  LL V  +  +E+ LGLP   G  K
Subjt:  RKLTGVRIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGK

Query:  MKALKQTKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLEL
         ++    ++RIW +IQ WK    S   G EVLIK +LQA+PT+ M+CFKLPK L ++  SM+ +FWWG +  TRK+HW SW  +C+ K  GGLGFRD+E 
Subjt:  MKALKQTKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLEL

Query:  FNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQII-PKNGFLPGAR
        FN ALL KQ WR +   DSL  +VFK KYF   + +D   K+  SY W+S+L  R ++++G  WR+ DG  + I  D+W+P P S ++I P N F   A 
Subjt:  FNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQII-PKNGFLPGAR

Query:  VVSLLDNDG-WWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG-------------------------------------------------
        V +L+D +G  W +E +   F   +A AIL I     R  D ++W     G                                                 
Subjt:  VVSLLDNDG-WWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG-------------------------------------------------

Query:  RLYHDFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSWWELW-------
        R   D LPT+++L  R +     C  C+   E T+HA+W C+  + +W +     +       +  D L+   K   +    E   M  W +W       
Subjt:  RLYHDFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSWWELW-------

Query:  ----------ICSEGVE----------------------AVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMA
                  I ++ +E                      A  W+PPE P +K+N D A+ RD +   IGA+VR+  G ++  L   ++    V+ +EA+A
Subjt:  ----------ICSEGVE----------------------AVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMA

Query:  VREGLAAVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVP
         R  +    + G   VVVE DS  + K   SD   ++  G ++ +IR  +     C      R  N +  ++A  A  L    +WLE++P
Subjt:  VREGLAAVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVP

TrEMBL top hitse value%identityAlignment
A0A2N9E9A1 Reverse transcriptase domain-containing protein7.8e-19436.49Show/hide
Query:  DRISKCASELALWGKSKKGNYGRRINEARNRLQAKVENGGNVNE------ARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEG
        D+I  C   L  W     G+  ++I+EA  RL+    N     +       +  L  L+G+EE  WRQR+R+EW+Q GD+N+R+FH RATQRRR NR+  
Subjt:  DRISKCASELALWGKSKKGNYGRRINEARNRLQAKVENGGNVNE------ARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEG

Query:  LFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEV
        L D  G W+  +  +  V   ++ +LF+S N     ++E+    I   V+ + N  L + +   +++ ++KQ+ P+K+PG DGFP +FYQ+YW I+  +V
Subjt:  LFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEV

Query:  TSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGK
        +   L  +N G  ++A+NHT I+LIPK KN   V +FR ISLCNVIYKIISK + NRLK +L Q++S +QSAFVPGR I DN ++ FE+LH+M Q+RRGK
Subjt:  TSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGK

Query:  IGWAALKLDMSKTYDRVECVVL--------LYAEW-KEKRENLSI------------------EGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGV
         G  ALKLDMSK YDRVE   L         + +W K   E +S                    GLRQGDP+SPYLFLFCAEGL  +L   + +  + GV
Subjt:  IGWAALKLDMSKTYDRVECVVL--------LYAEW-KEKRENLSI------------------EGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGV

Query:  RIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQ
         I+R  P ++HLFFADD L F KA   E R + ++L  Y   + Q+IN  K+ +  S +    I   I  +L V ++  +E+ LGLP   G  K  +  Q
Subjt:  RIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQ

Query:  TKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALL
         K+R+WS+++ WK    S   G+E+LIKS+ QAIPTY MSCF+LP+RL+KE   ++ RFWWG E    KMHW  W  +C +K  GG+GFR+L  FN+ALL
Subjt:  TKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALL

Query:  AKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGFLPGA---RVVSL
        AKQ WRLM   +SL  +VFK KYF QCS LDA+  + SSY WKS++  R +++ G  WR+GDG +I+I  D W+P      II     LP A    VV L
Subjt:  AKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGFLPGA---RVVSL

Query:  LDNDG-WWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG-------------------------------------------------RLYH
        +D D   WN  +++A F   +A  ILGI     R  D ++W   + G                                                 R  H
Subjt:  LDNDG-WWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG-------------------------------------------------RLYH

Query:  DFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSWWELW-----------
        D LPT++NL  R +     C  C    ESTLHA+ +CK  + +W+ +P+   L  ++ V+  +L     + +   +   F  M  W +W           
Subjt:  DFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSWWELW-----------

Query:  -------------ICSEGVEA----------------VCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVRE
                     + +E  +A                + W PP    YK+N D AV R+ N   IGAIVRN +GEVM  L + + +   ++ +EA A + 
Subjt:  -------------ICSEGVEA----------------VCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVRE

Query:  GLAAVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPP
         +  V+D G   V +E DS  ++   Q      +  G L+ +   +++ L   +     R  N + H +A  A   +   +W+E VPP
Subjt:  GLAAVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPP

A0A2N9FN80 Uncharacterized protein2.3e-18536.53Show/hide
Query:  RISKCASELALWGKSKKGNYGRRINEARNRL-QAKVEN--GGNVNEARLC---LENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGL
        +I  C  EL +W K   GN   +I     RL QA+  +  GG  +   L    L +L+ +EE  WRQR+R EW++ GDRN+R+FH RATQR+R N V  L
Subjt:  RISKCASELALWGKSKKGNYGRRINEARNRL-QAKVEN--GGNVNEARLC---LENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGL

Query:  FDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVT
            G W      +  +   Y+  LF + N    G++E+   +I P V+   NE L R +T  ++ V+LKQ+ PLKAPG DG P LFY +YW ++ +EVT
Subjt:  FDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVT

Query:  SLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGKI
           L  +N G  ++A NHT I+LIPK +N   V +FR ISLCNVIYK+ISK +ANRLK +L  ++S +QSAFVPGR I DN ++ FE+LH+M+ +R  + 
Subjt:  SLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGKI

Query:  GWAALKLDMSKTYDRVECV----VLLYAEWKEKRENLSIE-----------------------GLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVR
           ALKLDMSK YD+VE      V+    +  K   L +E                       GLRQGDP+SPYLFL CAEG   +L   ++   L GV 
Subjt:  GWAALKLDMSKTYDRVECV----VLLYAEWKEKRENLSIE-----------------------GLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVR

Query:  IARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQT
        I+RG P I+HLFFADD L F +A   +   +  +L  Y   + Q+IN  K+ I  S +        I N+L V ++  +ER LGLP   G  K  +  Q 
Subjt:  IARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQT

Query:  KDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALLA
        K+R+WS+++ WK    S   G E+LIKS+ QAIP Y MSCF+LP RL+KE   ++ RFWWG      KMHW  W  +C SK +GG+GFR+L  FN+ALLA
Subjt:  KDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALLA

Query:  KQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQII-PKNGFLPGARVVSLLD-
        KQ WRL+    SL  +VFK KYF +CS L+A+  S SSY WKS++  R L+K G  WRVG G++I+I  D W+P P    I  P+      ++V  L+D 
Subjt:  KQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQII-PKNGFLPGARVVSLLD-

Query:  NDGWWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG-------------------------------------------------RLYHDFL
            W +ELIR +F   DA AILG+   +    D ++W   K+G                                                 R  H+ L
Subjt:  NDGWWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG-------------------------------------------------RLYHDFL

Query:  PTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSWWELW--------------
        PT  NL  R +     C  C    E+ LHA+W+CK  +  W+   +  RL  +  ++  DLL  C + +S  + + F ++SW  +W              
Subjt:  PTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSWWELW--------------

Query:  --------------------------ICSEGVEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGLA
                                  I     EA+ W PP    YK+N D A+  + N   +G I+RN++GEVM +L + + F   V+ +EA A R  + 
Subjt:  --------------------------ICSEGVEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGLA

Query:  AVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNIL
           D G  ++ +E DS  +V      G   +  G ++E+I+ I++     +     R R +L
Subjt:  AVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNIL

A0A2N9G258 Uncharacterized protein2.4e-19037.26Show/hide
Query:  RISKCASELALWGKSKKGNYGRRINEARNRL-QAKVEN--GGNVNEARLC---LENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGL
        +I  C  EL +W K   GN   +I E  NRL QA+  +  GG  +   L    L +L+ +EE  WRQR+R EW++ GDRN+R+FH RATQR+R N V  L
Subjt:  RISKCASELALWGKSKKGNYGRRINEARNRL-QAKVEN--GGNVNEARLC---LENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGL

Query:  FDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVT
            G W      +  +   Y+  LF + N     ++E+   +I P V+   NE L R +   ++VV+LKQ+ PLKAPG DG P LFY +YW ++ +EVT
Subjt:  FDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVT

Query:  SLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGKI
           L  +N G  ++A NHT I+LIPK +N   V +FR ISLCNVIYK+ISK +ANRLK +L  ++S +QSAFVPGR I DN ++ FE+LH+M+ +R  + 
Subjt:  SLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGKI

Query:  GWAALKLDMSKTYDRVECVVLLYAEWKEKRENLSIEGL--RQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVRIARGSPAISHLFFADDCLFFFKADL
        G  ALKLDMSK YDRV        EWK  +  +   G   + GDP+SPYLFL CAEG   +L   ++   L GV I+RG P I+HLFFADD L F KA  
Subjt:  GWAALKLDMSKTYDRVECVVLLYAEWKEKRENLSIEGL--RQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVRIARGSPAISHLFFADDCLFFFKADL

Query:  REARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQTKDRIWSQIQRWKHMWFSVGGGEEVL
         +   +  +L  Y   + Q+IN  K+ +  S +        I N+L V  +  +ER LGLP   G  K  +  Q K+R+WS+++ WK    S   G+E+L
Subjt:  REARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQTKDRIWSQIQRWKHMWFSVGGGEEVL

Query:  IKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQ
        IKS+ QAIP Y MSCF+LP RL+KE   ++ RFWWG      KMHW  W  +C SK +GG+GFR+L  FN+ALLAKQ WRL+    SL  +VFK KYF +
Subjt:  IKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQ

Query:  CSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGFLPG-ARVVSLLD-NDGWWNKELIRAVFEEDDAMAILGI
        CS L+A+  S SSY WKS++  R L+K G  WRVG  ++I+I  D W+P P    I       P  ++V  L+D     W +ELIR  F   DA AILG+
Subjt:  CSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGFLPG-ARVVSLLD-NDGWWNKELIRAVFEEDDAMAILGI

Query:  LRPIVRCPDKIMWNYEKDG-------------------------------------------------RLYHDFLPTERNLRKRGLDVQKGCVRCKKYEE
           +    D ++W   K+G                                                 R  H  LPT  NL  R +     CV C    E
Subjt:  LRPIVRCPDKIMWNYEKDG-------------------------------------------------RLYHDFLPTERNLRKRGLDVQKGCVRCKKYEE

Query:  STLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSWWELW---------------------------------------
        + LHA+W+CK  + +W+   +  RL  +  ++  DLL  C + +S  + + F ++SW  +W                                       
Subjt:  STLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSWWELW---------------------------------------

Query:  -ICSEGVEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGLAAVVDAGFSRVVVETDSSSVVKQFQS
         I     EA+ W PP     K+N D A+  + N   +G I+RN++GEVM +L + V F   V+ +EA A R  +    D G  ++ +E DS  +V     
Subjt:  -ICSEGVEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGLAAVVDAGFSRVVVETDSSSVVKQFQS

Query:  DGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPP
         G   +  G ++E+I+ I++   F +     R  N L H +A  A       +W+E VPP
Subjt:  DGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPP

A0A2N9HYE3 Reverse transcriptase domain-containing protein2.7e-18635.52Show/hide
Query:  RISKCASELALWGKSKKGNYGRRINEARNRLQAKVENG------GNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGL
        +I  C   L LW ++  GN   RI E    L+   EN         VN+ +  L +L+ +EE  WRQR+R EW+  GDRN+R+FH RATQR+R N V  L
Subjt:  RISKCASELALWGKSKKGNYGRRINEARNRLQAKVENG------GNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGL

Query:  FDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVT
          ++G+W   +  +  +   Y++ LF + N     ++E+   +I   V+ + N +L   +T  ++ ++LKQ+ PLKAPG D  P +FYQ+YW ++  +VT
Subjt:  FDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVT

Query:  SLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGKI
        +  L  +N G  ++A+NHT I+LIPK +N   V EFR ISLCNVIYK+ISK +ANRLK +L  ++  +QSAF+PGR I DN ++ FE+LH+M+ ++ GK 
Subjt:  SLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGKI

Query:  GWAALKLDMSKTYDRVECV----VLLYAEWKEKRENLSIE-----------------------GLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVR
        G  ALKLDMSK YDRVE      V+    +  K   L +E                       GLRQGDP+SPYLFL CAEGL  ++   ++   L GV 
Subjt:  GWAALKLDMSKTYDRVECV----VLLYAEWKEKRENLSIE-----------------------GLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVR

Query:  IARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQT
        I+R  P I+HLFFADD L F KA   +   +  +L  Y   + Q++N  K+ +  S +      L I N+L V  +  +ER LGLP   G  K  +  Q 
Subjt:  IARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQT

Query:  KDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALLA
        K+R+WS+++ WK    S   G E+LIKS+ QAIP Y MSCF+LP RL+KE   ++ RFWWG      KMHW  W  +C SK +GG+G RDL  FN+ALLA
Subjt:  KDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALLA

Query:  KQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQII-PKNGFLPGARVVSLLDN
        KQ WRL+  P SL ++VFK KYF  CS L+A+ +S  SY WKS++  R L+  G  WRVG G+ IRI  D W+P   S  I+ P    +  + V  L+D+
Subjt:  KQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQII-PKNGFLPGARVVSLLDN

Query:  D-GWWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG-------------------------------------------------RLYHDFL
        +   W  EL++ +F   +A  ILGI   I    D ++W   K+G                                                 R  H+ L
Subjt:  D-GWWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG-------------------------------------------------RLYHDFL

Query:  PTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSW--W---------------
        PT  NL  R +     C  C    EST+HA+W+CK  + +W+  P+  +L   +     DL++ C++ +S ++ + F + SW  W               
Subjt:  PTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSW--W---------------

Query:  ------------ELWICSEG----------VEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGLAA
                    E                  ++  W PPE   YK+N D AV  + N   +G I+RN +GEVM +L   + +   V+ +EA A    +  
Subjt:  ------------ELWICSEG----------VEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGLAA

Query:  VVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPP
          D GF  + +E DS  +V+         +  G ++E+I+  ++ L+       +R  N + H +A  A   +   VW+E VPP
Subjt:  VVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPP

A0A2N9J809 Uncharacterized protein9.3e-18736.19Show/hide
Query:  DRISKCASELALWGKSKKGNYGRRINEARNRLQAKVENG--GNVNEARLCLEN----LMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEG
        ++I +C   L  W K   G+  ++I E  ++L+A       G  +   + L+N    L+ ++E  WRQR+RVEW++ GD+N+R+FH +AT RRR N V  
Subjt:  DRISKCASELALWGKSKKGNYGRRINEARNRLQAKVENG--GNVNEARLCLEN----LMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEG

Query:  LFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEV
        L D  G W   ++ +  +   Y+  LF++ N      +E+    IAP V+ + N  L R +T  +++ +LKQ+ PLKAPG DG P +FYQ+YW ++  +V
Subjt:  LFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEV

Query:  TSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGK
        T   L  +N G  ++A+NHT I+LIPK KN   V EFR ISLCNVIYK+I+K + NRLK +L +V+S +QSAFVPGR I DN ++ FE+LH+M  ++ GK
Subjt:  TSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGK

Query:  IGWAALKLDMSKTYDRVE-----CVVL---LYAEWKE------KRENLSI-------------EGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGV
         G  ALKLDMSK YDRVE     CV+     +++W           + SI              GLRQGDP+SPYLFLFCAEG   ++   +++ +L GV
Subjt:  IGWAALKLDMSKTYDRVE-----CVVL---LYAEWKE------KRENLSI-------------EGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGV

Query:  RIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQ
         I+RG P ++HLFFADD L F KA   +   V ++L+ Y   + Q+IN  K+ I  S + S+     I  +L V  +  +E+ LGLP   G  K  +  Q
Subjt:  RIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQ

Query:  TKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALL
         K+R+WS+++ WK    S   G E+LIK++ QAIPTY MSCF+LP RL+KE   ++ RFWWG      KMHW  W  +C SK  GGLG RDL +FN+ALL
Subjt:  TKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALL

Query:  AKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIP-RPPSLQIIPKNGFLPGARVVSLLD
        AKQ WRLM    SL  RVFK KYF + S L+A+P S  S  WKS+L    L+  G+ WRVG G+ IRI  D W+P R  +  + P+    P   V  L+D
Subjt:  AKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIP-RPPSLQIIPKNGFLPGARVVSLLD

Query:  -NDGWWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG-------------------------------------------------RLYHDF
         +   W + +I + F   +A AILGI        D  +W   +DG                                                 R  H+ 
Subjt:  -NDGWWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDG-------------------------------------------------RLYHDF

Query:  LPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSW--WELWIC---------
        LPT  NL  R +     C  C    E+ +HA+W+CK    +W   P+  RL  +  +N  +L   C   +S  + + F + +W  W    C         
Subjt:  LPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSW--WELWIC---------

Query:  ----------------------------SEGVEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGLA
                                    S+    + W PP    YK+N D A   + N   IG ++RN  GEVM +L + V F   V+ +EA A R  + 
Subjt:  ----------------------------SEGVEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGLA

Query:  AVVDAGFSRVVVETDSSSVVKQF-QSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPP
           D GF    +E DS +VV      + CN +  G LV + + I++           R  N+L H +A  A   +   VW+E VPP
Subjt:  AVVDAGFSRVVVETDSSSVVKQF-QSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPP

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.5e-2821.62Show/hide
Query:  EARNRLQAKVENGGNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGLFDREGRWVEEEEGMEGVITWYFQELFSSNNS
        E + +  +K      + + R  L+ +  ++ +     +R  + +  ++  R   R   ++R  N+++ + + +G    +   ++  I  Y++ L+ +N  
Subjt:  EARNRLQAKVENGGNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGLFDREGRWVEEEEGMEGVITWYFQELFSSNNS

Query:  QSSGRLEEFRRNIA-PCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVTSLCLEVMNEGGSVEALNHTVISLIPK-CKN
        ++   ++ F      P ++ +  E L RP T  ++V  +  +   K+PG DGF   FYQRY   +   +  L   +  EG    +     I LIPK  ++
Subjt:  QSSGRLEEFRRNIA-PCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVTSLCLEVMNEGGSVEALNHTVISLIPK-CKN

Query:  ARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGK-IGWAALKLDMSKTYDRVE-----------
          +   FR ISL N+  KI++K +ANR+++ + ++I   Q  F+PG     N     +S++ ++   R K      + +D  K +D+++           
Subjt:  ARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGK-IGWAALKLDMSKTYDRVE-----------

Query:  -----CVVLLYAEWKEKRENLSIE-----------GLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVRIARGSPAISHLFFADDCLFFFKADLREA
              + ++ A + +   N+ +            G RQG P+SP LF    E L R    +  ++++ G+++ +    +S   FADD + + +  +  A
Subjt:  -----CVVLLYAEWKEKRENLSIE-----------GLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVRIARGSPAISHLFFADDCLFFFKADLREA

Query:  RMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVG------FGGGKMKALKQTKDRIWSQIQRWKHMWFSVGGGE
        + +L ++  +S ++  KIN  KS   L  N++     +I   L  ++     + LG+ +       F       LK+ K+       +WK++  S  G  
Subjt:  RMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVG------FGGGKMKALKQTKDRIWSQIQRWKHMWFSVGGGE

Query:  EVLIKSIL-QAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYH--GGLGFRDLELFNKALLAKQGWRLMEKPD
         ++  +IL + I  +     KLP     E      +F W  +        A   K  +S+ +  GG+   D +L+ KA + K  W   +  D
Subjt:  EVLIKSIL-QAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYH--GGLGFRDLELFNKALLAKQGWRLMEKPD

P0C2F6 Putative ribonuclease H protein At1g657503.4e-2926.39Show/hide
Query:  DRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALLAK
        +R+ S++  W+    S   G   L K++L ++P ++MS   LP+ ++   + +   F WG     +K H   W KVC  K  GGLG R  +  N+AL++K
Subjt:  DRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALLAK

Query:  QGWRLMEKPDSLLARVFKGKYFNQCSFLDAR---PKSNSSYLWKSLLWG-RSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGFLPGARVVSLL
         GWRL+++ +SL   V + KY +     D+R   PK + S  W+S+  G R ++  GV W  GDG  IR   D W+   P L++   NG  P     +++
Subjt:  QGWRLMEKPDSLLARVFKGKYFNQCSFLDAR---PKSNSSYLWKSLLWG-RSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGFLPGARVVSLL

Query:  DNDGW-----WNKELIRAVFEEDDAMAILGILRPIVR-CPDKIMWNYEKDGR---------LYHDFLP--------------------------------
          D W     W+   I      +  + +  ++  +V    D++ W + +DG+         L  D +P                                
Subjt:  DNDGW-----WNKELIRAVFEEDDAMAILGILRPIVR-CPDKIMWNYEKDGR---------LYHDFLP--------------------------------

Query:  -TERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLW
         TE    +R L     C  CK   ES LH + +C     +W
Subjt:  -TERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLW

P11369 LINE-1 retrotransposable element ORF2 protein1.5e-3223.67Show/hide
Query:  DREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVTS
        + +G    + E ++  I  +++ L+S+         +   R   P ++    + L  P + K++   +  +   K+PG DGF   FYQ +   +   +  
Subjt:  DREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVTS

Query:  LCLEVMNEGGSVEALNHTVISLIPK-CKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGKI
        L  ++  EG    +     I+LIPK  K+  ++  FR ISL N+  KI++K +ANR++E +  +I P Q  F+PG     N       +HY+  K + K 
Subjt:  LCLEVMNEGGSVEALNHTVISLIPK-CKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGKI

Query:  GWAALKLDMSKTYDRVE----------------CVVLLYAEWKEKRENLSI-----------EGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVR
            + LD  K +D+++                 + ++ A + +   N+ +            G RQG P+SPYLF    E L R    +   +++ G++
Subjt:  GWAALKLDMSKTYDRVE----------------CVVLLYAEWKEKRENLSI-----------EGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVR

Query:  IARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKM--KALK
        I +    IS L  ADD + +        R +LN++ ++  +   KIN  KS   L   + +     I      S+V ++ + LG+ +      +  K  K
Subjt:  IARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKM--KALK

Query:  QTKDRIWSQIQRWKHMWFSVGGGEEVLIKSIL-QAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKA
          K  I   ++RWK +  S  G   ++  +IL +AI  +     K+P +   E    + +F W       K    +   +   +  GG+   DL+L+ +A
Subjt:  QTKDRIWSQIQRWKHMWFSVGGGEEVLIKSIL-QAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKA

Query:  LLAKQGW
        ++ K  W
Subjt:  LLAKQGW

P14381 Transposon TX1 uncharacterized 149 kDa protein1.9e-3225.05Show/hide
Query:  RARVEWMQWGDRNSRWFHRRATQRRRTNRVEGLFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVV
        R+R++ +   DR SR+F+    ++    ++  LF  +G  +E+ E +      ++Q LFS  +  S    EE    + P VS +  ERL  P T  +L  
Subjt:  RARVEWMQWGDRNSRWFHRRATQRRRTNRVEGLFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVV

Query:  SLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVTSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISP
        +L+ +   K+PG DG    F+Q +W  +  +   +  E   +G    +    V+SL+PK  + R +  +R +SL +  YKI++K+I+ RLK VL +VI P
Subjt:  SLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVTSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISP

Query:  TQSAFVPGRNICDNAMLGFESLHYMKQKRRGKIGWAALKLDMSKTYDRV-----------------------------ECVVLLYAEWKEKRENLSIEGL
         QS  VPGR I DN  L  + LH+    RR  +  A L LD  K +DRV                             EC+V +   W          G+
Subjt:  TQSAFVPGRNICDNAMLGFESLHYMKQKRRGKIGWAALKLDMSKTYDRV-----------------------------ECVVLLYAEWKEKRENLSIEGL

Query:  RQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVRIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLS--------P
        RQG P+S  L+    E    +L      ++LTG+ +      +    +ADD +   + DL +          Y+  +  +IN+ KS   L         P
Subjt:  RQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVRIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLS--------P

Query:  NDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQTKDRIWSQIQRWKHMWFSVG-GGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLA
            DI+     +  + +  S E     PV       +   + ++ + +++ +WK     +   G  ++I  ++ +   Y + C    +  + +    L 
Subjt:  NDSEDINLRIANLLQVSMVGSHERCLGLPVGFGGGKMKALKQTKDRIWSQIQRWKHMWFSVG-GGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLA

Query:  RFWWGVEEGTRKMHWASWGKVCVSKYHGGLG
         F W         HW S G   +    GG G
Subjt:  RFWWGVEEGTRKMHWASWGKVCVSKYHGGLG

P93295 Uncharacterized mitochondrial protein AtMg003103.3e-3243.71Show/hide
Query:  AIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKY-HGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLD
        A+P Y MSCF+L K L K+  S +  FWW   E  RK+ W +W K+C SK   GGLGFRDL  FN+ALLAKQ +R++ +P +LL+R+ + +YF   S ++
Subjt:  AIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKY-HGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLD

Query:  ARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWI----PRPP
            +  SY W+S++ GR LL  G+   +GDG   ++  D WI    P PP
Subjt:  ARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWI----PRPP

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.5e-1628.27Show/hide
Query:  EIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGLFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAP--CVSVKSNERLGRP
        E ++RQ++R++W+Q GD N+R+FH+     +  N ++ L   +   VE    ++ +I  Y+  L  S++   +    +  ++I P  C    ++ RL   
Subjt:  EIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGLFDREGRWVEEEEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAP--CVSVKSNERLGRP

Query:  YTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVTSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIIS
         ++K++  ++  +   KAPG D F   F+   W +V++   +   E    G  ++  N T I+LIPK     ++S FR +S C V+YKII+
Subjt:  YTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVTSLCLEVMNEGGSVEALNHTVISLIPKCKNARRVSEFRSISLCNVIYKIIS

AT3G09510.1 Ribonuclease H-like superfamily protein1.8e-1728.38Show/hide
Query:  KGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWI----PRPPSLQIIPKNGFLPGARVVSLLDNDG---WWNKELIRAV
        K +YF   S LDA+ +   SY W SLL G +LLK G R  +GDG +IRIG DN +    PRP + +   K        + +L +  G   +W+   I   
Subjt:  KGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWI----PRPPSLQIIPKNGFLPGARVVSLLDNDG---WWNKELIRAV

Query:  FEEDDAMAILGILRPIVRCPDKIMWNYEKDGR---------LYHD----------------------------------------FLPTERNLRKRGLDV
         ++ D   I  I     + PDKI+WNY   G          L HD                                         L T   L  RG+ +
Subjt:  FEEDDAMAILGILRPIVRCPDKIMWNYEKDGR---------LYHD----------------------------------------FLPTERNLRKRGLDV

Query:  QKGCVRCKKYEESTLHAIWECKRSRHLWR
           C RC +  ES  HA++ C  +   WR
Subjt:  QKGCVRCKKYEESTLHAIWECKRSRHLWR

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-1226.4Show/hide
Query:  LPVGFGGGKMKALKQTK-------DRIWSQIQRW--KHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWG
        LPV + G  +   K T        ++I  +I +W  +H+ F+   G   LI S++ ++  + MS F+LP   +KE +S+ + F W   E   K    +W 
Subjt:  LPVGFGGGKMKALKQTK-------DRIWSQIQRW--KHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWG

Query:  KVCVSKYHGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNW
         VC  K  GGLG R L+  NK       W +                        +   +  S++WK +L  R+L    V+  + +G++    FDNW
Subjt:  KVCVSKYHGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNW

AT4G29090.1 Ribonuclease H-like superfamily protein5.8e-4827.5Show/hide
Query:  AIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDA
        A+PTYTM+CF LPK + K+  S+LA FWW  ++  + MHW +W  +   K  GG+GF+D+E FN ALL KQ WR++ +P+SL+A+VFK +YF++   L+A
Subjt:  AIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKYHGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDA

Query:  RPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPS-----LQIIPKNGFLPGA---RVVSLLDNDG-WWNKELIRAVFEEDDAMAILG
           S  S++WKS+   + +L+ G R  VG+G  I I    W+   P+     +Q +P   +   +   +V  L+D  G  W K++I  +F E +   ++G
Subjt:  RPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPS-----LQIIPKNGFLPGA---RVVSLLDNDG-WWNKELIRAVFEEDDAMAILG

Query:  ILRP-IVRCPDKIMWNYEKDG-------------------------------------------RLYH-------DFLPTERNLRKRGLDVQKGCVRCKK
         LRP   R  D   W+Y   G                                           ++ H       + LP    L  R L  +  C+RC  
Subjt:  ILRP-IVRCPDKIMWNYEKDG-------------------------------------------RLYH-------DFLPTERNLRKRGLDVQKGCVRCKK

Query:  YEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWK----EMSVDKFEEFVVMSWWELW-----ICSEGVE--------------------
         +E+  H +++C  +R  W  S     LG     +    L+W +         +K  + V    W LW     +   G E                    
Subjt:  YEESTLHAIWECKRSRHLWRDSPFFPRLGPSTIVNPADLLWWCWK----EMSVDKFEEFVVMSWWELW-----ICSEGVE--------------------

Query:  --------------AVC--WIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGLAAVVDAGFSRVVVETDSS
                      + C  W PP     K NTD   NRD     IG ++RNEKGEV     R++  +  V   E  A+R  + ++    ++ V+ E+DS 
Subjt:  --------------AVC--WIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREGLAAVVDAGFSRVVVETDSS

Query:  SVVKQFQSD
         +++   +D
Subjt:  SVVKQFQSD

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.4e-3343.71Show/hide
Query:  AIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKY-HGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLD
        A+P Y MSCF+L K L K+  S +  FWW   E  RK+ W +W K+C SK   GGLGFRDL  FN+ALLAKQ +R++ +P +LL+R+ + +YF   S ++
Subjt:  AIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKVCVSKY-HGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLD

Query:  ARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWI----PRPP
            +  SY W+S++ GR LL  G+   +GDG   ++  D WI    P PP
Subjt:  ARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWI----PRPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAGGTGGAAGGATAGGATTAGCAAGTGTGCGAGTGAGCTGGCTCTTTGGGGTAAATCGAAGAAGGGGAATTATGGGAGAAGGATAAATGAGGCCCGAAATAGGCT
CCAAGCGAAGGTTGAGAATGGAGGAAATGTGAATGAGGCGAGGTTGTGTCTTGAGAATTTGATGGGGGAAGAGGAGATTTATTGGAGGCAGAGAGCTAGAGTTGAATGGA
TGCAGTGGGGTGATAGGAACTCGAGGTGGTTTCATCGTAGAGCCACTCAGCGACGCAGAACTAATAGGGTGGAAGGTTTGTTTGATAGGGAAGGGAGGTGGGTGGAGGAG
GAGGAAGGGATGGAGGGGGTTATCACTTGGTATTTTCAGGAGCTATTTAGTTCGAATAATAGTCAGAGTAGTGGTAGGCTTGAAGAGTTCCGCAGGAACATTGCCCCTTG
TGTGAGCGTGAAATCAAATGAGAGGCTGGGAAGGCCATATACGGAGAAGGACTTGGTGGTATCTCTTAAGCAAATAGGCCCCCTGAAGGCTCCTGGTGAGGATGGATTCC
CAACTTTGTTTTACCAAAGGTACTGGGGAATTGTAAGGAACGAGGTAACCTCACTCTGTCTAGAAGTGATGAATGAGGGTGGGTCGGTGGAGGCGCTGAACCATACGGTC
ATATCCTTGATTCCAAAGTGTAAAAATGCTCGAAGGGTCTCAGAGTTTAGATCAATAAGCCTTTGCAATGTGATTTATAAAATAATATCAAAGTCTATTGCTAACAGGCT
AAAAGAAGTCTTGGATCAAGTGATCTCTCCTACTCAGAGTGCGTTTGTTCCGGGTAGAAATATCTGTGATAATGCCATGTTGGGTTTTGAAAGTCTCCATTATATGAAGC
AGAAGAGGAGGGGGAAGATTGGGTGGGCGGCTTTGAAGTTGGACATGAGCAAAACTTATGACCGGGTTGAATGTGTCGTACTCCTTTATGCTGAATGGAAAGAGAAGAGG
GAGAATCTCTCCATCGAGGGGCTTCGCCAAGGTGATCCTATCTCGCCTTACCTATTTTTGTTCTGTGCTGAAGGCTTATTTAGGATGCTTTCATGGCTAGAGGTGGATAG
AAAGTTGACTGGGGTTCGAATAGCTCGTGGTTCCCCAGCCATATCCCATTTGTTTTTTGCTGATGACTGTCTTTTTTTCTTTAAGGCAGATTTGAGGGAAGCAAGGATGG
TGTTGAATGTGCTGAGGACCTATTCTGTTATCACTGAACAGAAGATTAATTATGGTAAGTCGGGGATTTGTTTAAGTCCGAATGATAGTGAGGATATAAATTTGAGAATT
GCCAATCTTCTTCAGGTGTCTATGGTGGGTTCTCATGAGCGTTGTCTAGGACTTCCTGTGGGTTTTGGTGGAGGGAAGATGAAGGCGTTGAAGCAAACTAAGGACCGAAT
TTGGTCCCAGATTCAGAGATGGAAACATATGTGGTTTTCAGTGGGGGGGGGGGAGGAGGTTTTGATAAAGTCTATTCTGCAGGCCATCCCTACCTATACAATGTCATGTT
TTAAGCTCCCGAAGAGGTTGGTTAAGGAGTGTAATAGCATGTTGGCCAGATTCTGGTGGGGAGTAGAGGAGGGGACAAGGAAGATGCACTGGGCATCATGGGGGAAGGTT
TGTGTCTCAAAATATCATGGAGGGTTGGGCTTTCGGGACCTTGAACTGTTCAATAAAGCATTGCTTGCGAAGCAAGGATGGAGGTTGATGGAAAAGCCGGATTCCTTATT
GGCTAGGGTCTTTAAAGGGAAGTACTTCAATCAATGTTCCTTTTTGGATGCTAGGCCAAAGAGTAATAGCTCTTATCTATGGAAGAGTCTGTTGTGGGGTAGAAGTTTGC
TAAAAGTAGGGGTGAGATGGAGGGTGGGGGATGGGAACTCTATTAGAATTGGTTTCGACAATTGGATTCCTAGGCCTCCATCTTTACAGATTATCCCTAAGAATGGTTTT
CTCCCTGGTGCACGAGTGGTCAGTTTGTTGGATAATGATGGTTGGTGGAATAAGGAGCTTATTCGGGCTGTGTTTGAAGAGGATGATGCTATGGCTATTCTGGGCATTCT
GAGGCCTATAGTGCGATGTCCAGATAAGATTATGTGGAATTATGAGAAGGATGGTAGGTTGTACCATGACTTTCTTCCTACTGAGCGTAACTTGAGGAAGAGGGGATTGG
ATGTGCAAAAGGGATGTGTTCGGTGTAAGAAGTATGAGGAATCGACCCTACACGCTATATGGGAATGTAAAAGATCGAGACACCTTTGGAGAGACTCTCCTTTCTTTCCT
CGGCTCGGGCCCTCGACGATTGTGAATCCTGCTGACCTGTTATGGTGGTGTTGGAAAGAGATGTCAGTGGATAAATTTGAAGAGTTTGTGGTGATGAGTTGGTGGGAATT
GTGGATCTGTAGTGAGGGGGTGGAAGCTGTGTGTTGGATTCCTCCTGAATTTCCTAATTATAAGCTAAATACAGATATTGCAGTGAATAGGGATCTCAATCTCAACAGTA
TTGGGGCTATTGTTAGAAATGAGAAAGGGGAAGTGATGCTTACCTTGATGAGATCGGTGGAGTTTATGCTTGATGTGGATGTGTTGGAAGCAATGGCAGTTCGCGAAGGG
CTGGCAGCTGTAGTGGATGCCGGCTTCTCGCGGGTGGTGGTGGAGACGGACTCGTCAAGCGTGGTGAAGCAGTTTCAGTCCGACGGGTGTAATCTTTCGGAGATGGGCTT
TCTGGTGGAGGAAATAAGAGGCATCTCTCGGGAGTTGAGGTTCTGTGAGGTCGGGTGGTGTAGTCGAGCAAGGAATATTTTGGAGCATGAAGTGGCGATGTCAGCTTTAA
GGTTGCAAATGGAAGGCGTTTGGCTGGAGGAGGTGCCGCCGGTTGTCGACGACGTTTATGTCGCTGAGCTGAGGTCTTCGGGTTTGAGTTTTAGGGCAGCTTGTAATGTT
CCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGAGGTGGAAGGATAGGATTAGCAAGTGTGCGAGTGAGCTGGCTCTTTGGGGTAAATCGAAGAAGGGGAATTATGGGAGAAGGATAAATGAGGCCCGAAATAGGCT
CCAAGCGAAGGTTGAGAATGGAGGAAATGTGAATGAGGCGAGGTTGTGTCTTGAGAATTTGATGGGGGAAGAGGAGATTTATTGGAGGCAGAGAGCTAGAGTTGAATGGA
TGCAGTGGGGTGATAGGAACTCGAGGTGGTTTCATCGTAGAGCCACTCAGCGACGCAGAACTAATAGGGTGGAAGGTTTGTTTGATAGGGAAGGGAGGTGGGTGGAGGAG
GAGGAAGGGATGGAGGGGGTTATCACTTGGTATTTTCAGGAGCTATTTAGTTCGAATAATAGTCAGAGTAGTGGTAGGCTTGAAGAGTTCCGCAGGAACATTGCCCCTTG
TGTGAGCGTGAAATCAAATGAGAGGCTGGGAAGGCCATATACGGAGAAGGACTTGGTGGTATCTCTTAAGCAAATAGGCCCCCTGAAGGCTCCTGGTGAGGATGGATTCC
CAACTTTGTTTTACCAAAGGTACTGGGGAATTGTAAGGAACGAGGTAACCTCACTCTGTCTAGAAGTGATGAATGAGGGTGGGTCGGTGGAGGCGCTGAACCATACGGTC
ATATCCTTGATTCCAAAGTGTAAAAATGCTCGAAGGGTCTCAGAGTTTAGATCAATAAGCCTTTGCAATGTGATTTATAAAATAATATCAAAGTCTATTGCTAACAGGCT
AAAAGAAGTCTTGGATCAAGTGATCTCTCCTACTCAGAGTGCGTTTGTTCCGGGTAGAAATATCTGTGATAATGCCATGTTGGGTTTTGAAAGTCTCCATTATATGAAGC
AGAAGAGGAGGGGGAAGATTGGGTGGGCGGCTTTGAAGTTGGACATGAGCAAAACTTATGACCGGGTTGAATGTGTCGTACTCCTTTATGCTGAATGGAAAGAGAAGAGG
GAGAATCTCTCCATCGAGGGGCTTCGCCAAGGTGATCCTATCTCGCCTTACCTATTTTTGTTCTGTGCTGAAGGCTTATTTAGGATGCTTTCATGGCTAGAGGTGGATAG
AAAGTTGACTGGGGTTCGAATAGCTCGTGGTTCCCCAGCCATATCCCATTTGTTTTTTGCTGATGACTGTCTTTTTTTCTTTAAGGCAGATTTGAGGGAAGCAAGGATGG
TGTTGAATGTGCTGAGGACCTATTCTGTTATCACTGAACAGAAGATTAATTATGGTAAGTCGGGGATTTGTTTAAGTCCGAATGATAGTGAGGATATAAATTTGAGAATT
GCCAATCTTCTTCAGGTGTCTATGGTGGGTTCTCATGAGCGTTGTCTAGGACTTCCTGTGGGTTTTGGTGGAGGGAAGATGAAGGCGTTGAAGCAAACTAAGGACCGAAT
TTGGTCCCAGATTCAGAGATGGAAACATATGTGGTTTTCAGTGGGGGGGGGGGAGGAGGTTTTGATAAAGTCTATTCTGCAGGCCATCCCTACCTATACAATGTCATGTT
TTAAGCTCCCGAAGAGGTTGGTTAAGGAGTGTAATAGCATGTTGGCCAGATTCTGGTGGGGAGTAGAGGAGGGGACAAGGAAGATGCACTGGGCATCATGGGGGAAGGTT
TGTGTCTCAAAATATCATGGAGGGTTGGGCTTTCGGGACCTTGAACTGTTCAATAAAGCATTGCTTGCGAAGCAAGGATGGAGGTTGATGGAAAAGCCGGATTCCTTATT
GGCTAGGGTCTTTAAAGGGAAGTACTTCAATCAATGTTCCTTTTTGGATGCTAGGCCAAAGAGTAATAGCTCTTATCTATGGAAGAGTCTGTTGTGGGGTAGAAGTTTGC
TAAAAGTAGGGGTGAGATGGAGGGTGGGGGATGGGAACTCTATTAGAATTGGTTTCGACAATTGGATTCCTAGGCCTCCATCTTTACAGATTATCCCTAAGAATGGTTTT
CTCCCTGGTGCACGAGTGGTCAGTTTGTTGGATAATGATGGTTGGTGGAATAAGGAGCTTATTCGGGCTGTGTTTGAAGAGGATGATGCTATGGCTATTCTGGGCATTCT
GAGGCCTATAGTGCGATGTCCAGATAAGATTATGTGGAATTATGAGAAGGATGGTAGGTTGTACCATGACTTTCTTCCTACTGAGCGTAACTTGAGGAAGAGGGGATTGG
ATGTGCAAAAGGGATGTGTTCGGTGTAAGAAGTATGAGGAATCGACCCTACACGCTATATGGGAATGTAAAAGATCGAGACACCTTTGGAGAGACTCTCCTTTCTTTCCT
CGGCTCGGGCCCTCGACGATTGTGAATCCTGCTGACCTGTTATGGTGGTGTTGGAAAGAGATGTCAGTGGATAAATTTGAAGAGTTTGTGGTGATGAGTTGGTGGGAATT
GTGGATCTGTAGTGAGGGGGTGGAAGCTGTGTGTTGGATTCCTCCTGAATTTCCTAATTATAAGCTAAATACAGATATTGCAGTGAATAGGGATCTCAATCTCAACAGTA
TTGGGGCTATTGTTAGAAATGAGAAAGGGGAAGTGATGCTTACCTTGATGAGATCGGTGGAGTTTATGCTTGATGTGGATGTGTTGGAAGCAATGGCAGTTCGCGAAGGG
CTGGCAGCTGTAGTGGATGCCGGCTTCTCGCGGGTGGTGGTGGAGACGGACTCGTCAAGCGTGGTGAAGCAGTTTCAGTCCGACGGGTGTAATCTTTCGGAGATGGGCTT
TCTGGTGGAGGAAATAAGAGGCATCTCTCGGGAGTTGAGGTTCTGTGAGGTCGGGTGGTGTAGTCGAGCAAGGAATATTTTGGAGCATGAAGTGGCGATGTCAGCTTTAA
GGTTGCAAATGGAAGGCGTTTGGCTGGAGGAGGTGCCGCCGGTTGTCGACGACGTTTATGTCGCTGAGCTGAGGTCTTCGGGTTTGAGTTTTAGGGCAGCTTGTAATGTT
CCTTAG
Protein sequenceShow/hide protein sequence
MRRWKDRISKCASELALWGKSKKGNYGRRINEARNRLQAKVENGGNVNEARLCLENLMGEEEIYWRQRARVEWMQWGDRNSRWFHRRATQRRRTNRVEGLFDREGRWVEE
EEGMEGVITWYFQELFSSNNSQSSGRLEEFRRNIAPCVSVKSNERLGRPYTEKDLVVSLKQIGPLKAPGEDGFPTLFYQRYWGIVRNEVTSLCLEVMNEGGSVEALNHTV
ISLIPKCKNARRVSEFRSISLCNVIYKIISKSIANRLKEVLDQVISPTQSAFVPGRNICDNAMLGFESLHYMKQKRRGKIGWAALKLDMSKTYDRVECVVLLYAEWKEKR
ENLSIEGLRQGDPISPYLFLFCAEGLFRMLSWLEVDRKLTGVRIARGSPAISHLFFADDCLFFFKADLREARMVLNVLRTYSVITEQKINYGKSGICLSPNDSEDINLRI
ANLLQVSMVGSHERCLGLPVGFGGGKMKALKQTKDRIWSQIQRWKHMWFSVGGGEEVLIKSILQAIPTYTMSCFKLPKRLVKECNSMLARFWWGVEEGTRKMHWASWGKV
CVSKYHGGLGFRDLELFNKALLAKQGWRLMEKPDSLLARVFKGKYFNQCSFLDARPKSNSSYLWKSLLWGRSLLKVGVRWRVGDGNSIRIGFDNWIPRPPSLQIIPKNGF
LPGARVVSLLDNDGWWNKELIRAVFEEDDAMAILGILRPIVRCPDKIMWNYEKDGRLYHDFLPTERNLRKRGLDVQKGCVRCKKYEESTLHAIWECKRSRHLWRDSPFFP
RLGPSTIVNPADLLWWCWKEMSVDKFEEFVVMSWWELWICSEGVEAVCWIPPEFPNYKLNTDIAVNRDLNLNSIGAIVRNEKGEVMLTLMRSVEFMLDVDVLEAMAVREG
LAAVVDAGFSRVVVETDSSSVVKQFQSDGCNLSEMGFLVEEIRGISRELRFCEVGWCSRARNILEHEVAMSALRLQMEGVWLEEVPPVVDDVYVAELRSSGLSFRAACNV
P