; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0009393 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0009393
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:38832287..38838686
RNA-Seq ExpressionLag0009393
SyntenyLag0009393
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.3e-14631.02Show/hide
Query:  FEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSMRMTREKEKELEMLLEDDEIYWKQR
        FEEMW K  +   ++   W     +   SV +   +  + L  W+   +  ++R  +     E+  L  R   D   +  + E+ +  LLE  EI W+QR
Subjt:  FEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSMRMTREKEKELEMLLEDDEIYWKQR

Query:  AR---EERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRWIVWLILFLPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDI
        +R    E  + GD    +F    +        R L + R            P IT  +N+ LL+ FT+EE+   +  M PTKAPG DG+ A+F+QKYW I
Subjt:  AR---EERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRWIVWLILFLPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDI

Query:  VGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLKKVLDKII-----------------------------
        VG+     CL++LN    + +FN T I LIPKVK P T+ +FRPISLC+ +YK+IAKT+ANRLK VL  +I                             
Subjt:  VGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLKKVLDKII-----------------------------

Query:  -----------------------------------------------------------SPI------------------------KGLSCLLNGAERNK
                                                                   +P+                        +G SCLL GAER  
Subjt:  -----------------------------------------------------------SPI------------------------KGLSCLLNGAERNK

Query:  D----------------------ITVYKASNKDCRSVKEIIDSYEKATGQTVNYEKSMFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQ
        D                      I   KA+N+ CR+++ +  +YE+ +GQ +NY KS F +SPN +      IK  L V       +YLGLP+   + ++
Subjt:  D----------------------ITVYKASNKDCRSVKEIIDSYEKATGQTVNYEKSMFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQ

Query:  EIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFN
        ++F H+KD++WK + GWK K  S AG+E+L+KAV Q IP Y+MSCF+ P  LC ELN + A FWW  ++++  IHW  W  LC++K  GG+ F+DL  FN
Subjt:  EIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFN

Query:  QAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVAN
        QA+LAKQ WR+L+ P SL++R+ R RY+ +  +L+AE+G NPS++WRSL WG+EL  KG RWRVG GVSI+  +D W+P       + P Q   +  V +
Subjt:  QAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVAN

Query:  FINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRL----EMQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGW
            +G WN  L++D+F  Q+ D  + IPL      D +IWHY+  G++SVKS YRL    + ++ G+    V  ++    FWKK W + +P+KIK   W
Subjt:  FINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRL----EMQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGW

Query:  RIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRDSISGAREQDKMCRSM--------------------ILCWHIWTYRNQI
        R   D LP    L NR I    +C  C  K E   H   W+  +    +WR+S  G   ++    S                      LCW +W  RN  
Subjt:  RIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRDSISGAREQDKMCRSM--------------------ILCWHIWTYRNQI

Query:  LHNKHHPDISTLQSQIDRYMEELLEKDDTHQDI---DHSALTPLQSNVWVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVL
        +        + L  ++ +  +E    ++    I     S   PL    W  PP+G++K+N D A        G+G  +R +N   +    R I   +   
Subjt:  LHNKHHPDISTLQSQIDRYMEELLEKDDTHQDI---DHSALTPLQSNVWVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVL

Query:  WLETMAVSEGLK-ALSIESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRNQLAHNLAK
          E MA  EGL+ A+ +      +E DA   ++ +    E      + I+E   L+            R+ N++AH LA+
Subjt:  WLETMAVSEGLK-ALSIESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRNQLAHNLAK

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]9.5e-15331.49Show/hide
Query:  FEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDK----SRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSMRMTREK-------------E
        +E+MW  YE C +IVR  W     N+ +S + K    ++  L  L  WS+             +E+E R+      +D ++MT+++             E
Subjt:  FEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDK----SRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSMRMTREK-------------E

Query:  KELEMLLEDDEIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRWI------------VWLILF-----------------LPSITED
         ++  +L D+E+YWKQR+R + LK GD+NTK+FH +A+ RR+ N+I G+ D++   W+             +  LF                 LP ++++
Subjt:  KELEMLLEDDEIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRWI------------VWLILF-----------------LPSITED

Query:  LNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAK
        +N  L + FT E++T  +  M PTKAPGPDG+ A F+QK+W IVGE     CL +LN+   +   N T+I LIPKV+ P+ + +FRPISLC+V+Y+++AK
Subjt:  LNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAK

Query:  TLANRLKKVLDKIISP---------------IKGLSC---------------------------------------------------------------
         +ANRLK +L+ IISP               I G  C                                                               
Subjt:  TLANRLKKVLDKIISP---------------IKGLSC---------------------------------------------------------------

Query:  ----------------------------------LLNGAERN---------KDITV------------YKASNKDCRSVKEIIDSYEKATGQTVNYEKSM
                                          LLN AER          +DIT+             KAS  DC+ +K I D Y KA+GQ  N+EKS 
Subjt:  ----------------------------------LLNGAERN---------KDITV------------YKASNKDCRSVKEIIDSYEKATGQTVNYEKSM

Query:  FIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNS
           S   S E I  IK+   +       +YLGLP    RNK   F  +K +V   +  W  K FS  G+E+LIKAVAQ +P YAMS FK P  LC ++  
Subjt:  FIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNS

Query:  LCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKK
          A FWWG+ +++H IHW  W  + + K  GG+ F+DL  FNQA++AKQ WRL+++PNSLM+RV++ RYYK   +  A++G+NPS++WRS+LWG ++ KK
Subjt:  LCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKK

Query:  GFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLE
        G RWR+G G  +    D WIPR     PI P        VA+ I+    W    +   F+ +D + I+ I L   +++D V+WH+D KG +SVKS Y+L 
Subjt:  GFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLE

Query:  MQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHL--------KEW--IPM---------
          +   +     + N     WK  W + LP K+KI  WR  K+ILPT  NL  R      +C  C+ + E  SH+        K W   P+         
Subjt:  MQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHL--------KEW--IPM---------

Query:  -DYLDGI---WRDSISGAREQDKMCRSMILCWHIWTYRNQILHNKHHPDISTLQSQIDRYM---EELLEKDDTHQDIDHSALTPLQSNVWVKPPSGLWKL
         D+   I   W  S S   E + M   ++ CW IW+ RN+ +      D   L ++ D  +   + + +  + H   D      +    W  P   + KL
Subjt:  -DYLDGI---WRDSISGAREQDKMCRSMILCWHIWTYRNQILHNKHHPDISTLQSQIDRYM---EELLEKDDTHQDIDHSALTPLQSNVWVKPPSGLWKL

Query:  NCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLK-ALSIESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKR
        N DAA S + ++ G+G  +R +   +L VG +      +V   E  A+  GL+ A  I S  + +ESD  +VV LLN      TE+   + + +    + 
Subjt:  NCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLK-ALSIESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKR

Query:  SIEGVYHVTRNRNQLAHNLAK
               + R  N  AH LAK
Subjt:  SIEGVYHVTRNRNQLAHNLAK

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]6.8e-15132.28Show/hide
Query:  NTQRKYGAVRPRRFEEMWVKYEECRDIVRRVW-MDHGYNNTDSVMDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRR-DVD-SMRMTREKEK
        N+ R    V+   FE  W K E+C+ I+   W      +  + + +  RI    L  WS   Y G I   I  K   +  L  R  D D S+ + R +E 
Subjt:  NTQRKYGAVRPRRFEEMWVKYEECRDIVRRVW-MDHGYNNTDSVMDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRR-DVD-SMRMTREKEK

Query:  ELEMLLEDDEIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRW--------------------------IVWLILFLP-SITEDLNN
        E+  LL+D+E YW QRA+   LK GDRNTK+FH +A++RRK N I G++DE+  RW                          I  +   +P  +TE++N 
Subjt:  ELEMLLEDDEIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRW--------------------------IVWLILFLP-SITEDLNN

Query:  DLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLA
         L++EFTKEEV   +K + P KAPGPDG+ A+F+QKYW IVG +  +  L VLN  + + + N T I LIPK  +PK M DFRPISLC+V+YK+I+K LA
Subjt:  DLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLA

Query:  NRLKKVLDKIISP---------------------------------------------------------------------------------------
        NRLK +L  IIS                                                                                        
Subjt:  NRLKKVLDKIISP---------------------------------------------------------------------------------------

Query:  -------------------------IKGLSCLLNGAERNKDIT----------------------VYKASNKDCRSVKEIIDSYEKATGQTVNYEKSMFI
                                  +GLS L+N A RNK IT                        KA+ ++C  ++ I+  YE+A+GQ +N +KS   
Subjt:  -------------------------IKGLSCLLNGAERNKDIT----------------------VYKASNKDCRSVKEIIDSYEKATGQTVNYEKSMFI

Query:  VSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLC
         SPNT+ E    I   LG        +YLGLPS   R+K ++F  +K++V   L GWKGK  S+ G+E+LIKAVAQ IP Y MSCF  P  LC+++  + 
Subjt:  VSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLC

Query:  AGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGF
          FWWG   +  K+ W SW+++C +KA GG+ F++L  FN AMLAKQ+WR+L  PNSL+ RVL+ RY+ TG+ L A++G++PSY WRS+    E+ ++G 
Subjt:  AGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGF

Query:  RWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGT--WNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLE
        RWRVG G  I    D W+P       I P        + + +    T  W    +R +F+  + + I+ IPL  +  +D +IW  + KG FSVKSAY + 
Subjt:  RWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGT--WNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLE

Query:  MQ-IQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDG-IWRDSISGAREQ
           I        SN + +   WKK W + LP KIKI  WR   D LPT  N++ RGI  ++ C +C    E  +H   +     L    W D     +  
Subjt:  MQ-IQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDG-IWRDSISGAREQ

Query:  D--------KMCRS---------MILCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALTPLQSNV-WVKPPSGLWKLNCDAAW
        +         +C S          +L W IW  RN+I+HN    D     SQ+       LE       +D   + P  S + W  PP G++K+N D A 
Subjt:  D--------KMCRS---------MILCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALTPLQSNV-WVKPPSGLWKLNCDAAW

Query:  SEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEG-LKALSIESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVY
        S++ +   IG  +R SN  ++    + +   +    +E +A+ +G L A  ++   V +E DAL V+  LN       E+   +   +S+          
Subjt:  SEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEG-LKALSIESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVY

Query:  HVTRNRNQLAHNLAK---------KWGPVPQPMVFDL
        HV R  N +AH LA+          W  VP+P V  L
Subjt:  HVTRNRNQLAHNLAK---------KWGPVPQPMVFDL

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.7e-14631.27Show/hide
Query:  LEDKNTQRKYGAVRPRRFEEMWVKYEECRDIVRRVW-MDHGYNNTDSVMDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSM----RM
        + D    +K+   R  +FE MW + E+C+DI++ VW   H  N+   +  + R   E L  W++  + G+I   I  K+  +  LV+     S+     M
Subjt:  LEDKNTQRKYGAVRPRRFEEMWVKYEECRDIVRRVW-MDHGYNNTDSVMDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSM----RM

Query:  TREKEKELEMLLEDDEIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRW--------------------------IVWLILFLP-SI
         R   KE+  LL+ +EI W+QR+R + L  GDRNTK+FH +A+ RR+ N I G+ DE N  W                          I  ++  +P ++
Subjt:  TREKEKELEMLLEDDEIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRW--------------------------IVWLILFLP-SI

Query:  TEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKV
        TE++N+ L++EFT+EE+   +  M PTKAPGPDG+ AIF+QKYW+IVG D     L VLN  + M + N T I L+PK+K+P  M DFRPISLC+V+YK+
Subjt:  TEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKV

Query:  IAKTLANRLKKVLDKIIS----------------------------------------------------------------------------------
        I+K LANRLK +L +IIS                                                                                  
Subjt:  IAKTLANRLKKVLDKIIS----------------------------------------------------------------------------------

Query:  ------------PIKGL------------------SCLLNGAER----------------------NKDITVYKASNKDCRSVKEIIDSYEKATGQTVNY
                    P +GL                  S LLN   R                      +  +   KA++++C+++ +I+  YE A+GQ +N 
Subjt:  ------------PIKGL------------------SCLLNGAER----------------------NKDITVYKASNKDCRSVKEIIDSYEKATGQTVNY

Query:  EKSMFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCN
        +KS    S NT  E    +   LG        +YLGLPS   ++K EIF  +K+RV + L GWK K  SV GRE+LIKAVAQ IP Y MSCF+ P TLC 
Subjt:  EKSMFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCN

Query:  ELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRE
        E+ ++   FWWG   +  KI W SW+KLC+ K  GG+ F++L  FN AMLAKQ WRL+  PNSL++++ + RYY  G+  +A++G +PSY WRS+  G E
Subjt:  ELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRE

Query:  LFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQ--DTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVK
        + ++G RWRVG G  I    D W+P       I P +  D      A        W + +VRDLF+  +A  I++IPL  +  +D +IW  + KG FSVK
Subjt:  LFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQ--DTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVK

Query:  SAYRLEM-QIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRDSIS
        SAY + +  I    +   S+ +     W+K W + +P K++I  W++  + LPT  NL  +G+++  +C  C  + E   H+  ++  +    +WR  + 
Subjt:  SAYRLEM-QIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRDSIS

Query:  GARE-----QDKMCRSM---------------ILCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALTPLQSNVWVKPPSGLWK
           +      D +  +M               ++ W IW  RN+I+          +     +Y+ E      T+  I      P     W+ PP G++K
Subjt:  GARE-----QDKMCRSM---------------ILCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALTPLQSNVWVKPPSGLWK

Query:  LNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALSIESPLVQI--ESDALQVVHLLNGRVEDETEVKIFIDETKSLMA
        +N D A SE  +   +G  +R +   +       +   + V  +E +A+  GL  L+ E  L QI  ESDAL VV  +N   E    +        SL++
Subjt:  LNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALSIESPLVQI--ESDALQVVHLLNGRVEDETEVKIFIDETKSLMA

Query:  KRSIEGVYHVTRNRNQLAHNLA---------KKWGPVPQPMVFDL
              + HV R  N+ AH LA         + W  V  PMV  +
Subjt:  KRSIEGVYHVTRNRNQLAHNLA---------KKWGPVPQPMVFDL

XP_030483481.1 uncharacterized protein LOC115700065 [Cannabis sativa]6.0e-14731.84Show/hide
Query:  DSMRMTREKEKELEMLLEDDEIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRWI----------------------------VWLI
        D+    +  E  L+ LLE +E+YW+QR+R + ++ GD+NTK+FH +A+ RR  N+I  L  E    +                               L 
Subjt:  DSMRMTREKEKELEMLLEDDEIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRWI----------------------------VWLI

Query:  LFLPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLC
          L S+T D+N  LLK F+K++V   + SM P K+PG DG+ A+FYQK WDIVG       L VLNDG + S  NST I LIPK+K  K MKDFRPISLC
Subjt:  LFLPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLC

Query:  SVLYKVIAKTLANRLKKVLDKIISP---------------------------------------------------------------------------
        +V+ K+I K L  R K+VL  +IS                                                                            
Subjt:  SVLYKVIAKTLANRLKKVLDKIISP---------------------------------------------------------------------------

Query:  -------------------------------------IKGLSCLLNGAERNKDITVYKASNKDCR----------------------SVKEIIDSYEKAT
                                             ++GLSCLL   ER  ++ VY  + +  R                      ++K  +D Y +A+
Subjt:  -------------------------------------IKGLSCLLNGAERNKDITVYKASNKDCR----------------------SVKEIIDSYEKAT

Query:  GQTVNYEKSMFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKF
        GQ +N +KS+   SPNT++         L +  ++   +YLGLP+   R+KQ++F+ IK++VW+ ++ W  K FS+ G+E+L+KAV Q IP Y MSCFK 
Subjt:  GQTVNYEKSMFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKF

Query:  PITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRS
        PITLCN+L ++ + FWWGS+E   KIHW+SW+ LC+ K  GG+ F+    +NQA+LAKQ+WRLL  P+SL+SR+L+ RY+   ++L+A  G++PS  W+ 
Subjt:  PITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRS

Query:  LLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGL
        +L GREL + G RW+VG G  I  ++DPWIP     +PI  +    N  VAN I  +  WN++L+ + F   D D I+ IPL      D +IWHY   G+
Subjt:  LLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGL

Query:  FSVKSAYRLEMQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRD
        + V S Y     ++   ++S SN +   ++WK FWK+ LP KIKI  WR++ D LP  ++L  R I  ++ C +CR   E   H        Y   +WR 
Subjt:  FSVKSAYRLEMQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRD

Query:  S--------------------ISGAREQDKMCRSMILCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSA-----LTPLQSNVWV
        +                    +S    + +M   + + W IWT RN+I+H K       L S    Y+         ++  + +A       P+  + W 
Subjt:  S--------------------ISGAREQDKMCRSMILCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSA-----LTPLQSNVWV

Query:  KPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLK-ALSIESPLVQIESDALQVVHLLNGRVEDETEVKIFID
         P  G  K+N DAA +    + GIG  +R     ++    + +   +    +E  AV   L  A  +E  L QIE+DA+ V + L GR    +     I+
Subjt:  KPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLK-ALSIESPLVQIESDALQVVHLLNGRVEDETEVKIFID

Query:  ETKSLMAKRSIEGVYHVTRNRNQLAHNLAK
        +   L++      V H  R  N+ AH LA+
Subjt:  ETKSLMAKRSIEGVYHVTRNRNQLAHNLAK

TrEMBL top hitse value%identityAlignment
A0A7N2L6Z9 Reverse transcriptase domain-containing protein5.6e-15132.46Show/hide
Query:  RPR----RFEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDKS-RIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNR-RDVDSMRMTREKEKELEMLL
        RPR     FE  W K E+CR I+  VW      NT   M    ++    L +W+       I   I  K   + +L  + RD           +EL  LL
Subjt:  RPR----RFEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDKS-RIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNR-RDVDSMRMTREKEKELEMLL

Query:  EDDEIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDE-------------------------RNMRWIVWLILFLPS-ITEDLNNDLLKEFT
        +D+EI+W QR++   LK GDRNTK+FH RA++RRK N I G++D+                          N   +  +   +P+ ITE++N +L + FT
Subjt:  EDDEIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDE-------------------------RNMRWIVWLILFLPS-ITEDLNNDLLKEFT

Query:  KEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLK---
        +EE+   +K + PTK+PGPDG+ AIF+QKYWDIVG +  N  L VLN G+ +   N T IVLIPK  +PK M DFRPISLC+V+YK+I+KTLANRLK   
Subjt:  KEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLK---

Query:  -------------------------------------------------KVLDKI---------------------------------------------
                                                         K  D++                                             
Subjt:  -------------------------------------------------KVLDKI---------------------------------------------

Query:  ---------ISP------IKGLSCLLNGAERNK----------------------DITVYKASNKDCRSVKEIIDSYEKATGQTVNYEKSMFIVSPNTSM
                 +SP       +GLS LL+ A RN+                       +   KA+ ++C  +KEI++ YE A+GQ VN +KS    SPNT+ 
Subjt:  ---------ISP------IKGLSCLLNGAERNK----------------------DITVYKASNKDCRSVKEIIDSYEKATGQTVNYEKSMFIVSPNTSM

Query:  EMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGS
        E+   I   LG        +YLGLPS   R+K+ +F  IK+RV   L GWKGK  S  G+E+LIKAVAQ IP Y MSCF  P +LC+EL  +   FWWG 
Subjt:  EMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGS

Query:  SEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRG
          +  K+ W SWRK+C+ K+LGG+ F++L  FN A+LAKQ+WR+L  P SL +R+L+ +Y+  G+ L A +G+NPSY WRS+    E+ KKG RWRVG G
Subjt:  SEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRG

Query:  VSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGT--WNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQ-IQGQ
          I    D W+P       I P + T +  + + +    T  W    +R LF+  DA+ I+ IPL  +   D +IW  + KG FSVKSAY + +  ++  
Subjt:  VSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGT--WNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQ-IQGQ

Query:  YMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDY---LDGIWRDSISGAREQDKMCR
         +   S+ +  +  WK  WK+ LP K+KI  WR   + LPT++N++ RG++ N  C +C ++ E  +H    +  D+   +  +W+D   G   + +  +
Subjt:  YMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDY---LDGIWRDSISGAREQDKMCR

Query:  SMIL-----------------CWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALTPLQSNV--WVKPPSGLWKLNCDAAWS-EE
        ++ L                  W IW  RN  +H++    +S LQ+      + L+  DD H  I      P+QS +  W  PP G++K+N D A S + 
Subjt:  SMIL-----------------CWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALTPLQSNV--WVKPPSGLWKLNCDAAWS-EE

Query:  LKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEG-LKALSIESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVT
        +   G+G  +R  +  ++    +++   +   W E  A+ +G L A  ++ P + +ESDAL  +  +N     E E    ++      A  S     ++ 
Subjt:  LKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEG-LKALSIESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVT

Query:  RNRNQLAHNLAK
        R+ N +AH LA+
Subjt:  RNRNQLAHNLAK

A0A803PV25 Uncharacterized protein3.3e-15129.8Show/hide
Query:  TWCNNHLNSEVI--WERPIMA-EWLED------------KNTQR-------------KYGAVRPR---RFEEMWVKYEECRDIVRRVWMDHGYN-NTDSV
        TWCN H NS ++   +R +   EWL+             ++  R             K G  + +    FEE W + EEC +IV R+W +   +    S 
Subjt:  TWCNNHLNSEVI--WERPIMA-EWLED------------KNTQR-------------KYGAVRPR---RFEEMWVKYEECRDIVRRVWMDHGYN-NTDSV

Query:  MDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSMRMTREKEKELEMLLEDDEIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIR
          K     + L  W+R +    +   +   +  + +L   +        ++ E +L  LLE DE YW+QR+R   L+WGDRNTK+FH +A+ RRK N I+
Subjt:  MDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSMRMTREKEKELEMLLEDDEIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIR

Query:  GLFDERNMRWIVW--------------------------------LILFLPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIV
        GL D       VW                                L +  P ++  +NNDLL EF +EEV   VK M PTKAPG DG+ A+FYQK+W  +
Subjt:  GLFDERNMRWIVW--------------------------------LILFLPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIV

Query:  GEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLKKVLDKIIS-----------------------------
          D     L VLN+G D+   N T + LIPKV  P+ +++FRPISLC+V+YK+++K LANR++  L  ++S                             
Subjt:  GEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLKKVLDKIIS-----------------------------

Query:  -----------------------------------------------------------------PIKGL------------------SCLLNGAER---
                                                                         P +GL                  SCL+  AE+   
Subjt:  -----------------------------------------------------------------PIKGL------------------SCLLNGAER---

Query:  -------------------NKDITVYKASNKDCRSVKEIIDSYEKATGQTVNYEKSMFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQE
                           +  +    A+  +CR  +E+++ Y  A+GQ VN+ KS      + S  +  H+   +GV   D  G+YLGLPS   R K++
Subjt:  -------------------NKDITVYKASNKDCRSVKEIIDSYEKATGQTVNYEKSMFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQE

Query:  IFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQ
         F  I ++VW  L+GWKG FFS AG+EVLIKA+ Q IP Y MSCF+ P    N ++S+ A FWWGSSE+  KIHW  W  LC++K  GG+ F+DL +FNQ
Subjt:  IFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQ

Query:  AMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPR----KGPCIPIVPHQDTRNLT
        A+LAKQ WR +++PNSL S+VL+  YY     L+A+ GN+ S+VWRSL+WG+++ + G+RWR+G G S+    DPW+PR    K    P +P     NL 
Subjt:  AMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPR----KGPCIPIVPHQDTRNLT

Query:  VANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWR
        V +   GNG W+E  VR +F   DA+LI+ +       +D ++WHY   G +SV+S YR+   ++ + + S +   +   +W++ WK+ +P K+K   W+
Subjt:  VANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWR

Query:  IYKDILPTLSNLNNRGIDLNTLCVLCRDKREK---------------------TSHLKEWIPMDYLDGIWRDSISGAREQDKMCRSMILCWHIWTYRNQI
        +    +PT S L +R + +   C  C     +                     +S +K     D L  + R S S A+E  +    ++L W++W  RN +
Subjt:  IYKDILPTLSNLNNRGIDLNTLCVLCRDKREK---------------------TSHLKEWIPMDYLDGIWRDSISGAREQDKMCRSMILCWHIWTYRNQI

Query:  LHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALTPLQSNVWVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLE
         H  H P  + +     +++ E  +      ++   A     +  WV P  G + +N DA          +   +R     +     RV+ +    L  E
Subjt:  LHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALTPLQSNVWVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLE

Query:  TMAVSEGLKA-LSIESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRNQLAHNLAKK
          A+++G+KA L  + P   +E+D LQ V+L+        +V   + + + L+    + G+  V R  NQ AH LA +
Subjt:  TMAVSEGLKA-LSIESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRNQLAHNLAKK

A0A803PYN5 Uncharacterized protein1.1e-15132.48Show/hide
Query:  RFEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVN--RRDVDSMRMTREKEKELEMLLEDDEIYW
        RFE++W+   +   I+R  W D      D  +       + L  W   R  G++R  IS  + E+ +L N   R   +M   +  E  L+ LLE +E YW
Subjt:  RFEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVN--RRDVDSMRMTREKEKELEMLLEDDEIYW

Query:  KQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRWIVWLILFLPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDI
         Q +R + L+ GD+NTK+FH  A+ R+  N I+ L D  ++     L     +IT ++N  L   FT  EV A ++SM+P K+PG DG+ A+FYQ YW+I
Subjt:  KQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRWIVWLILFLPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDI

Query:  VGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLKKVLDKIIS----------------------------
        VG    +  L VLNDG DM+  N + I LIPKV DP  M ++RPISLC+V+YK+I+KT+  R KKVL  +IS                            
Subjt:  VGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLKKVLDKIIS----------------------------

Query:  ------------------------------------------------------PI---------KGLSCLLNGAER----------------------N
                                                              P+         +GLS LL   E                       +
Subjt:  ------------------------------------------------------PI---------KGLSCLLNGAER----------------------N

Query:  KDITVYKASNKDCRSVKEIIDSYEKATGQTVNYEKSMFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKF
          +   + + +   S+K  +D+Y KA+GQ +N +KS+   SPNT  +       TL +  T+   +YLGLPS + R+KQE+F++IK++VWK L  W  K 
Subjt:  KDITVYKASNKDCRSVKEIIDSYEKATGQTVNYEKSMFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKF

Query:  FSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSR
        FSV G+EVL+KAV Q IP YAMSCFK     C++L S+ A FWWGS++   KIHW+ W+ LC++K  GG+ F+    FNQA+LAKQ+WR+   P+SL+SR
Subjt:  FSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSR

Query:  VLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQD
        +L+ RY+   ++L A IG++PS  W+ + WGREL  KG R+++G G  ++   DPWIP      P V    + +L V++FI  +  W+ +L+   F   D
Subjt:  VLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQD

Query:  ADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCV
         D I+ IPL      D +IWH+   G++SVKS + L   ++ Q+ +S S  N    +WK FW + LP KI+I  W+++++ILPT + L  R +  +  C 
Subjt:  ADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCV

Query:  LCRDKREKTSHL--------KEW----IPMDY------LDGIWRDSISGAREQDKMCRSMILCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLE-K
        LC    E   H           W      +DY       +G +   +S   +Q +    + + W IWT RN+++H +     S +     R+ E+    K
Subjt:  LCRDKREKTSHL--------KEW----IPMDY------LDGIWRDSISGAREQDKMCRSMILCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLE-K

Query:  DDTHQDIDHSALTPLQ--------SNVWVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALS-IESPL
        +      D S + P Q        +  W  P S  +KL  DAA   E K  GIG  LR     ++    +V+   +K   +E  A+   +  +S  + PL
Subjt:  DDTHQDIDHSALTPLQ--------SNVWVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALS-IESPL

Query:  VQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRNQLAHNLAK
           E+DA +V   LN    D       I + + L++      V HV R  NQ AH + K
Subjt:  VQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRNQLAHNLAK

A0A803Q7D4 Uncharacterized protein6.9e-14935.49Show/hide
Query:  RFEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDK--SRIFL--ERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSMRMTR--EKEKELEMLLEDD
        RFE+ W+K  EC D++ + W +    +TD  M      IF   E + +W   +Y G ++  I+     +  L N R   +   T+  + +K L+ LLE +
Subjt:  RFEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDK--SRIFL--ERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSMRMTR--EKEKELEMLLEDD

Query:  EIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLF--DERNMR--------------------------WIVWLILFLPSITEDLNNDLLKEFTK
        E+YW+QRAR + LK GD NTK+FH RA  R   N+I+ L   D R +R                            + +     SIT+++N  L + FT+
Subjt:  EIYWKQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLF--DERNMR--------------------------WIVWLILFLPSITEDLNNDLLKEFTK

Query:  EEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLKKVLD
         +V   + SMAP K+PG DG+ A+F+QK+WD+VG    N  L +LN+G DMS+ NS  I LIPKV+ P+ + D++PISLCSVLYK+++K + NR KKVL 
Subjt:  EEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLKKVLD

Query:  KIIS----------------------------PIKGLSCLLNGAER----------------------NKDITVYKASNKDCRSVKEIIDSYEKATGQTV
          IS                              +GLS LL   ER                      +  +   +A+   C+++  I++ Y +A+GQ +
Subjt:  KIIS----------------------------PIKGLSCLLNGAER----------------------NKDITVYKASNKDCRSVKEIIDSYEKATGQTV

Query:  NYEKSMFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITL
        N EK++   S NT   +     + LG+   D   +YLGLPS   R+K ++F++IKDRVWK +  W  K FS+ GREVL+KAV Q IP YAMSCF FP   
Subjt:  NYEKSMFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITL

Query:  CNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWG
        C +L S  A FWWGSS  + KIHW+ W  LC+ KA GGI F      NQA+LAKQ+WR+L+ P+SL++RVL+ RY+K G +L A  G  PS  W+S+  G
Subjt:  CNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWG

Query:  RELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVK
        R+L  KG RW++G G  ++  SDPW+P     +P     D    TV ++I+ +  W+  L++  F   D D I+++ L P+Q++D +IWH+   G ++VK
Subjt:  RELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVK

Query:  SAYRLEMQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRDSISG
        S Y L   ++    +S SN     S+W + W + LP K+KI  WR+  D LPT  NL +R I  +T C LC   RE + H        +L    +  I G
Subjt:  SAYRLEMQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRDSISG

Query:  AREQDKMCRSMILCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALTPLQSNVWVKPPSGLWKLNCDAAWSEELKRGGIG
         + +     + I+C     Y  Q +++   P      SQ      +L           H+   P+    W  PP+G +K+N DAA+ E  +  G+G
Subjt:  AREQDKMCRSMILCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALTPLQSNVWVKPPSGLWKLNCDAAWSEELKRGGIG

A0A803QH07 Uncharacterized protein3.1e-14931.04Show/hide
Query:  RFEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSMRMT--REKEKELEMLLEDDEIYW
        RFE+MW+K EE   +++  W      +            + L  W  +++ G ++  IS  +  + QL N  D     +T  ++ EK L+ LL ++E YW
Subjt:  RFEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSMRMT--REKEKELEMLLEDDEIYW

Query:  KQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDE--------RNMRWIV-------------------WLILFLPS-ITEDLNNDLLKEFTKEEVT
         QR+R + L+ GD+NTK+FH +A+ R+  N I+ L ++         N+  ++                    +I  +PS I   +N  L   F+  EV 
Subjt:  KQRAREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDE--------RNMRWIV-------------------WLILFLPS-ITEDLNNDLLKEFTKEEVT

Query:  AVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLKKVLDKIIS
        A ++SM+P K+PG DG+ A+FYQ YWDIVG       L +LNDG +M+Q N++ I LIPKV +P  M D+RPISLC+V+YK+I+KT+  R +KVL  +IS
Subjt:  AVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLKKVLDKIIS

Query:  PIKG--------------------------------------------------------------------LSCL--------LNG-------------
          +                                                                     +SC+        LNG             
Subjt:  PIKG--------------------------------------------------------------------LSCL--------LNG-------------

Query:  -------------------------AE--------------------RNKDITVYKASNKDCRSVKEIIDSYEKATGQTVNYEKSMFIVSPNTSMEMIHH
                                 AE                     +  +   +A+ +   ++K  + +Y KA+GQ +N +KS+   SPNTS+     
Subjt:  -------------------------AE--------------------RNKDITVYKASNKDCRSVKEIIDSYEKATGQTVNYEKSMFIVSPNTSMEMIHH

Query:  IKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERH
          ATL +  TD   +YLGLPS + R+KQE+F++IK++VWK L  W  K FSV G+EVL+KAV Q IP YAMSCFK     CN+L S+ A FWWG+++   
Subjt:  IKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERH

Query:  KIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEA
        KIHW+ W+ LC++K  GG+ F+    FNQA+LAKQ+WR+   P+SL+SR+L+ RY+ T ++L A IG++PSY W+S+ WGREL  KG R++VG G  I+ 
Subjt:  KIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEA

Query:  SSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNH
        S DPWIP      P V +     L V+  IN N  WN  L+ D F   D + I++IPL     +D +IWH+   G ++VKS + L   ++ Q  +S S+ 
Subjt:  SSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNH

Query:  NHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRDS--------------------ISG
        N    +WK FW + LP KI+I  W++ + ILP  + L  R +  +  C LC    E   H        +   IW++S                    +S 
Subjt:  NHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRDS--------------------ISG

Query:  AREQDKMCRSMILCWHIWTYRNQILH--------------NKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALTPLQSNVWVKPPSGLWKLNCDAAW
           Q+     + L W IWT RN++ H                 H D S  +    +      E   THQ  + +  T  Q   W  P    +KLN DAA 
Subjt:  AREQDKMCRSMILCWHIWTYRNQILH--------------NKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALTPLQSNVWVKPPSGLWKLNCDAAW

Query:  SEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALS-IESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVY
        + + ++ GIG  +R  +  ++    +V+   ++   +E  A+   L  +S  +  +  IE+DAL+V   LN    D +     I + + L++      V 
Subjt:  SEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALS-IESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVY

Query:  HVTRNRNQLAHNLAK---------KW-GPVPQPM
        H  R+ NQ AH LA+          W G +P P+
Subjt:  HVTRNRNQLAHNLAK---------KW-GPVPQPM

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.6e-1232.79Show/hide
Query:  LPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKV-KDPKTMKDFRPISLCS
        LP + ++    L +  T  E+ A++ S+   K+PGPDG  A FYQ+Y + +          +  +G+  + F    I+LIPK  +D    ++FRPISL +
Subjt:  LPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKV-KDPKTMKDFRPISLCS

Query:  VLYKVIAKTLANRLKKVLDKII
        +  K++ K LANR+++ + K+I
Subjt:  VLYKVIAKTLANRLKKVLDKII

P0C2F6 Putative ribonuclease H protein At1g657502.4e-3724.22Show/hide
Query:  LPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGG
        +P    R  ++ F  I +RV   + GW+ K  S AGR  L KAV   +P ++MS    P ++ N L+ L   F WGS+ E+ K H   W K+C  K  GG
Subjt:  LPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGG

Query:  ISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYY----KTGNYLKAEIGNNPSYVWRSLLWG-RELFKKGFRWRVGRGVSIEASSDPWIPRKGPCI
        +  +     N+A+++K  WRLL+  NSL + VL+ +Y+    +   +L  +   + S  WRS+  G R++   G  W  G G  I   +D W+  K    
Subjt:  ISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYY----KTGNYLKAEIGNNPSYVWRSLLWG-RELFKKGFRWRVGRGVSIEASSDPWIPRKGPCI

Query:  PIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQ--------QKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNHNHHVS
        P++   +    T  + +     W      D F   D     N  L             +D + W +   G FSV+SAY +                +  S
Subjt:  PIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQ--------QKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNHNHHVS

Query:  FWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRDSISGAREQDKMCRSM--------------
        F+   WK+ +P ++K   W +    + T    + R +  + +C +C+   E   H+    P     GIW   +   R+Q    +S+              
Subjt:  FWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRDSISGAREQDKMCRSM--------------

Query:  ----------ILCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALT-PLQSNV--WVKPPSGLWKLNCDAAWSEELKRGGIGWA
                  ++ W  W +R   +  ++    +  + ++    E  +E    H       +T P    +  WV P  G  K+N D A          G  
Subjt:  ----------ILCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDDTHQDIDHSALT-PLQSNV--WVKPPSGLWKLNCDAAWSEELKRGGIGWA

Query:  LRQ-----SNDHLLYVGYRVISRPWKVLWLETMAVSEGLK-ALSIESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRN
        LR           L +G    S P   LW     V  GL  A   + P V++E D+  +V  L   + D   +   +      + K  +  + HV R  N
Subjt:  LRQ-----SNDHLLYVGYRVISRPWKVLWLETMAVSEGLK-ALSIESPLVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRN

Query:  QLAHNLA
        +LA  LA
Subjt:  QLAHNLA

P11369 LINE-1 retrotransposable element ORF2 protein9.9e-1233.87Show/hide
Query:  LPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPK-VKDPKTMKDFRPISLCS
        +P + +D  + L    + +E+ AV+ S+   K+PGPDG  A FYQ + + +         ++  +G   + F    I LIPK  KDP  +++FRPISL +
Subjt:  LPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPK-VKDPKTMKDFRPISLCS

Query:  VLYKVIAKTLANRLKKVLDKIISP
        +  K++ K LANR+++ +  II P
Subjt:  VLYKVIAKTLANRLKKVLDKIISP

P14381 Transposon TX1 uncharacterized 149 kDa protein3.4e-1232.52Show/hide
Query:  LPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSV
        LP ++E     L    T +E++  ++ M   K+PG DG+   F+Q +WD +G D           G          + L+PK  D + +K++RP+SL S 
Subjt:  LPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSV

Query:  LYKVIAKTLANRLKKVLDKIISP
         YK++AK ++ RLK VL ++I P
Subjt:  LYKVIAKTLANRLKKVLDKIISP

P93295 Uncharacterized mitochondrial protein AtMg003102.4e-3443.71Show/hide
Query:  IPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKA-LGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKA
        +P YAMSCF+    LC +L S    FWW S E + KI W +W+KLC++K   GG+ F+DL  FNQA+LAKQS+R++  P++L+SR+LR RY+   + ++ 
Subjt:  IPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKA-LGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKA

Query:  EIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPI
         +G  PSY WRS++ GREL  +G    +G G+  +   D WI  + P  P+
Subjt:  EIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.5e-1227.19Show/hide
Query:  QLVNRRDVDSMRMTREKEKELEMLLEDDEIYWKQRAREERLKWGDRNTKWFH--MRANQRRKVNRIRGLFDE---RNMRWIVWLIL-----FLPSITEDL
        QL+        R+     K+        E +++Q++R + L+ GD NT++FH  + ANQ + + +   + D+    N+  +  +I+      L S ++ L
Subjt:  QLVNRRDVDSMRMTREKEKELEMLLEDDEIYWKQRAREERLKWGDRNTKWFH--MRANQRRKVNRIRGLFDE---RNMRWIVWLIL-----FLPSITEDL

Query:  NNDLLKEF--------------------TKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKT
          D ++                      + +E+TA V +M   KAPGPD   A F+ + W +V + T          G  + +FN+T I LIPKV     
Subjt:  NNDLLKEF--------------------TKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVLNDGVDMSQFNSTYIVLIPKVKDPKT

Query:  MKDFRPISLCSVLYKVI
        +  FRP+S C+V+YK+I
Subjt:  MKDFRPISLCSVLYKVI

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.7e-1229.82Show/hide
Query:  CVLCRDKREKTSHL--------KEW----IPMDYLDGIWRDSISG--------AREQDKMCR--SMI--LCWHIWTYRNQILHNKHHPDISTLQSQIDRY
        CV C D RE  +HL          W    IP  Y +G W DS+            E  K+ +  +++  L W +W  RN+++      D   +   + R 
Subjt:  CVLCRDKREKTSHL--------KEW----IPMDYLDGIWRDSISG--------AREQDKMCR--SMI--LCWHIWTYRNQILHNKHHPDISTLQSQIDRY

Query:  MEELLEKDDTHQDIDHSALTP-LQSNV---WVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALS-IE
        ME+  E+  T ++++  A  P ++ N+   W  PP    K N DA W  E  R GIGW LR  +  +L++G R + R   VL  E  A+   +  +S   
Subjt:  MEELLEKDDTHQDIDHSALTP-LQSNV---WVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALS-IE

Query:  SPLVQIESDALQVVHLLN
           +  ESDA  +V+LLN
Subjt:  SPLVQIESDALQVVHLLN

AT3G09510.1 Ribonuclease H-like superfamily protein8.3e-3022.83Show/hide
Query:  LRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGT---WNERLVRDLFIG
        ++ RY+K  + L A++    SY W SLL G  L KKG R  +G G +I    D  I    P  P+   +  + +T+ N     G+   W++  +      
Subjt:  LRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPIVPHQDTRNLTVANFINGNGT---WNERLVRDLFIG

Query:  QDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTL
         D   I  I L   ++ D +IW+Y+  G ++V+S Y L        + +++  +  +    + W +P+  K+K   WR     L T   L  RG+ ++  
Subjt:  QDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNHNHHVSFWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTL

Query:  CVLCRDKREKTSHLKEWIPMDYLDGIWRDSISGAREQDKMCRS-------------------------MILCWHIWTYRNQILHNKHHPDISTLQSQIDR
        C  C  + E  +H     P  +    WR S S       M                            + L W IW  RN ++ NK     S        
Subjt:  CVLCRDKREKTSHLKEWIPMDYLDGIWRDSISGAREQDKMCRS-------------------------MILCWHIWTYRNQILHNKHHPDISTLQSQIDR

Query:  YMEELLEKDDTHQDIDHSALTPLQSNV-WVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALSIES-P
           + L    +H+          ++ + W  PP+   K N DA +  +      GW +R      +  G   ++     L  ET A+   L+   I    
Subjt:  YMEELLEKDDTHQDIDHSALTPLQSNV-WVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALSIES-P

Query:  LVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRNQLAHNLAK
         V +E D   +++L+NG +   + +   +++      K +      + R  N+LAH LAK
Subjt:  LVQIESDALQVVHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRNQLAHNLAK

AT4G29090.1 Ribonuclease H-like superfamily protein2.9e-5926.55Show/hide
Query:  IPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAE
        +P Y M+CF  P T+C ++ S+ A FWW + +E   +HW++W  L   KA GGI FKD+  FN A+L KQ WR+L  P SLM++V + RY+   + L A 
Subjt:  IPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAE

Query:  IGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPI----VPHQDTRN----LTVANFINGNG-TWNERLVRDLFIGQDADLIMNI
        +G+ PS+VW+S+   +E+ ++G R  VG G  I      W+  K     +    VP Q+  +    L V++ I+ +G  W + ++  LF   +  LI  +
Subjt:  IGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPI----VPHQDTRN----LTVANFINGNG-TWNERLVRDLFIGQDADLIMNI

Query:  PLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNHNHHVS-FWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKR
             +  D   W Y   G ++VKS Y +  QI  +  +        ++  ++K WK     KI+   W+   + LP    L  R +   + C+ C   +
Subjt:  PLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNHNHHVS-FWKKFWKIPLPSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKR

Query:  EKTSHL---------------------KEWIPMDYLDGIWRDSISGAREQDKMCRSMI--LCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDD
        E  +HL                      EW    Y++  W  ++     Q +    ++  L W +W  RN+++      +   +  + +  +EE   + +
Subjt:  EKTSHL---------------------KEWIPMDYLDGIWRDSISGAREQDKMCRSMI--LCWHIWTYRNQILHNKHHPDISTLQSQIDRYMEELLEKDD

Query:  THQDIDHSALTPLQSNVWVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALS-IESPLVQIESDALQV
                 +       W  PP    K N DA W+ + +R GIGW LR     + ++G R + +   VL  E  A+   + +LS  +   V  ESD+  +
Subjt:  THQDIDHSALTPLQSNVWVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALS-IESPLVQIESDALQV

Query:  VHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRNQLAHNLAKK
        + +LN   E    +K  I + + L+++ +      + R  N LA  +A++
Subjt:  VHLLNGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRNQLAHNLAKK

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-3543.71Show/hide
Query:  IPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKA-LGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKA
        +P YAMSCF+    LC +L S    FWW S E + KI W +W+KLC++K   GG+ F+DL  FNQA+LAKQS+R++  P++L+SR+LR RY+   + ++ 
Subjt:  IPNYAMSCFKFPITLCNELNSLCAGFWWGSSEERHKIHWQSWRKLCENKA-LGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKA

Query:  EIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPI
         +G  PSY WRS++ GREL  +G    +G G+  +   D WI  + P  P+
Subjt:  EIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPWIPRKGPCIPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTAAAAGGTTCTGATTCCGCCATTGTTGAGGTTGTTGGAGAAATTGAAGGATTGGTACTTGCAGGTTTGGGGGAGGCCGACGTCGGAGCCGATGAGCTTTCTGA
TTTCGGATCTGAACTTGAAGAAGACGTGTCTCCAACTACCTTTGTTTTGAGATCCGAGGATTTTGGACTCCTCTTCGTTCATCTCGAGATCTTTGGCGAAGATAGGCTTG
CGAGAGAGGATCCAGGCCCAGCCCCAGTCCCACCACTTGCGGACGGAGCCGCCGGCGAGATCGCCGATCGAGGACATGGATTTGGATTTTCGGATCGCCCAGGCGGTGGC
AGGGGAGGAGATTGGCTGGATTCGGAGGGAGAGAGAGGAGAGACGGCGGCGGAGGGTTTGCTCGACGCCGGAGTCGGTGTGGGAGACCGGGAGGGCGTCGGGGAGAGGCC
TGGGCGGGAGGAAGGTGTGGACGACGGAGGACATGGTGGCGTGACGGCGGAGGGAAATTGGGGCGATGGGATTTGGGGGAAATGGAGAAGGAAAGTGGGAAGAGGTGGAA
ATGAAGAGAGTGTCTACACAACCTTTACATGGTGTAACAATCATTTGAACAGCGAGGTGATTTGGGAGAGGCCGATTATGGCTGAATGGCTTGAGGATAAAAATACCCAA
AGAAAATATGGGGCAGTCCGACCTAGAAGATTTGAAGAGATGTGGGTTAAGTATGAGGAGTGTAGGGATATTGTCCGTAGGGTTTGGATGGATCATGGATACAACAATAC
TGATTCAGTGATGGATAAATCTAGGATTTTCCTAGAGAGGCTAGGAAACTGGAGTCGATATAGGTATGGAGGATCGATTAGGGGGGCAATCTCTTGGAAGGAATATGAGA
TCCGACAATTAGTCAACAGAAGAGATGTGGATAGTATGAGAATGACCCGGGAAAAGGAAAAAGAATTGGAAATGCTTCTAGAAGACGACGAAATCTACTGGAAACAGAGA
GCTCGTGAGGAGCGGTTGAAATGGGGGGATAGAAACACGAAATGGTTCCACATGAGAGCTAACCAAAGAAGGAAAGTGAACAGGATCAGAGGCCTGTTTGATGAAAGGAA
TATGAGATGGATTGTGTGGTTGATTCTATTCCTACCAAGTATCACTGAGGACCTGAATAATGATCTTTTAAAAGAGTTCACTAAAGAAGAGGTCACTGCAGTAGTAAAGA
GTATGGCTCCTACAAAAGCTCCGGGTCCAGATGGGATTCAGGCGATTTTCTATCAGAAGTACTGGGATATAGTAGGTGAAGATACTTGCAATTTCTGTTTAAGGGTCCTT
AATGATGGGGTTGATATGAGCCAGTTTAATAGTACCTATATCGTGTTAATTCCAAAAGTTAAAGATCCAAAAACTATGAAGGACTTTAGGCCCATTAGCCTATGTTCAGT
ATTGTACAAAGTCATTGCTAAGACGTTAGCTAATAGATTGAAGAAGGTTCTTGACAAGATTATATCCCCAATCAAAGGGCTTTCATGCCTTTTAAATGGTGCAGAGAGAA
ACAAGGATATCACAGTCTATAAAGCTTCTAATAAGGATTGCAGGTCAGTTAAAGAGATCATTGACAGTTATGAAAAGGCAACGGGACAAACGGTAAACTATGAGAAATCT
ATGTTCATAGTGAGCCCAAACACAAGTATGGAAATGATTCACCACATAAAGGCCACGCTAGGGGTGACTCATACGGACACCTTAGGGCAATATCTAGGCCTTCCTAGCCA
AAATGCTCGAAACAAGCAGGAGATTTTCAACCATATCAAAGATCGGGTATGGAAAACGCTTCAAGGCTGGAAAGGGAAATTTTTCTCCGTGGCTGGAAGGGAAGTTCTCA
TTAAAGCAGTAGCCCAAGTCATTCCTAATTATGCTATGAGTTGTTTTAAGTTTCCGATTACCCTTTGTAATGAACTTAATTCTTTGTGTGCCGGGTTTTGGTGGGGCAGC
TCAGAGGAAAGACATAAGATCCATTGGCAAAGCTGGAGGAAACTTTGTGAAAATAAAGCTCTAGGCGGCATAAGTTTCAAGGATCTCACCATTTTCAATCAAGCAATGCT
GGCAAAGCAAAGCTGGAGACTTCTTAAGTTCCCTAACAGCCTCATGTCTAGGGTTCTTAGAGGTCGATATTATAAAACAGGGAACTATTTGAAAGCAGAGATTGGAAACA
ACCCTTCGTATGTCTGGAGGAGCTTACTCTGGGGTCGGGAGTTATTTAAGAAGGGCTTCCGTTGGAGAGTGGGAAGAGGGGTCAGTATTGAGGCCTCATCTGACCCTTGG
ATTCCAAGAAAGGGTCCATGTATTCCGATCGTCCCTCATCAAGATACCAGAAACCTTACGGTGGCTAATTTTATAAATGGGAATGGGACTTGGAATGAACGATTGGTTAG
AGATCTCTTTATCGGCCAGGATGCTGATTTGATTATGAATATTCCCCTCCACCCGCACCAACAGAAGGATGGTGTTATCTGGCACTATGATCCTAAAGGGCTCTTCTCAG
TAAAAAGTGCGTATCGTTTGGAGATGCAAATTCAGGGTCAATATATGGCATCTGTTTCAAATCACAATCACCATGTATCTTTTTGGAAGAAGTTTTGGAAAATTCCCCTT
CCTTCTAAGATCAAAATTTGTGGTTGGAGAATTTACAAGGATATTCTACCTACTTTATCTAATCTGAATAATAGAGGAATTGATCTTAACACTTTGTGTGTTTTGTGCAG
GGATAAAAGGGAAAAGACCTCGCATCTGAAAGAATGGATTCCTATGGACTATCTAGATGGCATTTGGAGGGATTCAATTTCGGGAGCGAGGGAGCAGGACAAAATGTGCA
GAAGCATGATACTATGTTGGCATATATGGACTTACAGAAACCAAATACTCCATAACAAGCATCACCCCGACATCAGTACACTTCAGTCTCAAATTGACAGATACATGGAA
GAGTTGTTAGAGAAAGATGATACGCACCAGGATATTGATCATTCTGCCTTAACCCCATTGCAGTCGAACGTATGGGTCAAACCGCCGAGTGGGTTATGGAAACTCAATTG
TGATGCAGCTTGGTCTGAGGAACTTAAACGCGGCGGGATCGGTTGGGCGCTCCGGCAATCGAACGATCATCTTCTATATGTTGGCTATCGTGTGATTTCTAGGCCGTGGA
AGGTACTGTGGCTGGAAACGATGGCGGTAAGTGAAGGCCTCAAGGCCCTATCGATCGAGTCTCCTCTGGTACAGATTGAATCGGACGCCCTCCAGGTGGTTCATCTTCTA
AACGGTCGAGTGGAAGACGAAACAGAGGTGAAAATTTTCATCGATGAAACCAAATCCCTAATGGCTAAACGGTCGATCGAAGGGGTGTATCATGTGACCAGAAATCGTAA
CCAACTGGCTCACAACTTGGCCAAAAAGTGGGGTCCTGTCCCACAACCAATGGTGTTCGACCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTAAAAGGTTCTGATTCCGCCATTGTTGAGGTTGTTGGAGAAATTGAAGGATTGGTACTTGCAGGTTTGGGGGAGGCCGACGTCGGAGCCGATGAGCTTTCTGA
TTTCGGATCTGAACTTGAAGAAGACGTGTCTCCAACTACCTTTGTTTTGAGATCCGAGGATTTTGGACTCCTCTTCGTTCATCTCGAGATCTTTGGCGAAGATAGGCTTG
CGAGAGAGGATCCAGGCCCAGCCCCAGTCCCACCACTTGCGGACGGAGCCGCCGGCGAGATCGCCGATCGAGGACATGGATTTGGATTTTCGGATCGCCCAGGCGGTGGC
AGGGGAGGAGATTGGCTGGATTCGGAGGGAGAGAGAGGAGAGACGGCGGCGGAGGGTTTGCTCGACGCCGGAGTCGGTGTGGGAGACCGGGAGGGCGTCGGGGAGAGGCC
TGGGCGGGAGGAAGGTGTGGACGACGGAGGACATGGTGGCGTGACGGCGGAGGGAAATTGGGGCGATGGGATTTGGGGGAAATGGAGAAGGAAAGTGGGAAGAGGTGGAA
ATGAAGAGAGTGTCTACACAACCTTTACATGGTGTAACAATCATTTGAACAGCGAGGTGATTTGGGAGAGGCCGATTATGGCTGAATGGCTTGAGGATAAAAATACCCAA
AGAAAATATGGGGCAGTCCGACCTAGAAGATTTGAAGAGATGTGGGTTAAGTATGAGGAGTGTAGGGATATTGTCCGTAGGGTTTGGATGGATCATGGATACAACAATAC
TGATTCAGTGATGGATAAATCTAGGATTTTCCTAGAGAGGCTAGGAAACTGGAGTCGATATAGGTATGGAGGATCGATTAGGGGGGCAATCTCTTGGAAGGAATATGAGA
TCCGACAATTAGTCAACAGAAGAGATGTGGATAGTATGAGAATGACCCGGGAAAAGGAAAAAGAATTGGAAATGCTTCTAGAAGACGACGAAATCTACTGGAAACAGAGA
GCTCGTGAGGAGCGGTTGAAATGGGGGGATAGAAACACGAAATGGTTCCACATGAGAGCTAACCAAAGAAGGAAAGTGAACAGGATCAGAGGCCTGTTTGATGAAAGGAA
TATGAGATGGATTGTGTGGTTGATTCTATTCCTACCAAGTATCACTGAGGACCTGAATAATGATCTTTTAAAAGAGTTCACTAAAGAAGAGGTCACTGCAGTAGTAAAGA
GTATGGCTCCTACAAAAGCTCCGGGTCCAGATGGGATTCAGGCGATTTTCTATCAGAAGTACTGGGATATAGTAGGTGAAGATACTTGCAATTTCTGTTTAAGGGTCCTT
AATGATGGGGTTGATATGAGCCAGTTTAATAGTACCTATATCGTGTTAATTCCAAAAGTTAAAGATCCAAAAACTATGAAGGACTTTAGGCCCATTAGCCTATGTTCAGT
ATTGTACAAAGTCATTGCTAAGACGTTAGCTAATAGATTGAAGAAGGTTCTTGACAAGATTATATCCCCAATCAAAGGGCTTTCATGCCTTTTAAATGGTGCAGAGAGAA
ACAAGGATATCACAGTCTATAAAGCTTCTAATAAGGATTGCAGGTCAGTTAAAGAGATCATTGACAGTTATGAAAAGGCAACGGGACAAACGGTAAACTATGAGAAATCT
ATGTTCATAGTGAGCCCAAACACAAGTATGGAAATGATTCACCACATAAAGGCCACGCTAGGGGTGACTCATACGGACACCTTAGGGCAATATCTAGGCCTTCCTAGCCA
AAATGCTCGAAACAAGCAGGAGATTTTCAACCATATCAAAGATCGGGTATGGAAAACGCTTCAAGGCTGGAAAGGGAAATTTTTCTCCGTGGCTGGAAGGGAAGTTCTCA
TTAAAGCAGTAGCCCAAGTCATTCCTAATTATGCTATGAGTTGTTTTAAGTTTCCGATTACCCTTTGTAATGAACTTAATTCTTTGTGTGCCGGGTTTTGGTGGGGCAGC
TCAGAGGAAAGACATAAGATCCATTGGCAAAGCTGGAGGAAACTTTGTGAAAATAAAGCTCTAGGCGGCATAAGTTTCAAGGATCTCACCATTTTCAATCAAGCAATGCT
GGCAAAGCAAAGCTGGAGACTTCTTAAGTTCCCTAACAGCCTCATGTCTAGGGTTCTTAGAGGTCGATATTATAAAACAGGGAACTATTTGAAAGCAGAGATTGGAAACA
ACCCTTCGTATGTCTGGAGGAGCTTACTCTGGGGTCGGGAGTTATTTAAGAAGGGCTTCCGTTGGAGAGTGGGAAGAGGGGTCAGTATTGAGGCCTCATCTGACCCTTGG
ATTCCAAGAAAGGGTCCATGTATTCCGATCGTCCCTCATCAAGATACCAGAAACCTTACGGTGGCTAATTTTATAAATGGGAATGGGACTTGGAATGAACGATTGGTTAG
AGATCTCTTTATCGGCCAGGATGCTGATTTGATTATGAATATTCCCCTCCACCCGCACCAACAGAAGGATGGTGTTATCTGGCACTATGATCCTAAAGGGCTCTTCTCAG
TAAAAAGTGCGTATCGTTTGGAGATGCAAATTCAGGGTCAATATATGGCATCTGTTTCAAATCACAATCACCATGTATCTTTTTGGAAGAAGTTTTGGAAAATTCCCCTT
CCTTCTAAGATCAAAATTTGTGGTTGGAGAATTTACAAGGATATTCTACCTACTTTATCTAATCTGAATAATAGAGGAATTGATCTTAACACTTTGTGTGTTTTGTGCAG
GGATAAAAGGGAAAAGACCTCGCATCTGAAAGAATGGATTCCTATGGACTATCTAGATGGCATTTGGAGGGATTCAATTTCGGGAGCGAGGGAGCAGGACAAAATGTGCA
GAAGCATGATACTATGTTGGCATATATGGACTTACAGAAACCAAATACTCCATAACAAGCATCACCCCGACATCAGTACACTTCAGTCTCAAATTGACAGATACATGGAA
GAGTTGTTAGAGAAAGATGATACGCACCAGGATATTGATCATTCTGCCTTAACCCCATTGCAGTCGAACGTATGGGTCAAACCGCCGAGTGGGTTATGGAAACTCAATTG
TGATGCAGCTTGGTCTGAGGAACTTAAACGCGGCGGGATCGGTTGGGCGCTCCGGCAATCGAACGATCATCTTCTATATGTTGGCTATCGTGTGATTTCTAGGCCGTGGA
AGGTACTGTGGCTGGAAACGATGGCGGTAAGTGAAGGCCTCAAGGCCCTATCGATCGAGTCTCCTCTGGTACAGATTGAATCGGACGCCCTCCAGGTGGTTCATCTTCTA
AACGGTCGAGTGGAAGACGAAACAGAGGTGAAAATTTTCATCGATGAAACCAAATCCCTAATGGCTAAACGGTCGATCGAAGGGGTGTATCATGTGACCAGAAATCGTAA
CCAACTGGCTCACAACTTGGCCAAAAAGTGGGGTCCTGTCCCACAACCAATGGTGTTCGACCTTTAG
Protein sequenceShow/hide protein sequence
MGLKGSDSAIVEVVGEIEGLVLAGLGEADVGADELSDFGSELEEDVSPTTFVLRSEDFGLLFVHLEIFGEDRLAREDPGPAPVPPLADGAAGEIADRGHGFGFSDRPGGG
RGGDWLDSEGERGETAAEGLLDAGVGVGDREGVGERPGREEGVDDGGHGGVTAEGNWGDGIWGKWRRKVGRGGNEESVYTTFTWCNNHLNSEVIWERPIMAEWLEDKNTQ
RKYGAVRPRRFEEMWVKYEECRDIVRRVWMDHGYNNTDSVMDKSRIFLERLGNWSRYRYGGSIRGAISWKEYEIRQLVNRRDVDSMRMTREKEKELEMLLEDDEIYWKQR
AREERLKWGDRNTKWFHMRANQRRKVNRIRGLFDERNMRWIVWLILFLPSITEDLNNDLLKEFTKEEVTAVVKSMAPTKAPGPDGIQAIFYQKYWDIVGEDTCNFCLRVL
NDGVDMSQFNSTYIVLIPKVKDPKTMKDFRPISLCSVLYKVIAKTLANRLKKVLDKIISPIKGLSCLLNGAERNKDITVYKASNKDCRSVKEIIDSYEKATGQTVNYEKS
MFIVSPNTSMEMIHHIKATLGVTHTDTLGQYLGLPSQNARNKQEIFNHIKDRVWKTLQGWKGKFFSVAGREVLIKAVAQVIPNYAMSCFKFPITLCNELNSLCAGFWWGS
SEERHKIHWQSWRKLCENKALGGISFKDLTIFNQAMLAKQSWRLLKFPNSLMSRVLRGRYYKTGNYLKAEIGNNPSYVWRSLLWGRELFKKGFRWRVGRGVSIEASSDPW
IPRKGPCIPIVPHQDTRNLTVANFINGNGTWNERLVRDLFIGQDADLIMNIPLHPHQQKDGVIWHYDPKGLFSVKSAYRLEMQIQGQYMASVSNHNHHVSFWKKFWKIPL
PSKIKICGWRIYKDILPTLSNLNNRGIDLNTLCVLCRDKREKTSHLKEWIPMDYLDGIWRDSISGAREQDKMCRSMILCWHIWTYRNQILHNKHHPDISTLQSQIDRYME
ELLEKDDTHQDIDHSALTPLQSNVWVKPPSGLWKLNCDAAWSEELKRGGIGWALRQSNDHLLYVGYRVISRPWKVLWLETMAVSEGLKALSIESPLVQIESDALQVVHLL
NGRVEDETEVKIFIDETKSLMAKRSIEGVYHVTRNRNQLAHNLAKKWGPVPQPMVFDL