; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035376 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035376
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:20614725..20619823
RNA-Seq ExpressionLag0035376
SyntenyLag0035376
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2711776.1 hypothetical protein I3760_04G092800 [Carya illinoinensis]8.4e-16128.96Show/hide
Query:  MHKIWNRDHTI-ISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDL
        M KIW     +   ++     L +F + R K  +M  GPW FDK L++LKE  G      IE  +  FW+  H +P +  ++ +   +G  LG+V +VDL
Subjt:  MHKIWNRDHTI-ISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDL

Query:  DEKIDQKWGGSLRMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSGSSSEELPYGDWLYEPPFFFKGRETTSPAR
        D   + +WG  +R+++ I++ KPL R   +    G   W+  SYE+  + C+ CG +GH +KECE    +   S+ LPYG WL     F   R +   A 
Subjt:  DEKIDQKWGGSLRMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSGSSSEELPYGDWLYEPPFFFKGRETTSPAR

Query:  LNATMVGRGRGRTWEGGRSDWRKRFS--DDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAPHPPTVPTGELLEMVTAICEDS----GINEGVSAK
          AT        T      D  +  S  D N K     +     GE N  PD  G   P    ++  P   P+   L    AI E S    G+N+ VS  
Subjt:  LNATMVGRGRGRTWEGGRSDWRKRFS--DDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAPHPPTVPTGELLEMVTAICEDS----GINEGVSAK

Query:  GKNEISQK----------------------ASGSSSNNGP-VIGGALNEQMEVDDVG--LGPHADVILQPEIRDN---YGMAKQNTQVKEGL---GNPRT
           ++ Q+                       SG S N  P  +GG+ +        G  +G  A   +Q E  +     G+A+ +    EGL    + R 
Subjt:  GKNEISQK----------------------ASGSSSNNGP-VIGGALNEQMEVDDVG--LGPHADVILQPEIRDN---YGMAKQNTQVKEGL---GNPRT

Query:  LRGLWHIIRSHKSQLVFLMETRCQYNGGDRIRYDLQYD------------------CSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQKKDFM
         +G+  I+ +   +  FL     +++   R R  L                        S  +VG+SGG  L W+    + + SYS+ HI   I+  D +
Subjt:  LRGLWHIIRSHKSQLVFLMETRCQYNGGDRIRYDLQYD------------------CSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQKKDFM

Query:  -WRFTGIYGNPRRELHHETWTLIKRLRDDSDIPWVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNEP--------
         W  TG+YG+P      E W L+K L    ++PW++  DFNE+   SEK GG  R ES M++FR V+  C L D GY G  FTW N +  E         
Subjt:  -WRFTGIYGNPRRELHHETWTLIKRLRDDSDIPWVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNEP--------

Query:  -----------INLRL---VAP--------------------RRPRRFEEAWCKYEKCRDIVQQVW-QSRGNIDVSNFVVKTQLCIDRLSHWSRLKYGCS
                   +NLR+   VA                     +R  RFE  W   ++C  I+++ W Q  G+I +   + +   C   L  W++  +G  
Subjt:  -----------INLRL---VAP--------------------RRPRRFEEAWCKYEKCRDIVQQVW-QSRGNIDVSNFVVKTQLCIDRLSHWSRLKYGCS

Query:  IRGAIARIEKEIQGLSNLEGGTYQNMLLEKE----TELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEA
        ++  +A  ++++Q L   + G   ++ LE+      E+   LE DE+ WKQR+R  WL+ GDCN+++FH +A+ RR++N I +L D+ G W + D +M+A
Subjt:  IRGAIARIEKEIQGLSNLEGGTYQNMLLEKE----TELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEA

Query:  IANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDIN
        +   YF  LF +++     +  IL      +  + N +LL+ +  EE+   ++ MHP+KA GPDG+  +F++K+W ++G+      L  LN G+  + +N
Subjt:  IANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDIN

Query:  KTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA-----
         TFI LIPK   P  + DFRPISL                  +L  +IS +QSAFVP R I+DN ++ +E +H + +++KG+ G ++LKLDMS A     
Subjt:  KTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA-----

Query:  ---------AVGL----------------------------------LNQAD---------------------------------------NNRVLTD--
                 ++G                                   L Q D                                       N+ +  D  
Subjt:  ---------AVGL----------------------------------LNQAD---------------------------------------NNRVLTD--

Query:  ---CRA-------IQDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFF
           C+A       IQ +L  Y +ASGQ IN +K + + S N K+++   I ++    FT+   +YLGLP    R+KK+ FS IK RVW+ LQ WK  L  
Subjt:  ---CRA-------IQDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFF

Query:  SIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR----------------------------
        S  G+E+LIK++A +IP YAMSCF FP TL +EL  + A+FWWG    + KIHW SW+KLC +K  GGMGFR                            
Subjt:  SIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR----------------------------

Query:  -------------------------------------YRWRIGNGLSVDVVYEPWILKEGSAKPILVCPDICHLTVAHLKDLN-GLWNENIIRESFLDTD
                                              RWR+GNG ++ +  +PW L EG +   L+  D  +L V  L D N G WN +++R  F    
Subjt:  -------------------------------------YRWRIGNGLSVDVVYEPWILKEGSAKPILVCPDICHLTVAHLKDLN-GLWNENIIRESFLDTD

Query:  AEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHT---SSSNYKAQERMWKSYWKINLPQKIKI-------------------------
         + IL +   S   ED   W+++  G +SVKSAYR    ++Q  H     SS+  ++   WKS W + LP+K+KI                         
Subjt:  AEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHT---SSSNYKAQERMWKSYWKINLPQKIKI-------------------------

Query:  ----CGEKPEDLRHLFWECKFTKEVW
            C +  ED  H  + C   + VW
Subjt:  ----CGEKPEDLRHLFWECKFTKEVW

XP_010673168.1 PREDICTED: uncharacterized protein LOC104889608 [Beta vulgaris subsp. vulgaris]8.4e-16128.14Show/hide
Query:  MHKIWN-RDHTIISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDL
        +++IW+     +   +   +F+ +F N R K  +M   PW FD+ L++  E +G     +I   +  FW+  + LP    ++     IGS +G V +VD 
Subjt:  MHKIWN-RDHTIISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDL

Query:  DEKIDQKWGGSLRMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSGSSSEELPYGDWLYEPPFFFKGRETTSPAR
        D  +   W  S R+K+ +DV KPL+R   + S  G    + + YE+  NFCY CG +GHI ++C   +P    +EE  +G WL   P             
Subjt:  DEKIDQKWGGSLRMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSGSSSEELPYGDWLYEPPFFFKGRETTSPAR

Query:  LNATMVGRGRGRTWEGGRSDWRKRFSDDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAPHPPTVPTGELL--EMVTAICEDSGINEGVSA-----
               RGR +  E  + ++R      N        ++            + F    +      P TV   ++   E  +  CE + +N  VS      
Subjt:  LNATMVGRGRGRTWEGGRSDWRKRFSDDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAPHPPTVPTGELL--EMVTAICEDSGINEGVSA-----

Query:  --KG----KNEISQKASGSSSNNGP-----VIGGALNEQMEVDDVGL--GPHADVILQPEIRDNYGMAKQNTQVKEGLGNPRTLRGLWHIIRSHKSQLVF
          KG    K     K      N+ P     V    LN  +  ++ GL      D+IL   +    G  K  T++ +GL N  +L      +      LV 
Subjt:  --KG----KNEISQKASGSSSNNGP-----VIGGALNEQMEVDDVGL--GPHADVILQPEIRDNYGMAKQNTQVKEGLGNPRTLRGLWHIIRSHKSQLVF

Query:  LMETRCQYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKE-TDVTVRSYSKGHI--DVIIQKKDFMWRFTGIYGNPRRELHHETWTLIKRLRDDSDIP
                   + I+  L YDC+  + SVG+SGG  + W+    D ++ S+S  HI  DV++      WRF GIYG P     ++TW L++ L  D + P
Subjt:  LMETRCQYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKE-TDVTVRSYSKGHI--DVIIQKKDFMWRFTGIYGNPRRELHHETWTLIKRLRDDSDIP

Query:  WVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTW-------------------------CNNQF-----------NEPINLRL
         + G DFNE+ S SE +GG       M DFR+V+D   L D G+SG  +TW                         C  Q            + PI ++L
Subjt:  WVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTW-------------------------CNNQF-----------NEPINLRL

Query:  VAPRRPR--------RFEEAWCKYEKCRDIVQQVWQSRGNIDVSNFVVKTQLCIDRLSHWSRLKYGCSIRGAIARIEKEIQGLSNLEGGTYQNMLLEKET
           +R R        RF  AW   + C  +V+  W     +    F  +       L  WS+      +   I  +E+EI+ L +      Q  L+E  +
Subjt:  VAPRRPR--------RFEEAWCKYEKCRDIVQQVWQSRGNIDVSNFVVKTQLCIDRLSHWSRLKYGCSIRGAIARIEKEIQGLSNLEGGTYQNMLLEKET

Query:  ELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQN
        +LD LLE  E YW  R+R   +K GD NTK+FH +A+ R++RN I  L D+  +W ++D ++E +   Y++ LF SS P+ EA+  +L++    I  + N
Subjt:  ELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQN

Query:  NELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTL
          L R   KEE+   +R MHP+KA GPDG+ AVFY++FW IVGDD   +    ++G      +N T I LIPKV+ P  + +FRPISLCNVI+K++ K L
Subjt:  NELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTL

Query:  ANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA--------------AVGL-----------------------
        ANRLK +L  ++S NQSAFVP RLITDNA++  E  H++  R KG  G V +KLDMSKA               +G                        
Subjt:  ANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA--------------AVGL-----------------------

Query:  -----------LNQAD-------------------------NNRVLTDCRAIQDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSL
                   L Q D                         +   + +C  I DIL+ Y  ASGQ IN++K     S          + E L ++     
Subjt:  -----------LNQAD-------------------------NNRVLTDCRAIQDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSL

Query:  GQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCH
         +YLG+P+   R+K+ +FSGI DRVWK LQGWK KL  S  GKE+L+K++ Q IP Y M  ++FP+ +   +++  A+FWWG+   ++ ++W+SW+ +C+
Subjt:  GQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCH

Query:  NKNSGGMGFR-----------------------------------------------------------------YRWRIGNGLSVDVVYEPWILKEGSA
         K  GGMGFR                                                                   WR+GNG ++++  +PW+L  G +
Subjt:  NKNSGGMGFR-----------------------------------------------------------------YRWRIGNGLSVDVVYEPWILKEGSA

Query:  KPILVCPDICHLTVAHLKDLNGL-WNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWKSYW
        + I          V  L D   + W+ N++ E F + D +AIL +P S  +  D + W +   G+YSVK+AY +G          S N     R W + W
Subjt:  KPILVCPDICHLTVAHLKDLNGL-WNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWKSYW

Query:  KINLPQKIK-----------------------------ICGEKPEDLRHLFWECKFTKEVWHKYFPISNY------LFQNLIIHNKQIPKTECLQ-----
         + +  K++                             +C E PE + H    C   +EVW      S         + +     +++ K   +      
Subjt:  KINLPQKIK-----------------------------ICGEKPEDLRHLFWECKFTKEVWHKYFPISNY------LFQNLIIHNKQIPKTECLQ-----

Query:  ----QHINKVISELIGEEDMYLPSELIAPL-----------ESHRNGSWSSNEMTPRAGWLAIPNGIWKINCDASWSDENQQGGVGWILRDWRGYPLSAG
               NKV+      ED   P+E +A L             H  GS +         W   P G  K+N DAS  D+    G+G + R+  G  L A 
Subjt:  ----QHINKVISELIGEEDMYLPSELIAPL-----------ESHRNGSWSSNEMTPRAGWLAIPNGIWKINCDASWSDENQQGGVGWILRDWRGYPLSAG

Query:  YHIFNRRWNV
               W V
Subjt:  YHIFNRRWNV

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]9.0e-17130.31Show/hide
Query:  MHKIWN-RDHTIISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDL
        + + W   D   I  VG N+F  KF++      IM  GPW FD  L++L+  K G    +I F   S WI     PF   S  VA E+GS LGKV++V+ 
Subjt:  MHKIWN-RDHTIISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDL

Query:  DEKIDQKWGGSLRMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSGSSSE--ELPYGDWLYEPPFFFKGRETTSP
         ++ D      +R+++ + + KP++RG F+  + G   W+   YE+   FC+ CG +GH ++ C          E  E  YGD+L        GR     
Subjt:  DEKIDQKWGGSLRMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSGSSSE--ELPYGDWLYEPPFFFKGRETTSP

Query:  ARLNATMVGRGRGRTWEGGRSDWRKRFSDDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAPHPPTVP--TGELLEMVTAICEDSGIN-----EGV
        A+ +          T E        R  D   K P   T    + +D+  P +      V +  A +P TV   T   +E + +     G        G 
Subjt:  ARLNATMVGRGRGRTWEGGRSDWRKRFSDDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAPHPPTVP--TGELLEMVTAICEDSGIN-----EGV

Query:  SAKGKNEISQKASGSSSNNGPVIGGALNEQMEVDDVGLGPHADVILQPEIRDNYGMAKQNTQVKEGLGNPRTL--RGLWHIIRSHKSQLVFLMETRCQYN
          K K ++S++ +G   + G       N  +           DV++Q +     G     + +  G+  P  +  +G W  +      L     T     
Subjt:  SAKGKNEISQKASGSSSNNGPVIGGALNEQMEVDDVGLGPHADVILQPEIRDNYGMAKQNTQVKEGLGNPRTL--RGLWHIIRSHKSQLVFLMETRCQYN

Query:  GGDRIRYDLQYD----CSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQKKD-FMWRFTGIYGNPRRELHHETWTLIKRLRDDSDIPWVLGVDF
         G R   +   +     +   P  G  GG   LW+ +  + V +++  H+   + ++D F+W  TG YG P  +    +W L+K L+     PWV+  DF
Subjt:  GGDRIRYDLQYD----CSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQKKD-FMWRFTGIYGNPRRELHHETWTLIKRLRDDSDIPWVLGVDF

Query:  NEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNE------------------------------------PINLRLVAPRRPR
        N     SEK      Q S ++ FR+ +  C L D G+ G  +TW N +  E                                    P+ L + +  +PR
Subjt:  NEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNE------------------------------------PINLRLVAPRRPR

Query:  -------RFEEAWCKYEKCRDIVQQVW-QSRGNID-VSNFVVKTQLCIDRLSHWSRLKYGCSI----RGAIARIEKEIQGLSNLE-GGTYQNMLLEKETE
               +FEE+W   ++C  ++Q+ W    GN D ++    K + C   L  W     G SI     GAI  I+K++  L+  E     +   L    +
Subjt:  -------RFEEAWCKYEKCRDIVQQVW-QSRGNID-VSNFVVKTQLCIDRLSHWSRLKYGCSI----RGAIARIEKEIQGLSNLE-GGTYQNMLLEKETE

Query:  LDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNN
        +D+LL+  EIYW QR+R +WL+ GD NTK+FH +A+ RR++N IR + +  G W E   E+  +A  YF  LFQ+     + +++ L++  T +  D   
Subjt:  LDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNN

Query:  ELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLA
         L   FT EE+   +  M PTKA GPDG+ A+FY+KFW IVGD      L +LN G  + +IN T IVLIPKV++P+ M +FRPISLCNVIYKII+K LA
Subjt:  ELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLA

Query:  NRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA------------------------------------------
        NRLKQ+L +IIS  QSAFVP RLITDN ++ +E +H +++RKKGK G V LKLD+SKA                                          
Subjt:  NRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA------------------------------------------

Query:  ----------------------------AVGLLNQADNNRVLTD---CR--------------------------AIQDILDTYAKASGQTINLDKYNFM
                                       LLN+A+ N ++T    CR                           I +IL  Y +ASGQ+INL+K +  
Subjt:  ----------------------------AVGLLNQADNNRVLTD---CR--------------------------AIQDILDTYAKASGQTINLDKYNFM

Query:  VSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNAL
         S NT E     I E+L V+      +YLGLP+   R K   FS +KDRVWK LQGWK  L  S  GKEILIK++AQ IP Y MS F+ PL L +EL AL
Subjt:  VSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNAL

Query:  CARFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR-----------------------------------------------------------------
        CARFWWG   +++KIHW+SWDKL   K  GGMGFR                                                                 
Subjt:  CARFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR-----------------------------------------------------------------

Query:  YRWRIGNGLSVDVVYEPWILKEGSAKPI-LVCPDICHLTVAHLKDLN-GLWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRL
        Y WR+GNG S++ V + W+    + K +  V  D   + VA L +    +WN   IR  F   +AEAI  +P S     D I W Y  +G +SVKSAY +
Subjt:  YRWRIGNGLSVDVVYEPWILKEGSAKPI-LVCPDICHLTVAHLKDLN-GLWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRL

Query:  GFNV-KQSGHTSSSNYKAQERMWKSYWKINLPQKIK-----------------------------ICGEKPEDLRHLFWECKFTKEVW
           +   +    +S     + +W + WK+ LP K+K                             IC  + E   H  W+C   +++W
Subjt:  GFNV-KQSGHTSSSNYKAQERMWKSYWKINLPQKIK-----------------------------ICGEKPEDLRHLFWECKFTKEVW

XP_030924745.1 uncharacterized protein LOC115951731 [Quercus lobata]6.5e-16131.47Show/hide
Query:  LVFLMETRCQYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQK-KDFMWRFTGIYGNPRRELHHETWTLIKRLRDDSDI
        +VFLMET+ + N  +RI   LQ+     +P V   GG  LLW+    V V+S+S  HID I+    D  WRFTG YG+P       +W+L++ L     +
Subjt:  LVFLMETRCQYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQK-KDFMWRFTGIYGNPRRELHHETWTLIKRLRDDSDI

Query:  PWVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQF------------------------------------NEPINL-
        PW+   DFNE+  G EK G + R E  MQ FRD +D C L D G++G  FTWCN +                                     ++PI L 
Subjt:  PWVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQF------------------------------------NEPINL-

Query:  ------RLVAPRRPRRFEEAWCKYEKCRDIVQQVW-QSRGNIDVSNFVVKTQLCIDRLSHWSRLKYGCSIRGAIARIEKEIQGLSNLEGGTY---QNMLL
              R     RP RFE  W K + C  ++Q  W  +R    +  F  K +     L  W+R  +G  +R ++A+  KE++ +   E G Y      + 
Subjt:  ------RLVAPRRPRRFEEAWCKYEKCRDIVQQVW-QSRGNIDVSNFVVKTQLCIDRLSHWSRLKYGCSIRGAIARIEKEIQGLSNLEGGTY---QNMLL

Query:  EKETELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIM
           TE++ L   +E  WKQR+R  WLK GD NT +FH RA  R KRN I  L D+ G W +++ ++  +   YF+ +F SS P+    ++IL S    I 
Subjt:  EKETELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIM

Query:  GDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKII
         + +  L R +  EE+L  ++ M P  A GPDG+  +FYK +W IVG+D   I L  LN G+    +N TFI LIPKV++PK + +FRPISLCNV+YK+I
Subjt:  GDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKII

Query:  AKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA-------------------------------------
        AK + NRLK++L  +I  +QSAF+  RLITDN ++ FE +H +  + +G+ G + LKLDMSKA                                     
Subjt:  AKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA-------------------------------------

Query:  ----AVGLLNQADNN-------------RV----------------LTDCRAIQDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKS
              GL+ +A+NN             RV                + +C+ I D+L  Y K SGQ IN +K N   S NT  +M   I+ +L V   + 
Subjt:  ----AVGLLNQADNN-------------RV----------------LTDCRAIQDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKS

Query:  LGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLC
          +YLGLP+   R KK+ F+ IK+RVW+ LQGWK KL  S  G+E+LIKS+ Q IP Y MSCFK P  L  EL  L  +FWW  +   +K+HW  W++LC
Subjt:  LGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLC

Query:  HNKNSGGMGFR-----------------------------------------------------------------YRWRIGNGLSVDVVYEPWILKEGS
         +K  GGMGF+                                                                   WRIGNG SV +  + W+  + +
Subjt:  HNKNSGGMGFR-----------------------------------------------------------------YRWRIGNGLSVDVVYEPWILKEGS

Query:  AKPILVCPDIC-HLTVAHL--KDLNGLWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWK
           I   P +     V+ L   DL G WN  ++ + FL   A  I  MP S+ +  D IIW+    G ++ KSAY+L      + H  +S+   Q + W+
Subjt:  AKPILVCPDIC-HLTVAHL--KDLNGLWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWK

Query:  SYWKINLPQKIK-----------------------------ICGEKPEDLRHLFWECKFTKEVW------HKYFPISNYLFQNLIIHNKQIPKTECLQQH
          W++ +P K+K                              C  +PED  H  W C   + VW        Y       FQ L+  +K +   E  +  
Subjt:  SYWKINLPQKIK-----------------------------ICGEKPEDLRHLFWECKFTKEVW------HKYFPISNYLFQNLIIHNKQIPKTECLQQH

Query:  INKVISELIGEEDMYLPSEL-IAPLE--SHRNGSWSS-------------NEMTPRAGWLAIPNGIWKINCDASWSDENQQGGVGWILRDWRGYPLSA
        I   I+ ++      L   L + P+   S   GS+               +  +P+  W       +K N DA+    +   G+G I+RDWRG  + A
Subjt:  INKVISELIGEEDMYLPSEL-IAPLE--SHRNGSWSS-------------NEMTPRAGWLAIPNGIWKINCDASWSDENQQGGVGWILRDWRGYPLSA

XP_030939698.1 uncharacterized protein LOC115964550 [Quercus lobata]3.4e-16227.81Show/hide
Query:  MHKIWNRDHTI-ISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDL
        +  +W   + + I +VG  +F  +FK       ++E GPW FD  L++L+  + G   + + F  +  W+    LPF   +++   EIG  LG+V +VD 
Subjt:  MHKIWNRDHTI-ISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDL

Query:  DEKIDQKWGGSLRMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSGSSSEELPYGDWLYEPPFFFKGRETTSPAR
           +  +    +R+++ I + KP++RG ++ +  G+   +   YE+    CY CG +GH +K  +C +   S   E PYGDWL     F +       A+
Subjt:  DEKIDQKWGGSLRMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSGSSSEELPYGDWLYEPPFFFKGRETTSPAR

Query:  LNATMVGRGRGRTWEGGRSDWRKRFSDD---NSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAPHPPTVPTGELLEMVTAICEDSGINEGVS-----
         NA           E  +S      S D    S    D  ++  +G + +CP S        S        V T  L        E   IN GVS     
Subjt:  LNATMVGRGRGRTWEGGRSDWRKRFSDD---NSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAPHPPTVPTGELLEMVTAICEDSGINEGVS-----

Query:  AKGKNEISQKASGSSSNNGPVIGGALNEQMEVDDVGLGPHAD---VILQPEIRDNYGMAKQNTQVKEGLGNPRTLRGLWHIIRSHKSQLVFLMETRCQYN
         +G      + +   + N  +I   +N +          HAD    I QP  R+   + + +  V            L H++R    +++FLMET+    
Subjt:  AKGKNEISQKASGSSSNNGPVIGGALNEQMEVDDVGLGPHAD---VILQPEIRDNYGMAKQNTQVKEGLGNPRTLRGLWHIIRSHKSQLVFLMETRCQYN

Query:  GGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQKKDF-MWRFTGIYGNPRRELHHETWTLIKRLRDDSDIPWVLGVDFNEMT
            I+ DL Y C   +PSV +SGG  LLW +E D+ +++++  HID +I       WR TG YG P  +   E+W L+K L     +PW+   DFNE+ 
Subjt:  GGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQKKDF-MWRFTGIYGNPRRELHHETWTLIKRLRDDSDIPWVLGVDFNEMT

Query:  SGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNEPINLRL------------------------------------------VAPRR
           EKQGG+P+  + M +FR+ +  CGL+D GY G  FTW N + ++ +  RL                                          +  + 
Subjt:  SGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNEPINLRL------------------------------------------VAPRR

Query:  PRRFEEAWCKYEKCRDIVQQVWQS--RGNIDVSNFVVKTQLCIDRLSHWSRLKYGCSIRGAIARIEKEIQGLSNLEGGTYQNMLLEKETELDNLLEDDEI
        P RFEE W  +  C +I+Q  W S       ++    K + C   L  WSR+ +G S +  +   +K ++ L           +   + E+ N++  DE+
Subjt:  PRRFEEAWCKYEKCRDIVQQVWQS--RGNIDVSNFVVKTQLCIDRLSHWSRLKYGCSIRGAIARIEKEIQGLSNLEGGTYQNMLLEKETELDNLLEDDEI

Query:  YWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNELLRSFTKEE
        +W+QR+R  WL  GD NTK+FH RA+ RR++N I  + D    W   D ++  +A  YF+ LF ++ P  + ++ +L+S    +    N  L R +T +E
Subjt:  YWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNELLRSFTKEE

Query:  ILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKI
        +   +  MHP+K+ GPDG+   F++K+W I+G+D  +  L +L     ++ +N T IVLIPK +DPK + D+RPISL NV+ +II+K +ANRLK +L  +
Subjt:  ILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKI

Query:  ISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA----------------------------------------------------
        IS +QSAFVP+RLITDN  + +E +H + +R++GK G + +KLD+SKA                                                    
Subjt:  ISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA----------------------------------------------------

Query:  ------------------AVGLLNQADNNRVLT-----------------------------DCRAIQDILDTYAKASGQTINLDKYNFMVSPNTKEEMA
                             LLN+A  NR++                              +C+ +  IL  Y  ASGQ IN  K +   S NT  E+ 
Subjt:  ------------------AVGLLNQADNNRVLT-----------------------------DCRAIQDILDTYAKASGQTINLDKYNFMVSPNTKEEMA

Query:  TIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCARFWWGTTE
         +I++ +  +   +  +YLGLP    ++K   F  +++++ K + GWK K F S  G+E+LIK++AQ IP YAMS FK P T+ + +N+L A++WWG  +
Subjt:  TIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCARFWWGTTE

Query:  SKKKIHWRSWDKLCHNKNSGGMGFR-----------------------------------------------------------------YRWRIGNGLS
         ++KIHW +W KLC +K  GGMGFR                                                                  +WR+G+G +
Subjt:  SKKKIHWRSWDKLCHNKNSGGMGFR-----------------------------------------------------------------YRWRIGNGLS

Query:  VDVVYEPWILKEGSAKPILVCPDICHLTVAHLKDLNG-LWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTS
        V V    W+    S  PI +      + V  L D+    W+   I   F     + IL +P       D ++W  +   K++VK+AY+L   + +     
Subjt:  VDVVYEPWILKEGSAKPILVCPDICHLTVAHLKDLNG-LWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTS

Query:  SSNYKAQERMWKSYWKINLPQKIK
         S  +A + +W   W +N+P K++
Subjt:  SSNYKAQERMWKSYWKINLPQKIK

TrEMBL top hitse value%identityAlignment
A0A2N9F6L9 Reverse transcriptase domain-containing protein1.7e-18328.74Show/hide
Query:  ISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDLDEKIDQKWGGSL
        I +VG NM L +F++      ++   PW +DK L+       G   +++ F  V FWI  H LP +   +KVAV +G  +GKV      E+ +   G  +
Subjt:  ISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDLDEKIDQKWGGSL

Query:  RMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECE--CDMPSGSSSEELPYGDWLYEPPFFFKGRETTSPARLNATMVGRGR
        R+++++D+ KPL RG  +    G + W+   YE+  NFCY CG   H  ++CE     P+      + YG WL       +        ++  T+   G 
Subjt:  RMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECE--CDMPSGSSSEELPYGDWLYEPPFFFKGRETTSPARLNATMVGRGR

Query:  GRTWEGGRSDWRKRFSDDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSP--AAPHPPTVPTGELLEMVTAICEDSGINEGVSAKGKNEISQKASGSSS
         R   GG+  W      +++  PK              P     QPP  SP  A  H   +   E         + +G N     +   EI    + +  
Subjt:  GRTWEGGRSDWRKRFSDDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSP--AAPHPPTVPTGELLEMVTAICEDSGINEGVSAKGKNEISQKASGSSS

Query:  NNGPVIGGALNEQMEVDDVGLGPHADVILQP------------EIRDNYGMAKQNTQVKEG-------------------LGNPRT--------------
        N+G  +   L +   V   G        +QP            ++ + +   ++ ++ K G                   L   RT              
Subjt:  NNGPVIGGALNEQMEVDDVGLGPHADVILQP------------EIRDNYGMAKQNTQVKEG-------------------LGNPRT--------------

Query:  ----LRGLWHIIRSHKSQLVFLMETRCQYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQK-KDFMWRFTGIYGNPRRE
            L+ L  ++R      VFLMET    +  + +R +L +   + + S  + GG  L W  + DV+++SYS  HID +I   K   WR TG+YG P   
Subjt:  ----LRGLWHIIRSHKSQLVFLMETRCQYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQK-KDFMWRFTGIYGNPRRE

Query:  LHHETWTLIKRLRDDSDIPWVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNEPINLRLV----------------
          H+TW LI+RL   S + W    DFNE+   +E  G   R +  MQ FR+V+D CGL+D G++G  FTWCNN+  +P N   V                
Subjt:  LHHETWTLIKRLRDDSDIPWVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNEPINLRLV----------------

Query:  ----------------------------APRRPRRFEEAWCKYEKCRDIVQQVWQS-RGNIDVSNFVVKTQLCIDRLSHWSRLKYGCSIRGAIARIEKEI
                                      R+P RFEE W     C   +++ W S +    +     K + C  +L  WSR  +G ++   I   + E+
Subjt:  ----------------------------APRRPRRFEEAWCKYEKCRDIVQQVWQS-RGNIDVSNFVVKTQLCIDRLSHWSRLKYGCSIRGAIARIEKEI

Query:  QGLSNL--EGGTYQNMLLEKETELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSE
        +    L  +G +++N+ + +   L++L E +E  W+QR+R  WL  GD NTK+FH RA  R +RN+I  L D+ G+  +    M ++   Y+ +LF + +
Subjt:  QGLSNL--EGGTYQNMLLEKETELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSE

Query:  PNLEAIDKILESTPTSIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPK
        P  + I+ ++      +  D N  L+R FT  E+   ++ M PTKA GPDG+  VFY+KFW +VG D  K  L  LN G  ++ IN TFI LIPK ++P+
Subjt:  PNLEAIDKILESTPTSIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPK

Query:  SMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA-------------------
         + +FRPISLCNVIYK+I+K LANRLK +L +I+S +QSAFVP RLITDN ++ FE +H ++  K G+DG + LKLDMSKA                   
Subjt:  SMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA-------------------

Query:  ------------------------------------------------------------------AVGLLNQ---------ADNNRVLTD-----CRAI
                                                                           + L  Q         AD++ + +      C  I
Subjt:  ------------------------------------------------------------------AVGLLNQ---------ADNNRVLTD-----CRAI

Query:  QDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQ
        QDIL  Y KASGQ +N DK     S  T      IIK  L+V   +   +YLGLPS   RN+ + FS IK+RVW+ L+GWK KL  S  G+EILIK++AQ
Subjt:  QDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQ

Query:  TIPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR-------------------------YR---------------
         IP Y+MSCF+ P  L N+L A+  RFWW     ++KIHW SW KLC +K  GG+GFR                         YR               
Subjt:  TIPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR-------------------------YR---------------

Query:  -------------------------WRIGNGLSVDVVYEPWILKEGSAKPILVCPDI-CHLTVAHL-KDLNGLWNENIIRESFLDTDAEAILNMPSSSTM
                                 WR+GNG S+++  + W+L++   K +   P+I  H TV  L      +W+  +I   F   DAEAI ++P S+  
Subjt:  -------------------------WRIGNGLSVDVVYEPWILKEGSAKPILVCPDI-CHLTVAHL-KDLNGLWNENIIRESFLDTDAEAILNMPSSSTM

Query:  GEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWKSYWKINLPQK-----------------------------IKICGEKPEDLRHLF
          D++IW  +  G YSV+S YRL  + ++      S     +  W+S W +N+P+K                              +ICG   ED  H  
Subjt:  GEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWKSYWKINLPQK-----------------------------IKICGEKPEDLRHLF

Query:  WECKFTKEVWHKYFPISNYL------FQNL---IIHNKQIPKTE-------CLQQHINKV--------ISELIGEEDMYLPSELIAPLESHRNGSWSSNE
        W CK  ++VW +     +        F +L   +IH  +  + E        L Q  NK+        ++++  +   YL   L A  +   N     ++
Subjt:  WECKFTKEVWHKYFPISNYL------FQNL---IIHNKQIPKTE-------CLQQHINKV--------ISELIGEEDMYLPSELIAPLESHRNGSWSSNE

Query:  MTPRAGWLAIPNGIWKINCDASWSDENQQGGVGWILRDWRGYPLSA
          P   W+   +   KIN D +   +  + G+G I+RD  G  L++
Subjt:  MTPRAGWLAIPNGIWKINCDASWSDENQQGGVGWILRDWRGYPLSA

A0A2N9G3W9 Uncharacterized protein1.5e-17929.45Show/hide
Query:  NVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDLDEKIDQKWGGSLRM
        N+G N  +  F+N      +M  GPW FDK L++LK          + F   +FW+  H LP    +  V  +I S LG+V+ V+ D       G  +R 
Subjt:  NVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDLDEKIDQKWGGSLRM

Query:  KIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSGSS--SEELPYGDWLY-EPPFFFKGRETTSPARLNATMVGRGRG
        ++ +DV +PL RG  +     ++ W+   +E+   FCY CG V H  ++C   + S  +  +E   YG WL  +     +   +T   R  A   G   G
Subjt:  KIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSGSS--SEELPYGDWLY-EPPFFFKGRETTSPARLNATMVGRGRG

Query:  RTWEGGRSDWRKRFSD----DNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAP---HPPTVPTGELLEMVT---AICEDSGINEGVSAKGKNEISQ
         T   G +  ++  S+        D   +T+  K+ +         FQ  V+ P  P   +  +VP      M T      E S       A+G      
Subjt:  RTWEGGRSDWRKRFSD----DNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAP---HPPTVPTGELLEMVT---AICEDSGINEGVSAKGKNEISQ

Query:  KASGSSSNNGPVIGGALNEQMEVDDVGLGPHADVILQPEIRDNYGM--AKQNTQVKEGLGNPRTLRGLWHIIRSHKSQLVFLMETRCQYNGGDRIRYDLQ
        K         P+   + +++          H  +I    +RD       + N   +    +  T+  L  ++R     ++FL ET+   N  + +R   +
Subjt:  KASGSSSNNGPVIGGALNEQMEVDDVGLGPHADVILQPEIRDNYGM--AKQNTQVKEGLGNPRTLRGLWHIIRSHKSQLVFLMETRCQYNGGDRIRYDLQ

Query:  YDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQKKDF-MWRFTGIYGNPRRELHHETWTLIKRLRDDSDIPWVLGVDFNEMTSGSEKQGGVP
        +     +PS G SGG ++ W     V++ SYS  HID I+  +    WRFTG YG+P  E     W +++ LR+  D+PW+   DFNE+ S  EK G   
Subjt:  YDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQKKDF-MWRFTGIYGNPRRELHHETWTLIKRLRDDSDIPWVLGVDFNEMTSGSEKQGGVP

Query:  RQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNEPINLRLV-------------------------------------APRRPR----RFEEAWCKY
        R E+ M+ FR V+D CG +D G SGP FTWCNNQ  +   L  +                                     + R+P+    RFEE W   
Subjt:  RQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNEPINLRLV-------------------------------------APRRPR----RFEEAWCKY

Query:  EKCRDIVQQVWQSRGNIDVSNFVVKTQLCIDR--LSHWSRLKYGCSIRGAIARIEKEIQGLSNLEGGTYQNM--LLEKETELDNLLEDDEIYWKQRARED
         +C D VQQ W +  +  +  F V  ++   R  L+ WSR  +G S+R +I    K++Q  +       QN+  +     EL NL   +E  WKQR+R  
Subjt:  EKCRDIVQQVWQSRGNIDVSNFVVKTQLCIDR--LSHWSRLKYGCSIRGAIARIEKEIQGLSNLEGGTYQNM--LLEKETELDNLLEDDEIYWKQRARED

Query:  WLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNELLRSFTKEEILGVVRGMH
        WL+  DCNT++FH +A  R++RN I+ + D+ G+W E + ++ +    Y++ LF SS P   A++++L+     +  + N++LL+ FT  E+   +  M 
Subjt:  WLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNELLRSFTKEEILGVVRGMH

Query:  PTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFV
        P KA GPDG+  +FY+K+W IVG D     L  L  G+ ++ IN T I LIPK+ +P S KDFRPISLCNVIYKI+AK LANRLK++L  +IS  QSAFV
Subjt:  PTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFV

Query:  PSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA--------------------------------------AVGL-------------------
        P R I+DN ++ FE +H ++  ++G+ G + LKL+MSKA                                      A GL                   
Subjt:  PSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA--------------------------------------AVGL-------------------

Query:  --------LNQADNN-----RVLTDCRAIQDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKD
                L  AD++       L +C  I +IL TY KASGQ +N DK     S +T  E    IK  LQ+   K    YLGLPS   R+K   F+ +K+
Subjt:  --------LNQADNN-----RVLTDCRAIQDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKD

Query:  RVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCAR-FWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR------------
        R+W+ +QGW+ +L     GKE++IK++ Q IP Y M CFK P  L ++L  +  R FWWG     +++HW  W  LC  K  GGMGFR            
Subjt:  RVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCAR-FWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR------------

Query:  -----------------------------------------------------YRWRIGNGLSVDVVYEPWILKEGSAKPIL----VCPDICHLTVAHLK
                                                              RWRIGNG S  + ++ W+    S KP+     +  D C   V+HL 
Subjt:  -----------------------------------------------------YRWRIGNGLSVDVVYEPWILKEGSAKPIL----VCPDICHLTVAHLK

Query:  DLNGLWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWKSYWKINLPQKIK----------
           G+WN+ +I + F   DA+ I  +  S    +D++IW+ +  G YSV+SAYRL      +     S+     + WK  W I +P K++          
Subjt:  DLNGLWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWKSYWKINLPQKIK----------

Query:  -------------------ICGEKPEDLRHLFWECKFTKEVWHKYFPISNYLFQNLIIHNKQIPKTECLQQHINKVISELIGEEDMYLPSELIAPLESHR
                            C    ED+RH  W C     VW      SN      ++ +K   +  C    I   +  L   E + +   ++  L + R
Subjt:  -------------------ICGEKPEDLRHLFWECKFTKEVWHKYFPISNYLFQNLIIHNKQIPKTECLQQHINKVISELIGEEDMYLPSELIAPLESHR

Query:  NGSWSSNEMTP--------RAGWLAIPNGIWKINCDASWSDENQQGGVGWILRDWRGYPLSA
        N +   NE  P        R  W       +K+N DA+   + Q  GVG I+RD +G P++A
Subjt:  NGSWSSNEMTP--------RAGWLAIPNGIWKINCDASWSDENQQGGVGWILRDWRGYPLSA

A0A2N9GLU2 Reverse transcriptase domain-containing protein3.0e-18029.41Show/hide
Query:  NVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQV-DLDEKIDQKWGGSLR
        N+G N  +  F++      +M   PW FDK L++L+          + F   +FW+  H LP    +  V  +I   LG+V+ V DLDE   +  G  +R
Subjt:  NVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQV-DLDEKIDQKWGGSLR

Query:  MKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSGS--SSEELPYGDWLYEP-----------PFFFKGRETTSP-A
         +I ++V +PL RG  +      + W+   +E+  NFCY CG V H  ++C   + S    S E   YG WL               F    R+ T+P A
Subjt:  MKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSGS--SSEELPYGDWLYEP-----------PFFFKGRETTSP-A

Query:  RLNATMVGRGRGRTWEGGRSDWRKRFSDDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAPHPPTVPTGELLEMVTAICEDSGINEGV------SA
           AT +   +  T   G +    + +D    D   + +  K   D     S G     KSPA    P   T   L + T       +   V      SA
Subjt:  RLNATMVGRGRGRTWEGGRSDWRKRFSDDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAPHPPTVPTGELLEMVTAICEDSGINEGV------SA

Query:  KGKNEISQKASGSSSNNGPVIGGALNEQMEVDDVGLGPHADVILQP--EIRDNYGMAKQNTQVKEGLGNP-----RTLRGLWHIIRSHKSQLVFLMETRC
        +  N+  +K +GSS     V+   ++    V        +    QP  ++     + K    V  G+ +P      T+  L  ++R     ++FL ET+ 
Subjt:  KGKNEISQKASGSSSNNGPVIGGALNEQMEVDDVGLGPHADVILQP--EIRDNYGMAKQNTQVKEGLGNP-----RTLRGLWHIIRSHKSQLVFLMETRC

Query:  QYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQ-KKDFMWRFTGIYGNPRRELHHETWTLIKRLRDDSDIPWVLGVDFN
             + +R   ++     +PS GQSGG ++ W     V++ SYS+ HID ++  +    WRFTG YG+P  E     W++++ LR   ++PW+   DFN
Subjt:  QYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQ-KKDFMWRFTGIYGNPRRELHHETWTLIKRLRDDSDIPWVLGVDFN

Query:  EMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNE----------------------------------------PINLRLVAP-
        E+ SGSEK G  PR E  M  FR V+D CG +D G++GP +TWCNNQ  +                                         +N  +  P 
Subjt:  EMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNE----------------------------------------PINLRLVAP-

Query:  --RRPRRFEEAWCKYEKCRDIVQQVWQSRGNIDVSNFVVKTQLCIDR--LSHWSRLKYGCSIRGAI----ARIEKEIQGLSNLEGGTYQNMLLEKETELD
          RR  RFEE W    +C D +QQ W++        F V  ++   R  L  WS+  +G S+R +I     ++E+E   +  L+      ++     EL 
Subjt:  --RRPRRFEEAWCKYEKCRDIVQQVWQSRGNIDVSNFVVKTQLCIDR--LSHWSRLKYGCSIRGAI----ARIEKEIQGLSNLEGGTYQNMLLEKETELD

Query:  NLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNEL
        +L   +E  WKQR+R  WL+ GD NT++FH +A  R++RN I+ +MD+ GIW +E++++E     Y+++LF S+ P    +D++L      +  + N++L
Subjt:  NLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNEL

Query:  LRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLANR
        +R FT  E+   +  M P KA GPDG+  +FY+K+W IVG D     L  L  G+ ++ IN T I LIPK+++P S KDFRPISLCNVIYKI+AK LANR
Subjt:  LRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLANR

Query:  LKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA--------------------------------------------
        LKQ+L  +IS  QSAFVP RLI+DN ++ FE +H ++  ++G+ G + LKLDMSKA                                            
Subjt:  LKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA--------------------------------------------

Query:  -----------------------AVG---LLNQADNNR-----------------------------VLTDCRAIQDILDTYAKASGQTINLDKYNFMVS
                               A G   LL++A  +R                              L +C  I +IL  Y +ASGQ IN DK     S
Subjt:  -----------------------AVG---LLNQADNNR-----------------------------VLTDCRAIQDILDTYAKASGQTINLDKYNFMVS

Query:  PNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCA
         +T       IK+ LQ+   K    YLGLPS   R+K   FS +K+ +W+ +QGWK KL   + GKE+LIK++ Q IP Y M CFK P  L ++L  +  
Subjt:  PNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCA

Query:  RFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR-----------------------------------------------------------------YR
         FWWG  ++ +K+HW  W  LC  K  GGMGFR                                                                  R
Subjt:  RFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR-----------------------------------------------------------------YR

Query:  WRIGNGLSVDVVYEPWILKEGSAKPI---LVCP-DICHLTVAHLKDLNGLWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRL
        WRIGNGLS  + ++ W+    S KP+    V P D C  ++  + D +GLWN ++I   F   DA+ I  +  SS  GED++IW+ +  G YSV+SAYRL
Subjt:  WRIGNGLSVDVVYEPWILKEGSAKPI---LVCP-DICHLTVAHLKDLNGLWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRL

Query:  GFNVKQSGHTSSSNYKAQERMWKSYWKINLPQKIK-----------------------------ICGEKPEDLRHLFWECKFTKEVWH------KYFPIS
              +      +    ++ WK  W + +P K++                              C  + ED+RH+ W C     VW       K F   
Subjt:  GFNVKQSGHTSSSNYKAQERMWKSYWKINLPQKIK-----------------------------ICGEKPEDLRHLFWECKFTKEVWH------KYFPIS

Query:  NYLFQNLIIHNKQIPKTECLQQHI----------NKVI-------SELIGEEDMYLPSELIAPLESHRNGSWSSNEMTPRAGWLAIPNGIWKINCDASWS
        ++ F +++     + K++ + + I          NK++        +LI    + L SE +    S +    +S +  PR  W      ++K+N DA+  
Subjt:  NYLFQNLIIHNKQIPKTECLQQHI----------NKVI-------SELIGEEDMYLPSELIAPLESHRNGSWSSNEMTPRAGWLAIPNGIWKINCDASWS

Query:  DENQQGGVGWILRDWRGYPLSA
         E Q  GVG I+RD +G P++A
Subjt:  DENQQGGVGWILRDWRGYPLSA

A0A2N9IIR5 Uncharacterized protein2.2e-17829.29Show/hide
Query:  ISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDLDEKIDQKWGGSL
        + ++G N  +  F++G   + ++   PW +DK L+IL+        D + F  + FW+  H LP    +  VA  IGS LG+V++  +D +     G  +
Subjt:  ISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDLDEKIDQKWGGSL

Query:  RMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSG--SSSEELPYGDWLYEPPFFFKGRETTSPARLNATMVGRGR
        R++I +D  KPL RG  +    G+  W+   +E+  NFCY CG + H +K+C+  + SG     E   YGDWL                R  A  +G+ R
Subjt:  RMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEK-SNFCYGCGYVGHIIKECECDMPSG--SSSEELPYGDWLYEPPFFFKGRETTSPARLNATMVGRGR

Query:  -----GRTWEGGRSDWRKRFSDDNSKDPKDYTQQNKDGEDNSCPD--SSGFQPPVKSPA----------APHPPTVPTGELLEMVTAICEDSGINEGVSA
             G   E  R+DW+      ++K   +++      +D   PD  +   + P K  A          +  PP V   E L  +     D  I +  +A
Subjt:  -----GRTWEGGRSDWRKRFSDDNSKDPKDYTQQNKDGEDNSCPD--SSGFQPPVKSPA----------APHPPTVPTGELLEMVTAICEDSGINEGVSA

Query:  KGKNEISQKASGSSSNNGPVIGGALNEQMEVDDVGLGPHA---DVILQPEIRDNYGMAKQNTQVK------------------EGLGNPRTLRGLWHIIR
          + E  + +S    + G V  G+   Q ++             +  +  +     +   N+ +                    GLGNPRT++ L  ++ 
Subjt:  KGKNEISQKASGSSSNNGPVIGGALNEQMEVDDVGLGPHA---DVILQPEIRDNYGMAKQNTQVK------------------EGLGNPRTLRGLWHIIR

Query:  SHKSQLVFLMETRCQYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQKKDF-MWRFTGIYGNPRRELHHETWTLIKRLR
        +   + VF++ET    +  ++IR  L +   + +P   + GG +L W++   +T++S+S  HID II +     WRFTG YG P     H +W +++ L 
Subjt:  SHKSQLVFLMETRCQYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQKKDF-MWRFTGIYGNPRRELHHETWTLIKRLR

Query:  DDSDIPWVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFN-------------------------------------
            +PW    DFNE+ S SEK+GG PR ++ MQ FRDV+D CG  D G+ GP+FTWCNN+ N                                     
Subjt:  DDSDIPWVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFN-------------------------------------

Query:  ----EPINLRLVAPRRPRRFEEAWCKYEKCRDIVQQVWQSRGN----IDVSNFVVKTQLCIDRLSHWSRLKYGCSIRGAIARIEKEIQ--GLSNLEGGTY
             P ++ +   ++  RFE  W   E CR+ V+  W+S  +    + V N   +  LC  RLS+WSR  +G S+R  +A    ++Q   L +++GG +
Subjt:  ----EPINLRLVAPRRPRRFEEAWCKYEKCRDIVQQVWQSRGN----IDVSNFVVKTQLCIDRLSHWSRLKYGCSIRGAIARIEKEIQ--GLSNLEGGTY

Query:  QNMLLEKETELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILEST
         N ++    EL  LL  +E  W QR+R  WLK GD NT++FH RA+ RR+RN I  L D+ G W ++ +++  +A  YF+ LF+++ P    I ++    
Subjt:  QNMLLEKETELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILEST

Query:  PTSIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNV
           +  D NN L + +T EE+   ++ M P  A GPDG+  +FY+ FW +VG+D     L  LN G  ++ IN T+I LIPKV+ P+++ +FRPISLCNV
Subjt:  PTSIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNV

Query:  IYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA--------------------------------
        +YKII+K +ANRLK ++ ++IS  QSAFVP RLITDN ++ FE +H +     G+ G + LKLDMSKA                                
Subjt:  IYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA--------------------------------

Query:  --------------------------------------AVGLLNQADN-----------------------------NRVLTDCRAIQDILDTYAKASGQ
                                                GLL  A +                                + +C+ + DIL TY  ASGQ
Subjt:  --------------------------------------AVGLLNQADN-----------------------------NRVLTDCRAIQDILDTYAKASGQ

Query:  TINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFP
         IN +K     S +T       IK  LQV   +S   YLGLPS   R+K + F+ IK+RVW  LQGWK +L  S  G+E+LIK++ Q +P ++MSCF+ P
Subjt:  TINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFP

Query:  LTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR-------------------------------------------------------
          L ++L  L  RFWWG  + K+KI W SW KLC  K  GGMGFR                                                       
Subjt:  LTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR-------------------------------------------------------

Query:  ----------YRWRIGNGLSVDVVYEPWILKEGSAKPILVCPD-ICHLTVAHLKD-LNGLWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKG
                   RWRIGNG  V +  + W+        I  CP+ +   TV+ + D    +WN + I+   L  + EAI ++P S     D I W     G
Subjt:  ----------YRWRIGNGLSVDVVYEPWILKEGSAKPILVCPD-ICHLTVAHLKD-LNGLWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKG

Query:  KYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWKSYWKINLPQKIK-----------------------------ICGEKPEDLRHLFWECKFTKEVWHK
         +SV+SAYRL    + S   SSS+      +WK+ W + +P  ++                              C +  ED  H  W C     +W +
Subjt:  KYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWKSYWKINLPQKIK-----------------------------ICGEKPEDLRHLFWECKFTKEVWHK

A0A2N9IWN7 Uncharacterized protein1.8e-18030.96Show/hide
Query:  WINFHKLPFVCFSKKVAVEIGSLLGKVDQVDLDEKIDQKWGGSLRMKIQIDVGKPLKRGIFLESNMGEDRWIPIS--YEK-SNFCYGCGYVGHIIKECEC
        W+  H LP        A +IG  +GKV Q + DE+ +  WG  +R+++ IDV KPL RG   +  +G+D+ I +S  YEK  NFCY CG + H  K+C  
Subjt:  WINFHKLPFVCFSKKVAVEIGSLLGKVDQVDLDEKIDQKWGGSLRMKIQIDVGKPLKRGIFLESNMGEDRWIPIS--YEK-SNFCYGCGYVGHIIKECEC

Query:  DMPSGSSSE--ELPYGDWLYEPPFFFKGRETTSPAR--LNATMVGRGRGRTWEGGRSDWRKRFSDDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPA
         + +  + E  +  YG WL  PP     R+T S       +          +  G+SD  K      S    D+         +  P+SS F P +    
Subjt:  DMPSGSSSE--ELPYGDWLYEPPFFFKGRETTSPAR--LNATMVGRGRGRTWEGGRSDWRKRFSDDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPA

Query:  APHPPTVPTGELLEMVTAICEDSGINEGVSAKGKNEISQKASGSSSNNGPVIGGALNEQMEVDDVGLGPHADVILQPEIRDNYGMAKQNTQVKEGLGNPR
          +PP            +I  ++ +  G  A  +  +  + + S ++  P++   +  Q+       G     +  P +  +Y M         GLG P 
Subjt:  APHPPTVPTGELLEMVTAICEDSGINEGVSAKGKNEISQKASGSSSNNGPVIGGALNEQMEVDDVGLGPHADVILQPEIRDNYGMAKQNTQVKEGLGNPR

Query:  TLRGLWHIIRSHKSQLVFLMETRCQYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQK-KDFMWRFTGIYGNPRRELHH
        T++ L  ++R     ++F++E+       + +R    +   + +P   Q GG  + W++E  V+++S+S  HID II + +   WRFTG YG       H
Subjt:  TLRGLWHIIRSHKSQLVFLMETRCQYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQK-KDFMWRFTGIYGNPRRELHH

Query:  ETWTLIKRLRDDSDIPWVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNN------------------------------
        E+W+L++ L   S +PW    DFNE+ S  EK+GG  R    MQDFRD ID CG  D GY G  FTWCNN                              
Subjt:  ETWTLIKRLRDDSDIPWVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNN------------------------------

Query:  --------------QFNEPINLRLVAPRRPRRFEEAWCKYEKCRDIVQQVWQSRGNIDVSNFVV--KTQLCIDRLSHWSRLKYGCSIRGAIARIEKEIQG
                      QF+ P+N R   P R  RF+E W  +  C++ +   WQ++ +   + F V  K + C + L  WSR  +G   R     ++K+   
Subjt:  --------------QFNEPINLRLVAPRRPRRFEEAWCKYEKCRDIVQQVWQSRGNIDVSNFVV--KTQLCIDRLSHWSRLKYGCSIRGAIARIEKEIQG

Query:  LSNLEGGTYQNMLLEK----ETELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSE
        L   E  + +    EK    + E+  LL  +E  W+QR+R+ WL+WGD +T +FH  A  RR+RN I  + D  G     D ++      +F  LF SS 
Subjt:  LSNLEGGTYQNMLLEK----ETELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSE

Query:  PNLEAIDKILESTPTSIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPK
        P     D  L      +  + N EL+R FT EE+   ++ M P+ A GPDG+  +FY+  W +VG D  +  L  LN G  ++ +N T+I LIPK + P+
Subjt:  PNLEAIDKILESTPTSIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPK

Query:  SMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA--------------AVGLL
         + DFRPISLCNVIYKI++K + NRLK +L KIIS  QSAFVP RLITDN ++ FE +H + + + G+ G + LKLDMSKA               +G  
Subjt:  SMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA--------------AVGLL

Query:  NQ----------------------ADNNRVL-----TDCRAIQDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQN
         Q                       DNN +       DC AI  IL  Y  ASGQ +N DK     S +T   ++T +  +L V   +   +YLGLPS  
Subjt:  NQ----------------------ADNNRVL-----TDCRAIQDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQN

Query:  ARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR
         R++K+ F  IK+R+W  L GWK KL  S  GKEILIK++ Q IP Y+MSCFK P+TL +E+  L  +FWW      KKIHW +W+ LC  K  GGMGFR
Subjt:  ARNKKKIFSGIKDRVWKALQGWKWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFR

Query:  ----------------------YRWR----------------IGNGLSVDVVYEPWILKEGSAKPILVCPDICHLTVAHLKDLNGL----WNENIIRESF
                              Y W+                IGNG SV +  + W+ +    +  ++ P       A +  L  L    W E +I   F
Subjt:  ----------------------YRWR----------------IGNGLSVDVVYEPWILKEGSAKPILVCPDICHLTVAHLKDLNGL----WNENIIRESF

Query:  LDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWKSYWKINLPQKIK-------------------------
        L  +A  IL +P S    +D+IIW +   G +S++SAY     V Q    SSSN      +W   W + +P KI+                         
Subjt:  LDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWKSYWKINLPQKIK-------------------------

Query:  -ICGE---KPEDLRHLFWECKFTKEVWHKYFPISNYLFQNLIIHNKQIPKTECLQQHINKVISELIGEEDMYLPSELIAPLESHRNGSWSSNEMTPRAGW
         IC E     ED+ H+ W C    ++W+     ++ +F   +I      K       +  +I +   EE   + +     L  HRN   + N +     W
Subjt:  -ICGE---KPEDLRHLFWECKFTKEVWHKYFPISNYLFQNLIIHNKQIPKTECLQQHINKVISELIGEEDMYLPSELIAPLESHRNGSWSSNEMTPRAGW

Query:  LAIPNGIWKINCDASWSDENQQGGVGWILRDWRG
             G WKIN + ++S+ +Q  G+G ++ D  G
Subjt:  LAIPNGIWKINCDASWSDENQQGGVGWILRDWRG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.0e-2126.96Show/hide
Query:  WSRLKYGCSIRGAIARIEKEIQGLSNLEGGTYQNMLLEKETELDNLLEDDEIYWKQRAR------EDWLKWGDCNTKWFHMRAN-----------NRRKR
        W   K  C  +       K  Q  S ++  T Q   LEK+ +  +     +   K RA       +  L+  + +  WF  R N            +R++
Subjt:  WSRLKYGCSIRGAIARIEKEIQGLSNLEGGTYQNMLLEKETELDNLLEDDEIYWKQRAR------EDWLKWGDCNTKWFHMRAN-----------NRRKR

Query:  NKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILES-TPTSIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDI
        N+I  + +D G  T +  E++     Y++ L+ +   NLE +D  L++ T   +  ++   L R  T  EI+ ++  +   K+ GPDG  A FY+++ + 
Subjt:  NKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILES-TPTSIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDI

Query:  VGDDTYKICLQYLNGGLSIRDINKTFIVLIPKV-RDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVN
        +     K+       G+      +  I+LIPK  RD    ++FRPISL N+  KI+ K LANR++Q + K+I  +Q  F+P      N       I  +N
Subjt:  VGDDTYKICLQYLNGGLSIRDINKTFIVLIPKV-RDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVN

Query:  SRKKGKDGVVTLKLDMSKA
         R K K+ V+ + +D  KA
Subjt:  SRKKGKDGVVTLKLDMSKA

P08548 LINE-1 reverse transcriptase homolog1.8e-1729.39Show/hide
Query:  WFHMRAN---------NRRKRNK-----IRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNELL-RSFTKEEILGV
        WF  + N          R+KR K     IR   D++   T + +E++ I N Y++ L+     NL+ ID+ LE+     +  +  E+L R  +  EI   
Subjt:  WFHMRAN---------NRRKRNK-----IRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNELL-RSFTKEEILGV

Query:  VRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKV-RDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISP
        ++ +   K+ GPDG  + FY+ F + +      +       G+      +  I LIPK  +DP   +++RPISL N+  KI+ K L NR++Q + KII  
Subjt:  VRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKV-RDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISP

Query:  NQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA
        +Q  F+P      N       I  +N + K KD ++ L +D  KA
Subjt:  NQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA

P11369 LINE-1 retrotransposable element ORF2 protein3.7e-1829.28Show/hide
Query:  RKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPT-SIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKF
        R +  I ++ ++ G  T +  E++     +++ L+ +   NL+ +DK L+      +  DQ + L    + +EI  V+  +   K+ GPDG  A FY+ F
Subjt:  RKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPT-SIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKF

Query:  WDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPK-VRDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIH
         + +    +K+  +    G       +  I LIPK  +DP  +++FRPISL N+  KI+ K LANR+++ +  II P+Q  F+P      N       IH
Subjt:  WDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPK-VRDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIH

Query:  AVNSRKKGKDGVVTLKLDMSKA
         +N + K K+ ++ + LD  KA
Subjt:  AVNSRKKGKDGVVTLKLDMSKA

P14381 Transposon TX1 uncharacterized 149 kDa protein4.2e-2227.65Show/hide
Query:  SIRGAIARIEKEIQGLSNLEGGTYQNMLLEKETELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIAN
        ++ G +  +E   Q LS  E    Q   LE++  L N+ +        R+R   L   D  +++F+     +  R +I  L  + G   E+   +   A 
Subjt:  SIRGAIARIEKEIQGLSNLEGGTYQNMLLEKETELDNLLEDDEIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIAN

Query:  RYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNELLRS-FTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKT
         +++ LF     + +A +++ +  P  ++ ++  E L +  T +E+   +R M   K+ G DGL   F++ FWD +G D +++  +    G       + 
Subjt:  RYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNELLRS-FTKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKT

Query:  FIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA
         + L+PK  D + +K++RP+SL +  YKI+AK ++ RLK +L ++I P+QS  VP R I DN  L  + +H   +R+ G   +  L LD  KA
Subjt:  FIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA

P93295 Uncharacterized mitochondrial protein AtMg003107.0e-0946.55Show/hide
Query:  IPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNK-NSGGMGFR
        +P YAMSCF+    L  +L +    FWW + E+K+KI W +W KLC +K + GG+GFR
Subjt:  IPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNK-NSGGMGFR

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.9e-1830.16Show/hide
Query:  EIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNL--EAIDKILESTPTSIMGDQNNELLRSF
        E +++Q++R  WL+ GD NT++FH      + +N I+ L  D  +  E   +++ +   Y+  L  S    L  +++ +I +  P        + L    
Subjt:  EIYWKQRAREDWLKWGDCNTKWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNL--EAIDKILESTPTSIMGDQNNELLRSF

Query:  TKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKII
        + +EI   V  M   KA GPD   A F+ + W +V D T     ++   G  ++  N T I LIPKV     +  FRP+S C V+YKII
Subjt:  TKEEILGVVRGMHPTKALGPDGLQAVFYKKFWDIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKII

AT3G09510.1 Ribonuclease H-like superfamily protein1.4e-0419.94Show/hide
Query:  RWRIGNGLSVDVVYEPWILKEGSAKPILVCPDICHLTVAHLKDLNG---LWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRL
        R  IG+G ++ +  +  I+     +P+        +T+ +L +  G    W+++ I +    +D   I  +  + +   D+IIWNY+  G+Y+V+S Y L
Subjt:  RWRIGNGLSVDVVYEPWILKEGSAKPILVCPDICHLTVAHLKDLNG---LWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRL

Query:  GFNVKQSGHTSSSNYKA----------QERMWK---------SYWK----------------INLPQKIKICGEKPEDLRHLFWECKFTKEVWHKYFPIS
                H  S+N  A          + R+W            W+                + +      C  + E + H  + C F    W       
Subjt:  GFNVKQSGHTSSSNYKA----------QERMWK---------SYWK----------------INLPQKIKICGEKPEDLRHLFWECKFTKEVWHKYFPIS

Query:  NYLFQNLIIHNKQIPKTECLQQHINKVISELIGEEDMYLPSELIAPLESHRNG----------------------SW----SSNEMTP---------RAG
        + L +N ++ N      E +   +N V    + +    LP  LI  +   RN                        W     S++ TP         +  
Subjt:  NYLFQNLIIHNKQIPKTECLQQHINKVISELIGEEDMYLPSELIAPLESHRNG----------------------SW----SSNEMTP---------RAG

Query:  WLAIPNGIWKINCDASWSDENQQGGVGWILRDWRGYPLSAG
        W   P    K N DA +  +  +   GWI+R+  G P+S G
Subjt:  WLAIPNGIWKINCDASWSDENQQGGVGWILRDWRGYPLSAG

AT4G20520.1 RNA binding;RNA-directed DNA polymerases8.8e-0743.33Show/hide
Query:  LANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA
        +  RLK L+  +I P Q++F+P R+ TDN +   E +H++  RKKG  G + LKLD+ KA
Subjt:  LANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGVVTLKLDMSKA

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.0e-1046.55Show/hide
Query:  IPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNK-NSGGMGFR
        +P YAMSCF+    L  +L +    FWW + E+K+KI W +W KLC +K + GG+GFR
Subjt:  IPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNK-NSGGMGFR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATAAGATCTGGAATCGGGACCACACTATCATAAGCAATGTGGGCTTCAATATGTTCCTCTGCAAATTCAAGAACGGACGTATTAAGAACCATATCATGGAAGCTGG
ACCTTGGTTCTTTGATAAAGCACTTATGATACTCAAGGAACCTAAAGGCGGAAATTTTGGAGACGATATTGAATTCAGGTATGTTTCATTTTGGATTAATTTTCATAAAT
TGCCATTTGTGTGCTTTTCCAAGAAGGTGGCAGTGGAGATTGGGAGCCTTCTTGGCAAAGTGGATCAAGTGGATTTAGATGAGAAGATTGACCAAAAATGGGGTGGCTCC
TTGCGAATGAAGATCCAAATTGATGTGGGAAAACCTCTGAAACGTGGAATATTTCTGGAATCAAACATGGGTGAGGATAGGTGGATCCCAATCTCGTATGAAAAATCGAA
TTTCTGCTACGGATGTGGATACGTTGGTCATATCATCAAAGAGTGTGAATGTGATATGCCGTCTGGATCATCCTCAGAGGAACTCCCTTACGGTGATTGGCTCTATGAGC
CTCCCTTCTTCTTCAAAGGGCGAGAGACAACATCTCCAGCTAGATTAAATGCTACAATGGTTGGTCGTGGAAGGGGTAGAACTTGGGAGGGAGGAAGAAGTGATTGGAGG
AAACGGTTTTCTGATGATAATAGCAAAGATCCAAAAGATTATACTCAGCAGAACAAAGATGGGGAGGACAATAGTTGCCCAGATTCCAGCGGATTTCAACCTCCGGTGAA
GTCTCCGGCAGCTCCACATCCTCCGACAGTTCCAACGGGTGAATTACTAGAAATGGTAACTGCGATTTGCGAGGATTCCGGAATTAATGAGGGGGTTTCAGCAAAAGGAA
AAAATGAAATCTCACAGAAGGCAAGCGGTAGCAGCTCAAATAACGGTCCAGTAATTGGAGGTGCATTGAATGAGCAAATGGAAGTGGATGATGTTGGGCTTGGGCCTCAT
GCTGATGTGATTCTACAACCTGAGATAAGGGATAATTATGGGATGGCAAAGCAAAACACTCAAGTAAAAGAGGGGCTGGGGAACCCTCGGACATTAAGAGGGTTGTGGCA
TATCATCCGCAGTCATAAGTCCCAACTGGTTTTCTTAATGGAAACTAGATGTCAGTACAATGGAGGTGATAGAATTCGTTATGATTTACAATATGACTGTAGTATATCAA
TTCCAAGTGTTGGCCAAAGTGGAGGGTTCATGTTGCTTTGGCAGAAAGAGACTGATGTTACGGTTCGTTCATACTCTAAGGGGCATATTGATGTTATTATTCAAAAGAAG
GATTTTATGTGGAGATTCACGGGTATCTATGGCAACCCTCGTAGGGAGCTCCATCATGAAACTTGGACTCTGATTAAAAGACTTCGGGATGACAGTGATATTCCATGGGT
CCTGGGGGTAGACTTTAATGAGATGACAAGTGGCTCAGAAAAGCAAGGAGGAGTGCCCCGACAGGAGAGTGTTATGCAAGATTTCAGGGATGTGATTGACGTATGTGGCC
TCATTGATCCCGGCTACTCTGGCCCTGATTTCACATGGTGCAACAATCAGTTTAATGAGCCCATAAACCTAAGATTGGTAGCTCCAAGACGGCCAAGGAGATTCGAAGAA
GCCTGGTGTAAATATGAGAAATGCAGAGATATAGTTCAACAGGTTTGGCAATCCCGGGGGAATATTGATGTTTCTAATTTTGTTGTTAAAACCCAACTGTGTATTGATCG
TTTATCTCATTGGAGCCGACTTAAATATGGGTGTTCTATCAGAGGTGCTATAGCCAGAATTGAAAAGGAGATTCAAGGGCTTTCTAATTTGGAAGGAGGCACATATCAGA
ATATGCTTTTGGAGAAAGAGACAGAACTTGATAACCTCTTGGAAGATGATGAAATCTACTGGAAGCAACGAGCTCGAGAGGACTGGTTGAAATGGGGGGACTGCAATACA
AAGTGGTTTCACATGAGAGCAAATAACAGGAGGAAAAGAAATAAAATCAGAAGGTTGATGGATGATCTAGGGATTTGGACTGAAGAGGATAACGAAATGGAGGCTATTGC
AAACCGGTACTTTAGAACTCTCTTCCAATCCTCTGAACCAAATTTGGAGGCTATAGACAAGATCTTGGAATCAACTCCAACATCGATCATGGGTGACCAAAATAATGAAC
TTCTGAGAAGTTTCACCAAGGAAGAAATTTTGGGAGTTGTCAGAGGTATGCATCCTACGAAAGCCCTTGGCCCTGATGGGCTTCAAGCGGTATTTTATAAAAAATTTTGG
GATATTGTAGGGGACGATACATACAAGATTTGTCTGCAGTACTTGAATGGAGGTCTCTCTATAAGAGACATCAACAAAACGTTCATAGTCCTAATTCCTAAAGTGAGGGA
TCCTAAATCCATGAAGGACTTTCGACCCATTAGTCTGTGCAATGTAATTTATAAGATCATTGCTAAGACGCTTGCCAACAGACTGAAACAGTTGCTCGACAAGATTATCT
CGCCAAACCAATCAGCTTTTGTGCCAAGCCGCCTTATCACAGACAATGCCATATTGGGCTTTGAATGTATTCATGCTGTTAATAGCAGAAAAAAGGGAAAGGATGGAGTT
GTGACGCTTAAACTTGATATGAGTAAGGCGGCTGTAGGCCTTTTGAACCAAGCTGATAATAACAGGGTGTTGACAGATTGCAGGGCTATTCAGGACATCCTCGATACATA
TGCAAAAGCTTCTGGCCAAACAATTAATCTTGATAAATATAACTTTATGGTTAGCCCTAACACAAAAGAAGAGATGGCGACGATCATCAAGGAGGTCCTACAAGTCCAAT
TTACCAAAAGCTTAGGCCAATATTTAGGCCTACCCTCGCAGAATGCTAGAAACAAGAAGAAGATCTTCAGTGGCATAAAGGATCGGGTTTGGAAAGCTCTTCAAGGATGG
AAATGGAAGCTTTTTTTTTCAATTGATGGCAAAGAAATTCTTATTAAATCTATTGCCCAGACTATTCCTAATTATGCAATGAGCTGCTTTAAGTTTCCTCTAACACTTGG
TAATGAATTAAATGCCTTATGTGCTAGGTTTTGGTGGGGGACGACAGAATCGAAAAAGAAAATCCATTGGCGTAGCTGGGACAAACTTTGCCACAACAAGAATTCAGGAG
GAATGGGCTTTCGATATAGATGGAGAATTGGAAATGGCTTAAGTGTGGATGTTGTTTATGAACCTTGGATTCTGAAAGAGGGTTCTGCTAAGCCAATTCTAGTCTGTCCT
GATATATGTCATCTTACTGTGGCTCATCTCAAAGATTTGAACGGGTTGTGGAATGAGAATATAATTAGAGAGAGTTTTTTAGACACAGATGCTGAGGCAATTTTGAATAT
GCCATCTAGTTCCACCATGGGCGAGGACGAGATCATTTGGAATTATGATGTTAAAGGGAAGTATTCGGTTAAAAGCGCGTATCGCTTAGGCTTTAATGTTAAACAGAGTG
GTCACACTTCGTCCTCTAACTATAAAGCTCAAGAGCGAATGTGGAAAAGCTACTGGAAGATCAATCTACCTCAGAAAATAAAAATATGCGGGGAGAAGCCTGAAGATTTG
CGGCATCTCTTTTGGGAGTGCAAGTTCACAAAAGAGGTGTGGCACAAATATTTCCCTATTTCTAATTACTTGTTTCAGAATCTCATTATCCACAACAAACAAATTCCCAA
AACAGAGTGTTTACAGCAGCATATCAACAAAGTAATATCCGAGCTTATAGGGGAAGAAGATATGTACCTGCCAAGTGAATTGATTGCGCCTCTAGAATCTCACCGAAATG
GCAGCTGGAGTAGCAACGAAATGACGCCTCGGGCTGGATGGCTGGCGATCCCCAACGGCATCTGGAAAATCAACTGTGATGCCTCATGGAGTGACGAGAACCAACAAGGA
GGAGTTGGATGGATTTTGCGTGATTGGAGAGGATACCCCCTGTCGGCTGGTTACCACATTTTTAATCGTCGATGGAATGTTACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATAAGATCTGGAATCGGGACCACACTATCATAAGCAATGTGGGCTTCAATATGTTCCTCTGCAAATTCAAGAACGGACGTATTAAGAACCATATCATGGAAGCTGG
ACCTTGGTTCTTTGATAAAGCACTTATGATACTCAAGGAACCTAAAGGCGGAAATTTTGGAGACGATATTGAATTCAGGTATGTTTCATTTTGGATTAATTTTCATAAAT
TGCCATTTGTGTGCTTTTCCAAGAAGGTGGCAGTGGAGATTGGGAGCCTTCTTGGCAAAGTGGATCAAGTGGATTTAGATGAGAAGATTGACCAAAAATGGGGTGGCTCC
TTGCGAATGAAGATCCAAATTGATGTGGGAAAACCTCTGAAACGTGGAATATTTCTGGAATCAAACATGGGTGAGGATAGGTGGATCCCAATCTCGTATGAAAAATCGAA
TTTCTGCTACGGATGTGGATACGTTGGTCATATCATCAAAGAGTGTGAATGTGATATGCCGTCTGGATCATCCTCAGAGGAACTCCCTTACGGTGATTGGCTCTATGAGC
CTCCCTTCTTCTTCAAAGGGCGAGAGACAACATCTCCAGCTAGATTAAATGCTACAATGGTTGGTCGTGGAAGGGGTAGAACTTGGGAGGGAGGAAGAAGTGATTGGAGG
AAACGGTTTTCTGATGATAATAGCAAAGATCCAAAAGATTATACTCAGCAGAACAAAGATGGGGAGGACAATAGTTGCCCAGATTCCAGCGGATTTCAACCTCCGGTGAA
GTCTCCGGCAGCTCCACATCCTCCGACAGTTCCAACGGGTGAATTACTAGAAATGGTAACTGCGATTTGCGAGGATTCCGGAATTAATGAGGGGGTTTCAGCAAAAGGAA
AAAATGAAATCTCACAGAAGGCAAGCGGTAGCAGCTCAAATAACGGTCCAGTAATTGGAGGTGCATTGAATGAGCAAATGGAAGTGGATGATGTTGGGCTTGGGCCTCAT
GCTGATGTGATTCTACAACCTGAGATAAGGGATAATTATGGGATGGCAAAGCAAAACACTCAAGTAAAAGAGGGGCTGGGGAACCCTCGGACATTAAGAGGGTTGTGGCA
TATCATCCGCAGTCATAAGTCCCAACTGGTTTTCTTAATGGAAACTAGATGTCAGTACAATGGAGGTGATAGAATTCGTTATGATTTACAATATGACTGTAGTATATCAA
TTCCAAGTGTTGGCCAAAGTGGAGGGTTCATGTTGCTTTGGCAGAAAGAGACTGATGTTACGGTTCGTTCATACTCTAAGGGGCATATTGATGTTATTATTCAAAAGAAG
GATTTTATGTGGAGATTCACGGGTATCTATGGCAACCCTCGTAGGGAGCTCCATCATGAAACTTGGACTCTGATTAAAAGACTTCGGGATGACAGTGATATTCCATGGGT
CCTGGGGGTAGACTTTAATGAGATGACAAGTGGCTCAGAAAAGCAAGGAGGAGTGCCCCGACAGGAGAGTGTTATGCAAGATTTCAGGGATGTGATTGACGTATGTGGCC
TCATTGATCCCGGCTACTCTGGCCCTGATTTCACATGGTGCAACAATCAGTTTAATGAGCCCATAAACCTAAGATTGGTAGCTCCAAGACGGCCAAGGAGATTCGAAGAA
GCCTGGTGTAAATATGAGAAATGCAGAGATATAGTTCAACAGGTTTGGCAATCCCGGGGGAATATTGATGTTTCTAATTTTGTTGTTAAAACCCAACTGTGTATTGATCG
TTTATCTCATTGGAGCCGACTTAAATATGGGTGTTCTATCAGAGGTGCTATAGCCAGAATTGAAAAGGAGATTCAAGGGCTTTCTAATTTGGAAGGAGGCACATATCAGA
ATATGCTTTTGGAGAAAGAGACAGAACTTGATAACCTCTTGGAAGATGATGAAATCTACTGGAAGCAACGAGCTCGAGAGGACTGGTTGAAATGGGGGGACTGCAATACA
AAGTGGTTTCACATGAGAGCAAATAACAGGAGGAAAAGAAATAAAATCAGAAGGTTGATGGATGATCTAGGGATTTGGACTGAAGAGGATAACGAAATGGAGGCTATTGC
AAACCGGTACTTTAGAACTCTCTTCCAATCCTCTGAACCAAATTTGGAGGCTATAGACAAGATCTTGGAATCAACTCCAACATCGATCATGGGTGACCAAAATAATGAAC
TTCTGAGAAGTTTCACCAAGGAAGAAATTTTGGGAGTTGTCAGAGGTATGCATCCTACGAAAGCCCTTGGCCCTGATGGGCTTCAAGCGGTATTTTATAAAAAATTTTGG
GATATTGTAGGGGACGATACATACAAGATTTGTCTGCAGTACTTGAATGGAGGTCTCTCTATAAGAGACATCAACAAAACGTTCATAGTCCTAATTCCTAAAGTGAGGGA
TCCTAAATCCATGAAGGACTTTCGACCCATTAGTCTGTGCAATGTAATTTATAAGATCATTGCTAAGACGCTTGCCAACAGACTGAAACAGTTGCTCGACAAGATTATCT
CGCCAAACCAATCAGCTTTTGTGCCAAGCCGCCTTATCACAGACAATGCCATATTGGGCTTTGAATGTATTCATGCTGTTAATAGCAGAAAAAAGGGAAAGGATGGAGTT
GTGACGCTTAAACTTGATATGAGTAAGGCGGCTGTAGGCCTTTTGAACCAAGCTGATAATAACAGGGTGTTGACAGATTGCAGGGCTATTCAGGACATCCTCGATACATA
TGCAAAAGCTTCTGGCCAAACAATTAATCTTGATAAATATAACTTTATGGTTAGCCCTAACACAAAAGAAGAGATGGCGACGATCATCAAGGAGGTCCTACAAGTCCAAT
TTACCAAAAGCTTAGGCCAATATTTAGGCCTACCCTCGCAGAATGCTAGAAACAAGAAGAAGATCTTCAGTGGCATAAAGGATCGGGTTTGGAAAGCTCTTCAAGGATGG
AAATGGAAGCTTTTTTTTTCAATTGATGGCAAAGAAATTCTTATTAAATCTATTGCCCAGACTATTCCTAATTATGCAATGAGCTGCTTTAAGTTTCCTCTAACACTTGG
TAATGAATTAAATGCCTTATGTGCTAGGTTTTGGTGGGGGACGACAGAATCGAAAAAGAAAATCCATTGGCGTAGCTGGGACAAACTTTGCCACAACAAGAATTCAGGAG
GAATGGGCTTTCGATATAGATGGAGAATTGGAAATGGCTTAAGTGTGGATGTTGTTTATGAACCTTGGATTCTGAAAGAGGGTTCTGCTAAGCCAATTCTAGTCTGTCCT
GATATATGTCATCTTACTGTGGCTCATCTCAAAGATTTGAACGGGTTGTGGAATGAGAATATAATTAGAGAGAGTTTTTTAGACACAGATGCTGAGGCAATTTTGAATAT
GCCATCTAGTTCCACCATGGGCGAGGACGAGATCATTTGGAATTATGATGTTAAAGGGAAGTATTCGGTTAAAAGCGCGTATCGCTTAGGCTTTAATGTTAAACAGAGTG
GTCACACTTCGTCCTCTAACTATAAAGCTCAAGAGCGAATGTGGAAAAGCTACTGGAAGATCAATCTACCTCAGAAAATAAAAATATGCGGGGAGAAGCCTGAAGATTTG
CGGCATCTCTTTTGGGAGTGCAAGTTCACAAAAGAGGTGTGGCACAAATATTTCCCTATTTCTAATTACTTGTTTCAGAATCTCATTATCCACAACAAACAAATTCCCAA
AACAGAGTGTTTACAGCAGCATATCAACAAAGTAATATCCGAGCTTATAGGGGAAGAAGATATGTACCTGCCAAGTGAATTGATTGCGCCTCTAGAATCTCACCGAAATG
GCAGCTGGAGTAGCAACGAAATGACGCCTCGGGCTGGATGGCTGGCGATCCCCAACGGCATCTGGAAAATCAACTGTGATGCCTCATGGAGTGACGAGAACCAACAAGGA
GGAGTTGGATGGATTTTGCGTGATTGGAGAGGATACCCCCTGTCGGCTGGTTACCACATTTTTAATCGTCGATGGAATGTTACCTAG
Protein sequenceShow/hide protein sequence
MHKIWNRDHTIISNVGFNMFLCKFKNGRIKNHIMEAGPWFFDKALMILKEPKGGNFGDDIEFRYVSFWINFHKLPFVCFSKKVAVEIGSLLGKVDQVDLDEKIDQKWGGS
LRMKIQIDVGKPLKRGIFLESNMGEDRWIPISYEKSNFCYGCGYVGHIIKECECDMPSGSSSEELPYGDWLYEPPFFFKGRETTSPARLNATMVGRGRGRTWEGGRSDWR
KRFSDDNSKDPKDYTQQNKDGEDNSCPDSSGFQPPVKSPAAPHPPTVPTGELLEMVTAICEDSGINEGVSAKGKNEISQKASGSSSNNGPVIGGALNEQMEVDDVGLGPH
ADVILQPEIRDNYGMAKQNTQVKEGLGNPRTLRGLWHIIRSHKSQLVFLMETRCQYNGGDRIRYDLQYDCSISIPSVGQSGGFMLLWQKETDVTVRSYSKGHIDVIIQKK
DFMWRFTGIYGNPRRELHHETWTLIKRLRDDSDIPWVLGVDFNEMTSGSEKQGGVPRQESVMQDFRDVIDVCGLIDPGYSGPDFTWCNNQFNEPINLRLVAPRRPRRFEE
AWCKYEKCRDIVQQVWQSRGNIDVSNFVVKTQLCIDRLSHWSRLKYGCSIRGAIARIEKEIQGLSNLEGGTYQNMLLEKETELDNLLEDDEIYWKQRAREDWLKWGDCNT
KWFHMRANNRRKRNKIRRLMDDLGIWTEEDNEMEAIANRYFRTLFQSSEPNLEAIDKILESTPTSIMGDQNNELLRSFTKEEILGVVRGMHPTKALGPDGLQAVFYKKFW
DIVGDDTYKICLQYLNGGLSIRDINKTFIVLIPKVRDPKSMKDFRPISLCNVIYKIIAKTLANRLKQLLDKIISPNQSAFVPSRLITDNAILGFECIHAVNSRKKGKDGV
VTLKLDMSKAAVGLLNQADNNRVLTDCRAIQDILDTYAKASGQTINLDKYNFMVSPNTKEEMATIIKEVLQVQFTKSLGQYLGLPSQNARNKKKIFSGIKDRVWKALQGW
KWKLFFSIDGKEILIKSIAQTIPNYAMSCFKFPLTLGNELNALCARFWWGTTESKKKIHWRSWDKLCHNKNSGGMGFRYRWRIGNGLSVDVVYEPWILKEGSAKPILVCP
DICHLTVAHLKDLNGLWNENIIRESFLDTDAEAILNMPSSSTMGEDEIIWNYDVKGKYSVKSAYRLGFNVKQSGHTSSSNYKAQERMWKSYWKINLPQKIKICGEKPEDL
RHLFWECKFTKEVWHKYFPISNYLFQNLIIHNKQIPKTECLQQHINKVISELIGEEDMYLPSELIAPLESHRNGSWSSNEMTPRAGWLAIPNGIWKINCDASWSDENQQG
GVGWILRDWRGYPLSAGYHIFNRRWNVT