; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy07g006100 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy07g006100
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr07:31471000..31473288
RNA-Seq ExpressionLcy07g006100
SyntenyLcy07g006100
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]1.3e-18444.04Show/hide
Query:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE
        M  FRD ++  G  D G+ G  +TW N     + I  RLDR L   D   +   +KV HL     DH  +L     DN  +   R KR   FE  W K E
Subjt:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE

Query:  ECRNIVNQVW---EQARNQEGMT---------LIQKTSTYQQSI----------------------IDLETN--QRKLESLLEDEEVYWKQRSREEWLNW
        +C+ I+   W         EG++         L + +ST    I                      + LE N  + ++ +LL+DEE YW QR++  WL  
Subjt:  ECRNIVNQVW---EQARNQEGMT---------LIQKTSTYQQSI----------------------IDLETN--QRKLESLLEDEEVYWKQRSREEWLNW

Query:  GDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKA
        GDRNTK+FH +AS+RR  N I  + +  G W +++  + +    YF  ++ SS+P    I+ + E IP  ++E  N  L++EFT+EEV   + ++HP+KA
Subjt:  GDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKA

Query:  PGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRL
        PGPDG+ A+F+QK+W  VG++V  + L +L     I  LN+T ISLIPK  +P  M DFRPISLC+V+YK+I+K+LANRLK +L  IIS +QSAF   RL
Subjt:  PGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRL

Query:  ISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFL
        I+DN ++ FE +H + +K  GKEG  A+KLDMSKA+DRVEW +I K++E+MGF NRW +++M+C+ SV + +++NG+      P+RGLRQGDPLSP LFL
Subjt:  ISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFL

Query:  ICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQK
        +CAEGLS ++N   ++K  TG+ IN+ CP +THLF+ADDS+LF KA+ ++C  ++ +L  YE ASGQ IN +KSS    PNT     + I N LG     
Subjt:  ICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQK

Query:  SLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLC
           +YLGLPS + R K +VF  +K++V   L GWKGK  S GGKEILIK+VAQAIP Y MSCF  P  LC+++  +   FWWG  ++  K+ W  W  +C
Subjt:  SLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLC

Query:  LNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW
         +K  GGLGFR+L  FN AML+KQ+WRI+ NP SL+ +VL+ RYF  G+ + A LG++PSY+W
Subjt:  LNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]7.7e-18543.96Show/hide
Query:  NKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYEECRNIVNQVW---EQARNQEGMTL--
        N       ++ RLDR L   D +    D+KV HL    SDH  +L     D    ++   +R  +FE  W + E+C++I+  VW    +  +  G+    
Subjt:  NKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYEECRNIVNQVW---EQARNQEGMTL--

Query:  ---------------------IQKTSTYQQSIIDLETN----------QRKLESLLEDEEVYWKQRSREEWLNWGDRNTKWFHLKASKRRSINRIQRLRN
                             IQ+      +++  + N          ++++  LL+ EE+ W+QRSR +WL  GDRNTK+FH KAS RR  N I  + +
Subjt:  ---------------------IQKTSTYQQSIIDLETN----------QRKLESLLEDEEVYWKQRSREEWLNWGDRNTKWFHLKASKRRSINRIQRLRN

Query:  AMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKAPGPDGIQAMFYQKFWDTVGDDVCKLC
          GNW +   G+ ++   YFQT++ SS P    I  +L+ IPT+++E  N  L++EFTREE++  +++MHP+KAPGPDG+ A+F+QK+W+ VG+D+  + 
Subjt:  AMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKAPGPDGIQAMFYQKFWDTVGDDVCKLC

Query:  LGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLISDNTIIGFECLHAVKNKRRGKEGVA
        L +L    S+  +N+T I+L+PKIK+P  M DFRPISLC+V+YK+I+KVLANRLK +L  IIS +QSAF+ GRLI+DN ++ FE +H +++K+ GKEG A
Subjt:  LGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLISDNTIIGFECLHAVKNKRRGKEGVA

Query:  ALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKDFTGLRINK
        A+KLDMSKAYDRVEW +I++++EKMGF  +WI ++M C+ SV + +++NG      +P RGLRQGDP+SPY+FL+CA+G S +LN   +    +G+ I +
Subjt:  ALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKDFTGLRINK

Query:  YCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQKSLGQYLGLPSQMERKKKEVFDNIKDR
         CP ITHLF+ADDSLLF KA+ ++C+ +  +L +YE ASGQ IN +KSS     NT   +   +   LG        +YLGLPS + + K E+F  +K+R
Subjt:  YCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQKSLGQYLGLPSQMERKKKEVFDNIKDR

Query:  VWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKKQGGLGFRDLYIFNQAMLSKQSW
        V + L GWK K  S GG+EILIK+VAQAIP Y MSCF+ P +LC E+ ++  RFWWG   +  KI W  W  LC  KK GG+GFR+L  FN AML+KQ W
Subjt:  VWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKKQGGLGFRDLYIFNQAMLSKQSW

Query:  RIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW
        R+I NP SL+AQ+ + RY+ +G+  +A LG +PSYTW
Subjt:  RIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW

XP_030923017.1 uncharacterized protein LOC115949892 [Quercus lobata]6.5e-18442.6Show/hide
Query:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE
        M++FRDC+D  GL D GFSGL FTWCN+ Y   L+W RLDR +  +D +++    ++ HLP  +SDH+P+  V   D+   R  R K+P RFE  W+  E
Subjt:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE

Query:  ECRNIVNQVWEQ------------------------------------ARNQEGMTLIQKTSTYQQSIIDLETNQRKLESLLEDEEVYWKQRSREEWLNW
         C  +V+ VW++                                    ARN++ + + +  S   ++   ++    ++  L++ EE  W QRS+ EWL +
Subjt:  ECRNIVNQVWEQ------------------------------------ARNQEGMTLIQKTSTYQQSIIDLETNQRKLESLLEDEEVYWKQRSREEWLNW

Query:  GDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKA
        GD+NTK+FH +A++R   N I  L NA G W+E +  + E++  Y+  LF + NP    ++ +L  +   +SE  N EL+K F  EEV   + +M P  A
Subjt:  GDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKA

Query:  PGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRL
        PGPDG   +FY+ FW+ VG +V +  L +L       HLN T+ISLIPK K P  + +FRPISLC+VLYK+IAKVLANRLK +L  +IS SQSAF+  RL
Subjt:  PGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRL

Query:  ISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFL
        I+DN +I  E LH +K KR GK G  ++KLDMSKAYDRVEW Y+ KI+E+MGF+ +WIN+I  C+ SV + ++LNG P    +P RGLRQGDPLSPYLFL
Subjt:  ISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFL

Query:  ICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQK
        +  EGL  +    + S D  G+ +    P I+HL +ADDSL+F +A+  +   I+ +L  YE ASGQ IN  K++     NT+      IK  LG+    
Subjt:  ICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQK

Query:  SLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLC
           QYLGLPS + R KK+ F  IK+R+WK L+GWK +  S  G+EIL+K+V QAIP Y MSCF+ P  L +++ ++  +FWWG   + +KIHW  W  LC
Subjt:  SLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLC

Query:  LNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW
          KK+GG+GF++L  FN +ML+KQ WR+  N E L  +V + ++F NG+  ++ +    SY W
Subjt:  LNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]9.1e-18644.76Show/hide
Query:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE
        ++ FR+ +    L D GF G  +TW NK         RLDR + N +   R    +VVHL   ASDH P+L      + SQ      R  +FEE W+  +
Subjt:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE

Query:  ECRNIVNQVWEQA-RNQEGMTLIQ--------KTSTYQQSIIDLETN----------------------------QRKLESLLEDEEVYWKQRSREEWLN
        EC  ++ + W     N++G+  +Q        +   +  SI D +T                              +K++ LL+ +E+YW QRSR  WL 
Subjt:  ECRNIVNQVWEQA-RNQEGMTLIQ--------KTSTYQQSIIDLETN----------------------------QRKLESLLEDEEVYWKQRSREEWLN

Query:  WGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSK
         GDRNTK+FH KAS+RR  N I+ +RN+ G WVE+   + ++   YF  LFQ+   D   ++  L+ + T ++E     L  +FT EEV+  + +M P+K
Subjt:  WGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSK

Query:  APGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGR
        APGPDG+ A+FYQKFW  VGD V    L  L     +  +N T I LIPK+++P  M +FRPISLC+V+YKII+KVLANRLK+VL  IIS +QSAFVPGR
Subjt:  APGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGR

Query:  LISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLF
        LI+DN ++ +E LH +  +++GK+G  ALKLD+SKAYDRVEW +++ I+EKMGF   WI  +M CV +  F +++NG P     P+RG+RQGDP+SPYLF
Subjt:  LISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLF

Query:  LICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQ
        L+CAEGL+ +LN  E +   TG+ I +  P IT+L +ADDSLLF +A+  +   I  +L IYERASGQ+IN EKSS     NT+  Q   I   LGV+  
Subjt:  LICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQ

Query:  KSLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSL
            +YLGLP+ + R K   F  +KDRVWK LQGWKG   S  GKEILIK+VAQAIP Y MS F+ P+ LC+EL ++CARFWWG     RKIHW+ W  L
Subjt:  KSLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSL

Query:  CLNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW
           KK+GG+GFRDL  FN AML+KQ WR+++  +SLL +  + RYF   +F+EA    N S+ W
Subjt:  CLNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW

XP_030940268.1 uncharacterized protein LOC115965235 [Quercus lobata]2.9e-18443.59Show/hide
Query:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVW-SEDNGSQRENRKKRPRRFEECWVKY
        M+ FRDC+D  G  D GF+GL FTWCN  +   L+W RLDR L +++ +++   +++ HL   +SDH+PI   W   D+  +R  R ++P RFEE W+K 
Subjt:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVW-SEDNGSQRENRKKRPRRFEECWVKY

Query:  EECRNIVNQVWEQARNQEGM---------------------------TLIQKTSTYQQSIIDLETNQ---------RKLESLLEDEEVYWKQRSREEWLN
        E C  +VN  W+ +     M                           TL+QK     ++ I     Q          ++  LL+ EE  W QR++ +WL 
Subjt:  EECRNIVNQVWEQARNQEGM---------------------------TLIQKTSTYQQSIIDLETNQ---------RKLESLLEDEEVYWKQRSREEWLN

Query:  WGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSK
        +GDRN+K+FH +AS R   N I  L +  G WV+++  + E++N+Y+ +LF SSNP     D  L  +   ++   N  L + F   EV+  +++M  + 
Subjt:  WGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSK

Query:  APGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGR
        +PGPDG   +FY+++W  +G +V    +G+L      H LN T+++LIPKIK P  + DFRPISL +VLYK+IAKVLANRLKK L  +IS +QSAFVPGR
Subjt:  APGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGR

Query:  LISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLF
        LI+DN +I  E LH +K+KR GK G+ ALKLDMSKAYDRVEW ++ KI+E +GF  RWI++I  C+ SV + ++LNG P    SP+RGLRQGDPLSPYLF
Subjt:  LISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLF

Query:  LICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQ
        L+  EGL G+L   E S    G+ +    P I+HL +ADDSL+F +AS  DC+ I+ +L IYE ASGQ IN  K++     NT     E IKN L V   
Subjt:  LICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQ

Query:  KSLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSL
        +   QYLGLPS + R KK  F  IK+R+WK L+GWK K  S  G+EILIK+V QAIP Y MSCFK P  L NE+  +  +FWWG   + ++IHW  W  L
Subjt:  KSLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSL

Query:  CLNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW
        CL K  GG+GFR+L  FN ++L+KQ WR+  N  SL   V + ++F   +  EA      S+ W
Subjt:  CLNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW

TrEMBL top hitse value%identityAlignment
A0A2N9GIC4 Reverse transcriptase domain-containing protein9.5e-18942.75Show/hide
Query:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE
        M+ FR+ +D  GL D GF+G  FTWCN     +  W RLDR +  +D L+R    +V H+ +I SDH+    +W         N K+RP RFEE W+   
Subjt:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE

Query:  ECRNIVNQVWEQARNQEGMTLIQKTSTYQQSIID--------------------------------------LETNQRKLESLLEDEEVYWKQRSREEWL
         C   + + W   R   G  + Q T   ++                                          L++ ++KL SL E EE  W+QRSR  WL
Subjt:  ECRNIVNQVWEQARNQEGMTLIQKTSTYQQSIID--------------------------------------LETNQRKLESLLEDEEVYWKQRSREEWL

Query:  NWGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPS
          GDRNTK+FH +A++R   N I  L++ M  W + + GM  ++ +Y+ +LF +S+PD   I  ++ H+P  ++E  N+ L+++FT  EV+  + +M P+
Subjt:  NWGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPS

Query:  KAPGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPG
        KAPGPDG+  +FYQKFW  VG DV K  L  L     +  +N T+ISLIPK K+P  + +FRPISLC+V+YK+I+KVLANRLK +L  ++S SQSAFVPG
Subjt:  KAPGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPG

Query:  RLISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYL
        RLI+DN ++ FE LH + + + G++G  ALKLDMSKAYDRVEW ++ KI+ +MGF  +WI+I++ C+ +V + +++NG P     P+RGLRQGDPLSPYL
Subjt:  RLISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYL

Query:  FLICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRH
        FL+CAEGL  ++       D  G+ + +  P I+HLF+ADDSLLF KA+   C  I+ +L  YE+ASGQ +N +K++     NT  A   +IK  L V  
Subjt:  FLICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRH

Query:  QKSLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSS
         +   +YLGLPS + R +   F  IK+RVW+ L+GWK K  S  G+EILIK+VAQAIP Y+MSCFK P  LCNEL ++  RFWW ++ + RKIHW  W  
Subjt:  QKSLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSS

Query:  LCLNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW
        LC  K +GG+GFRD+  FN A+L+KQ WR++ +  SL  +V + ++F +G+ ++ P     SY+W
Subjt:  LCLNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW

A0A2N9H567 Reverse transcriptase domain-containing protein1.5e-19447.1Show/hide
Query:  FRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYEECR
        FR  +    L D GF G  FTW N  Y +D ++ERLDR    +D      + ++ H+P   SDH  +  V + +    R N   +  RFE  W++ + C 
Subjt:  FRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYEECR

Query:  NIVNQVWEQARNQEGMTLIQKTSTYQQSIIDLETNQRKLESLLEDEEVYWKQRSREEWLNWGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGME
         ++   W Q   Q G  + Q     ++ I++  T +++L  LL  EE YW+QRSR  W+  GDRNT++FH  AS+RR  N I  L +  G        + 
Subjt:  NIVNQVWEQARNQEGMTLIQKTSTYQQSIIDLETNQRKLESLLEDEEVYWKQRSREEWLNWGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGME

Query:  EMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKAPGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHL
         +  QYF+ +F +S+P  + +D +   +   ++ + N +L K FTREE++  + +M+P+KAPGPDG+ A+FYQKFW  VG DV    L   K    +  +
Subjt:  EMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKAPGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHL

Query:  NRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRV
        N T+ISLIPK K P+ M  FRPISLC+VLYKII+KVLANRLKKVL+ +IS +QSAFVPGRLI+DN ++ FE LH +K KR+GK    A+KLDMSKAYDRV
Subjt:  NRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRV

Query:  EWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADD
        EW +IR ++ KMGFD++W+++IM+C++SV + +++NG P     P+RG+RQGDPLSPYLFLICAEGL+ +L + E S    GL I +  P I HLF+ADD
Subjt:  EWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADD

Query:  SLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQKSLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFF
        SLLF++A+  + + +  +L IYE+ASGQ +NYEK+S     NT+      I   L       LG+YLGLP  + R+KK+ F  IK +V K LQGWKGK  
Subjt:  SLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQKSLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFF

Query:  SFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQV
        S  G+EILIKSVAQAIP + MSCF+ P SLC E+NS+  RFWWG  +  RKIHW++WSSLC  K  GG+GFRDL  FNQA+L+KQ WRI++N  +LL +V
Subjt:  SFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQV

Query:  LRGRYFKNGNFIEAPLGNNPSYTW
        L+ +YF + +F+EA + ++ S+TW
Subjt:  LRGRYFKNGNFIEAPLGNNPSYTW

A0A2N9HWM9 Reverse transcriptase domain-containing protein1.1e-18944.3Show/hide
Query:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE
        M  FR+ ++   L D GF G  FTW N   Q + + ERLDR +  +  +       + H     SDH  +L + ++        RKKR   FE  W++ E
Subjt:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE

Query:  ECRNIVNQVWE-QARNQEGMTLIQKTSTYQQSIIDLETNQ-----------------------------------RKLESLLEDEEVYWKQRSREEWLNW
         C  ++ Q WE Q        L+QK    + +++    +Q                                   R L  LL  EE+YW+QRSR  WL  
Subjt:  ECRNIVNQVWE-QARNQEGMTLIQKTSTYQQSIIDLETNQ-----------------------------------RKLESLLEDEEVYWKQRSREEWLNW

Query:  GDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKA
        GDRNT +FH  A++R+  N I  +R++   W  DDVG+E +V+ YF  ++ SSNP   +ID + + +   +S   N +LM  FTREEV+  + +M PSKA
Subjt:  GDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKA

Query:  PGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRL
        PGPDG+ A+F+QKFW  VG DV    L  L     +  LN T+I+LIPK+K P  M  FRPISLC+VLYKII+KVL NR+K +L  ++S SQSAFVPGR+
Subjt:  PGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRL

Query:  ISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFL
        ISDN +I FE LH +KNKR GK    A+KLDMSKAYDRVEW Y++K++ K+GF  RW+ +IM CV SV + +++NG P+    P+RGLRQGDPLSPYLFL
Subjt:  ISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFL

Query:  ICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQK
        ICAEGL+ +L   E+     G+ I +  P ++HLF+ADDSL+F +A++ +C+ ++ +L +YE ASGQ IN  K++     N +      I N  G     
Subjt:  ICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQK

Query:  SLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLC
           +YLGLP  + R KK+ F  IKDR+W+ LQGWK K  S  GK +LIK+V QAIP YAMSCFKFP  LC E++S+  RFWWG  + GRKIHW     LC
Subjt:  SLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLC

Query:  LNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW
          K++GG+GFRDL  FNQA+L++Q WR+++NP+SL+ + L+ +YF + +F+EA +  N SY W
Subjt:  LNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW

A0A7N2L6Z9 Reverse transcriptase domain-containing protein4.3e-18945.35Show/hide
Query:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE
        M  FR+ I+  G  D G+SG  +TWCN    +  I+ RLDR L   D +    ++KV HL    SDH  +       N    +  + R   FE  W K E
Subjt:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE

Query:  ECRNIVNQVW---EQARNQEGMT-----------------------LIQK--------TSTYQQSIIDLETN--QRKLESLLEDEEVYWKQRSREEWLNW
        +CR I+  VW         EGM                        LIQ+        T   +   + +E N  +R+L  LL+DEE++W QRS+  WL  
Subjt:  ECRNIVNQVW---EQARNQEGMT-----------------------LIQK--------TSTYQQSIIDLETN--QRKLESLLEDEEVYWKQRSREEWLNW

Query:  GDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKA
        GDRNTK+FH +AS+RR  N I  + +  G W ED   +      YF+ ++ +SNP    +D +   IPT I+E  N EL + FTREE+   + ++HP+K+
Subjt:  GDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKA

Query:  PGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRL
        PGPDG+ A+F+QK+WD VG +V  + L +L    S+  +N+T I LIPK  +P  M DFRPISLC+V+YK+I+K LANRLK  L  II+ +QSAF   RL
Subjt:  PGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRL

Query:  ISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFL
        I+DN +I +E +H +K+K+ GK+   A KLDMSKA+DRVEW +I +++ KMGF+  WI++IMRC+ SV + VI+NG       P RGLRQGDPLSPYLFL
Subjt:  ISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFL

Query:  ICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQK
        +CAEGLS +L+   +++   G+ + + CP ITHLF+ADDSLLF KA+ ++C  +K +L  YE ASGQ +N +KSS    PNT     E I N LG     
Subjt:  ICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQK

Query:  SLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLC
           +YLGLPS + R KK VF  IK+RV   L GWKGK  S GGKEILIK+VAQAIP Y MSCF  P SLC+EL  +   FWWG  ++  K+ W  W  +C
Subjt:  SLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLC

Query:  LNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW
          K  GGLGFR+L+ FN A+L+KQ+WRI+ NP SL A++L+ +YF  G+ + A LG+NPSYTW
Subjt:  LNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW

A0A7N2LIH6 Uncharacterized protein2.1e-18844.09Show/hide
Query:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE
        M  FR+ +   GL D GF G  FTWCN  +       RLDR + N    +   + KV H+ + ASDH  +LA++     +QR  +K+    FEE W + E
Subjt:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYE

Query:  ECRNIVNQVWEQARNQEGMT--------------------------LIQKTSTYQQ---------SIIDLETNQRKLESLLEDEEVYWKQRSREEWLNWG
        EC+ IV   W+  R    M                           + QK +  QQ         +  +++T ++++  L   EEV WKQRSR  WL +G
Subjt:  ECRNIVNQVWEQARNQEGMT--------------------------LIQKTSTYQQ---------SIIDLETNQRKLESLLEDEEVYWKQRSREEWLNWG

Query:  DRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKAP
        D+N+K+FH  AS+RR  NRI  L + +G W ED    E+++  YF+ ++ S+ P   S D  LE +   ++   N EL KEF   EV   + +MHP+KAP
Subjt:  DRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKAP

Query:  GPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLI
        GPDG+  +FYQK+WD VG  V    L  L        +N+TYI LIPK K+P  + +FRPISLC+V+YKII+KVLANRLKKVL  +I  +QSAFVPGR+I
Subjt:  GPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLI

Query:  SDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFLI
        +DN I+ FE +H++  +R+GKEG+ A+KLDMSKAYDRVEW Y+  +++KMGF +RWI++IM CV SV F V++NG P+  F+P+RGLRQGDP+SPYLFL+
Subjt:  SDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFLI

Query:  CAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQKS
        C EGLS M+   E+     G+   +  P I+HLF+ADDS++F +A+  +C  + ++L +YE  SGQ +N +K+S     NT     E  K   G +  + 
Subjt:  CAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQKS

Query:  LGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCL
          +YLGLP  + R KK+ F+ IKD+V + + GWKGK  S  G+E+LIK+VAQA P Y M+ FK P SLC ELNS+   FWWG   + +K+ W  W +LC 
Subjt:  LGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCL

Query:  NKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW
         K  GG+GF+DL  FN A+L+KQ WR+ +NP SL  +VL+ +YF N +F+EA LG  PSY W
Subjt:  NKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.2e-4223.93Show/hide
Query:  KYEECRNIVNQVWEQARNQEGMTLIQKTSTYQQSIIDLETNQRKLESLLEDEEVYWKQRSREEWLNWGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVE
        K ++ R+ ++ +  Q +  E        ++ +Q I  +    +++E+    +++   +    E +N  DR          K+R  N+I  ++N  G+   
Subjt:  KYEECRNIVNQVWEQARNQEGMTLIQKTSTYQQSIIDLETNQRKLESLLEDEEVYWKQRSREEWLNWGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVE

Query:  DDVGMEEMVNQYFQTLFQSSNPDTESIDGILE-HIPTSISEMQNRELMKEFTREEVKYVISRMHPSKAPGPDGIQAMFYQKFWDTVGDDVCKLCLGILKG
        D   ++  + +Y++ L+ +   + E +D  L+ +    +++ +   L +  T  E+  +I+ +   K+PGPDG  A FYQ++ + +   + KL   I K 
Subjt:  DDVGMEEMVNQYFQTLFQSSNPDTESIDGILE-HIPTSISEMQNRELMKEFTREEVKYVISRMHPSKAPGPDGIQAMFYQKFWDTVGDDVCKLCLGILKG

Query:  EESIHHLNRTYISLIPKI-KDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLISDNTIIGFECLHAVKNKRRGKEGVAALKLD
            +      I LIPK  +D    ++FRPISL ++  KI+ K+LANR+++ +  +I   Q  F+PG     N       +  + N+ + K  V  + +D
Subjt:  EESIHHLNRTYISLIPKI-KDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLISDNTIIGFECLHAVKNKRRGKEGVAALKLD

Query:  MSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKDFTGLRINKYCPSI
          KA+D+++  ++ K + K+G D  ++ II    +     +ILNG     F    G RQG PLSP LF I  E L+  +    + K+  G+++ K    +
Subjt:  MSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKDFTGLRINKYCPSI

Query:  THLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQKSLGQYLGLPSQMERKKKEVF-DNIK---DRV
            +ADD +++ +      +N+ +++  + + SG  IN +KS   +  N    + +++         K + +YLG+  Q+ R  K++F +N K     +
Subjt:  THLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQKSLGQYLGLPSQMERKKKEVF-DNIK---DRV

Query:  WKALQGWKGKFFSFGGKEILIKS--VAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKKQGGLGFRDLYIFNQAMLSKQS
         +    WK    S+ G+  ++K   + + I  +     K P++   EL     +F W  + K  +I     S L    K GG+   D  ++ +A ++K +
Subjt:  WKALQGWKGKFFSFGGKEILIKS--VAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKKQGGLGFRDLYIFNQAMLSKQS

Query:  WRIIRN
        W   +N
Subjt:  WRIIRN

P0C2F6 Putative ribonuclease H protein At1g657505.6e-2135.77Show/hide
Query:  LPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKKQGG
        +P   +R  K+ F  I +RV   + GW+ K  SF G+  L K+V  ++P ++MS    P S+ N L+ +   F WG + + +K H  +WS +C  KK+GG
Subjt:  LPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKKQGG

Query:  LGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRY
        LG R     N+A++SK  WR+++   SL   VL+ +Y
Subjt:  LGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRY

P11369 LINE-1 retrotransposable element ORF2 protein4.7e-4426.46Show/hide
Query:  IQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPT-SISEMQNRELMKEFTREEVKYVISRMHPSKAPGPDGIQAMFYQKFWDTVG
        I ++RN  G+   D   ++  +  +++ L+ +   + + +D  L+      +++ Q   L    + +E++ VI+ +   K+PGPDG  A FYQ F + + 
Subjt:  IQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPT-SISEMQNRELMKEFTREEVKYVISRMHPSKAPGPDGIQAMFYQKFWDTVG

Query:  DDVCKLCLGILKGEESIHHLNRTYISLIPK-IKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLISDNTIIGFECLHAVKNK
          + KL   I       +      I+LIPK  KDP  +++FRPISL ++  KI+ K+LANR+++ + +II P Q  F+PG     N       +H + NK
Subjt:  DDVCKLCLGILKGEESIHHLNRTYISLIPK-IKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLISDNTIIGFECLHAVKNK

Query:  RRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKD
         + K  +  + LD  KA+D+++  ++ K++E+ G    ++N+I          + +NG          G RQG PLSPYLF I  E L+  +    + K+
Subjt:  RRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKD

Query:  FTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNT---LGVRHQKSLGQYLGLPSQMERK
          G++I K    I+ L  ADD +++        R +  ++  +    G  IN  KS   +      A+ E+ + T   +   + K LG  + L  +++  
Subjt:  FTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNT---LGVRHQKSLGQYLGLPSQMERK

Query:  KKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKS--VAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKK-QGGLGFRD
          + F ++K  + + L+ WK    S+ G+  ++K   + +AI  +     K P    NEL     +F W +         R   SL  +K+  GG+   D
Subjt:  KKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKS--VAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKK-QGGLGFRD

Query:  LYIFNQAMLSKQSW
        L ++ +A++ K +W
Subjt:  LYIFNQAMLSKQSW

P14381 Transposon TX1 uncharacterized 149 kDa protein1.0e-4626.91Show/hide
Query:  RLDRFLLNSDMLIRA----------SDIKVVHLPLIASDHRPILAVWSEDNG-SQRENRKKRPRRFEECWVKYEECRNIVNQVWE---------------
        R+DR  ++S ++ RA          SD   V L +  +   P  A W  +N   + E   K  R     W  +++    +NQ W+               
Subjt:  RLDRFLLNSDMLIRA----------SDIKVVHLPLIASDHRPILAVWSEDNG-SQRENRKKRPRRFEECWVKYEECRNIVNQVWE---------------

Query:  ----------QARNQEGMTLIQKTSTYQQSIIDLETNQRKLESLLEDEEVYWK---QRSREEWLNWGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVED
                  +A N E + L Q+ S  +   +  E  +RK E+L   E+   +    RSR + L   DR +++F+    K+ +  +I  L    G  +ED
Subjt:  ----------QARNQEGMTLIQKTSTYQQSIIDLETNQRKLESLLEDEEVYWK---QRSREEWLNWGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVED

Query:  DVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKAPGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEE
           + +    ++Q LF       ++ + + + +P  +SE +   L    T +E+   +  M  +K+PG DG+   F+Q FWDT+G D  ++     K  E
Subjt:  DVGMEEMVNQYFQTLFQSSNPDTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKAPGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEE

Query:  SIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSK
              R  +SL+PK  D   +K++RP+SL S  YKI+AK ++ RLK VL  +I P QS  VPGR I DN  +  + LH     RR    +A L LD  K
Subjt:  SIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSK

Query:  AYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHL
        A+DRV+  Y+   ++   F  +++  +     S    V +N    A  +  RG+RQG PLS  L+ +  E    +L      K  TGL + +    +   
Subjt:  AYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNGIPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHL

Query:  FYADDSLLFFKASD-KDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQ--KSLGQYLGLPSQMERKKKEVFDNIKDRVWKAL
         YADD +L   A D  D    +    +Y  AS   IN+ KSS ++  +  V  +      +    +  K LG YL   S  E    + F  +++ V   L
Subjt:  FYADDSLLFFKASD-KDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQIEVIKNTLGVRHQ--KSLGQYLGLPSQMERKKKEVFDNIKDRVWKAL

Query:  QGWKG--KFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKKQGGLG
          WKG  K  S  G+ ++I  +  +   Y + C         ++      F W         HW       L  K+GG G
Subjt:  QGWKG--KFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKKQGGLG

P93295 Uncharacterized mitochondrial protein AtMg003102.4e-2750.45Show/hide
Query:  AIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKK-QGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIE
        A+P YAMSCF+    LC +L S    FWW   +  RKI W  W  LC +K+  GGLGFRDL  FNQA+L+KQS+RII  P +LL+++LR RYF + + +E
Subjt:  AIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKK-QGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIE

Query:  APLGNNPSYTW
          +G  PSY W
Subjt:  APLGNNPSYTW

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.8e-2225.93Show/hide
Query:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIR-ASDIKVVHLPLIASDHRPILAVWSEDNGSQRENR--------KKRPRR
        ++EF++C+  + L D    G+ +TW N H   + I  +LDR + N D      S I V  L  + SDH P + +   +N  +R  +           P  
Subjt:  MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIR-ASDIKVVHLPLIASDHRPILAVWSEDNGSQRENR--------KKRPRR

Query:  FEECWVKYEECRNIVNQVWE------------QARNQEGMTLIQKTSTYQQSIIDLETNQRKL-----ESLLEDEEV--------------YWKQRSREE
             V +EE   + + ++             +  N++G   IQ  +  ++++  LE+ Q +L     +SL   E V              +++Q+SR +
Subjt:  FEECWVKYEECRNIVNQVWE------------QARNQEGMTLIQKTSTYQQSIIDLETNQRKL-----ESLLEDEEV--------------YWKQRSREE

Query:  WLNWGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNP--DTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISR
        WL  GD NT++FH      ++ N I+ LR      VE+   ++EM+  Y+  L  S +     +S+  I +  P   ++     L    + +E+   +  
Subjt:  WLNWGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNP--DTESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISR

Query:  MHPSKAPGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKII
        M  +KAPGPD   A F+ + W  V D          +    +   N T I+LIPK+   + +  FRP+S C+V+YKII
Subjt:  MHPSKAPGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCSVLYKII

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.5e-1328.22Show/hide
Query:  QYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNK
        +YLGLP   ++     +  + +++   +  W  +  SF G+  LI SV  ++ N+ MS F+ P +   E++SIC+ F W   +   K     WS +C  K
Subjt:  QYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNK

Query:  KQGGLGFRDLYIFNQAMLSKQSWRIIRNP---ESLLAQVLRGRYFKNGNFIEAPL--GNNPSY
         +GGLG R L   N+       W I  N      +  ++L+ R   +G F++  +  G+N S+
Subjt:  KQGGLGFRDLYIFNQAMLSKQSWRIIRNP---ESLLAQVLRGRYFKNGNFIEAPL--GNNPSY

AT4G29090.1 Ribonuclease H-like superfamily protein2.9e-2842.73Show/hide
Query:  AIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEA
        A+P Y M+CF  P ++C ++ S+ A FWW +  + + +HW+ W  L   K +GG+GF+D+  FN A+L KQ WR++  PESL+A+V + RYF   + + A
Subjt:  AIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEA

Query:  PLGNNPSYTW
        PLG+ PS+ W
Subjt:  PLGNNPSYTW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.7e-2850.45Show/hide
Query:  AIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKK-QGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIE
        A+P YAMSCF+    LC +L S    FWW   +  RKI W  W  LC +K+  GGLGFRDL  FNQA+L+KQS+RII  P +LL+++LR RYF + + +E
Subjt:  AIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRWSSLCLNKK-QGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIE

Query:  APLGNNPSYTW
          +G  PSY W
Subjt:  APLGNNPSYTW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.8e-1548.53Show/hide
Query:  ILNGIPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDS
        I+NG P+   +P+RGLRQGDPLSPYLF++C E LSG+    ++     G+R++   P I HL +ADD+
Subjt:  ILNGIPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAATTTAGAGATTGTATTGATGGTGCTGGCCTGTGTGATGCTGGATTTTCAGGTCTGGCTTTTACTTGGTGCAATAAACACTATCAATCTGATCTTATATGGGA
AAGGCTGGATAGATTCTTACTAAACTCTGATATGCTCATTCGGGCGAGTGATATTAAGGTAGTTCATCTTCCCCTTATAGCTTCAGATCACAGACCAATCCTAGCGGTTT
GGTCAGAGGATAATGGGTCTCAAAGAGAAAATAGGAAGAAGAGGCCACGGAGGTTTGAAGAATGTTGGGTGAAATATGAAGAGTGCAGAAACATTGTAAACCAAGTTTGG
GAGCAAGCGAGAAATCAAGAAGGAATGACTCTGATACAGAAAACATCGACGTATCAACAGAGCATTATTGATTTGGAAACAAATCAAAGAAAATTAGAAAGTCTTTTAGA
AGATGAGGAGGTCTACTGGAAGCAAAGATCAAGAGAAGAATGGTTGAATTGGGGAGATAGAAATACAAAATGGTTCCATTTAAAGGCCAGTAAAAGAAGGTCAATTAACA
GAATTCAAAGGCTTCGAAATGCTATGGGAAACTGGGTGGAAGATGATGTAGGGATGGAAGAGATGGTGAATCAATACTTTCAGACCTTGTTTCAATCTTCTAATCCTGAT
ACAGAGTCTATTGATGGCATTTTGGAACACATTCCAACAAGTATTTCAGAGATGCAGAATAGGGAGTTAATGAAAGAGTTCACTCGAGAAGAGGTCAAATATGTGATTAG
TAGGATGCATCCATCAAAAGCTCCAGGTCCAGATGGGATCCAAGCAATGTTTTATCAAAAGTTTTGGGATACTGTTGGGGATGATGTGTGTAAGCTTTGTTTAGGTATTC
TAAAGGGGGAGGAATCCATCCATCACCTAAATAGAACTTATATCTCTTTAATCCCCAAGATCAAAGACCCAAACACTATGAAAGACTTTAGGCCAATCAGTTTGTGTTCT
GTTCTGTATAAGATTATAGCTAAAGTTCTTGCAAATAGATTGAAAAAGGTCCTGGATTCGATCATATCCCCAAGTCAATCTGCGTTTGTGCCAGGTCGACTCATTTCAGA
TAATACCATTATTGGTTTCGAATGTCTACATGCAGTGAAGAATAAACGAAGGGGGAAAGAAGGAGTGGCAGCCCTCAAACTCGACATGAGCAAGGCTTATGATCGAGTGG
AGTGGATATACATTCGGAAGATCATAGAAAAGATGGGGTTTGATAACAGATGGATAAATATTATTATGAGATGTGTTGAATCAGTAAGATTTCAGGTTATCCTTAATGGG
ATTCCGAGAGCAGAATTTTCTCCTAATCGAGGCCTAAGGCAAGGAGATCCTCTATCTCCGTATTTATTTTTGATATGTGCAGAGGGTCTATCTGGAATGCTGAACCATTA
TGAGAAATCTAAAGATTTTACAGGTTTGCGTATCAATAAATACTGTCCGTCTATTACTCATCTCTTTTATGCTGATGATAGTCTCCTGTTTTTCAAAGCTTCTGATAAAG
ATTGCAGGAACATTAAGCGGATGCTTCTCATATATGAGCGTGCCTCAGGCCAAACCATAAATTATGAGAAATCATCATTTATGGTTGGCCCAAACACAAACGTAGCCCAG
ATTGAGGTGATCAAGAACACTTTGGGAGTGCGGCACCAAAAGAGCTTAGGCCAGTACCTAGGTTTACCTTCACAAATGGAAAGGAAAAAGAAGGAGGTTTTTGATAACAT
CAAGGATCGTGTTTGGAAGGCTTTACAAGGATGGAAAGGGAAGTTCTTCTCTTTTGGAGGGAAGGAAATTCTTATTAAATCAGTGGCTCAAGCCATTCCAAATTATGCGA
TGAGTTGTTTTAAATTTCCTATTTCTTTATGTAACGAGCTAAATTCTATTTGTGCTAGGTTCTGGTGGGGTGATTCAGACAAAGGAAGGAAAATACATTGGAGGAGGTGG
TCAAGTCTATGCCTTAATAAGAAGCAGGGAGGGCTAGGCTTCCGAGATTTATATATTTTCAACCAAGCTATGTTATCAAAGCAAAGCTGGAGAATTATTCGTAACCCTGA
AAGTCTTTTAGCACAGGTTCTACGAGGAAGATATTTTAAAAATGGCAACTTTATTGAGGCTCCTTTGGGCAATAATCCTTCATATACGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAATTTAGAGATTGTATTGATGGTGCTGGCCTGTGTGATGCTGGATTTTCAGGTCTGGCTTTTACTTGGTGCAATAAACACTATCAATCTGATCTTATATGGGA
AAGGCTGGATAGATTCTTACTAAACTCTGATATGCTCATTCGGGCGAGTGATATTAAGGTAGTTCATCTTCCCCTTATAGCTTCAGATCACAGACCAATCCTAGCGGTTT
GGTCAGAGGATAATGGGTCTCAAAGAGAAAATAGGAAGAAGAGGCCACGGAGGTTTGAAGAATGTTGGGTGAAATATGAAGAGTGCAGAAACATTGTAAACCAAGTTTGG
GAGCAAGCGAGAAATCAAGAAGGAATGACTCTGATACAGAAAACATCGACGTATCAACAGAGCATTATTGATTTGGAAACAAATCAAAGAAAATTAGAAAGTCTTTTAGA
AGATGAGGAGGTCTACTGGAAGCAAAGATCAAGAGAAGAATGGTTGAATTGGGGAGATAGAAATACAAAATGGTTCCATTTAAAGGCCAGTAAAAGAAGGTCAATTAACA
GAATTCAAAGGCTTCGAAATGCTATGGGAAACTGGGTGGAAGATGATGTAGGGATGGAAGAGATGGTGAATCAATACTTTCAGACCTTGTTTCAATCTTCTAATCCTGAT
ACAGAGTCTATTGATGGCATTTTGGAACACATTCCAACAAGTATTTCAGAGATGCAGAATAGGGAGTTAATGAAAGAGTTCACTCGAGAAGAGGTCAAATATGTGATTAG
TAGGATGCATCCATCAAAAGCTCCAGGTCCAGATGGGATCCAAGCAATGTTTTATCAAAAGTTTTGGGATACTGTTGGGGATGATGTGTGTAAGCTTTGTTTAGGTATTC
TAAAGGGGGAGGAATCCATCCATCACCTAAATAGAACTTATATCTCTTTAATCCCCAAGATCAAAGACCCAAACACTATGAAAGACTTTAGGCCAATCAGTTTGTGTTCT
GTTCTGTATAAGATTATAGCTAAAGTTCTTGCAAATAGATTGAAAAAGGTCCTGGATTCGATCATATCCCCAAGTCAATCTGCGTTTGTGCCAGGTCGACTCATTTCAGA
TAATACCATTATTGGTTTCGAATGTCTACATGCAGTGAAGAATAAACGAAGGGGGAAAGAAGGAGTGGCAGCCCTCAAACTCGACATGAGCAAGGCTTATGATCGAGTGG
AGTGGATATACATTCGGAAGATCATAGAAAAGATGGGGTTTGATAACAGATGGATAAATATTATTATGAGATGTGTTGAATCAGTAAGATTTCAGGTTATCCTTAATGGG
ATTCCGAGAGCAGAATTTTCTCCTAATCGAGGCCTAAGGCAAGGAGATCCTCTATCTCCGTATTTATTTTTGATATGTGCAGAGGGTCTATCTGGAATGCTGAACCATTA
TGAGAAATCTAAAGATTTTACAGGTTTGCGTATCAATAAATACTGTCCGTCTATTACTCATCTCTTTTATGCTGATGATAGTCTCCTGTTTTTCAAAGCTTCTGATAAAG
ATTGCAGGAACATTAAGCGGATGCTTCTCATATATGAGCGTGCCTCAGGCCAAACCATAAATTATGAGAAATCATCATTTATGGTTGGCCCAAACACAAACGTAGCCCAG
ATTGAGGTGATCAAGAACACTTTGGGAGTGCGGCACCAAAAGAGCTTAGGCCAGTACCTAGGTTTACCTTCACAAATGGAAAGGAAAAAGAAGGAGGTTTTTGATAACAT
CAAGGATCGTGTTTGGAAGGCTTTACAAGGATGGAAAGGGAAGTTCTTCTCTTTTGGAGGGAAGGAAATTCTTATTAAATCAGTGGCTCAAGCCATTCCAAATTATGCGA
TGAGTTGTTTTAAATTTCCTATTTCTTTATGTAACGAGCTAAATTCTATTTGTGCTAGGTTCTGGTGGGGTGATTCAGACAAAGGAAGGAAAATACATTGGAGGAGGTGG
TCAAGTCTATGCCTTAATAAGAAGCAGGGAGGGCTAGGCTTCCGAGATTTATATATTTTCAACCAAGCTATGTTATCAAAGCAAAGCTGGAGAATTATTCGTAACCCTGA
AAGTCTTTTAGCACAGGTTCTACGAGGAAGATATTTTAAAAATGGCAACTTTATTGAGGCTCCTTTGGGCAATAATCCTTCATATACGTGGTGA
Protein sequenceShow/hide protein sequence
MKEFRDCIDGAGLCDAGFSGLAFTWCNKHYQSDLIWERLDRFLLNSDMLIRASDIKVVHLPLIASDHRPILAVWSEDNGSQRENRKKRPRRFEECWVKYEECRNIVNQVW
EQARNQEGMTLIQKTSTYQQSIIDLETNQRKLESLLEDEEVYWKQRSREEWLNWGDRNTKWFHLKASKRRSINRIQRLRNAMGNWVEDDVGMEEMVNQYFQTLFQSSNPD
TESIDGILEHIPTSISEMQNRELMKEFTREEVKYVISRMHPSKAPGPDGIQAMFYQKFWDTVGDDVCKLCLGILKGEESIHHLNRTYISLIPKIKDPNTMKDFRPISLCS
VLYKIIAKVLANRLKKVLDSIISPSQSAFVPGRLISDNTIIGFECLHAVKNKRRGKEGVAALKLDMSKAYDRVEWIYIRKIIEKMGFDNRWINIIMRCVESVRFQVILNG
IPRAEFSPNRGLRQGDPLSPYLFLICAEGLSGMLNHYEKSKDFTGLRINKYCPSITHLFYADDSLLFFKASDKDCRNIKRMLLIYERASGQTINYEKSSFMVGPNTNVAQ
IEVIKNTLGVRHQKSLGQYLGLPSQMERKKKEVFDNIKDRVWKALQGWKGKFFSFGGKEILIKSVAQAIPNYAMSCFKFPISLCNELNSICARFWWGDSDKGRKIHWRRW
SSLCLNKKQGGLGFRDLYIFNQAMLSKQSWRIIRNPESLLAQVLRGRYFKNGNFIEAPLGNNPSYTW