; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026516 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026516
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:38270065..38273179
RNA-Seq ExpressionLag0026516
SyntenyLag0026516
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CCA66054.1 hypothetical protein [Beta vulgaris subsp. vulgaris]6.3e-7337.01Show/hide
Query:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF
        KAS+R+K N V GLFD    W +E   +E +  +YF  +F S+NP   ++  ++      ++EEHNLKL+  F+++EI   ++ MHP KAPGPD +  IF
Subjt:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF

Query:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE
        YQ++W ++G D   F    L+G  SP  +N T IALIPK+ NP    +FRPI+LC+VLYK++SK +  RLK  L  IIS +QSAFVPGRLITDNA++  E
Subjt:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE

Query:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----------------------------------------YQELH
          H +KN+ + +   +++KLDMSKA+DR EW ++ K++  +GF  RW++L  E V                                        + ++ 
Subjt:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----------------------------------------YQELH

Query:  NQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDIL--QRLDRFQ-----PS
         + V  K+L   + ++  P ++HLF+ DDSLLF  A+  +   I   LN YE A GQ INY+KS    S   +     E+ +IL  +++DR +     PS
Subjt:  NQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDIL--QRLDRFQ-----PS

Query:  YVSQNKARELFDIQI----VYLLGFSEAAISRQNR
         +S    + +FD  I      L G+ E  +SR  +
Subjt:  YVSQNKARELFDIQI----VYLLGFSEAAISRQNR

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]2.9e-7839.71Show/hide
Query:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF
        +AS+RRK N + G++D    W   ++ + + A +YF  ++ S++P    I ++ +A P  ++EE N  LI  FT+EE+   +K +HP KAPGPD + A+F
Subjt:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF

Query:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE
        +QKYW ++G +  D  L  LN       +NKT I+LIPK NNPK M DFRPISLC+V+YK+ISK +ANRLK +L +IIS +QSAF   RLITDN ++ FE
Subjt:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE

Query:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLY-----------------HEKVY-----------------------QELH
         +H + +K  GK+  +++KLDMSKA DR EW +I K+M ++GFCNRW DL                  H  +Y                         L 
Subjt:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLY-----------------HEKVY-----------------------QELH

Query:  NQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDILQRLDRFQ-------PS
        NQ+   K ++ + IN+ CP +THLF+ DDS+LF  A+  +   ++  L  YE+A GQ IN DKS+   SPNT ++   EI +IL  +   +       PS
Subjt:  NQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDILQRLDRFQ-------PS

Query:  YVSQNKAR
         + ++K++
Subjt:  YVSQNKAR

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]9.3e-7742.74Show/hide
Query:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF
        KAS RR+ N + G+ D N  W    + + +VA +YFQ ++ S+ P    I+++LDA P  ++EE N  LI  FTREEI   +  MHPTKAPGPD + AIF
Subjt:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF

Query:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE
        +QKYW ++G D     L  LN   S   INKT I L+PKI NP  M DFRPISLC+V+YK+ISK +ANRLK +L  IIS +QSAF+ GRLITDN ++ FE
Subjt:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE

Query:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----------------------------------------YQELH
         +H +++K++GK+   ++KLDMSKA+DR EW +I ++M K+GF  +WI L    +                                        +  L 
Subjt:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----------------------------------------YQELH

Query:  NQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNT
        N    + ++S + I + CP +THLF+ DDSLLF  A+S +  ++   L +YE A GQ IN DKS+   S NT
Subjt:  NQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNT

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]2.8e-7337.68Show/hide
Query:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF
        +AS+RRK N +  L++ +  W    + +   A +YF+ ++ S++P    IN++++A P  +++E N +L  +FT EE+   +K +HPTKAPGPD + A F
Subjt:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF

Query:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE
        +  YW+++G    +  L  LN       INKT I+LIPK N P  M +FRPISLC+  YKIISK +ANR K +L NIIS +QSAF P RLITDN ++ FE
Subjt:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE

Query:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----YQELHN-----------------------------------
         +H + +K +GK+  +S+KLDMSKA DR EW +I  +M KLGF  +WI L    V    Y  L N                                   
Subjt:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----YQELHN-----------------------------------

Query:  -QSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDILQRLDRFQ-------PS
         ++   ++++ + I + CP++THLF+ DDSLLF  A   +  ++   LN YE+A GQ IN DKS+   SPNT+++L   I +IL  +   +       PS
Subjt:  -QSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDILQRLDRFQ-------PS

Query:  YVSQNKARELFDIQ
         + ++KA+   +++
Subjt:  YVSQNKARELFDIQ

XP_030964220.1 uncharacterized protein LOC115985421 [Quercus lobata]4.3e-7439.06Show/hide
Query:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF
        +AS+R++ N + GL++ +  W +  + +   A +YF++++ +++P    IN+++ A P  ++EE N++L  +FT EE+   ++ +HP KAPGPD + AIF
Subjt:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF

Query:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE
        +  YW+++G +  +  L  LN       INKT I+LIPK N P  M +F PISLC+  YKIISK +ANRLK +L NIIS +QSAF P RLITDN ++ FE
Subjt:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE

Query:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----------------------------------------YQELH
         +H + +K +GK+  +S+KLDMSKA DR EW +I  +M KLGF ++WIDL    V                                        +  L 
Subjt:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----------------------------------------YQELH

Query:  NQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDIL
        +++   ++++ + I + CP++TH F+ DDSLLF  A   +  ++   LN YE+A GQ IN DKS+   SPNT +DL   I DIL
Subjt:  NQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDIL

TrEMBL top hitse value%identityAlignment
A0A2K3MPI1 Ribonuclease H2.0e-7240.96Show/hide
Query:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNP-QFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAI
        KAS+R KVN++  + D +  W + DQ +ERV  NYF++LF S+NP   EA  +++      +SEEH       F+REEI   +  MHP KAPGPD + A+
Subjt:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNP-QFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAI

Query:  FYQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGF
        FYQKYW ++G +  +  L  LN    PR INKTF+ LIPK  NP S KDFRPISLC+V+ KI++K +ANRLK+ L ++I   QSAFV GRLITDNA++  
Subjt:  FYQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGF

Query:  ECIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----YQELHN----------------------------------
        EC H +K KR+GK  V++LKLDMSKA+DR EW ++  ++  +G+  R ++L    +    YQ L N                                  
Subjt:  ECIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----YQELHN----------------------------------

Query:  --QSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKD
          ++   K++  +++ +  P L+HLF+ DDSLLF  A+S +   I   L +Y++A GQ +N DKS    S N   +
Subjt:  --QSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKD

A0A2N9GM07 Reverse transcriptase domain-containing protein6.1e-7439.79Show/hide
Query:  MKASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAI
        +KAS+RR+ N++ GLF T   W+    +++     YF+++F ++ P    + + + A  + ++   N +L  +FT EE+H  ++ MHPTKAPGPD + A+
Subjt:  MKASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAI

Query:  FYQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGF
        F+QKYW ++GK+  +  LQ LN   S    NKT IALIPK   P+ M +FRPISLC+V YK+ISK +ANRLK VL+++IS +QSAFVPGR ITDNA++ F
Subjt:  FYQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGF

Query:  ECIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----------------------------------------YQEL
        E +H  + KR GKD  ++LKLDMSKA+DR EW++I ++M KLGFC +WI L    +                                        +  L
Subjt:  ECIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----------------------------------------YQEL

Query:  HNQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIE
          ++     +  + + +  P +THLF+ DDS+LF  A++TD   + +    YEKA GQ IN DKS+   S NT+     EI+
Subjt:  HNQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIE

A0A2N9I335 Reverse transcriptase domain-containing protein5.2e-7340.16Show/hide
Query:  RRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIFYQKY
        RR++N + GL D       +  +M  +A +YFQ +F S+NP  E IN  LD     ++EE N  L+  F  EE+   +K M+PTKAPGPD + A+FYQ Y
Subjt:  RRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIFYQKY

Query:  WEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFECIHV
        W+++G +     L  L+       IN T IALIPK+ NP+ + DFRPISLC+V+YKI+SK +ANRLK VL  +IS SQSAFVPGRLITDN ++ FE +H 
Subjt:  WEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFECIHV

Query:  VKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDL---------YHEKVYQELH-------------------------------NQSV
        +  KR G+   ++LKLDMSKA+DR EW+++  IM +LGF   WI+L         Y   +  E H                                ++V
Subjt:  VKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDL---------YHEKVYQELH-------------------------------NQSV

Query:  ARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDILQ
          KK+S +  ++  P LTHLF+ DDSLLF  A+  +  ++ H L  YE   GQ +N  K++   + NT+ D+  +I+++ Q
Subjt:  ARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDILQ

A0A7N2L6Z9 Reverse transcriptase domain-containing protein2.0e-7239.84Show/hide
Query:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF
        +AS+RRK N + G++D    W ++   +   A  YF++++ ++NP    ++++  A P  I+EE N +L   FTREEI   +K +HPTK+PGPD + AIF
Subjt:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF

Query:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE
        +QKYW+++G +  +  L  LN   S   INKT I LIPK +NPK M DFRPISLC+V+YK+ISK +ANRLK  L  II+ +QSAF   RLITDN ++ +E
Subjt:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE

Query:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----------------------------------------YQELH
         +H +K+K+ GKD  ++ KLDMSKA DR EW +I ++M K+GF   WI L    +                                           L 
Subjt:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----------------------------------------YQELH

Query:  NQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDIL
        + +   + L+ + + + CP +THLF+ DDSLLF  A+  +   +K  L  YE A GQ +N DKS+   SPNT  +L   I +IL
Subjt:  NQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDIL

F4NCJ6 Reverse transcriptase domain-containing protein3.0e-7337.01Show/hide
Query:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF
        KAS+R+K N V GLFD    W +E   +E +  +YF  +F S+NP   ++  ++      ++EEHNLKL+  F+++EI   ++ MHP KAPGPD +  IF
Subjt:  KASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIF

Query:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE
        YQ++W ++G D   F    L+G  SP  +N T IALIPK+ NP    +FRPI+LC+VLYK++SK +  RLK  L  IIS +QSAFVPGRLITDNA++  E
Subjt:  YQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFE

Query:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----------------------------------------YQELH
          H +KN+ + +   +++KLDMSKA+DR EW ++ K++  +GF  RW++L  E V                                        + ++ 
Subjt:  CIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKV----------------------------------------YQELH

Query:  NQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDIL--QRLDRFQ-----PS
         + V  K+L   + ++  P ++HLF+ DDSLLF  A+  +   I   LN YE A GQ INY+KS    S   +     E+ +IL  +++DR +     PS
Subjt:  NQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTSPNTNKDLVREIEDIL--QRLDRFQ-----PS

Query:  YVSQNKARELFDIQI----VYLLGFSEAAISRQNR
         +S    + +FD  I      L G+ E  +SR  +
Subjt:  YVSQNKARELFDIQI----VYLLGFSEAAISRQNR

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-1726Show/hide
Query:  KRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDA-TPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIFYQ
        K+R+ N++  + +       +  E++     Y++ L+ +     E ++  LD  T   +++E    L    T  EI  ++  +   K+PGPD   A FYQ
Subjt:  KRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDA-TPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIFYQ

Query:  KYWEVMGKDACDFCLQ-FLNGEKS---PRHINKTFIALIPKINNPKSMKD-FRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAIL
        +Y E    +   F L+ F + EK    P    +  I LIPK     + K+ FRPISL ++  KI++K +ANR++  +  +I   Q  F+PG     N   
Subjt:  KYWEVMGKDACDFCLQ-FLNGEKS---PRHINKTFIALIPKINNPKSMKD-FRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAIL

Query:  GFECIHVVKNKRQGKDR-VVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKVYQELHNQSVARKKLSSLRI---------NKHCPILTHLFYV
          + I+V+++  + KD+  V + +D  KA D+ +  ++ K + KLG     ID  + K+ + ++++  A   L+  ++          + CP+   LF +
Subjt:  GFECIHVVKNKRQGKDR-VVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKVYQELHNQSVARKKLSSLRI---------NKHCPILTHLFYV

P11369 LINE-1 retrotransposable element ORF2 protein3.5e-1824.44Show/hide
Query:  EDQEMERVANNYFQELFQSTNPQFEAINQILDATPV-CISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIFYQKYWEVMGKDACDFCLQFLNG
        + +E++    ++++ L+ +     + +++ LD   V  ++++    L +  + +EI  V+  +   K+PGPD   A FYQ + E +         +    
Subjt:  EDQEMERVANNYFQELFQSTNPQFEAINQILDATPV-CISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIFYQKYWEVMGKDACDFCLQFLNG

Query:  EKSPRHINKTFIALIPK-INNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFECIHVVKNKRQGKDRVVSLKLD
           P    +  I LIPK   +P  +++FRPISL ++  KI++K +ANR++  +  II P Q  F+PG     N       IH + NK + K+ ++ + LD
Subjt:  EKSPRHINKTFIALIPK-INNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFECIHVVKNKRQGKDRVVSLKLD

Query:  MSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKVYQELHNQSVARKKLSSLRI----NKHCPILTHLFYV
          KA D+ +  ++ K++ + G    ++++      + + N  V  +KL ++ +     + CP+  +LF +
Subjt:  MSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKVYQELHNQSVARKKLSSLRI----NKHCPILTHLFYV

P14381 Transposon TX1 uncharacterized 149 kDa protein3.6e-2330.21Show/hide
Query:  ASKRRKVNK--VCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAI
        A +++K N+  +  LF  +   L++ + +   A +++Q LF       +A  ++ D  PV +SE    +L    T +E+   ++ M   K+PG D +   
Subjt:  ASKRRKVNK--VCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAI

Query:  FYQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGF
        F+Q +W+ +G D      +     + P    +  ++L+PK  + + +K++RP+SL S  YKI++K ++ RLK VL  +I P QS  VPGR I DN  L  
Subjt:  FYQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGF

Query:  ECIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYI
        + +H    +R G   +  L LD  KA DR +  Y+
Subjt:  ECIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.2e-1031.9Show/hide
Query:  EAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIFYQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMK
        +++ +I D  P   ++    +L A  + +EI   V  M   KAPGPD   A F+ + W V+         +F       +  N T I LIPK+     + 
Subjt:  EAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIFYQKYWEVMGKDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMK

Query:  DFRPISLCSVLYKIIS
         FRP+S C+V+YKII+
Subjt:  DFRPISLCSVLYKIIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.3e-1137.35Show/hide
Query:  VANRLKVVLNNIISPSQSAFVPGRLITDNAILGFECIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWI
        +  RLK ++ N+I P+Q++F+PGR+ TDN +   E +H ++ K+  K  ++ LKLD+ KA+DR  W Y+   +   GF   W+
Subjt:  VANRLKVVLNNIISPSQSAFVPGRLITDNAILGFECIHVVKNKRQGKDRVVSLKLDMSKAHDRAEWIYIWKIMGKLGFCNRWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGCTAGTAAGAGGAGGAAAGTGAATAAGGTATGTGGCTTGTTTGATACGAATAGTGCGTGGTTAAAGGAGGATCAAGAGATGGAGAGGGTAGCTAATAATTATTT
CCAAGAGCTTTTCCAATCCACCAATCCACAGTTTGAGGCCATTAATCAAATCCTGGATGCTACGCCTGTTTGTATTTCAGAGGAGCATAACTTAAAACTTATTGCCTCTT
TTACTAGAGAAGAAATTCATGGTGTGGTTAAAGGCATGCATCCTACTAAAGCTCCGGGCCCGGATGACGTTCAAGCCATTTTCTACCAGAAATATTGGGAGGTGATGGGA
AAGGATGCATGTGACTTCTGTCTTCAGTTTTTGAATGGGGAGAAAAGCCCAAGGCATATTAATAAGACTTTCATCGCACTGATCCCGAAGATCAATAATCCTAAATCCAT
GAAGGATTTCAGGCCTATTAGTTTATGCTCAGTGCTATACAAAATCATTTCGAAATTTGTCGCCAACAGACTGAAAGTTGTTTTGAATAATATCATCTCCCCTAGTCAAT
CGGCATTTGTTCCAGGCAGACTGATCACAGACAATGCTATCCTTGGATTTGAGTGCATTCACGTAGTTAAAAACAAAAGACAAGGGAAAGACAGAGTGGTTTCTCTCAAG
TTAGATATGAGCAAAGCTCATGATCGAGCGGAATGGATTTACATTTGGAAAATCATGGGGAAGTTGGGGTTCTGCAACCGATGGATTGATCTTTATCATGAGAAGGTCTA
TCAAGAACTCCACAATCAATCAGTAGCTAGAAAGAAATTGTCAAGTTTGCGCATCAATAAACATTGCCCTATTTTAACTCATTTATTTTACGTCGATGATAGCCTTTTGT
TCTTTAATGCATCCAGTACTGATTATTGGTCTATAAAACATACTCTTAATATCTATGAGAAAGCCTTTGGACAAACAATTAATTATGACAAATCTAACTTTATGACTAGT
CCCAATACTAACAAGGATCTAGTCCGAGAGATCGAGGATATTTTGCAGAGACTTGACCGTTTTCAACCAAGCTATGTTAGCCAAAACAAAGCTAGAGAATTATTCGATAT
CCAGATAGTTTACTTGCTAGGGTTTTCAGAGGCCGCTATTTCAAGACAGAACAGAATGGATGAGATTATCTGGAATTATGATCCAAAAGGACCAAATGAGAGACAACTAA
ACATCTATTCTGGAAGTGCAAGGTTACTAAAGTCAAAAAGTGGCGACATTGGGTGGGTTCTTCGACGATGGGACGGTACACTTGTGACTGTCGACTTTAGATTCATCCAC
CGTCAATGGCGAGTCAGTTGGCTAAAAGCCCTTGCGGTAGTGGAAGGGTTGAAATCGATTCCTCAAACATCTCCCAAGTTGATTTTCGAGCTTGACTCCATTCAGGTGGT
ACATCTTCTTGAGGGGAAGGAGGATGACGTTACCGAGCTTGCCTACTTTATCAAGGAAGCCAATTCGTTTATCTCTGGTCTTCAAGTGCACACGATAAGCCATATTTCTA
GAAGACATAATAACTTGGCCCATCAGTTGGCCCGAAAAGCCTGTAATAAAGCTTTTGACAGTTGGTACAATGTATTTCCTCTTTGGTTTTTAGATATTGGAACTGTGAAT
ACCTTTAGTGGGAGTGCCTCTCCCACAAATGATTGCCCTATGGGAGTAATTGCTCAGTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGCTAGTAAGAGGAGGAAAGTGAATAAGGTATGTGGCTTGTTTGATACGAATAGTGCGTGGTTAAAGGAGGATCAAGAGATGGAGAGGGTAGCTAATAATTATTT
CCAAGAGCTTTTCCAATCCACCAATCCACAGTTTGAGGCCATTAATCAAATCCTGGATGCTACGCCTGTTTGTATTTCAGAGGAGCATAACTTAAAACTTATTGCCTCTT
TTACTAGAGAAGAAATTCATGGTGTGGTTAAAGGCATGCATCCTACTAAAGCTCCGGGCCCGGATGACGTTCAAGCCATTTTCTACCAGAAATATTGGGAGGTGATGGGA
AAGGATGCATGTGACTTCTGTCTTCAGTTTTTGAATGGGGAGAAAAGCCCAAGGCATATTAATAAGACTTTCATCGCACTGATCCCGAAGATCAATAATCCTAAATCCAT
GAAGGATTTCAGGCCTATTAGTTTATGCTCAGTGCTATACAAAATCATTTCGAAATTTGTCGCCAACAGACTGAAAGTTGTTTTGAATAATATCATCTCCCCTAGTCAAT
CGGCATTTGTTCCAGGCAGACTGATCACAGACAATGCTATCCTTGGATTTGAGTGCATTCACGTAGTTAAAAACAAAAGACAAGGGAAAGACAGAGTGGTTTCTCTCAAG
TTAGATATGAGCAAAGCTCATGATCGAGCGGAATGGATTTACATTTGGAAAATCATGGGGAAGTTGGGGTTCTGCAACCGATGGATTGATCTTTATCATGAGAAGGTCTA
TCAAGAACTCCACAATCAATCAGTAGCTAGAAAGAAATTGTCAAGTTTGCGCATCAATAAACATTGCCCTATTTTAACTCATTTATTTTACGTCGATGATAGCCTTTTGT
TCTTTAATGCATCCAGTACTGATTATTGGTCTATAAAACATACTCTTAATATCTATGAGAAAGCCTTTGGACAAACAATTAATTATGACAAATCTAACTTTATGACTAGT
CCCAATACTAACAAGGATCTAGTCCGAGAGATCGAGGATATTTTGCAGAGACTTGACCGTTTTCAACCAAGCTATGTTAGCCAAAACAAAGCTAGAGAATTATTCGATAT
CCAGATAGTTTACTTGCTAGGGTTTTCAGAGGCCGCTATTTCAAGACAGAACAGAATGGATGAGATTATCTGGAATTATGATCCAAAAGGACCAAATGAGAGACAACTAA
ACATCTATTCTGGAAGTGCAAGGTTACTAAAGTCAAAAAGTGGCGACATTGGGTGGGTTCTTCGACGATGGGACGGTACACTTGTGACTGTCGACTTTAGATTCATCCAC
CGTCAATGGCGAGTCAGTTGGCTAAAAGCCCTTGCGGTAGTGGAAGGGTTGAAATCGATTCCTCAAACATCTCCCAAGTTGATTTTCGAGCTTGACTCCATTCAGGTGGT
ACATCTTCTTGAGGGGAAGGAGGATGACGTTACCGAGCTTGCCTACTTTATCAAGGAAGCCAATTCGTTTATCTCTGGTCTTCAAGTGCACACGATAAGCCATATTTCTA
GAAGACATAATAACTTGGCCCATCAGTTGGCCCGAAAAGCCTGTAATAAAGCTTTTGACAGTTGGTACAATGTATTTCCTCTTTGGTTTTTAGATATTGGAACTGTGAAT
ACCTTTAGTGGGAGTGCCTCTCCCACAAATGATTGCCCTATGGGAGTAATTGCTCAGTCTTAA
Protein sequenceShow/hide protein sequence
MKASKRRKVNKVCGLFDTNSAWLKEDQEMERVANNYFQELFQSTNPQFEAINQILDATPVCISEEHNLKLIASFTREEIHGVVKGMHPTKAPGPDDVQAIFYQKYWEVMG
KDACDFCLQFLNGEKSPRHINKTFIALIPKINNPKSMKDFRPISLCSVLYKIISKFVANRLKVVLNNIISPSQSAFVPGRLITDNAILGFECIHVVKNKRQGKDRVVSLK
LDMSKAHDRAEWIYIWKIMGKLGFCNRWIDLYHEKVYQELHNQSVARKKLSSLRINKHCPILTHLFYVDDSLLFFNASSTDYWSIKHTLNIYEKAFGQTINYDKSNFMTS
PNTNKDLVREIEDILQRLDRFQPSYVSQNKARELFDIQIVYLLGFSEAAISRQNRMDEIIWNYDPKGPNERQLNIYSGSARLLKSKSGDIGWVLRRWDGTLVTVDFRFIH
RQWRVSWLKALAVVEGLKSIPQTSPKLIFELDSIQVVHLLEGKEDDVTELAYFIKEANSFISGLQVHTISHISRRHNNLAHQLARKACNKAFDSWYNVFPLWFLDIGTVN
TFSGSASPTNDCPMGVIAQS