; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0023235 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0023235
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:46313849..46314865
RNA-Seq ExpressionLag0023235
SyntenyLag0023235
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]6.7e-9952.07Show/hide
Query:  MVVKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIIS
        + +K + P+KA GPDG+ A+F+QKYW IVG +V D  L  LN    +  +NKT I+LIPK  +P  M DFRPISLC+V+YK+I+K LANRLK +L  IIS
Subjt:  MVVKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIIS

Query:  PSQSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLR
         +QSAF   RLI+DN ++ FE +H + +K  GKEG +A+KLDMSKA+DRVEW +I K+ME+MGF +RW   +M C+ SV++ +L+NG+      P+RGLR
Subjt:  PSQSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLR

Query:  QGDPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIK
        QGDPLSP LFL+CAEGLS+ +N++ R +  TG+ IN  CP + HLF+ADDS+LF KA  ++C  +  IL  YE ASGQ IN +KS+   SPNT+++   +
Subjt:  QGDPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIK

Query:  IKETLGVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV
        I   LG        +YLGLPS IGRSK +VF ++K++V
Subjt:  IKETLGVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.6e-10052.98Show/hide
Query:  VKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPS
        +  M P+KA GPDG+ AIF+QKYW+IVG D+    L+ LN   S+  INKT I L+PKIK+P+ M DFRPISLC+V+YK+I+K LANRLK++L +IIS +
Subjt:  VKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPS

Query:  QSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQG
        QSAF+ GRLI+DN ++ FE +H +++K++GKEG  A+KLDMSKAYDRVEW +I+++MEKMGF+ +WI  +M C+ SV++ +L+NG       P RGLRQG
Subjt:  QSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQG

Query:  DPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIK
        DP+SPY+FL+CA+G SS LN   R    +G+ I   CP I HLF+ADDSLLF KA  ++C+ +  IL  YE ASGQ IN +KS+   S NT  ++  ++ 
Subjt:  DPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIK

Query:  ETLGVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV
          LG        +YLGLPS IG+SK E+F  +K+RV
Subjt:  ETLGVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]4.2e-10156.16Show/hide
Query:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQSA
        M P+KA GPDG+ A+FYQK+W IVG  V    L+FLN    L  IN T I LIPK+++P  M +FRPISLC+VIYKII+K LANRLK VL +IIS +QSA
Subjt:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQSA

Query:  FVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDPL
        FVPGRLI+DN ++ +E +H +  ++KGK+G VALKLD+SKAYDRVEW ++Q IMEKMGF + WI ++MSCV + +F +L+NG P E  +P+RG+RQGDP+
Subjt:  FVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDPL

Query:  SPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIKETL
        SPYLFL+CAEGL++ LNK+E     TG+ I    P I +L +ADDSLLF +A   +   I  IL  YERASGQ+IN EKS+   S NTS  Q  +I E L
Subjt:  SPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIKETL

Query:  GVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV
        GV+  +   +YLGLP+ IGR+K   F  +KDRV
Subjt:  GVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV

XP_030939568.1 uncharacterized protein LOC115964386 [Quercus lobata]5.0e-10254.46Show/hide
Query:  VKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPS
        +K M+P  A GPDG+  IFY+  W+ +  DV    L  LN       +N TYIALIPK K P   KDFRPISLC+V+YKII+K +ANRLK +L K++  S
Subjt:  VKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPS

Query:  QSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQG
        QSAF+  RLISDN ++ FE +H IKNKRKGK G +ALKLDMSKAYDRVEW++++K+MEK+GF SRWI  I SC+ +V+F VL+NG P   F PNRGLRQG
Subjt:  QSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQG

Query:  DPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIK
        DPLSPYLFL+CAEGL S + ++E      G+ +    P ++HLF+ADDSLLF +A +KD   I  ILH YERASGQ IN EK+    SPNT       IK
Subjt:  DPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIK

Query:  ETLGVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV
          +GV     +  YLGLP+ +GR KK+ F  I++R+
Subjt:  ETLGVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV

XP_030964568.1 uncharacterized protein LOC115985808 [Quercus lobata]2.3e-9952.98Show/hide
Query:  VKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPS
        +K M  + + GPDG+  +FY+ YW+ VG DV    L  LN       IN T+IALIPKIK P   KDFRPISLC+VIYK+I+K +ANRLK +L K++S S
Subjt:  VKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPS

Query:  QSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQG
        QSAF+  RLISDN ++ FE +H +KNKRKGK G +ALKLDMSKAYD+VEW +++ +MEK+GF+ RWI  + SC+ +V+F V++NG P   F+PNRGLRQG
Subjt:  QSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQG

Query:  DPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIK
        DPLSPYLFL+CAEGL S + +++   +  G+ I+   P ++HLF+ADD LLF KA   DC+AI   L  YE+A+GQ IN +K+    S NTS      IK
Subjt:  DPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIK

Query:  ETLGVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV
        E LGV       +YLGLPS +GR+KK+ F  I++R+
Subjt:  ETLGVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein1.6e-10155.86Show/hide
Query:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQSA
        M PSKA GPDG+ A+F+QK+W IVG DV +  L+FLN    L  +N T+IALIPK+K P +M  FRPISLC+V+YKII+K L NR+K +L  ++S SQSA
Subjt:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQSA

Query:  FVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDPL
        FVPGR+ISDN +I FE IH +KNKR GK   +A KLDMSKAY+RVEW Y++KIM K+GF+ +W+  IM CV SV++ +L+NG P+   +P+RGLRQGDPL
Subjt:  FVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDPL

Query:  SPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIKETL
        SPYLFLICAEGLS+ L K+ER     G+ ++   P ++HLF+ADDSL+F +A E DC A+ +IL  YERASGQ IN +K+A   S N S      I    
Subjt:  SPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIKETL

Query:  GVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV
        G        +YLGLP  IGRSKK+ F  IKDR+
Subjt:  GVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV

A0A2N9HWM9 Reverse transcriptase domain-containing protein1.0e-10055.26Show/hide
Query:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQSA
        M PSKA GPDG+ A+F+QK+W IVG DV D  L+FLN    L  +N T+IALIPK+K P  M  FRPISLC+V+YKII+K L NR+K++L  ++S SQSA
Subjt:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQSA

Query:  FVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDPL
        FVPGR+ISDN +I FE +H +KNKR GK   +A+KLDMSKAYDRVEW Y++K+M K+GF +RW+  IM CV SV++ +L+NG P+   +P+RGLRQGDPL
Subjt:  FVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDPL

Query:  SPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIKETL
        SPYLFLICAEGL++ L K+ER     G+ I    P ++HLF+ADDSL+F +A   +C+A+  IL  YE ASGQ IN  K+A   S N S      I    
Subjt:  SPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIKETL

Query:  GVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV
        G        +YLGLP  IGRSKK+ F  IKDR+
Subjt:  GVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV

A0A2N9I4C9 Uncharacterized protein1.4e-10255.86Show/hide
Query:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQSA
        M PSKA GPDG+ A+FYQK+W I+G DV +  LEFL+    L  IN T+IALIPK+K P TM  FRPISLC+V+YKII+K LANRLK+VL+ +IS +QSA
Subjt:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQSA

Query:  FVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDPL
        FVPGRLISDN ++ FE +H +K+KRKG+   +A+KLDMSKAYDRVEW +I K+M K+GFN RW+  IM C++SV++ V+LNG P    RP RG+ QGDPL
Subjt:  FVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDPL

Query:  SPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIKETL
        SPYLFLICAEGL++ LN +   R  +GL +    P I+HLF+ADDSLLF +A  ++C  +  +L TYE+ASGQ +N+EK++   S NT +     I   L
Subjt:  SPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIKETL

Query:  GVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV
               LG+YLGLP  IGR KK+ F  IK +V
Subjt:  GVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV

A0A5B7BN08 Reverse transcriptase domain-containing protein3.0e-10556.59Show/hide
Query:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLN-GEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQS
        M P+KA GPDG+  +F+QK+WD+VG D+    L+FLN G GSL  IN TYIALIPK+  P  + +FRPISLC+V+YKII+K LANRLK++L  II+ SQS
Subjt:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLN-GEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQS

Query:  AFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDP
        AFVPGRLI+DN ++ FE IH +KNKRKGK G  ALKLDMSKAYDRVEW +++ +M +MGF+ +W+  IM CV +V+F VL+NG PR   +P RGLRQGDP
Subjt:  AFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDP

Query:  LSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIKET
        LSPYLF++CAE  S+ L KSE      G+ +    P ++HLF+ADDSLLF  A E     I+RI+  Y  ASGQ +NFEKSA   S N + D+  +IK+ 
Subjt:  LSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIKET

Query:  LGVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV
        LGV       +YLGLPS IGRSK + F++I+DRV
Subjt:  LGVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV

A0A7N2LIH6 Uncharacterized protein4.5e-10153.57Show/hide
Query:  VKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPS
        ++ M P+KA GPDG+  IFYQKYWDIVG+ V +  L+ LN       INKTYI LIPK K+P  + +FRPISLC+VIYKII+K LANRLK VL  +I  +
Subjt:  VKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPS

Query:  QSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQG
        QSAFVPGR+I+DN ++ FE +H+I  +RKGKEG++A+KLDMSKAYDRVEW Y++ +M+KMGF  RWI  IM CV SV+F VL+NG P+  F P+RGLRQG
Subjt:  QSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQG

Query:  DPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIK
        DP+SPYLFL+C EGLS+ + K ER     G+      P I+HLF+ADDS++F +A   +C  + ++L  YE  SGQ +N +K++   S NT  +     K
Subjt:  DPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIK

Query:  ETLGVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV
           G Q  +   +YLGLP  IGR+KK+ F+ IKD+V
Subjt:  ETLGVQHKESLGQYLGLPSQIGRSKKEVFDIIKDRV

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.1e-3128.57Show/hide
Query:  VVKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEF---LNGEGSL-TRINKTYIALIPKI-KDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLD
        ++ ++   K+ GPDG  A FYQ+Y +    ++  F L+    +  EG L     +  I LIPK  +D +  ++FRPISL ++  KI+ K LANR++  + 
Subjt:  VVKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEF---LNGEGSL-TRINKTYIALIPKI-KDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLD

Query:  KIISPSQSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPN
        K+I   Q  F+PG     N       I  I N+ K K  V+ + +D  KA+D+++  ++ K + K+G +  ++  I +  +  T  ++LNG   E F   
Subjt:  KIISPSQSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPN

Query:  RGLRQGDPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKS-AFMVSPNTSR
         G RQG PLSP LF I  E L+ ++ + + ++   G+++      +    +ADD +++ +      + + +++  + + SG  IN +KS AF+ + N   
Subjt:  RGLRQGDPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKS-AFMVSPNTSR

Query:  DQVIKIKETLGVQHKESLGQYLGLPSQIGRSKKEVF
        +  I  +    +  K    +YLG+  Q+ R  K++F
Subjt:  DQVIKIKETLGVQHKESLGQYLGLPSQIGRSKKEVF

P08548 LINE-1 reverse transcriptase homolog1.1e-3028.91Show/hide
Query:  VKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSL-TRINKTYIALIPKI-KDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIIS
        ++N+   K+ GPDG  + FYQ + + +   + +   + +  EG L     +  I LIPK  KDP+  +++RPISL ++  KI+ K L NR++  + KII 
Subjt:  VKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSL-TRINKTYIALIPKI-KDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIIS

Query:  PSQSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLR
          Q  F+PG     N       I  I NK K K+ ++ L +D  KA+D ++  ++ + ++K+G    ++  I +     T  ++LNG+  + F    G R
Subjt:  PSQSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLR

Query:  QGDPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKS-AFMVSPNTSRDQVI
        QG PLSP LF I  E L+ ++ + + ++   G+ I +    I    +ADD +++ +        +  ++  Y   SG  IN  KS AF+ + N   ++ +
Subjt:  QGDPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKS-AFMVSPNTSRDQVI

Query:  K--IKETLGVQHKESLGQYLGLPSQIGRSKKEVFDIIKD
        K  I  T+  +  + LG YL          K+V D+ K+
Subjt:  K--IKETLGVQHKESLGQYLGLPSQIGRSKKEVFDIIKD

P11369 LINE-1 retrotransposable element ORF2 protein4.3e-3229.64Show/hide
Query:  VVKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSL-TRINKTYIALIPK-IKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKII
        V+ ++   K+ GPDG  A FYQ + + +   +       +  EG+L     +  I LIPK  KDP+ +++FRPISL ++  KI+ K LANR++  +  II
Subjt:  VVKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSL-TRINKTYIALIPK-IKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKII

Query:  SPSQSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGL
         P Q  F+PG     N       IH I NK K K  ++ + LD  KA+D+++  ++ K++E+ G    ++  I +        + +NG   E      G 
Subjt:  SPSQSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGL

Query:  RQGDPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKS-AFMVSPNTSRDQV
        RQG PLSPYLF I  E L+ ++ + + ++   G++I      I+ L  ADD +++    +   R +  +++++    G  IN  KS AF+ + N   ++ 
Subjt:  RQGDPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKS-AFMVSPNTSRDQV

Query:  IKIKETLGVQHKESLGQYLGLPSQIGRSKKEVFD
         +I+ET       +  +YLG+   + +  K+++D
Subjt:  IKIKETLGVQHKESLGQYLGLPSQIGRSKKEVFD

P14381 Transposon TX1 uncharacterized 149 kDa protein9.0e-3031.99Show/hide
Query:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLE-FLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQS
        M  +K+ G DG+   F+Q +WD +G D      E F  GE  L+   +  ++L+PK  D   +K++RP+SL S  YKI+AKA++ RLKSVL ++I P QS
Subjt:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLE-FLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQS

Query:  AFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDP
          VPGR I DN  +  + +H     R+    +  L LD  KA+DRV+  Y+   ++   F  +++G + +   S    V +N          RG+RQG P
Subjt:  AFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDP

Query:  LSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFK-----AREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRD
        LS  L+ +  E     L K       TGL +      +    YADD +L  +      R ++C+ +      Y  AS   IN+ KS+ ++  +   D
Subjt:  LSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFK-----AREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRD

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM4.8e-1529.08Show/hide
Query:  SKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQSAFVP
        S + GPDGI     ++    +   + +  L   N   S+ R+ +T    IPK       +DFRPIS+ SV+ + +   LA RL S ++    P Q  F+P
Subjt:  SKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQSAFVP

Query:  GRLISDN-TVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDPLSP
            +DN T++     H+ K+ R          LD+SKA+D +    I   +   G    ++  + +  E     +  +G   EEF P RG++QGDPLSP
Subjt:  GRLISDN-TVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDPLSP

Query:  YLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAR
         LF +  + L  +L          G ++ N     N   +ADD +LF + R
Subjt:  YLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAR

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.3e-0939.74Show/hide
Query:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKII
        M  +KA GPD   A F+ + W +V         EF      L R N T I LIPK+     +  FRP+S C+V+YKII
Subjt:  MQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.4e-1439.53Show/hide
Query:  LANRLKSVLDKIISPSQSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKI
        +  RLK ++  +I P+Q++F+PGR+ +DN V   E +H+++ K KG +G + LKLD+ KAYDR+ W Y++  +   GF   W+ +I
Subjt:  LANRLKSVLDKIISPSQSAFVPGRLISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.2e-1548.53Show/hide
Query:  LLNGIPREEFRPNRGLRQGDPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDS
        ++NG P+    P+RGLRQGDPLSPYLF++C E LS    +++      G+R++N  P INHL +ADD+
Subjt:  LLNGIPREEFRPNRGLRQGDPLSPYLFLICAEGLSSSLNKSERMRDFTGLRINNYCPSINHLFYADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAGTTAAGAATATGCAGCCATCAAAGGCTCTGGGACCCGATGGAATTCAAGCAATTTTTTATCAAAAGTACTGGGACATAGTTGGGGCAGATGTGTGTGATTTCTG
TTTAGAGTTCCTAAATGGTGAAGGTTCTTTAACTCGAATCAACAAAACATATATTGCTTTGATTCCAAAGATTAAAGATCCTAGCACCATGAAAGACTTTCGTCCCATAA
GTCTATGCTCAGTTATTTACAAAATAATTGCTAAAGCTCTAGCCAACAGACTGAAATCGGTGCTAGACAAGATTATTTCTCCAAGCCAATCGGCATTCGTCCCAGGGAGA
CTCATATCAGATAATACAGTGATAGGGTTTGAGTGTATCCATGCTATTAAGAACAAAAGGAAAGGTAAAGAGGGAGTTGTTGCGCTCAAATTGGACATGAGCAAAGCATA
TGACAGAGTGGAGTGGATATATATTCAAAAGATCATGGAAAAGATGGGCTTCAACAGTCGATGGATTGGTAAAATCATGAGTTGTGTGGAATCAGTGACCTTCCAAGTTC
TGCTTAATGGCATTCCTCGAGAAGAATTCAGACCGAATAGAGGGCTAAGACAAGGGGACCCATTGTCTCCTTATCTGTTCTTGATTTGTGCTGAAGGTTTATCGAGTAGC
CTAAACAAATCAGAGCGTATGAGGGATTTCACAGGTTTGCGTATTAATAATTACTGCCCTTCTATTAATCATCTTTTTTATGCTGACGATAGTCTCTTGTTCTTTAAAGC
TAGAGAGAAAGATTGCAGGGCTATAACAAGGATTCTTCATACCTATGAGAGAGCATCGGGTCAAACCATTAATTTTGAGAAGTCAGCCTTCATGGTTAGCCCAAACACAA
GTAGAGATCAAGTGATTAAGATCAAGGAGACTCTGGGAGTGCAACATAAGGAAAGCTTGGGGCAGTACCTGGGCCTTCCTTCTCAAATTGGCAGAAGCAAAAAGGAAGTT
TTTGATATTATAAAAGATCGTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAGTTAAGAATATGCAGCCATCAAAGGCTCTGGGACCCGATGGAATTCAAGCAATTTTTTATCAAAAGTACTGGGACATAGTTGGGGCAGATGTGTGTGATTTCTG
TTTAGAGTTCCTAAATGGTGAAGGTTCTTTAACTCGAATCAACAAAACATATATTGCTTTGATTCCAAAGATTAAAGATCCTAGCACCATGAAAGACTTTCGTCCCATAA
GTCTATGCTCAGTTATTTACAAAATAATTGCTAAAGCTCTAGCCAACAGACTGAAATCGGTGCTAGACAAGATTATTTCTCCAAGCCAATCGGCATTCGTCCCAGGGAGA
CTCATATCAGATAATACAGTGATAGGGTTTGAGTGTATCCATGCTATTAAGAACAAAAGGAAAGGTAAAGAGGGAGTTGTTGCGCTCAAATTGGACATGAGCAAAGCATA
TGACAGAGTGGAGTGGATATATATTCAAAAGATCATGGAAAAGATGGGCTTCAACAGTCGATGGATTGGTAAAATCATGAGTTGTGTGGAATCAGTGACCTTCCAAGTTC
TGCTTAATGGCATTCCTCGAGAAGAATTCAGACCGAATAGAGGGCTAAGACAAGGGGACCCATTGTCTCCTTATCTGTTCTTGATTTGTGCTGAAGGTTTATCGAGTAGC
CTAAACAAATCAGAGCGTATGAGGGATTTCACAGGTTTGCGTATTAATAATTACTGCCCTTCTATTAATCATCTTTTTTATGCTGACGATAGTCTCTTGTTCTTTAAAGC
TAGAGAGAAAGATTGCAGGGCTATAACAAGGATTCTTCATACCTATGAGAGAGCATCGGGTCAAACCATTAATTTTGAGAAGTCAGCCTTCATGGTTAGCCCAAACACAA
GTAGAGATCAAGTGATTAAGATCAAGGAGACTCTGGGAGTGCAACATAAGGAAAGCTTGGGGCAGTACCTGGGCCTTCCTTCTCAAATTGGCAGAAGCAAAAAGGAAGTT
TTTGATATTATAAAAGATCGTGTTTGA
Protein sequenceShow/hide protein sequence
MVVKNMQPSKALGPDGIQAIFYQKYWDIVGADVCDFCLEFLNGEGSLTRINKTYIALIPKIKDPSTMKDFRPISLCSVIYKIIAKALANRLKSVLDKIISPSQSAFVPGR
LISDNTVIGFECIHAIKNKRKGKEGVVALKLDMSKAYDRVEWIYIQKIMEKMGFNSRWIGKIMSCVESVTFQVLLNGIPREEFRPNRGLRQGDPLSPYLFLICAEGLSSS
LNKSERMRDFTGLRINNYCPSINHLFYADDSLLFFKAREKDCRAITRILHTYERASGQTINFEKSAFMVSPNTSRDQVIKIKETLGVQHKESLGQYLGLPSQIGRSKKEV
FDIIKDRV