; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036291 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036291
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:43504445..43507210
RNA-Seq ExpressionLag0036291
SyntenyLag0036291
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.2e-7739.81Show/hide
Query:  EYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSK
        E D+     H  A+  RR+N+I G+ +  G+ +   E +E     +F  +FTSSNPS   I+ ALK +  KV+  MN  +  PFT  +I + +++M P+K
Subjt:  EYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSK

Query:  APGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGR
        APGP G PA F+QK+W  VG+     CL I N + ++ + N T+IA IPKV  P+ V +FRPISLCNV Y+I+AK + NR+K +L   IS NQSAF+P R
Subjt:  APGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGR

Query:  SIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGG--KHCP------
         I DN+I+G+ECLH I+  +  RNG VALKLD+SKAYDRVEW FLEQ M  +GF   W+ LIM C+ T  FS+L+NG P G I  + G  + CP      
Subjt:  SIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGG--KHCP------

Query:  -----------------------------EISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDAS--------------------------------VVN
                                      I+HL FADDSL+F KAS+     L+ +   Y  AS                                VV 
Subjt:  -----------------------------EISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDAS--------------------------------VVN

Query:  NLEKYLGVPSSFTRNRCDDFKAIKQRM
          EKYLG+P    RN+   FK +K ++
Subjt:  NLEKYLGVPSSFTRNRCDDFKAIKQRM

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]8.8e-7635.16Show/hide
Query:  ITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSKAPGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDF
        I A +  I  ++T  +N+++LAP+T+ EIE  I QM P+KA GP GFPALFYQ YW+ VG  T+  CL   N+   IK WN TYIA IPK+  P+ +SDF
Subjt:  ITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSKAPGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDF

Query:  RPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVP
        RPISLCNV YKII+K + NR+K V+   IS+ QSAFVP R+I DN+I+GHECLH+I S + G  G  ALKLD+SKA+DRVEW +LE IM  +GF+  W+ 
Subjt:  RPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVP

Query:  LIMECVQTAKFSILLNGVPTGNI-----ILKG-----------------------------GKHCPE----ISHLFFADDSLIFCKASIEQIWVLRTLLK
         I++C+ T +FSI LNG P G       I +G                             G H  E    I+HL FADDSLIF ++   +   LR LL 
Subjt:  LIMECVQTAKFSILLNGVPTGNI-----ILKG-----------------------------GKHCPE----ISHLFFADDSLIFCKASIEQIWVLRTLLK

Query:  TYEDAS--------------------------------VVNNLEKYLGVPSSFTRNRCDD----------------------------------------
        +Y  AS                                +V++   YLG+PS FTR R +                                         
Subjt:  TYEDAS--------------------------------VVNNLEKYLGVPSSFTRNRCDD----------------------------------------

Query:  --------FKAIKQRMLKEKC---TGRNGQTC-----------VVLKRYEEENRNGQSTRFFEDPWLPKAITFKPLQKQGSIVQGEMMVFEFFTPSLGWD
                 K +K +  K+        N ++            +++K       NG + + F DPWLP+  TFKPL+     +  +  V  F T    WD
Subjt:  --------FKAIKQRMLKEKC---TGRNGQTC-----------VVLKRYEEENRNGQSTRFFEDPWLPKAITFKPLQKQGSIVQGEMMVFEFFTPSLGWD

Query:  LQKLRTVVVEHDVKCIVAIPISMTNMDDS
        +  +       D   I+++PIS  N+ DS
Subjt:  LQKLRTVVVEHDVKCIVAIPISMTNMDDS

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]5.5e-7839.38Show/hide
Query:  HKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSKAPGPYGFPA
        H  A+  RR+N I GI + NG+     E + + +V+YF  +++SS P+   I+  L  I   VT  MN  ++  FTR EIE  +NQMHP+KAPGP G  A
Subjt:  HKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSKAPGPYGFPA

Query:  LFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSIFDNIIVG
        +F+QKYWN VG+  +   L++ NS  S+   N T I  +PK+ +P  +SDFRPISLCNV YK+I+KV+ NR+K +L   ISENQSAF+ GR I DN++V 
Subjt:  LFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSIFDNIIVG

Query:  HECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGG---------------------
         E +H ++ K++G+ G+ A+KLDMSKAYDRVEW F++Q+M  +GFH  W+ L+M C+ +  +SIL+NG   G+I    G                     
Subjt:  HECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGG---------------------

Query:  -----------------KHCPEISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDAS------------VVNNL--------------------EKYLGV
                         + CP+I+HLFFADDSL+FCKA+ ++   L  +L+ YEDAS              NN                     +KYLG+
Subjt:  -----------------KHCPEISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDAS------------VVNNL--------------------EKYLGV

Query:  PSSFTRNRCDDFKAIKQRM
        PS   +++ + F  +K+R+
Subjt:  PSSFTRNRCDDFKAIKQRM

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]4.5e-8041.19Show/hide
Query:  EYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSK
        E D+     H  A+  +R+N++ GI N  G  L +R+++E+   ++F  +FT+S+P+   I A L+ +  KVTP MN ++  PFT  ++E+ +  M P+K
Subjt:  EYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSK

Query:  APGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGR
        APGP G PA F+QK+W  V +  I  CL + N + +  T N T+IA IPK+++P+ VSD+RPISLCNV Y+++AK + NRMK +L   IS  QSAF+P R
Subjt:  APGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGR

Query:  SIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTG-------------------
         I DN+I+G+ECLH I+  +  +NG VALKLD+SKAYDRVEW FLEQ ML +GF  + V LIM CV +  FS+L+NGVP G                   
Subjt:  SIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTG-------------------

Query:  ----------------NIILKGGKHCPEIS--HLFFADDSLIFCKASIEQI-----WVLRTLLKTYEDASVVNNLEKYLGVPSSFTRNRCDDFKAIKQRM
                        N +++G K   ++S  HL FADDSLIF +AS+ +         R  ++   + +VV+  EKYLG+PS   R +   F  IK ++
Subjt:  ----------------NIILKGGKHCPEIS--HLFFADDSLIFCKASIEQI-----WVLRTLLKTYEDASVVNNLEKYLGVPSSFTRNRCDDFKAIKQRM

Query:  LKE
        L +
Subjt:  LKE

XP_030969743.1 uncharacterized protein LOC115990020 [Quercus lobata]3.3e-7538.22Show/hide
Query:  DQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSKAP
        DQ     H  AT  +RQN IKG+ + NG   S+ +        ++  +F SSNP  + I   +  +Q  VT SMN  +  P++  E+E+ I  M P KAP
Subjt:  DQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSKAP

Query:  GPYGFPALFYQKYWNEVG-DIT--ILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPG
        GP G P LFYQ YW++V  DIT  +L+CL   NS   +K+ N T+I  IPKV +P+ V++FRPISLCNV YKI++K + NR+K +L   IS+ QSAF+  
Subjt:  GPYGFPALFYQKYWNEVG-DIT--ILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPG

Query:  RSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNII---------------
        R I DN+++  E LH +K+   G+ G++ALKLDMSKAYDRVEW FLE+++L +GF  +WV LIMEC+ T  +SIL+NG P G I                
Subjt:  RSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNII---------------

Query:  --------------------LKGGKHC---PEISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDAS--------------------------------V
                            ++G   C   P+++HLFFADD L+FC++S+E+   ++ LL  YE+AS                                 
Subjt:  --------------------LKGGKHC---PEISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDAS--------------------------------V

Query:  VNNLEKYLGVPSSFTRNRCDDFKAIKQRM------LKEKCTGRNGQTCVV
        +++ EKYLG+PS   RN+   F  IK+R+       KEK   + G+  ++
Subjt:  VNNLEKYLGVPSSFTRNRCDDFKAIKQRM------LKEKCTGRNGQTCVV

TrEMBL top hitse value%identityAlignment
A0A2N9E147 Reverse transcriptase domain-containing protein5.4e-7939.39Show/hide
Query:  IGKGHEYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQ
        +G   E D+     H +A   RR NEI  + +  G++++  E +EQ S  YF N+FT+SNPS  +I + +  +   V+P MN+ +L P+T+ E++  + Q
Subjt:  IGKGHEYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQ

Query:  MHPSKAPGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSA
        M PSKAPGP G  ALF+Q+YW+ +G       L+  +S+R +++ N T+I  IPKV++P  +S FRPISLCNV YK+I+KV  NR+K  L   IS++QSA
Subjt:  MHPSKAPGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSA

Query:  FVPGRSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNI------------
        FV GR I DN+++  E LH +K+KRKG    +A KLDMSKAYDR+EW +L+ +ML +GF   WV L+MECV T  FSILLNG P G+I            
Subjt:  FVPGRSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNI------------

Query:  -----------------------------ILKGGKHCPEISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDAS--------------------------
                                     I +GG   P  SHLFFADDS++F KAS+E+  VL+ +LK YEDAS                          
Subjt:  -----------------------------ILKGGKHCPEISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDAS--------------------------

Query:  ------VVNNLEKYLGVPSSFTRNRCDDFKAIKQRM------LKEKCTGRNGQTCVV
                N LEKYLG+P    R++   F+ IK R+       KEK   + G+  ++
Subjt:  ------VVNNLEKYLGVPSSFTRNRCDDFKAIKQRM------LKEKCTGRNGQTCVV

A0A2N9EX83 Reverse transcriptase domain-containing protein4.5e-7838.93Show/hide
Query:  IYRKKSQDLFLIGKGHEYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPF
        +++++S+ L+L       D+     H  AT  +R+N I GI +  G   S  E++E + V Y+ ++FT+S P        L  +   +T  MN ++ A F
Subjt:  IYRKKSQDLFLIGKGHEYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPF

Query:  TRCEIEKVINQMHPSKAPGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWV
        T  E+E  +NQM P KAPGP G   +FYQKYWN VG     + L        +K  N T+I  IPKV +P+ V DFRPISLCNV YKIIAKV+ NR+K +
Subjt:  TRCEIEKVINQMHPSKAPGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWV

Query:  LQDFISENQSAFVPGRSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTG---
        L   ISE+QSAFVPGR I DNI++  E LH +K  +  + G++ALKLDMSKAYDRVEW FLE+IML +GF  +WV +IMECV+T  +S+L+NG P G   
Subjt:  LQDFISENQSAFVPGRSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTG---

Query:  -----------------------NIIL---------------KGGKHCPEISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDAS---------------
                               N +L               +GG   P++SHLFFADDS++FC+AS+++   ++ +L+TYE AS               
Subjt:  -----------------------NIIL---------------KGGKHCPEISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDAS---------------

Query:  -----------------VVNNLEKYLGVPSSFTRNRCDDFKAIKQRM
                         V+   E YLG+PS   R +   F  +K R+
Subjt:  -----------------VVNNLEKYLGVPSSFTRNRCDDFKAIKQRM

A0A2N9FNH6 Reverse transcriptase domain-containing protein1.7e-8041.26Show/hide
Query:  DQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSKAP
        D+     H  A+  +++N I G+ + NG   + R  M      YF N+F +SNPS   I   +  +   VT  MND +LAPFT  EI   + QMHP+KAP
Subjt:  DQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSKAP

Query:  GPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSI
        GP G  A+FYQK+W+ VGD      LE  +S + +K+ N T+I  IPK+  P++++ FRPISLCNV YKII+KV+ NR+K VL   IS+NQSAFVPGR I
Subjt:  GPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSI

Query:  FDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNI-------------------
         DNI+V  E LH +K+KRKGR+  +A+KLDMSKAYDRVEW FLE +M+ +GF   WV LIM+C+ +  +S++LNG PTG I                   
Subjt:  FDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNI-------------------

Query:  ----------------ILKGGKHC---PEISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDA--------------------------------SVVNN
                        I++G   C   P+ISHLFFADDSL+FC+A++ +   L  +L TYE A                                S   +
Subjt:  ----------------ILKGGKHC---PEISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDA--------------------------------SVVNN

Query:  LEKYLGVPSSFTRNRCDDFKAIKQRMLKE
        L KYLG+P    R +   F  IKQ++ K+
Subjt:  LEKYLGVPSSFTRNRCDDFKAIKQRMLKE

A0A2N9HE46 Uncharacterized protein2.0e-7839.85Show/hide
Query:  DQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSKAP
        DQ     H  AT  +R+N++  + + NG   + +  + Q  V Y++++FT+SNP+   +   +++I   VT +MN  +++ FT  E+ + + QM P KAP
Subjt:  DQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSKAP

Query:  GPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSI
        GP G P +FYQ YW+ +G+  I   L   NS + ++  N TY+  IPKV +P+VV++FRPISLCNV YKII+KV+ NR+K +L   +SE+QSAFVPGR I
Subjt:  GPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSI

Query:  FDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGGKHCPEISHLFFADD
         DNI+V  E LH ++ ++ G+ G +ALKLDMSKAYDRVEW +L+ +M  +GFH  W+ +IMEC+ T  +SIL      G  I +GG   P+I+HLFFADD
Subjt:  FDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGGKHCPEISHLFFADD

Query:  SLIFCKASIEQIWVLRTLLKTYEDAS--------------------------------VVNNLEKYLGVPSSFTRNRCDDFKAIKQRM------LKEKCT
        SL+FCKA  + +  ++ +L  YE AS                                 +   EKYLG+PS   R +   F  IK+R+       KEK  
Subjt:  SLIFCKASIEQIWVLRTLLKTYEDAS--------------------------------VVNNLEKYLGVPSSFTRNRCDDFKAIKQRM------LKEKCT

Query:  GRNGQTCVV
         + G+  ++
Subjt:  GRNGQTCVV

A0A2N9J3U0 Reverse transcriptase domain-containing protein1.7e-8041.26Show/hide
Query:  DQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSKAP
        D+     H  A+  +++N I G+ + NG   + R  M      YF N+F +SNPS   I   +  +   VT  MND +LAPFT  EI   + QMHP+KAP
Subjt:  DQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAPFTRCEIEKVINQMHPSKAP

Query:  GPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSI
        GP G  A+FYQK+W+ VGD      LE  +S + +K+ N T+I  IPK+  P++++ FRPISLCNV YKII+KV+ NR+K VL   IS+NQSAFVPGR I
Subjt:  GPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSI

Query:  FDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNI-------------------
         DNI+V  E LH +K+KRKGR+  +A+KLDMSKAYDRVEW FLE +M+ +GF   WV LIM+C+ +  +S++LNG PTG I                   
Subjt:  FDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNI-------------------

Query:  ----------------ILKGGKHC---PEISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDA--------------------------------SVVNN
                        I++G   C   P+ISHLFFADDSL+FC+A++ +   L  +L TYE A                                S   +
Subjt:  ----------------ILKGGKHC---PEISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDA--------------------------------SVVNN

Query:  LEKYLGVPSSFTRNRCDDFKAIKQRMLKE
        L KYLG+P    R +   F  IKQ++ K+
Subjt:  LEKYLGVPSSFTRNRCDDFKAIKQRMLKE

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.2e-2027.49Show/hide
Query:  RRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQV-KVTPSMNDKILAPFTRCEIEKVINQMHPSKAPGPYGFPALFYQKY
        R +N+I  I N  GD  +   +++ +   Y+ +++ +   + E +   L    + ++     + +  P T  EI  +IN +   K+PGP GF A FYQ+Y
Subjt:  RRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQV-KVTPSMNDKILAPFTRCEIEKVINQMHPSKAPGPYGFPALFYQKY

Query:  WNEVGDITILNCLEIFNS--KRSI--KTWNDTYIAFIPKVS-DPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSIFDNIIVGH
          E+    +   L++F S  K  I   ++ +  I  IPK   D     +FRPISL N+  KI+ K++ NR++  ++  I  +Q  F+PG   + NI    
Subjt:  WNEVGDITILNCLEIFNS--KRSI--KTWNDTYIAFIPKVS-DPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSIFDNIIVGH

Query:  ECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGG--KHCPEISHLFF
          +  I ++ K +N  V + +D  KA+D+++  F+ + +  +G    ++ +I         +I+LNG       LK G  + CP +S L F
Subjt:  ECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGG--KHCPEISHLFF

P08548 LINE-1 reverse transcriptase homolog3.2e-2023.85Show/hide
Query:  RRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQV-KVTPSMNDKILAPFTRCEIEKVINQMHPSKAPGPYGFPALFYQKY
        R ++ I  I NGN +  +   ++++    Y+  +++    + + I   L+   + +++    + +  P +  EI   I  +   K+PGP GF + FYQ +
Subjt:  RRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQV-KVTPSMNDKILAPFTRCEIEKVINQMHPSKAPGPYGFPALFYQKY

Query:  WNEVGDITILNCLEIFNSKRSI-KTWNDTYIAFIPKV-SDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSIFDNIIVGHECL
          E+  I +LN  +    +  +  T+ +  I  IPK   DP    ++RPISL N+  KI+ K++ NR++  ++  I  +Q  F+PG   + NI      +
Subjt:  WNEVGDITILNCLEIFNSKRSI-KTWNDTYIAFIPKV-SDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSIFDNIIVGHECL

Query:  HSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGG--KHCP-------------------
          I +K K ++  + L +D  KA+D ++  F+ + +  IG    ++ LI         +I+LNGV   +  L+ G  + CP                   
Subjt:  HSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGG--KHCP-------------------

Query:  ------------EISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDAS
                    EI    FADD +++ + + +    L  ++K Y + S
Subjt:  ------------EISHLFFADDSLIFCKASIEQIWVLRTLLKTYEDAS

P11369 LINE-1 retrotransposable element ORF2 protein1.5e-2228.98Show/hide
Query:  IKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQV-KVTPSMNDKILAPFTRCEIEKVINQMHPSKAPGPYGFPALFYQKYWNEVG
        I  I N  GD  +  E+++ +  +++  ++++   + + +   L   QV K+     D + +P +  EIE VIN +   K+PGP GF A FYQ +  ++ 
Subjt:  IKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQV-KVTPSMNDKILAPFTRCEIEKVINQMHPSKAPGPYGFPALFYQKYWNEVG

Query:  DITILNCL--EIFNSKRSIKTWNDTYIAFIPK-VSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSIFDNIIVGHECLHSIK
         I IL+ L  +I        ++ +  I  IPK   DP  + +FRPISL N+  KI+ K++ NR++  ++  I  +Q  F+PG   + NI      +H I 
Subjt:  DITILNCL--EIFNSKRSIKTWNDTYIAFIPK-VSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQSAFVPGRSIFDNIIVGHECLHSIK

Query:  SKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGG--KHCPEISHLF
        +K K +N  + + LD  KA+D+++  F+ +++   G    ++ +I         +I +NG     I LK G  + CP   +LF
Subjt:  SKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGG--KHCPEISHLF

P14381 Transposon TX1 uncharacterized 149 kDa protein1.2e-2226.33Show/hide
Query:  KGPIYRKKSQDLFLIGKGHEYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKIL
        +G   R + Q L  + +G  +    Y + K    G R+ +I  +   +G  L   E +   + +++ N+F S +P +      L D    V+    +++ 
Subjt:  KGPIYRKKSQDLFLIGKGHEYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKIL

Query:  APFTRCEIEKVINQMHPSKAPGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRM
         P T  E+ + +  M  +K+PG  G    F+Q +W+ +G        E F       +     ++ +PK  D +++ ++RP+SL +  YKI+AK +  R+
Subjt:  APFTRCEIEKVINQMHPSKAPGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRM

Query:  KWVLQDFISENQSAFVPGRSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTG
        K VL + I  +QS  VPGR+IFDN+ +  + LH   ++R G +    L LD  KA+DRV+  +L   +    F P +V  +     +A+  + +N   T 
Subjt:  KWVLQDFISENQSAFVPGRSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTG

Query:  NIILKGG--KHCPEISHLF
         +    G  + CP    L+
Subjt:  NIILKGG--KHCPEISHLF

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.0e-1431.05Show/hide
Query:  YRKKSQDLFLIGKGHEYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTS-SNPSTETITAALKDIQ-VKVTPSMNDKILAP
        YR+KS+  +L     + D      HK     + +N IK +   +  R+     +++  V Y+T++  S S+  T      +KDI   +   ++  ++ A 
Subjt:  YRKKSQDLFLIGKGHEYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTS-SNPSTETITAALKDIQ-VKVTPSMNDKILAP

Query:  FTRCEIEKVINQMHPSKAPGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKII
         +  EI   +  M  +KAPGP  F A F+ + W  V D TI    E F +   +K +N T I  IPKV+    +S FRP+S C V YKII
Subjt:  FTRCEIEKVINQMHPSKAPGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.4e-1541.86Show/hide
Query:  VVNRMKWVLQDFISENQSAFVPGRSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLI
        +V R+K ++ + I   Q++F+PGR   DNI+   E +HS++ ++KG  GW+ LKLD+ KAYDR+ W +LE  ++  GF   W+P I
Subjt:  VVNRMKWVLQDFISENQSAFVPGRSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATGTCTTTCACCCCACACCAAGGGTCCTATTTATAGGAAAAAAAGCCAAGACCTTTTCCTAATAGGAAAAGGTCATGAATATGATCAAATCTGTTATCTTATTCA
TAAGCATGCCACCATGGGGAGACGTCAGAATGAGATAAAAGGCATCAGTAATGGGAATGGGGATAGGCTTTCGAAAAGAGAGGATATGGAACAATCTTCAGTAACATATT
TTACTAATATGTTTACTTCGAGTAACCCCTCTACTGAAACTATCACTGCTGCTCTTAAGGATATTCAGGTGAAAGTAACTCCTAGCATGAATGACAAGATCCTTGCTCCA
TTCACTAGATGTGAGATTGAAAAAGTGATCAACCAAATGCATCCTTCAAAAGCTCCTGGACCTTATGGATTTCCTGCATTATTCTACCAAAAATATTGGAATGAGGTTGG
TGATATTACTATTCTTAATTGTTTGGAAATTTTTAATTCGAAACGGTCGATAAAAACATGGAATGATACATATATTGCTTTTATTCCTAAAGTGAGTGACCCCAAAGTAG
TATCTGATTTTCGACCCATTAGTTTATGTAATGTGAAATATAAGATTATTGCAAAAGTGGTTGTTAATCGTATGAAATGGGTTCTTCAAGATTTTATATCTGAAAATCAG
TCTGCTTTTGTACCTGGGAGATCCATATTTGATAACATTATTGTGGGTCATGAATGTTTACATTCCATTAAGTCGAAGCGAAAAGGTAGGAATGGTTGGGTTGCTTTGAA
ATTAGATATGAGTAAAGCGTATGATCGAGTCGAATGGTGTTTTCTGGAGCAAATTATGTTGATTATTGGTTTCCATCCAAATTGGGTGCCCTTAATTATGGAGTGTGTGC
AGACTGCGAAATTTTCCATTTTGCTTAATGGAGTTCCTACAGGAAATATCATCCTCAAAGGGGGTAAACATTGTCCAGAGATTTCTCATCTCTTCTTTGCAGATGATAGT
CTCATTTTCTGTAAAGCATCCATTGAACAAATATGGGTGTTGCGGACGTTGTTGAAGACATATGAGGATGCTTCAGTGGTGAATAATTTAGAAAAATATCTGGGTGTCCC
ATCATCGTTCACCAGGAATCGTTGTGATGACTTTAAAGCCATCAAGCAACGAATGCTAAAAGAAAAATGCACTGGAAGAAATGGGCAGACTTGTGTCGTCCTAAAGAGGT
ATGAGGAAGAGAATAGGAATGGTCAATCCACCAGGTTTTTTGAGGACCCTTGGCTCCCAAAGGCAATCACGTTTAAACCTTTGCAGAAACAGGGATCGATTGTGCAGGGG
GAGATGATGGTATTTGAGTTTTTTACTCCATCCTTAGGATGGGATTTGCAAAAACTCAGGACGGTGGTTGTAGAACATGATGTAAAGTGCATTGTTGCTATCCCTATTAG
TATGACTAATATGGATGATAGTGGATTTGGCGTTACACCTCCCATGGTTCTCACACGTCGATCAAAATATATTTGTGATAGCCTTTTTAGTCGAGTGGACAATAGCATAT
CAGTGGCTCATAATTTTGCAGATCGTATTATTTGGTTGGTTGCTCGGCTATGTAAGGAGGAATTTGAAAAAGCTTGTATAGCCTTTTGGGCTATATGGAATGATCACAAT
TCTTATACCAGGAAAATGACAGTAATGAAGTGGACCCAACGCTGTGAATGGATCCAATGTTGTTGGCTCGAAACCCGACCTAAAGTTGTGGAGATTATGGGGCGAAATGG
ATTAGAAGATGATAGACATGTTAGTCCTATTGCTTACCACGTATATACGGATGCTGCGGTGAATAGGAACAATGAAGGAACTGGTTTAGGAGTAGTTGTGCTTCAACCAT
ATGGGTCTCCTTCTCGCTGCCATGGAGATGAAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGGATGTCTTTCACCCCACACCAAGGGTCCTATTTATAGGAAAAAAAGCCAAGACCTTTTCCTAATAGGAAAAGGTCATGAATATGATCAAATCTGTTATCTTATTCA
TAAGCATGCCACCATGGGGAGACGTCAGAATGAGATAAAAGGCATCAGTAATGGGAATGGGGATAGGCTTTCGAAAAGAGAGGATATGGAACAATCTTCAGTAACATATT
TTACTAATATGTTTACTTCGAGTAACCCCTCTACTGAAACTATCACTGCTGCTCTTAAGGATATTCAGGTGAAAGTAACTCCTAGCATGAATGACAAGATCCTTGCTCCA
TTCACTAGATGTGAGATTGAAAAAGTGATCAACCAAATGCATCCTTCAAAAGCTCCTGGACCTTATGGATTTCCTGCATTATTCTACCAAAAATATTGGAATGAGGTTGG
TGATATTACTATTCTTAATTGTTTGGAAATTTTTAATTCGAAACGGTCGATAAAAACATGGAATGATACATATATTGCTTTTATTCCTAAAGTGAGTGACCCCAAAGTAG
TATCTGATTTTCGACCCATTAGTTTATGTAATGTGAAATATAAGATTATTGCAAAAGTGGTTGTTAATCGTATGAAATGGGTTCTTCAAGATTTTATATCTGAAAATCAG
TCTGCTTTTGTACCTGGGAGATCCATATTTGATAACATTATTGTGGGTCATGAATGTTTACATTCCATTAAGTCGAAGCGAAAAGGTAGGAATGGTTGGGTTGCTTTGAA
ATTAGATATGAGTAAAGCGTATGATCGAGTCGAATGGTGTTTTCTGGAGCAAATTATGTTGATTATTGGTTTCCATCCAAATTGGGTGCCCTTAATTATGGAGTGTGTGC
AGACTGCGAAATTTTCCATTTTGCTTAATGGAGTTCCTACAGGAAATATCATCCTCAAAGGGGGTAAACATTGTCCAGAGATTTCTCATCTCTTCTTTGCAGATGATAGT
CTCATTTTCTGTAAAGCATCCATTGAACAAATATGGGTGTTGCGGACGTTGTTGAAGACATATGAGGATGCTTCAGTGGTGAATAATTTAGAAAAATATCTGGGTGTCCC
ATCATCGTTCACCAGGAATCGTTGTGATGACTTTAAAGCCATCAAGCAACGAATGCTAAAAGAAAAATGCACTGGAAGAAATGGGCAGACTTGTGTCGTCCTAAAGAGGT
ATGAGGAAGAGAATAGGAATGGTCAATCCACCAGGTTTTTTGAGGACCCTTGGCTCCCAAAGGCAATCACGTTTAAACCTTTGCAGAAACAGGGATCGATTGTGCAGGGG
GAGATGATGGTATTTGAGTTTTTTACTCCATCCTTAGGATGGGATTTGCAAAAACTCAGGACGGTGGTTGTAGAACATGATGTAAAGTGCATTGTTGCTATCCCTATTAG
TATGACTAATATGGATGATAGTGGATTTGGCGTTACACCTCCCATGGTTCTCACACGTCGATCAAAATATATTTGTGATAGCCTTTTTAGTCGAGTGGACAATAGCATAT
CAGTGGCTCATAATTTTGCAGATCGTATTATTTGGTTGGTTGCTCGGCTATGTAAGGAGGAATTTGAAAAAGCTTGTATAGCCTTTTGGGCTATATGGAATGATCACAAT
TCTTATACCAGGAAAATGACAGTAATGAAGTGGACCCAACGCTGTGAATGGATCCAATGTTGTTGGCTCGAAACCCGACCTAAAGTTGTGGAGATTATGGGGCGAAATGG
ATTAGAAGATGATAGACATGTTAGTCCTATTGCTTACCACGTATATACGGATGCTGCGGTGAATAGGAACAATGAAGGAACTGGTTTAGGAGTAGTTGTGCTTCAACCAT
ATGGGTCTCCTTCTCGCTGCCATGGAGATGAAGGATGA
Protein sequenceShow/hide protein sequence
MGCLSPHTKGPIYRKKSQDLFLIGKGHEYDQICYLIHKHATMGRRQNEIKGISNGNGDRLSKREDMEQSSVTYFTNMFTSSNPSTETITAALKDIQVKVTPSMNDKILAP
FTRCEIEKVINQMHPSKAPGPYGFPALFYQKYWNEVGDITILNCLEIFNSKRSIKTWNDTYIAFIPKVSDPKVVSDFRPISLCNVKYKIIAKVVVNRMKWVLQDFISENQ
SAFVPGRSIFDNIIVGHECLHSIKSKRKGRNGWVALKLDMSKAYDRVEWCFLEQIMLIIGFHPNWVPLIMECVQTAKFSILLNGVPTGNIILKGGKHCPEISHLFFADDS
LIFCKASIEQIWVLRTLLKTYEDASVVNNLEKYLGVPSSFTRNRCDDFKAIKQRMLKEKCTGRNGQTCVVLKRYEEENRNGQSTRFFEDPWLPKAITFKPLQKQGSIVQG
EMMVFEFFTPSLGWDLQKLRTVVVEHDVKCIVAIPISMTNMDDSGFGVTPPMVLTRRSKYICDSLFSRVDNSISVAHNFADRIIWLVARLCKEEFEKACIAFWAIWNDHN
SYTRKMTVMKWTQRCEWIQCCWLETRPKVVEIMGRNGLEDDRHVSPIAYHVYTDAAVNRNNEGTGLGVVVLQPYGSPSRCHGDEG