; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022012 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022012
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:15913550..15914796
RNA-Seq ExpressionLag0022012
SyntenyLag0022012
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EEC84982.1 hypothetical protein OsI_32248 [Oryza sativa Indica Group]7.8e-6544.44Show/hide
Query:  NETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRV
        N+T++ LIPK + P R+ + +PISLC V YKL SKVL NRLK +L ++IS NQSAF+P R + DN +L YE  H ++ +  GR  +A+LKLDMSK YDRV
Subjt:  NETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRV

Query:  EWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPS--------------------LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYD
        EW F+EK+M+++ +A GWVKL+  C+S+V +   VNG     ++PS                     S +L  AEE   + G+K+ Q  P++SH+ FA D
Subjt:  EWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPS--------------------LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYD

Query:  RLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV
         LLLF+  ER A+ ++++LN YE  SGQ VN DKS I FS +T+  DR  V +IL +   A + +YLGLP +M R++  +  ++K+RV
Subjt:  RLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV

XP_030505314.1 uncharacterized protein LOC115720302 [Cannabis sativa]4.3e-6344.52Show/hide
Query:  PALLNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKV
        P+ +N+T+I LIPK + P  V++Y+PISLCNV YKL+SK +V R+K VL+ VIS  QSAFI  R ++DN ++ YE LH LR ++RGR  +A+LKLDMSK 
Subjt:  PALLNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKV

Query:  YDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVP--------------------SLSGMLRGAEEAKSIKGLKIAQYGPAISHMF
        +DRVEW F+E+V+LK+ +  G V L+  CISS  FSF +NG   G + P                    +LS +L+  E+   + GL + +  P +SH+ 
Subjt:  YDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVP--------------------SLSGMLRGAEEAKSIKGLKIAQYGPAISHMF

Query:  FAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV
        FA D LL  R   R   VV+  L+ Y   SGQ +N DKSV+SFS +T+  D+     +LG+ + +CH QYLGLPS+   NK    N IK+R+
Subjt:  FAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV

XP_030923826.1 uncharacterized protein LOC115950728 [Quercus lobata]1.5e-6338.03Show/hide
Query:  GGKNTRWFHTHASSRRRKNEVRDLEDGDGVWQQDPGRVLRLIE-----------------GVSPALLNETMIVLIPKERRPRRVSEYQPISLCNVSYKLV
        G KNT++FH+ AS+RRR+N ++ +   D  W +D   + ++                   G     +N T IVLIPK + P ++S+Y+PISLCNV YK++
Subjt:  GGKNTRWFHTHASSRRRKNEVRDLEDGDGVWQQDPGRVLRLIE-----------------GVSPALLNETMIVLIPKERRPRRVSEYQPISLCNVSYKLV

Query:  SKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSF
        SKVL NRLK +L ++IS  QSAFIPGR + DN ++ YECLH +  R +G+    +LKLD+SK YDRVEW F++ +M K+ +   W+  V  C+S+  FS 
Subjt:  SKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSF

Query:  NVNGVRCGDVVPS--------------------LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLD
         +NG   G+++PS                     + +L  AE    I G+ I +  P IS++ FA D LL  +A + + +V+ ++L+ Y   SGQ +N +
Subjt:  NVNGVRCGDVVPS--------------------LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLD

Query:  KSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV
        KS I F  +T A  + ++  +LGV+ VA    YLGLP+ + R+K  + +F+KDRV
Subjt:  KSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV

XP_030940187.1 uncharacterized protein LOC115965136 [Quercus lobata]1.5e-6333.65Show/hide
Query:  GGKNTRWFHTHASSRRRKNEVRDLEDGDGVWQQDPGR---------------------------------------------------------------
        G +NT++FH  A+ RR +N +  L D +G+WQ+DPG                                                                
Subjt:  GGKNTRWFHTHASSRRRKNEVRDLEDGDGVWQQDPGR---------------------------------------------------------------

Query:  -----------------------VLRLIE-GVSPALLNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVV
                               VL ++  GV P  +NET I LIPK + PR++SEY+PISLCNV YK+VSK+L NRLK +L EVI  +QSAF+PGR + 
Subjt:  -----------------------VLRLIE-GVSPALLNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVV

Query:  DNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPS-----------------
        DN ++ +E +H + GR +GR    +LKLDMSK YDRVEW F+E +M K+ +   W+ L+ +CIS+V +S  +NG   G ++PS                 
Subjt:  DNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPS-----------------

Query:  ---LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACH
           L   L+  E   +I+G+ + +  P ISH+FFA D ++  RA   + + V ++L  YE  SGQ +N +K+ + FS +T+   + QV Q+ G Q++  H
Subjt:  ---LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACH

Query:  RQYLGLPSFMPRNKMSSLNFIKDRV
         +YLGLP  + + K  + + IKD+V
Subjt:  RQYLGLPSFMPRNKMSSLNFIKDRV

XP_030969964.1 uncharacterized protein LOC115990257 [Quercus lobata]4.3e-6337.99Show/hide
Query:  ASSRRRKNEVRDLEDGDGVWQQDPGRVLRLI-------------------------------EGVSPALLNETMIVLIPKERRPRRVSEYQPISLCNVSY
        A+ R + N +  LED  GVW +D  ++ RL                                    PA +N T I L PK + P++VS+++PISLCNV Y
Subjt:  ASSRRRKNEVRDLEDGDGVWQQDPGRVLRLI-------------------------------EGVSPALLNETMIVLIPKERRPRRVSEYQPISLCNVSY

Query:  KLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVK
        KL++KVLVNRLK +L  V+S +QSAF+ GR + DN ++ +E LH L+ +++G+  + +LKLD+SK YDRVEW F+E+ ML + +A  +V  +  CI S+ 
Subjt:  KLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVK

Query:  FSFNVNGVRCGDVVPS--------------------LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTV
        +S  +NGV    + PS                    L G+L  AE    I+G+ I + GP +SH+FFA D +L  RAKE + +V+ D+L+ YERGSGQ +
Subjt:  FSFNVNGVRCGDVVPS--------------------LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTV

Query:  NLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV
        N DK+ I F+ +T+   + Q+  +LGV  +  +++YLGLP+F+ R K     +IK+R+
Subjt:  NLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV

TrEMBL top hitse value%identityAlignment
A0A2N9EDY7 Reverse transcriptase domain-containing protein2.7e-6333.88Show/hide
Query:  GGKNTRWFHTHASSRRRKNEVRDLEDGDGVWQQDPGRVLRLI----------------------------------------------------------
        G +NTR+FH  AS RRR+N +  L+D  GVW+++      LI                                                          
Subjt:  GGKNTRWFHTHASSRRRKNEVRDLEDGDGVWQQDPGRVLRLI----------------------------------------------------------

Query:  -EGVSPAL----------------------------LNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVV
         +G+ P                              +N T I LIPK + P R++E++PISLCNV+YKL+SKV+ NRLKG+L  +IS  QSAF+PGR + 
Subjt:  -EGVSPAL----------------------------LNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVV

Query:  DNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPS-----------------
        DN ++ +E LH +     G+    ++KLDMSK YDRVEW F+EK+M K+ + P WV L+  CIS+V +S  VNG   G + PS                 
Subjt:  DNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPS-----------------

Query:  ---LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACH
           L  ++  A+E  S++GL + + GP I+H+FFA D LL  +A  R+  ++++IL  YE+ SGQ VN DK+ + FS +T    +  +   LGV ++  +
Subjt:  ---LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACH

Query:  RQYLGLPSFMPRNKMSSLNFIKDRV
         +YLGLPS + R +++S + IK+RV
Subjt:  RQYLGLPSFMPRNKMSSLNFIKDRV

A0A2N9IMR2 Reverse transcriptase domain-containing protein3.5e-6334.12Show/hide
Query:  GGKNTRWFHTHASSRRRKNEVRDLEDGDGVWQQDPGRVLRLI----------------------------------------------------------
        G +NT++FH  AS RRR+N ++ + D  G+WQ++  +V R+                                                           
Subjt:  GGKNTRWFHTHASSRRRKNEVRDLEDGDGVWQQDPGRVLRLI----------------------------------------------------------

Query:  -EGVSPAL----------------------------LNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVV
         +G+ P                              +N+T I LIPK + P RV+E++PISLCNV YK++SKVLVNRLK +L  +IS  QSAF+PG  + 
Subjt:  -EGVSPAL----------------------------LNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVV

Query:  DNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPS-----------------
        DN ++ +E LH++     GR    +LKLDMSK YDRVEW ++EK+M K+ + P W+ L  +CIS V +S  +NG   G + PS                 
Subjt:  DNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPS-----------------

Query:  ---LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACH
           L  M++ AE  + ++G+ + +YGP I+H+FFA D LL  RA  +D E ++D+L  YER SGQ VN DK+ I FS  T    +  +   L V ++  +
Subjt:  ---LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACH

Query:  RQYLGLPSFMPRNKMSSLNFIKDRV
         +YLGLPS + RN+ +S + IK+RV
Subjt:  RQYLGLPSFMPRNKMSSLNFIKDRV

A0A5B7BN08 Reverse transcriptase domain-containing protein3.2e-6442.36Show/hide
Query:  WQQDPGRVLRLI-----EGV-SPALLNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLH
        W    G + R+I     +GV S   +N T I LIPK   PR++SE++PISLCNV YK++SK+L NRLK +L  +I+ +QSAF+PGR + DN ++ +E +H
Subjt:  WQQDPGRVLRLI-----EGV-SPALLNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLH

Query:  VLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVP--------------------SLSGMLRGA
         L+ + +G+   ++LKLDMSK YDRVEW F+E VML++ +   WV L+  C+S+V FS  +NG   G + P                    + S +LR +
Subjt:  VLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVP--------------------SLSGMLRGA

Query:  EEAKSIKGLKIAQYGPAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMP
        E    I G+ +A+  P +SH+FFA D LL   A E  A  +  I++ Y   SGQ VN +KS ISFS +  A  R Q+ QILGV + + H +YLGLPS + 
Subjt:  EEAKSIKGLKIAQYGPAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMP

Query:  RNKMSSLNFIKDRV
        R+K+   N I+DRV
Subjt:  RNKMSSLNFIKDRV

B8BE31 Reverse transcriptase domain-containing protein3.8e-6544.44Show/hide
Query:  NETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRV
        N+T++ LIPK + P R+ + +PISLC V YKL SKVL NRLK +L ++IS NQSAF+P R + DN +L YE  H ++ +  GR  +A+LKLDMSK YDRV
Subjt:  NETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRV

Query:  EWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPS--------------------LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYD
        EW F+EK+M+++ +A GWVKL+  C+S+V +   VNG     ++PS                     S +L  AEE   + G+K+ Q  P++SH+ FA D
Subjt:  EWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPS--------------------LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYD

Query:  RLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV
         LLLF+  ER A+ ++++LN YE  SGQ VN DKS I FS +T+  DR  V +IL +   A + +YLGLP +M R++  +  ++K+RV
Subjt:  RLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV

B9FNE2 Reverse transcriptase domain-containing protein3.8e-6544.44Show/hide
Query:  NETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRV
        N+T++ LIPK + P R+ + +PISLC V YKL SKVL NRLK +L ++IS NQSAF+P R + DN +L YE  H ++ +  GR  +A+LKLDMSK YDRV
Subjt:  NETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRV

Query:  EWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPS--------------------LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYD
        EW F+EK+M+++ +A GWVKL+  C+S+V +   VNG     ++PS                     S +L  AEE   + G+K+ Q  P++SH+ FA D
Subjt:  EWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPS--------------------LSGMLRGAEEAKSIKGLKIAQYGPAISHMFFAYD

Query:  RLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV
         LLLF+  ER A+ ++++LN YE  SGQ VN DKS I FS +T+  DR  V +IL +   A + +YLGLP +M R++  +  ++K+RV
Subjt:  RLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.1e-1623.47Show/hide
Query:  EGVSPALLNETMIVLIPKERRPRRVSE-YQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECL-HVLRGRSRGRTRWASLK
        EG+ P    E  I+LIPK  R     E ++PISL N+  K+++K+L NR++  + ++I H+Q  FIPG     N       + H+ R + +       + 
Subjt:  EGVSPALLNETMIVLIPKERRPRRVSE-YQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECL-HVLRGRSRGRTRWASLK

Query:  LDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPSLSG------------------MLRGAEEAKSIKGLKIAQYGPAI
        +D  K +D+++  FM K + K+     ++K++         +  +NG +  +  P  +G                  + R   + K IKG+++ +    +
Subjt:  LDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPSLSG------------------MLRGAEEAKSIKGLKIAQYGPAI

Query:  SHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGL
        S   FA D ++        A+ +  +++++ + SG  +N+ KS  +F  + N +  +Q+   L   + +   +YLG+
Subjt:  SHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGL

P08548 LINE-1 reverse transcriptase homolog7.4e-1826.35Show/hide
Query:  EGVSPALLNETMIVLIPKE-RRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECL-HVLRGRSRGRTRWASLK
        EG+ P    E  I LIPK  + P R   Y+PISL N+  K+++K+L NR++  + ++I H+Q  FIPG     N       + H+ + +++       L 
Subjt:  EGVSPALLNETMIVLIPKE-RRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECL-HVLRGRSRGRTRWASLK

Query:  LDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPSLSGMLRGA------------------EEAKSIKGLKIAQYGPAI
        +D  K +D ++  FM + + K+     ++KL+    S    +  +NGV+     P  SG  +G                    E K+IKG+ I      I
Subjt:  LDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPSLSGMLRGA------------------EEAKSIKGLKIAQYGPAI

Query:  SHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGL
            FA D ++           + +++  Y   SG  +N  KSV     + N  ++T V   +   VV    +YLG+
Subjt:  SHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGL

P11369 LINE-1 retrotransposable element ORF2 protein5.3e-1623.1Show/hide
Query:  IEGVSPALLNETMIVLIPK-ERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLK
        +EG  P    E  I LIPK ++ P ++  ++PISL N+  K+++K+L NR++  +  +I  +Q  FIPG     N       +H +  + + +     + 
Subjt:  IEGVSPALLNETMIVLIPK-ERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLK

Query:  LDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPSLSG------------------MLRGAEEAKSIKGLKIAQYGPAI
        LD  K +D+++  FM KV+ +      ++ ++    S    +  VNG +  + +P  SG                  + R   + K IKG++I +    I
Subjt:  LDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPSLSG------------------MLRGAEEAKSIKGLKIAQYGPAI

Query:  SHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGL
        S    A D ++     +     + +++N +    G  +N +KS ++F  + N +   ++ +     +V  + +YLG+
Subjt:  SHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGL

P14381 Transposon TX1 uncharacterized 149 kDa protein5.0e-1428.51Show/hide
Query:  EGVSPALLNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLD
        +G  P      ++ L+PK+   R +  ++P+SL +  YK+V+K +  RLK VL EVI  +QS  +PGR + DN  L  + LH  R   R     A L LD
Subjt:  EGVSPALLNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLD

Query:  MSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVN-----------GVRCGDVVPSLSGMLRGAE-------EAKSIKGLKIAQYGPAISH
          K +DRV+  ++   +   S+ P +V  +    +S +    +N           GVR G     LSG L             K + GL + +  P +  
Subjt:  MSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVN-----------GVRCGDVVPSLSGMLRGAE-------EAKSIKGLKIAQYGPAISH

Query:  MFFAY-DRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKS
        +  AY D ++L      D E  ++    Y   S   +N  KS
Subjt:  MFFAY-DRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKS

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.1e-1136.14Show/hide
Query:  LVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWV
        +V RLK ++  +I   Q++FIPGR   DN +   E +H +R R +G   W  LKLD+ K YDR+ W ++E  ++   +   W+
Subjt:  LVNRLKGVLNEVISHNQSAFIPGRCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGGTAAGAATACCCGCTGGTTCCATACTCATGCCTCGAGTAGGAGGAGGAAGAATGAGGTGAGGGATTTGGAGGATGGGGATGGAGTTTGGCAGCAGGACCCGGG
TAGAGTCCTAAGGCTAATAGAGGGGGTTTCCCCAGCTCTTCTAAACGAAACGATGATTGTTTTGATACCGAAAGAGAGGAGGCCCAGACGTGTATCTGAGTACCAGCCTA
TTTCGCTTTGCAACGTCTCTTACAAGCTGGTTTCAAAGGTTTTAGTTAACCGGTTGAAGGGTGTTCTGAATGAGGTAATCTCCCACAACCAGAGTGCATTTATACCTGGG
CGATGTGTGGTGGATAATGCCATTTTGGGTTATGAATGCCTGCATGTCTTGAGAGGAAGGAGTAGGGGCAGAACGAGATGGGCCTCACTAAAGTTAGACATGAGCAAGGT
CTATGATAGGGTTGAGTGGGTCTTTATGGAAAAGGTGATGCTGAAAGTGAGCTATGCACCAGGTTGGGTAAAGCTGGTTTCGCTCTGCATATCCTCTGTCAAGTTTTCGT
TTAATGTGAATGGTGTAAGGTGCGGGGATGTTGTTCCAAGTTTGTCTGGTATGCTACGTGGGGCTGAGGAGGCCAAGTCCATTAAGGGATTAAAGATAGCTCAGTATGGC
CCTGCTATCTCACATATGTTCTTTGCATATGATAGATTGCTACTCTTTCGGGCTAAGGAGAGGGATGCAGAGGTTGTTCGGGACATCCTTAACCACTATGAACGAGGGTC
GGGGCAGACCGTCAACCTGGATAAGTCTGTCATCTCTTTCAGTTTGAGCACGAATGCAAGGGATAGGACTCAGGTTGGGCAGATCCTTGGGGTTCAGGTTGTGGCGTGCC
ATCGCCAATACCTTGGACTTCCGTCTTTCATGCCTCGGAACAAAATGAGCTCATTGAATTTCATTAAGGATCGAGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGGTAAGAATACCCGCTGGTTCCATACTCATGCCTCGAGTAGGAGGAGGAAGAATGAGGTGAGGGATTTGGAGGATGGGGATGGAGTTTGGCAGCAGGACCCGGG
TAGAGTCCTAAGGCTAATAGAGGGGGTTTCCCCAGCTCTTCTAAACGAAACGATGATTGTTTTGATACCGAAAGAGAGGAGGCCCAGACGTGTATCTGAGTACCAGCCTA
TTTCGCTTTGCAACGTCTCTTACAAGCTGGTTTCAAAGGTTTTAGTTAACCGGTTGAAGGGTGTTCTGAATGAGGTAATCTCCCACAACCAGAGTGCATTTATACCTGGG
CGATGTGTGGTGGATAATGCCATTTTGGGTTATGAATGCCTGCATGTCTTGAGAGGAAGGAGTAGGGGCAGAACGAGATGGGCCTCACTAAAGTTAGACATGAGCAAGGT
CTATGATAGGGTTGAGTGGGTCTTTATGGAAAAGGTGATGCTGAAAGTGAGCTATGCACCAGGTTGGGTAAAGCTGGTTTCGCTCTGCATATCCTCTGTCAAGTTTTCGT
TTAATGTGAATGGTGTAAGGTGCGGGGATGTTGTTCCAAGTTTGTCTGGTATGCTACGTGGGGCTGAGGAGGCCAAGTCCATTAAGGGATTAAAGATAGCTCAGTATGGC
CCTGCTATCTCACATATGTTCTTTGCATATGATAGATTGCTACTCTTTCGGGCTAAGGAGAGGGATGCAGAGGTTGTTCGGGACATCCTTAACCACTATGAACGAGGGTC
GGGGCAGACCGTCAACCTGGATAAGTCTGTCATCTCTTTCAGTTTGAGCACGAATGCAAGGGATAGGACTCAGGTTGGGCAGATCCTTGGGGTTCAGGTTGTGGCGTGCC
ATCGCCAATACCTTGGACTTCCGTCTTTCATGCCTCGGAACAAAATGAGCTCATTGAATTTCATTAAGGATCGAGTCTAG
Protein sequenceShow/hide protein sequence
MGGKNTRWFHTHASSRRRKNEVRDLEDGDGVWQQDPGRVLRLIEGVSPALLNETMIVLIPKERRPRRVSEYQPISLCNVSYKLVSKVLVNRLKGVLNEVISHNQSAFIPG
RCVVDNAILGYECLHVLRGRSRGRTRWASLKLDMSKVYDRVEWVFMEKVMLKVSYAPGWVKLVSLCISSVKFSFNVNGVRCGDVVPSLSGMLRGAEEAKSIKGLKIAQYG
PAISHMFFAYDRLLLFRAKERDAEVVRDILNHYERGSGQTVNLDKSVISFSLSTNARDRTQVGQILGVQVVACHRQYLGLPSFMPRNKMSSLNFIKDRV