; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G29230 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G29230
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr3:26846982..26848824
RNA-Seq ExpressionCSPI03G29230
SyntenyCSPI03G29230
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.1e-9243.89Show/hide
Query:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD
        ML+KKG+P KWR WI ACI+SVQY I+IN RPRGKI+P+RGIRQGDPISPFIFVLAMDY+S LLN +     IKGV + G  NLTHLLFADDILLF+EDD
Subjt:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD

Query:  EETIDNMRYALRLFELASGLNINLNKSTITP----------------------------------------------GVNQFLDTSGLELLGNSKRKSAI
        E +I N++  + LF+LASGL+INLNKSTI+P                                               +N+ L +    +L    + + I
Subjt:  EETIDNMRYALRLFELASGLNINLNKSTITP----------------------------------------------GVNQFLDTSGLELLGNSKRKSAI

Query:  GNTLL-----------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY
         ++L             P    KNI                  +L+ W+ + S   KGGLGI+ ++ TNFALL+K +WR+  E +PL K+II AKY    
Subjt:  GNTLL-----------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY

Query:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD
         G++P     SS ++PW SI KG  W    + W IK G + SFWHS  H  SP S   PRL+AL++ KE+SI +MWN  + +WDL PRR LR  E  LW 
Subjt:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD

Query:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQ
        ELK +L     + G D P+W LN NG  +VAS K    Q  Q
Subjt:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQ

KAA0041367.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-9244.52Show/hide
Query:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD
        ML+KKG+P +WR+WI ACI+SVQY I+IN RPRGKI+P+RGIRQGDPISPFIFVLAMDY+S LLN +     IKGV + G  NLTHLLFADDILLF+EDD
Subjt:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD

Query:  EETIDNMRYALRLFELASGLNINLNKSTITP---------------GVN-QFLDTS--GLELLGNSKRKSAIGN-----------------------TLL
        E +I N++  + LF+LASGL+INLNKSTI+P               G++ +FL  +  G+ L G    K+   N                       TL+
Subjt:  EETIDNMRYALRLFELASGLNINLNKSTITP---------------GVN-QFLDTS--GLELLGNSKRKSAIGN-----------------------TLL

Query:  ----------------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY
                         P    KNI                  +L+ W+ + SP  +GGLGI+ ++ TNFALL+K +WR+  E +PL K+II AKY    
Subjt:  ----------------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY

Query:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD
         G++P     SS ++PW SI KG  W    + W IK G + SFWHS  H  SP S   PRLFAL++ K++SI +MWN  + +WDL PRR LR  E  LW 
Subjt:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD

Query:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQENVSI
        ELK +L     + G D P W LN NG  +VAS K A  Q +Q  + +
Subjt:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQENVSI

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.3e-9042.17Show/hide
Query:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD
        ML+KK +  KWR  I++CI+SVQY ILIN RPRG+IKPTRGIRQGDP+SPFIFVLAMDYLS LL  L +   I GV+     NLTH+LFADDIL+F+ED 
Subjt:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD

Query:  EETIDNMRYALRLFELASGLNINLNKSTITP------GVNQFLDTSGLEL-----------LGNSKRKSAIGNTLLFPDEEK------------------
        E+ + N++  L LFE ASGLNINL+KSTI P        N  +D+ G+             LG     S   + +L   ++K                  
Subjt:  EETIDNMRYALRLFELASGLNINLNKSTITP------GVNQFLDTSGLEL-----------LGNSKRKSAIGNTLLFPDEEK------------------

Query:  ----------------------------------------KNINLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY
                                                 NI+LI+W+ V+SP  KGGLGI+SV STNFALL K +W+F  EK PL KR+I +KY+Q  
Subjt:  ----------------------------------------KNINLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY

Query:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD
        +G  P++ K+SS  +PW+++    +W    I W +  G+ +SFW    +  SP S   PRLFAL++ K+ S+ ++WN  + +W++   RPLR  E+ LW 
Subjt:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD

Query:  ELKATLP-PLPDTGFDRPLWNLNRNGFSSVASTK
         +KA+LP PLPD G  +PLW LN N     AS K
Subjt:  ELKATLP-PLPDTGFDRPLWNLNRNGFSSVASTK

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.6e-9144.8Show/hide
Query:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD
        ML+KKG+P KWR WI ACI+SVQY I+IN RPRGKI+P+RGIRQGDPISPFIFVLAMDY+S LLN +     IKGV + G  NLTHLLFADDILLF+EDD
Subjt:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD

Query:  EETIDNMRYALRLFELASGLNINLNKSTITP---------------GVN-QFLDTS--GLELLGNSKRKSAIGN-----------------------TLL
        E +I N++  + LF+LASGL+INLNKSTI+P               G++ +FL  +  G+ L G    K+   N                       TL+
Subjt:  EETIDNMRYALRLFELASGLNINLNKSTITP---------------GVN-QFLDTS--GLELLGNSKRKSAIGN-----------------------TLL

Query:  ----------------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY
                         P    KNI                  +L+ W+ + S   KGGLGI+ ++ TNFALL+K +WR+  E +PL K+II AKY    
Subjt:  ----------------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY

Query:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD
         G++P     SS ++PW SI KG  W    + W IK G + SFWH   H  SP S   PRL+AL++ KE+SI +MWN  + +WDL PRR LR  E  LW 
Subjt:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD

Query:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQ
        ELK ++     + G D P+W LN NG  +VAS K A  Q  Q
Subjt:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQ

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]2.4e-9243.89Show/hide
Query:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD
        ML+KKG+P KWR WI ACI+SVQY I+IN RPRGKI+P+RGIRQGDPISPFIFVLAMDY+S LLN +     IKGV + G  NLTHLLFADDILLF+EDD
Subjt:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD

Query:  EETIDNMRYALRLFELASGLNINLNKSTITP----------------------------------------------GVNQFLDTSGLELLGNSKRKSAI
        E +I N++  + LF+LASGL+INLNKSTI+P                                               +N+ L +    +L    + + I
Subjt:  EETIDNMRYALRLFELASGLNINLNKSTITP----------------------------------------------GVNQFLDTSGLELLGNSKRKSAI

Query:  GNTLL-----------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY
         ++L             P    KNI                  +L+ W+ + S   KGGLGI+ ++ TNFALL+K +WR+  E +PL K+II AKY    
Subjt:  GNTLL-----------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY

Query:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD
         G++P     SS ++PW SI KG  W    + W IK G + SFWHS  H  SP S   PRL+AL++ KE+SI +MWN  + +WDL PRR LR  E  LW 
Subjt:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD

Query:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQ
        ELK +L     + G D P+W LN NG  +VAS K    Q  Q
Subjt:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQ

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein1.2e-9243.89Show/hide
Query:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD
        ML+KKG+P KWR WI ACI+SVQY I+IN RPRGKI+P+RGIRQGDPISPFIFVLAMDY+S LLN +     IKGV + G  NLTHLLFADDILLF+EDD
Subjt:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD

Query:  EETIDNMRYALRLFELASGLNINLNKSTITP----------------------------------------------GVNQFLDTSGLELLGNSKRKSAI
        E +I N++  + LF+LASGL+INLNKSTI+P                                               +N+ L +    +L    + + I
Subjt:  EETIDNMRYALRLFELASGLNINLNKSTITP----------------------------------------------GVNQFLDTSGLELLGNSKRKSAI

Query:  GNTLL-----------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY
         ++L             P    KNI                  +L+ W+ + S   KGGLGI+ ++ TNFALL+K +WR+  E +PL K+II AKY    
Subjt:  GNTLL-----------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY

Query:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD
         G++P     SS ++PW SI KG  W    + W IK G + SFWHS  H  SP S   PRL+AL++ KE+SI +MWN  + +WDL PRR LR  E  LW 
Subjt:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD

Query:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQ
        ELK +L     + G D P+W LN NG  +VAS K    Q  Q
Subjt:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQ

A0A5A7TI93 LINE-1 retrotransposable element ORF2 protein5.2e-9344.52Show/hide
Query:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD
        ML+KKG+P +WR+WI ACI+SVQY I+IN RPRGKI+P+RGIRQGDPISPFIFVLAMDY+S LLN +     IKGV + G  NLTHLLFADDILLF+EDD
Subjt:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD

Query:  EETIDNMRYALRLFELASGLNINLNKSTITP---------------GVN-QFLDTS--GLELLGNSKRKSAIGN-----------------------TLL
        E +I N++  + LF+LASGL+INLNKSTI+P               G++ +FL  +  G+ L G    K+   N                       TL+
Subjt:  EETIDNMRYALRLFELASGLNINLNKSTITP---------------GVN-QFLDTS--GLELLGNSKRKSAIGN-----------------------TLL

Query:  ----------------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY
                         P    KNI                  +L+ W+ + SP  +GGLGI+ ++ TNFALL+K +WR+  E +PL K+II AKY    
Subjt:  ----------------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY

Query:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD
         G++P     SS ++PW SI KG  W    + W IK G + SFWHS  H  SP S   PRLFAL++ K++SI +MWN  + +WDL PRR LR  E  LW 
Subjt:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD

Query:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQENVSI
        ELK +L     + G D P W LN NG  +VAS K A  Q +Q  + +
Subjt:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQENVSI

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein1.1e-9042.17Show/hide
Query:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD
        ML+KK +  KWR  I++CI+SVQY ILIN RPRG+IKPTRGIRQGDP+SPFIFVLAMDYLS LL  L +   I GV+     NLTH+LFADDIL+F+ED 
Subjt:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD

Query:  EETIDNMRYALRLFELASGLNINLNKSTITP------GVNQFLDTSGLEL-----------LGNSKRKSAIGNTLLFPDEEK------------------
        E+ + N++  L LFE ASGLNINL+KSTI P        N  +D+ G+             LG     S   + +L   ++K                  
Subjt:  EETIDNMRYALRLFELASGLNINLNKSTITP------GVNQFLDTSGLEL-----------LGNSKRKSAIGNTLLFPDEEK------------------

Query:  ----------------------------------------KNINLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY
                                                 NI+LI+W+ V+SP  KGGLGI+SV STNFALL K +W+F  EK PL KR+I +KY+Q  
Subjt:  ----------------------------------------KNINLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY

Query:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD
        +G  P++ K+SS  +PW+++    +W    I W +  G+ +SFW    +  SP S   PRLFAL++ K+ S+ ++WN  + +W++   RPLR  E+ LW 
Subjt:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD

Query:  ELKATLP-PLPDTGFDRPLWNLNRNGFSSVASTK
         +KA+LP PLPD G  +PLW LN N     AS K
Subjt:  ELKATLP-PLPDTGFDRPLWNLNRNGFSSVASTK

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein7.5e-9244.8Show/hide
Query:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD
        ML+KKG+P KWR WI ACI+SVQY I+IN RPRGKI+P+RGIRQGDPISPFIFVLAMDY+S LLN +     IKGV + G  NLTHLLFADDILLF+EDD
Subjt:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD

Query:  EETIDNMRYALRLFELASGLNINLNKSTITP---------------GVN-QFLDTS--GLELLGNSKRKSAIGN-----------------------TLL
        E +I N++  + LF+LASGL+INLNKSTI+P               G++ +FL  +  G+ L G    K+   N                       TL+
Subjt:  EETIDNMRYALRLFELASGLNINLNKSTITP---------------GVN-QFLDTS--GLELLGNSKRKSAIGN-----------------------TLL

Query:  ----------------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY
                         P    KNI                  +L+ W+ + S   KGGLGI+ ++ TNFALL+K +WR+  E +PL K+II AKY    
Subjt:  ----------------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY

Query:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD
         G++P     SS ++PW SI KG  W    + W IK G + SFWH   H  SP S   PRL+AL++ KE+SI +MWN  + +WDL PRR LR  E  LW 
Subjt:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD

Query:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQ
        ELK ++     + G D P+W LN NG  +VAS K A  Q  Q
Subjt:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQ

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein1.5e-9243.89Show/hide
Query:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD
        ML+KKG+P KWR WI ACI+SVQY I+IN RPRGKI+P+RGIRQGDPISPFIFVLAMDY+S LLN +     IKGV + G  NLTHLLFADDILLF+EDD
Subjt:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD

Query:  EETIDNMRYALRLFELASGLNINLNKSTITP----------------------------------------------GVNQFLDTSGLELLGNSKRKSAI
        E +I N++  + LF+LASGL+INLNKSTI+P                                               +N+ L +    +L    + + I
Subjt:  EETIDNMRYALRLFELASGLNINLNKSTITP----------------------------------------------GVNQFLDTSGLELLGNSKRKSAI

Query:  GNTLL-----------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY
         ++L             P    KNI                  +L+ W+ + S   KGGLGI+ ++ TNFALL+K +WR+  E +PL K+II AKY    
Subjt:  GNTLL-----------FPDEEKKNI------------------NLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTY

Query:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD
         G++P     SS ++PW SI KG  W    + W IK G + SFWHS  H  SP S   PRL+AL++ KE+SI +MWN  + +WDL PRR LR  E  LW 
Subjt:  LGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWD

Query:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQ
        ELK +L     + G D P+W LN NG  +VAS K    Q  Q
Subjt:  ELKATL-PPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQ

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.2e-0824.91Show/hide
Query:  ILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDDEETIDNMRYALRLFELASGLNINLN
        I++N +         G RQG P+SP +F +    L +L   + ++  IKG+ + GK  +   LFADD+++++E+   +  N+   +  F   SG  IN+ 
Subjt:  ILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDDEETIDNMRYALRLFELASGLNINLN

Query:  KSTITPGVNQFLDTSGLELLGN------SKRKSAIGNTL------LFPDEEKKNINLI-----KWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEE
        KS      N     S  +++G       SKR   +G  L      LF +  K  +  I     KW ++  P S     +  +     A+L K I+RF   
Subjt:  KSTITPGVNQFLDTSGLELLGN------SKRKSAIGNTL------LFPDEEKKNINLI-----KWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEE

Query:  KNPLGKRIITAKYEQTYLGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWH-SKRHDRSPFSQTSP
           L     T + E+T L  +  + +    K+      K     LP  K   K   T + W+  +  D   +++T P
Subjt:  KNPLGKRIITAKYEQTYLGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWH-SKRHDRSPFSQTSP

P08548 LINE-1 reverse transcriptase homolog1.1e-0727.78Show/hide
Query:  LIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDDE
        L K G    + + I A  +     I++N           G RQG P+SP +F + M+ L+I + +   + +IKG+ I G   +   LFADD+++++E+  
Subjt:  LIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDDE

Query:  ETIDNMRYALRLFELASGLNINLNKS
        ++   +   ++ +   SG  IN +KS
Subjt:  ETIDNMRYALRLFELASGLNINLNKS

P0C2F6 Putative ribonuclease H protein At1g657503.3e-1228.87Show/hide
Query:  EKKNINLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTYLGELPTKSKFSSFKAPWRSIIKGANWVLPQ-IKWSIKR
        EKK  +L+KWS V SP  +GGLG+ + +S N AL+SK  WR  +EKN L   ++  KY    + +        S+ + WRSI  G   V+   + W    
Subjt:  EKKNINLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTYLGELPTKSKFSSFKAPWRSIIKGANWVLPQ-IKWSIKR

Query:  GDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWDELKATLPPLPDTGFDRPLWNLNRNGFSSVAS
        G  + FW  +     P  +         +  +  +A         WD     P  +    L  EL+A +  L     DR  W  +++G  SV S
Subjt:  GDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWDELKATLPPLPDTGFDRPLWNLNRNGFSSVAS

P11369 LINE-1 retrotransposable element ORF2 protein2.5e-0723.64Show/hide
Query:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD
        +L + G    +   I A  +     I +N      I    G RQG P+SP++F +    L +L   + +   IKG+ I GK  +   L ADD+++++ D 
Subjt:  MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDD

Query:  EETIDNMRYALRLFELASGLNINLNKSTI------TPGVNQFLDTSGLELLGNSKRKSAIGNTLLFPDEEKKNINLIK---------WSSVLSPISKGGL
        + +   +   +  F    G  IN NKS             +  +T+   ++ N+ +   +  T    D   KN   +K         W  +  P S    
Subjt:  EETIDNMRYALRLFELASGLNINLNKSTI------TPGVNQFLDTSGLELLGNSKRKSAIGNTLLFPDEEKKNINLIK---------WSSVLSPISKGGL

Query:  GINSVESTNFALLSKRIWRF
         I  +     A+L K I+RF
Subjt:  GINSVESTNFALLSKRIWRF

P92555 Uncharacterized mitochondrial protein AtMg012502.4e-1041.79Show/hide
Query:  LINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSI-NGKHNLTHLLFADD
        +IN  P+G + P+RG+RQGDP+SP++F+L  + LS L  + ++   + G+ + N    + HLLFADD
Subjt:  LINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSI-NGKHNLTHLLFADD

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-0422.37Show/hide
Query:  EEKKNINLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTYLGELPTKSKFSSFKA-PWRSIIKGANWVLPQIKWSIK
        +E K ++   W  +    ++GG+G   +E+ N ALL K++WR       L  ++  ++Y   +    P  +   S  +  W+SI      +    +  + 
Subjt:  EEKKNINLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAKYEQTYLGELPTKSKFSSFKA-PWRSIIKGANWVLPQIKWSIK

Query:  RGDTLSFWHSKRHDRSPFS------QTSPRLFALTSR--KENSI----ANMWNAEIANWDLFPRRPLRSVEEALWDELKATLPPLPDTGFDRPLWNLNRN
         G+ +  W  K  D  P S      +  P+ +A  S   K + +       W  ++    LFP      VE  L  EL+    P      D   W+   +
Subjt:  RGDTLSFWHSKRHDRSPFS------QTSPRLFALTSR--KENSI----ANMWNAEIANWDLFPRRPLRSVEEALWDELKATLPPLPDTGFDRPLWNLNRN

Query:  GFSSVAS-----TKIARTQNNQENVSIP
        G  +V S     T+I   +++ + VS P
Subjt:  GFSSVAS-----TKIARTQNNQENVSIP

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.7e-1141.79Show/hide
Query:  LINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSI-NGKHNLTHLLFADD
        +IN  P+G + P+RG+RQGDP+SP++F+L  + LS L  + ++   + G+ + N    + HLLFADD
Subjt:  LINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSI-NGKHNLTHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCATTAAGAAAGGCTTCCCACAAAAATGGCGCCAATGGATCAGTGCATGCATCACAAGTGTTCAATACTTCATTCTCATTAATGACAGACCCAGAGGTAAAATCAA
GCCAACCAGAGGCATTCGACAAGGAGATCCTATCTCTCCTTTTATCTTTGTCCTCGCTATGGATTATCTCAGTATCCTCCTCAACCAACTGGAAAAAGATAACTCGATTA
AAGGTGTTAGTATAAATGGGAAACACAACCTCACCCACCTTCTGTTTGCAGATGATATCCTACTTTTTATGGAGGATGATGAAGAAACCATTGATAACATGAGATATGCC
CTTAGGCTTTTTGAGTTGGCCTCGGGTCTCAACATCAACCTCAACAAATCAACAATAACACCTGGGGTAAACCAATTTCTAGACACTTCTGGTCTGGAACTACTGGGAAA
CTCCAAAAGAAAATCAGCAATTGGAAATACGCTTCTCTTTCCAGATGAGGAAAAGAAAAATATCAACCTCATTAAATGGTCATCGGTTCTGTCTCCTATCAGTAAAGGTG
GCCTGGGCATCAACAGTGTTGAGAGTACAAATTTTGCTCTCCTGAGCAAACGGATCTGGAGATTCTTTGAAGAGAAAAATCCCCTAGGGAAACGAATTATCACCGCAAAA
TACGAGCAAACATACTTGGGAGAGCTTCCAACTAAGAGCAAATTCAGCAGCTTTAAAGCTCCTTGGAGGTCTATCATTAAAGGTGCAAATTGGGTTCTCCCTCAAATTAA
ATGGAGTATTAAAAGGGGTGACACATTATCATTTTGGCACAGTAAAAGGCACGACCGTAGTCCATTTTCACAGACAAGCCCGAGACTCTTTGCTCTTACTTCTAGAAAAG
AAAATTCCATTGCAAATATGTGGAATGCAGAAATTGCCAATTGGGACCTTTTCCCCCGAAGACCCTTAAGAAGTGTCGAGGAAGCCCTCTGGGATGAATTGAAAGCTACC
CTTCCTCCCTTGCCTGACACTGGATTCGACAGACCTCTCTGGAATTTAAACAGGAATGGCTTTTCCTCGGTGGCCTCCACCAAAATTGCAAGAACTCAAAACAACCAAGA
AAATGTATCAATACCATGGATATGCTGCAAAGAAGACTCCCTACTTGGAATATTCACCCCTCTTGGTGTATACTCTGCAAAGCTGCTGAGGAAGACAGACACCATCTATT
CTCCTCCTGCCCATTCTCAACAAATCTGTGGAAAAAAGTTGAAGAGATTCTGGACAAATCTTTCCCCCTTACATATCCCTCTGTGTTATGCAAAAAGCCTTTCAAAGGAA
AAGGAAAATAAAAAAGACAAACCATTGAACAGCATCTGGTGGCTGCCACCCTGTGGAACATCTGGAATGAGAGAAACAGAAGGACTTTTAAGGGAGAGGAAAAATCAGTT
GTCTCGGTTTGGGAAGACATTCAAGCCACAACCGGTCTATGGACTAGTCGTTCTTCCCTTTTCAAAAATTATTCGCCCAGCTCTATTGCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTCATTAAGAAAGGCTTCCCACAAAAATGGCGCCAATGGATCAGTGCATGCATCACAAGTGTTCAATACTTCATTCTCATTAATGACAGACCCAGAGGTAAAATCAA
GCCAACCAGAGGCATTCGACAAGGAGATCCTATCTCTCCTTTTATCTTTGTCCTCGCTATGGATTATCTCAGTATCCTCCTCAACCAACTGGAAAAAGATAACTCGATTA
AAGGTGTTAGTATAAATGGGAAACACAACCTCACCCACCTTCTGTTTGCAGATGATATCCTACTTTTTATGGAGGATGATGAAGAAACCATTGATAACATGAGATATGCC
CTTAGGCTTTTTGAGTTGGCCTCGGGTCTCAACATCAACCTCAACAAATCAACAATAACACCTGGGGTAAACCAATTTCTAGACACTTCTGGTCTGGAACTACTGGGAAA
CTCCAAAAGAAAATCAGCAATTGGAAATACGCTTCTCTTTCCAGATGAGGAAAAGAAAAATATCAACCTCATTAAATGGTCATCGGTTCTGTCTCCTATCAGTAAAGGTG
GCCTGGGCATCAACAGTGTTGAGAGTACAAATTTTGCTCTCCTGAGCAAACGGATCTGGAGATTCTTTGAAGAGAAAAATCCCCTAGGGAAACGAATTATCACCGCAAAA
TACGAGCAAACATACTTGGGAGAGCTTCCAACTAAGAGCAAATTCAGCAGCTTTAAAGCTCCTTGGAGGTCTATCATTAAAGGTGCAAATTGGGTTCTCCCTCAAATTAA
ATGGAGTATTAAAAGGGGTGACACATTATCATTTTGGCACAGTAAAAGGCACGACCGTAGTCCATTTTCACAGACAAGCCCGAGACTCTTTGCTCTTACTTCTAGAAAAG
AAAATTCCATTGCAAATATGTGGAATGCAGAAATTGCCAATTGGGACCTTTTCCCCCGAAGACCCTTAAGAAGTGTCGAGGAAGCCCTCTGGGATGAATTGAAAGCTACC
CTTCCTCCCTTGCCTGACACTGGATTCGACAGACCTCTCTGGAATTTAAACAGGAATGGCTTTTCCTCGGTGGCCTCCACCAAAATTGCAAGAACTCAAAACAACCAAGA
AAATGTATCAATACCATGGATATGCTGCAAAGAAGACTCCCTACTTGGAATATTCACCCCTCTTGGTGTATACTCTGCAAAGCTGCTGAGGAAGACAGACACCATCTATT
CTCCTCCTGCCCATTCTCAACAAATCTGTGGAAAAAAGTTGAAGAGATTCTGGACAAATCTTTCCCCCTTACATATCCCTCTGTGTTATGCAAAAAGCCTTTCAAAGGAA
AAGGAAAATAAAAAAGACAAACCATTGAACAGCATCTGGTGGCTGCCACCCTGTGGAACATCTGGAATGAGAGAAACAGAAGGACTTTTAAGGGAGAGGAAAAATCAGTT
GTCTCGGTTTGGGAAGACATTCAAGCCACAACCGGTCTATGGACTAGTCGTTCTTCCCTTTTCAAAAATTATTCGCCCAGCTCTATTGCTTTAA
Protein sequenceShow/hide protein sequence
MLIKKGFPQKWRQWISACITSVQYFILINDRPRGKIKPTRGIRQGDPISPFIFVLAMDYLSILLNQLEKDNSIKGVSINGKHNLTHLLFADDILLFMEDDEETIDNMRYA
LRLFELASGLNINLNKSTITPGVNQFLDTSGLELLGNSKRKSAIGNTLLFPDEEKKNINLIKWSSVLSPISKGGLGINSVESTNFALLSKRIWRFFEEKNPLGKRIITAK
YEQTYLGELPTKSKFSSFKAPWRSIIKGANWVLPQIKWSIKRGDTLSFWHSKRHDRSPFSQTSPRLFALTSRKENSIANMWNAEIANWDLFPRRPLRSVEEALWDELKAT
LPPLPDTGFDRPLWNLNRNGFSSVASTKIARTQNNQENVSIPWICCKEDSLLGIFTPLGVYSAKLLRKTDTIYSPPAHSQQICGKKLKRFWTNLSPLHIPLCYAKSLSKE
KENKKDKPLNSIWWLPPCGTSGMRETEGLLRERKNQLSRFGKTFKPQPVYGLVVLPFSKIIRPALLL