; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G16600 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G16600
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr3:12470349..12471050
RNA-Seq ExpressionCSPI03G16600
SyntenyCSPI03G16600
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039309.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.8e-3851.16Show/hide
Query:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL
        E EIK  + SFDGNKA G D F +  +K +W+  K  I++ F+DFFE                            RPISLTTSIYK + KTLS RLK  L
Subjt:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL

Query:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK
        P+TIS  Q+ F+K RQITD IL+ANEA+DYWK KKIKGF+ KLDIEKAFD +NWNFI+ +L K  +P    K
Subjt:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK

KAA0045262.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-3851.74Show/hide
Query:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL
        E EIK  + SFDG KA G D F +  +K FW+  K  I++ F+DFFE                            RPISLTTSIYK++ KTLS RLK  L
Subjt:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL

Query:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK
        P+TIS  Q+ F+K RQITD IL+ANEA+DYWK KKIKGF+ KLDIEKAFD +NW+FI+Y+L  K FP    K
Subjt:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK

KAA0047998.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-3836.27Show/hide
Query:  RITSDHFPIVLQTANLKWGLIPFRFNNYIINDKSFIFLCGGLTLSKKATRK-------------------------------------------------
        R  SDHFPI L++ ++ WG  PFR NN  +N+K F         S K  +                                                  
Subjt:  RITSDHFPIVLQTANLKWGLIPFRFNNYIINDKSFIFLCGGLTLSKKATRK-------------------------------------------------

Query:  ------------DEMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFF--------------------------EMRPISLTTSIYKIV
                    DE EIK  + S   +KA G D F +  YKK W+T K  ++E F++F                           + RPISLTTS+YKI+
Subjt:  ------------DEMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFF--------------------------EMRPISLTTSIYKIV

Query:  VKTLSTRLKEVLPNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK
        VKTL+ ++KE LP+T+++ Q+ FVKGRQITD ILIANE IDYWK KK KGF+ KLD+EKAFDKI+ +FI YML +K +  ++ K
Subjt:  VKTLSTRLKEVLPNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK

KAA0058104.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.6e-4052.91Show/hide
Query:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL
        E EIK  + SFDGNKA G D F +  +K +W+  K  IM+ F+DFFE                            RPISLTTSIYKI+ KTLS RLK  L
Subjt:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL

Query:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK
        P+TIS  Q+ F+K RQITD ILIANEA+DYWK KKIKGF+ KLDIEKAFD +NW+FI+++L KK +P    K
Subjt:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.8e-3851.16Show/hide
Query:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL
        E EIK  + SFDGNKA G D F +  +K +W+  K  I++ F+DFFE                            RPISLTTSIYK + KTLS RLK  L
Subjt:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL

Query:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK
        P+TIS  Q+ F+K RQITD IL+ANEA+DYWK KKIKGF+ KLDIEKAFD +NWNFI+ +L K  +P    K
Subjt:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK

TrEMBL top hitse value%identityAlignment
A0A5A7TDG1 LINE-1 retrotransposable element ORF2 protein8.8e-3951.16Show/hide
Query:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL
        E EIK  + SFDGNKA G D F +  +K +W+  K  I++ F+DFFE                            RPISLTTSIYK + KTLS RLK  L
Subjt:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL

Query:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK
        P+TIS  Q+ F+K RQITD IL+ANEA+DYWK KKIKGF+ KLDIEKAFD +NWNFI+ +L K  +P    K
Subjt:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK

A0A5A7TYF4 LINE-1 retrotransposable element ORF2 protein5.2e-3936.27Show/hide
Query:  RITSDHFPIVLQTANLKWGLIPFRFNNYIINDKSFIFLCGGLTLSKKATRK-------------------------------------------------
        R  SDHFPI L++ ++ WG  PFR NN  +N+K F         S K  +                                                  
Subjt:  RITSDHFPIVLQTANLKWGLIPFRFNNYIINDKSFIFLCGGLTLSKKATRK-------------------------------------------------

Query:  ------------DEMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFF--------------------------EMRPISLTTSIYKIV
                    DE EIK  + S   +KA G D F +  YKK W+T K  ++E F++F                           + RPISLTTS+YKI+
Subjt:  ------------DEMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFF--------------------------EMRPISLTTSIYKIV

Query:  VKTLSTRLKEVLPNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK
        VKTL+ ++KE LP+T+++ Q+ FVKGRQITD ILIANE IDYWK KK KGF+ KLD+EKAFDKI+ +FI YML +K +  ++ K
Subjt:  VKTLSTRLKEVLPNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein8.8e-3952.02Show/hide
Query:  DEMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFF---------------------------EMRPISLTTSIYKIVVKTLSTRLKEV
        DE EIK  ++SF   KA G D +TM  YKK W   K  ++  F+DF                            + RPISLTTS+YKI+ K L+ RLK  
Subjt:  DEMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFF---------------------------EMRPISLTTSIYKIVVKTLSTRLKEV

Query:  LPNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK
        LP+TI+E Q+ F+KGRQI D ILIANEAID WK +KIKGFV KLDIEKAFDKI+W+FI+YML KK FP K  K
Subjt:  LPNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK

A0A5A7USG2 LINE-1 retrotransposable element ORF2 protein2.7e-4052.91Show/hide
Query:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL
        E EIK  + SFDGNKA G D F +  +K +W+  K  IM+ F+DFFE                            RPISLTTSIYKI+ KTLS RLK  L
Subjt:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL

Query:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK
        P+TIS  Q+ F+K RQITD ILIANEA+DYWK KKIKGF+ KLDIEKAFD +NW+FI+++L KK +P    K
Subjt:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK

A0A5D3DZ07 LINE-1 retrotransposable element ORF2 protein5.2e-3951.74Show/hide
Query:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL
        E EIK  + SFDG KA G D F +  +K FW+  K  I++ F+DFFE                            RPISLTTSIYK++ KTLS RLK  L
Subjt:  EMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFE---------------------------MRPISLTTSIYKIVVKTLSTRLKEVL

Query:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK
        P+TIS  Q+ F+K RQITD IL+ANEA+DYWK KKIKGF+ KLDIEKAFD +NW+FI+Y+L  K FP    K
Subjt:  PNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEK

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein8.3e-1029.68Show/hide
Query:  EIKDALLSFDGNKASGLDDFTMELYKKFWNT----FKIKIMEAFQD-----------------------FFEMRPISLTTSIYKIVVKTLSTRLKEVLPN
        E+  AL     NK+ GLD  T+E ++ FW+T    F   + EAF+                            RP+SL ++ YKIV K +S RLK VL  
Subjt:  EIKDALLSFDGNKASGLDDFTMELYKKFWNT----FKIKIMEAFQD-----------------------FFEMRPISLTTSIYKIVVKTLSTRLKEVLPN

Query:  TISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFI
         I   Q   V GR I D + +  + + + +   +      LD EKAFD+++  ++
Subjt:  TISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFI

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.5e-0940Show/hide
Query:  RLKEVLPNTISEKQITFVKGRQITDEILIANEAIDYWKCKK-IKGF-VFKLDIEKAFDKINWNFINYMLMKKQFP
        RLK ++ N I   Q +F+ GR  TD I+   EA+   + KK +KG+ + KLD+EKA+D+I W+++   L+   FP
Subjt:  RLKEVLPNTISEKQITFVKGRQITDEILIANEAIDYWKCKK-IKGF-VFKLDIEKAFDKINWNFINYMLMKKQFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGCAAGCTTCTCACCAAGGATCACATCTGATCACTTCCCTATTGTTCTTCAAACTGCTAATCTCAAATGGGGACTTATTCCATTTCGTTTCAACAATTATATTAT
CAACGACAAAAGTTTTATATTCCTTTGTGGTGGACTAACTCTTAGCAAGAAGGCCACCCGAAAAGATGAAATGGAAATCAAAGATGCTCTTCTCTCTTTTGATGGCAACA
AAGCTTCTGGACTTGATGACTTCACCATGGAACTTTATAAAAAGTTTTGGAACACTTTCAAGATTAAAATTATGGAGGCGTTCCAAGACTTTTTTGAAATGAGACCAATT
AGCCTGACTACATCCATATACAAAATAGTGGTCAAAACTTTGTCTACTAGACTCAAAGAAGTTCTTCCCAACACTATATCTGAGAAACAAATTACTTTTGTTAAAGGCAG
ACAAATCACAGATGAAATCCTTATTGCTAATGAGGCCATTGATTACTGGAAATGTAAAAAGATAAAAGGCTTTGTTTTCAAGTTGGATATTGAGAAAGCTTTTGACAAGA
TCAACTGGAATTTCATCAACTATATGCTTATGAAGAAGCAATTTCCTAGAAAAATGGAGAAGATAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTAGCAAGCTTCTCACCAAGGATCACATCTGATCACTTCCCTATTGTTCTTCAAACTGCTAATCTCAAATGGGGACTTATTCCATTTCGTTTCAACAATTATATTAT
CAACGACAAAAGTTTTATATTCCTTTGTGGTGGACTAACTCTTAGCAAGAAGGCCACCCGAAAAGATGAAATGGAAATCAAAGATGCTCTTCTCTCTTTTGATGGCAACA
AAGCTTCTGGACTTGATGACTTCACCATGGAACTTTATAAAAAGTTTTGGAACACTTTCAAGATTAAAATTATGGAGGCGTTCCAAGACTTTTTTGAAATGAGACCAATT
AGCCTGACTACATCCATATACAAAATAGTGGTCAAAACTTTGTCTACTAGACTCAAAGAAGTTCTTCCCAACACTATATCTGAGAAACAAATTACTTTTGTTAAAGGCAG
ACAAATCACAGATGAAATCCTTATTGCTAATGAGGCCATTGATTACTGGAAATGTAAAAAGATAAAAGGCTTTGTTTTCAAGTTGGATATTGAGAAAGCTTTTGACAAGA
TCAACTGGAATTTCATCAACTATATGCTTATGAAGAAGCAATTTCCTAGAAAAATGGAGAAGATAGATTAA
Protein sequenceShow/hide protein sequence
MLASFSPRITSDHFPIVLQTANLKWGLIPFRFNNYIINDKSFIFLCGGLTLSKKATRKDEMEIKDALLSFDGNKASGLDDFTMELYKKFWNTFKIKIMEAFQDFFEMRPI
SLTTSIYKIVVKTLSTRLKEVLPNTISEKQITFVKGRQITDEILIANEAIDYWKCKKIKGFVFKLDIEKAFDKINWNFINYMLMKKQFPRKMEKID