; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G10400 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G10400
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr7:8501993..8506164
RNA-Seq ExpressionCSPI07G10400
SyntenyCSPI07G10400
Gene Ontology termsGO:0008757 - S-adenosylmethionine-dependent methyltransferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR044995 - Thiocyanate methyltransferase/thiol methyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040845.1 uncharacterized protein E6C27_scaffold333G00680 [Cucumis melo var. makuwa]3.1e-8551.89Show/hide
Query:  MSKKFDHKWIEWILGCVKNLRYSIFLNRRPGGRTLATKVIRQGDPLSSFLFILVSEVLDALIENLHQNGLYEGFVVGKDKIHVPILQFADYTLLFANAKA
        MSKKFDHKWIEWILGCVKN RYS+F+NRRP GR LA + IRQGDPLS FLFILVSEVL ALIENLHQNGLYEGFVVGKDKIHVPILQFAD TLLF     
Subjt:  MSKKFDHKWIEWILGCVKNLRYSIFLNRRPGGRTLATKVIRQGDPLSSFLFILVSEVLDALIENLHQNGLYEGFVVGKDKIHVPILQFADYTLLFANAKA

Query:  TFLDKIQLKLDEWKRFNLSRGGELCFPNGSGDTWKKTLRGARLSGVSMTKIPLIGILPEKKSNALLKDEEISDFQVLLRKLASRNISEVQDRQIWSPGGS
          L  ++  ++    F      ++   N S D  +  + G +    ++ K  L        S  L+ +E  S  + +LR    +             G  
Subjt:  TFLDKIQLKLDEWKRFNLSRGGELCFPNGSGDTWKKTLRGARLSGVSMTKIPLIGILPEKKSNALLKDEEISDFQVLLRKLASRNISEVQDRQIWSPGGS

Query:  RKFTVKSLTLH--LSASSPLDKFLY---TALWKSNSPRRLC-NINWLIVASLPPFARCVYNMKKIANTCSSYAAFRPIVGKGHTFVLKKSNYLQEWAAVP
            V    +H  L    P    ++   +  W +N     C   +W+I+      +R    M+ +A+           +G G         +L  W  V 
Subjt:  RKFTVKSLTLH--LSASSPLDKFLY---TALWKSNSPRRLC-NINWLIVASLPPFARCVYNMKKIANTCSSYAAFRPIVGKGHTFVLKKSNYLQEWAAVP

Query:  QTYKKS--ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSK
         +   S  ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSK
Subjt:  QTYKKS--ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSK

XP_004139811.1 uncharacterized protein LOC101209189 isoform X1 [Cucumis sativus]2.2e-5190.32Show/hide
Query:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        + E+  VP      ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVRNVIEKPHNDHLSLIEASSL
        LSKVRNVIEKPHNDHLSLIEAS L
Subjt:  LSKVRNVIEKPHNDHLSLIEASSL

XP_008447166.1 PREDICTED: uncharacterized protein LOC103489683 isoform X1 [Cucumis melo]4.9e-5189.52Show/hide
Query:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        + E+  VP      ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVRNVIEKPHNDHLSLIEASSL
        LSKV+NVIEKPHNDHLSLIEAS L
Subjt:  LSKVRNVIEKPHNDHLSLIEASSL

XP_011659013.1 uncharacterized protein LOC101209189 isoform X2 [Cucumis sativus]2.2e-5190.32Show/hide
Query:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        + E+  VP      ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVRNVIEKPHNDHLSLIEASSL
        LSKVRNVIEKPHNDHLSLIEAS L
Subjt:  LSKVRNVIEKPHNDHLSLIEASSL

XP_038888205.1 uncharacterized protein LOC120078074 isoform X2 [Benincasa hispida]4.9e-5189.52Show/hide
Query:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        + E+  VP      ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVRNVIEKPHNDHLSLIEASSL
        LSKV+NVIEKPHNDHLSLIEAS L
Subjt:  LSKVRNVIEKPHNDHLSLIEASSL

TrEMBL top hitse value%identityAlignment
A0A1S3BHE5 uncharacterized protein LOC103489683 isoform X12.4e-5189.52Show/hide
Query:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        + E+  VP      ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVRNVIEKPHNDHLSLIEASSL
        LSKV+NVIEKPHNDHLSLIEAS L
Subjt:  LSKVRNVIEKPHNDHLSLIEASSL

A0A1S3BHN4 uncharacterized protein LOC103489683 isoform X22.4e-5189.52Show/hide
Query:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        + E+  VP      ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVRNVIEKPHNDHLSLIEASSL
        LSKV+NVIEKPHNDHLSLIEAS L
Subjt:  LSKVRNVIEKPHNDHLSLIEASSL

A0A5D3D0Q1 Reverse transcriptase domain-containing protein1.5e-8551.89Show/hide
Query:  MSKKFDHKWIEWILGCVKNLRYSIFLNRRPGGRTLATKVIRQGDPLSSFLFILVSEVLDALIENLHQNGLYEGFVVGKDKIHVPILQFADYTLLFANAKA
        MSKKFDHKWIEWILGCVKN RYS+F+NRRP GR LA + IRQGDPLS FLFILVSEVL ALIENLHQNGLYEGFVVGKDKIHVPILQFAD TLLF     
Subjt:  MSKKFDHKWIEWILGCVKNLRYSIFLNRRPGGRTLATKVIRQGDPLSSFLFILVSEVLDALIENLHQNGLYEGFVVGKDKIHVPILQFADYTLLFANAKA

Query:  TFLDKIQLKLDEWKRFNLSRGGELCFPNGSGDTWKKTLRGARLSGVSMTKIPLIGILPEKKSNALLKDEEISDFQVLLRKLASRNISEVQDRQIWSPGGS
          L  ++  ++    F      ++   N S D  +  + G +    ++ K  L        S  L+ +E  S  + +LR    +             G  
Subjt:  TFLDKIQLKLDEWKRFNLSRGGELCFPNGSGDTWKKTLRGARLSGVSMTKIPLIGILPEKKSNALLKDEEISDFQVLLRKLASRNISEVQDRQIWSPGGS

Query:  RKFTVKSLTLH--LSASSPLDKFLY---TALWKSNSPRRLC-NINWLIVASLPPFARCVYNMKKIANTCSSYAAFRPIVGKGHTFVLKKSNYLQEWAAVP
            V    +H  L    P    ++   +  W +N     C   +W+I+      +R    M+ +A+           +G G         +L  W  V 
Subjt:  RKFTVKSLTLH--LSASSPLDKFLY---TALWKSNSPRRLC-NINWLIVASLPPFARCVYNMKKIANTCSSYAAFRPIVGKGHTFVLKKSNYLQEWAAVP

Query:  QTYKKS--ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSK
         +   S  ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSK
Subjt:  QTYKKS--ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSK

A0A6J1DT66 uncharacterized protein LOC111024157 isoform X29.1e-5188.71Show/hide
Query:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        + E+  VP      ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVRNVIEKPHNDHLSLIEASSL
        LSKV+NVIEKPHNDHL LIEAS L
Subjt:  LSKVRNVIEKPHNDHLSLIEASSL

A0A6J1DWJ8 uncharacterized protein LOC111024157 isoform X19.1e-5188.71Show/hide
Query:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        + E+  VP      ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
Subjt:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVRNVIEKPHNDHLSLIEASSL
        LSKV+NVIEKPHNDHL LIEAS L
Subjt:  LSKVRNVIEKPHNDHLSLIEASSL

SwissProt top hitse value%identityAlignment
P92555 Uncharacterized mitochondrial protein AtMg012503.2e-0538.89Show/hide
Query:  YSIF-LNRRPGGRTLATKVIRQGDPLSSFLFILVSEVLDALIENLHQNGLYEGFVVGKDKIHVPILQFADYT
        Y +F +N  P G    ++ +RQGDPLS +LFIL +EVL  L     + G   G  V  +   +  L FAD T
Subjt:  YSIF-LNRRPGGRTLATKVIRQGDPLSSFLFILVSEVLDALIENLHQNGLYEGFVVGKDKIHVPILQFADYT

Arabidopsis top hitse value%identityAlignment
AT2G43945.1 unknown protein4.7e-5292.73Show/hide
Query:  ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSKVRNVIEKPHND
        ELLAIQQQGPR+IGFFGTRNMGF+HQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAE+PELLTVILPQSLKKQPPESQELLSKV+NV+EKPHND
Subjt:  ELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSKVRNVIEKPHND

Query:  HLSLIEASSL
        HL L+EAS L
Subjt:  HLSLIEASSL

AT3G59870.1 unknown protein6.2e-5283.87Show/hide
Query:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL
        + E   VP      ELLAIQQQGPR IGFFGTRNMGF+HQELI+ILSYAMVITKNHIYTSGA+GTNAAVIRGALRAE+PELLTVILPQSLKKQPPESQEL
Subjt:  LQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIEILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQEL

Query:  LSKVRNVIEKPHNDHLSLIEASSL
        LSKV+NVIEKPHNDHL L+EAS L
Subjt:  LSKVRNVIEKPHNDHLSLIEASSL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.3e-0638.89Show/hide
Query:  YSIF-LNRRPGGRTLATKVIRQGDPLSSFLFILVSEVLDALIENLHQNGLYEGFVVGKDKIHVPILQFADYT
        Y +F +N  P G    ++ +RQGDPLS +LFIL +EVL  L     + G   G  V  +   +  L FAD T
Subjt:  YSIF-LNRRPGGRTLATKVIRQGDPLSSFLFILVSEVLDALIENLHQNGLYEGFVVGKDKIHVPILQFADYT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAGAAGTTTGACCACAAGTGGATTGAATGGATCTTGGGTTGTGTTAAGAACCTGAGATACTCAATATTCCTCAATAGAAGACCGGGAGGAAGAACGTTGGCTAC
AAAGGTTATCAGGCAGGGTGACCCCCTTTCGTCGTTCCTCTTCATACTTGTAAGTGAAGTTCTTGATGCTCTAATTGAAAATCTACATCAAAATGGCCTTTATGAAGGTT
TTGTAGTGGGAAAGGACAAGATTCATGTTCCAATCCTTCAATTTGCAGATTACACCCTACTTTTTGCAAATGCGAAAGCAACATTTCTTGATAAAATTCAACTAAAGTTA
GATGAATGGAAGAGGTTTAACCTCTCGAGAGGGGGGGAGCTATGCTTTCCAAATGGGAGTGGAGATACATGGAAGAAGACTCTCCGTGGTGCAAGGTTATCAGGAGTATC
CATGACAAAGATTCCTTTAATCGGCATACTGCCAGAAAAGAAGTCAAATGCTTTGCTAAAAGATGAAGAGATTTCAGACTTCCAAGTACTGCTTCGAAAATTAGCATCAA
GAAATATTTCTGAAGTTCAAGACAGACAAATATGGTCACCTGGCGGCAGCAGAAAATTTACAGTAAAGTCCCTTACTCTTCACCTGTCTGCTTCCTCCCCTCTGGATAAA
TTCCTCTATACGGCCCTTTGGAAGTCTAATAGCCCAAGAAGGCTTTGCAACATAAATTGGCTAATAGTTGCCTCTCTCCCTCCATTTGCCCGCTGTGTCTACAACATGAA
GAAGATTGCCAACACTTGTTCTTCCTATGCAGCTTTTCGGCCAATTGTTGGGAAAGGACATACATTCGTTCTCAAAAAGTCCAACTATCTACAAGAATGGGCTGCGGTCC
CTCAAACTTACAAGAAATCCGAGTTATTGGCCATTCAACAACAAGGTCCTAGAGCCATTGGCTTCTTTGGAACCCGAAATATGGGTTTCTTGCATCAAGAACTCATTGAG
ATTCTTAGCTATGCAATGGTTATAACGAAGAACCACATCTATACTTCAGGAGCATCTGGAACCAATGCAGCAGTTATCAGAGGTGCATTGAGGGCTGAAAAACCAGAACT
ACTTACTGTCATTTTGCCACAAAGTTTGAAAAAACAACCTCCCGAAAGCCAGGAATTGTTATCCAAAGTTAGGAATGTGATAGAGAAGCCCCACAATGATCATTTATCTT
TGATAGAAGCTAGCAGTTTATTGGTAAAGAAAAAGAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAGAAGTTTGACCACAAGTGGATTGAATGGATCTTGGGTTGTGTTAAGAACCTGAGATACTCAATATTCCTCAATAGAAGACCGGGAGGAAGAACGTTGGCTAC
AAAGGTTATCAGGCAGGGTGACCCCCTTTCGTCGTTCCTCTTCATACTTGTAAGTGAAGTTCTTGATGCTCTAATTGAAAATCTACATCAAAATGGCCTTTATGAAGGTT
TTGTAGTGGGAAAGGACAAGATTCATGTTCCAATCCTTCAATTTGCAGATTACACCCTACTTTTTGCAAATGCGAAAGCAACATTTCTTGATAAAATTCAACTAAAGTTA
GATGAATGGAAGAGGTTTAACCTCTCGAGAGGGGGGGAGCTATGCTTTCCAAATGGGAGTGGAGATACATGGAAGAAGACTCTCCGTGGTGCAAGGTTATCAGGAGTATC
CATGACAAAGATTCCTTTAATCGGCATACTGCCAGAAAAGAAGTCAAATGCTTTGCTAAAAGATGAAGAGATTTCAGACTTCCAAGTACTGCTTCGAAAATTAGCATCAA
GAAATATTTCTGAAGTTCAAGACAGACAAATATGGTCACCTGGCGGCAGCAGAAAATTTACAGTAAAGTCCCTTACTCTTCACCTGTCTGCTTCCTCCCCTCTGGATAAA
TTCCTCTATACGGCCCTTTGGAAGTCTAATAGCCCAAGAAGGCTTTGCAACATAAATTGGCTAATAGTTGCCTCTCTCCCTCCATTTGCCCGCTGTGTCTACAACATGAA
GAAGATTGCCAACACTTGTTCTTCCTATGCAGCTTTTCGGCCAATTGTTGGGAAAGGACATACATTCGTTCTCAAAAAGTCCAACTATCTACAAGAATGGGCTGCGGTCC
CTCAAACTTACAAGAAATCCGAGTTATTGGCCATTCAACAACAAGGTCCTAGAGCCATTGGCTTCTTTGGAACCCGAAATATGGGTTTCTTGCATCAAGAACTCATTGAG
ATTCTTAGCTATGCAATGGTTATAACGAAGAACCACATCTATACTTCAGGAGCATCTGGAACCAATGCAGCAGTTATCAGAGGTGCATTGAGGGCTGAAAAACCAGAACT
ACTTACTGTCATTTTGCCACAAAGTTTGAAAAAACAACCTCCCGAAAGCCAGGAATTGTTATCCAAAGTTAGGAATGTGATAGAGAAGCCCCACAATGATCATTTATCTT
TGATAGAAGCTAGCAGTTTATTGGTAAAGAAAAAGAATTAG
Protein sequenceShow/hide protein sequence
MSKKFDHKWIEWILGCVKNLRYSIFLNRRPGGRTLATKVIRQGDPLSSFLFILVSEVLDALIENLHQNGLYEGFVVGKDKIHVPILQFADYTLLFANAKATFLDKIQLKL
DEWKRFNLSRGGELCFPNGSGDTWKKTLRGARLSGVSMTKIPLIGILPEKKSNALLKDEEISDFQVLLRKLASRNISEVQDRQIWSPGGSRKFTVKSLTLHLSASSPLDK
FLYTALWKSNSPRRLCNINWLIVASLPPFARCVYNMKKIANTCSSYAAFRPIVGKGHTFVLKKSNYLQEWAAVPQTYKKSELLAIQQQGPRAIGFFGTRNMGFLHQELIE
ILSYAMVITKNHIYTSGASGTNAAVIRGALRAEKPELLTVILPQSLKKQPPESQELLSKVRNVIEKPHNDHLSLIEASSLLVKKKN