; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G22650 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G22650
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr1:18202349..18203081
RNA-Seq ExpressionCSPI01G22650
SyntenyCSPI01G22650
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026100.1 uncharacterized protein E6C27_scaffold19G00360 [Cucumis melo var. makuwa]3.0e-5259.89Show/hide
Query:  TLALPILKGYKLERHLTGEKPCPEKFITSLP-------------------------------------------LYNSVTPEVVVQLIGFTNAKDMWEAT
        TLALPILKGYKLE HLTGE PCP  F+ S                                             LYNS+TP+V +QL+GFTN +D+W+AT
Subjt:  TLALPILKGYKLERHLTGEKPCPEKFITSLP-------------------------------------------LYNSVTPEVVVQLIGFTNAKDMWEAT

Query:  HDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVIQGKPEISWLDMQS
         DFFGV+SRAEEDFLRQ  QTTRKGN+ ME+YL +MKTN DNLGQ  SP+PRRALISQVLLGLDEVYN VIVVIQGKP+ISWLDMQS
Subjt:  HDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVIQGKPEISWLDMQS

KAA0057475.1 uncharacterized protein E6C27_scaffold280G003560 [Cucumis melo var. makuwa]1.2e-4074.34Show/hide
Query:  LYNSVTPEVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVI
        +YNS+ P+V +QL+GF  AKD+WEA  + FG++SRAEE FLR TFQTTR+GN  MEDYLRIMK NADNLGQA SP+P R LISQVLLGLDEVYNPV  VI
Subjt:  LYNSVTPEVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVI

Query:  QGKPEISWLDMQS
        QGKP+ISWLDMQS
Subjt:  QGKPEISWLDMQS

KGN65684.1 hypothetical protein Csa_019689 [Cucumis sativus]4.4e-56100Show/hide
Query:  LYNSVTPEVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVI
        LYNSVTPEVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVI
Subjt:  LYNSVTPEVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVI

Query:  QGKPEISWLDMQS
        QGKPEISWLDMQS
Subjt:  QGKPEISWLDMQS

TYJ96311.1 uncharacterized protein E5676_scaffold1970G00140 [Cucumis melo var. makuwa]2.3e-3653.8Show/hide
Query:  TLALPILKGYKLERHLTGEKPCPEKFITSLP-------------------------------------------LYNSVTPEVVVQLIGFTNAKDMWEAT
        TLALPILKGYKLE HLTGE PCP  F+ S                                             LYNS+TP+V +QL+GFTN +D+W+AT
Subjt:  TLALPILKGYKLERHLTGEKPCPEKFITSLP-------------------------------------------LYNSVTPEVVVQLIGFTNAKDMWEAT

Query:  HDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQ
         DFFGV+SRAEEDFLRQ  QTTRKGN+ ME+YL +MKTN DNLGQ  SP+PRRALISQ
Subjt:  HDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQ

TYK05754.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]9.9e-4078.1Show/hide
Query:  VVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVIQGKPEISW
        + +QL+GFTNAKD+WEAT D FGV+SRAEEDFLRQ FQTTRK  ++ EDYLRIMKTN+D LGQA SP+P+RA ISQ LLGLDEVYNPVI VIQGKPEISW
Subjt:  VVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVIQGKPEISW

Query:  LDMQS
        +DMQS
Subjt:  LDMQS

TrEMBL top hitse value%identityAlignment
A0A0A0LXB7 Uncharacterized protein2.1e-56100Show/hide
Query:  LYNSVTPEVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVI
        LYNSVTPEVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVI
Subjt:  LYNSVTPEVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVI

Query:  QGKPEISWLDMQS
        QGKPEISWLDMQS
Subjt:  QGKPEISWLDMQS

A0A5A7SIT7 Uncharacterized protein1.4e-5259.89Show/hide
Query:  TLALPILKGYKLERHLTGEKPCPEKFITSLP-------------------------------------------LYNSVTPEVVVQLIGFTNAKDMWEAT
        TLALPILKGYKLE HLTGE PCP  F+ S                                             LYNS+TP+V +QL+GFTN +D+W+AT
Subjt:  TLALPILKGYKLERHLTGEKPCPEKFITSLP-------------------------------------------LYNSVTPEVVVQLIGFTNAKDMWEAT

Query:  HDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVIQGKPEISWLDMQS
         DFFGV+SRAEEDFLRQ  QTTRKGN+ ME+YL +MKTN DNLGQ  SP+PRRALISQVLLGLDEVYN VIVVIQGKP+ISWLDMQS
Subjt:  HDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVIQGKPEISWLDMQS

A0A5D3BCH9 Uncharacterized protein1.1e-3653.8Show/hide
Query:  TLALPILKGYKLERHLTGEKPCPEKFITSLP-------------------------------------------LYNSVTPEVVVQLIGFTNAKDMWEAT
        TLALPILKGYKLE HLTGE PCP  F+ S                                             LYNS+TP+V +QL+GFTN +D+W+AT
Subjt:  TLALPILKGYKLERHLTGEKPCPEKFITSLP-------------------------------------------LYNSVTPEVVVQLIGFTNAKDMWEAT

Query:  HDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQ
         DFFGV+SRAEEDFLRQ  QTTRKGN+ ME+YL +MKTN DNLGQ  SP+PRRALISQ
Subjt:  HDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQ

A0A5D3C373 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-4078.1Show/hide
Query:  VVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVIQGKPEISW
        + +QL+GFTNAKD+WEAT D FGV+SRAEEDFLRQ FQTTRK  ++ EDYLRIMKTN+D LGQA SP+P+RA ISQ LLGLDEVYNPVI VIQGKPEISW
Subjt:  VVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVIQGKPEISW

Query:  LDMQS
        +DMQS
Subjt:  LDMQS

A0A5D3E3L7 Uncharacterized protein5.7e-4174.34Show/hide
Query:  LYNSVTPEVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVI
        +YNS+ P+V +QL+GF  AKD+WEA  + FG++SRAEE FLR TFQTTR+GN  MEDYLRIMK NADNLGQA SP+P R LISQVLLGLDEVYNPV  VI
Subjt:  LYNSVTPEVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVI

Query:  QGKPEISWLDMQS
        QGKP+ISWLDMQS
Subjt:  QGKPEISWLDMQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.2e-0527.5Show/hide
Query:  ITSLPLYNSVTP-EVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYN
        I  L LY ++TP +     +  + ++D+W    + F     A    L    +T   G+  + DY R MK  AD+L   + P+  R L+  VL GL+  ++
Subjt:  ITSLPLYNSVTP-EVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYN

Query:  PVIVVIQGKPEISWLDMQST
         +I VI+ +      D  +T
Subjt:  PVIVVIQGKPEISWLDMQST

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.4e-0525.96Show/hide
Query:  LYNSVTPEVVVQLIGF-TNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVV
        +Y ++T  ++  +I     A+D+W +  + F     A         +TT   + ++ +Y + +K+ +D L   +SPI  R L+  +L GL E Y+ ++ V
Subjt:  LYNSVTPEVVVQLIGF-TNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTNADNLGQAESPIPRRALISQVLLGLDEVYNPVIVV

Query:  IQGK
        I+ K
Subjt:  IQGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACGTCACTCCAAATTTCTCTGTTCTTGACACACCTACGCTAGCTCTACCCATTCTGAAAGGATACAAGTTAGAAAGACATTTAACAGGTGAGAAACCTTGCCC
TGAAAAGTTTATCACTTCACTACCATTGTACAACTCAGTGACACCTGAAGTAGTTGTTCAACTAATAGGCTTCACAAATGCCAAAGATATGTGGGAAGCAACACATGATT
TCTTTGGCGTTCGATCAAGAGCAGAGGAGGACTTCCTTCGACAAACCTTTCAAACAACAAGAAAAGGTAATTCTAACATGGAGGATTATCTAAGAATTATGAAAACTAAT
GCTGACAATCTTGGCCAAGCCGAAAGTCCTATTCCGAGACGTGCCCTTATTTCACAGGTTTTGTTGGGATTGGATGAAGTTTATAATCCTGTCATAGTAGTCATTCAAGG
TAAGCCAGAGATATCATGGCTTGATATGCAGTCAACTTCTAATTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACGTCACTCCAAATTTCTCTGTTCTTGACACACCTACGCTAGCTCTACCCATTCTGAAAGGATACAAGTTAGAAAGACATTTAACAGGTGAGAAACCTTGCCC
TGAAAAGTTTATCACTTCACTACCATTGTACAACTCAGTGACACCTGAAGTAGTTGTTCAACTAATAGGCTTCACAAATGCCAAAGATATGTGGGAAGCAACACATGATT
TCTTTGGCGTTCGATCAAGAGCAGAGGAGGACTTCCTTCGACAAACCTTTCAAACAACAAGAAAAGGTAATTCTAACATGGAGGATTATCTAAGAATTATGAAAACTAAT
GCTGACAATCTTGGCCAAGCCGAAAGTCCTATTCCGAGACGTGCCCTTATTTCACAGGTTTTGTTGGGATTGGATGAAGTTTATAATCCTGTCATAGTAGTCATTCAAGG
TAAGCCAGAGATATCATGGCTTGATATGCAGTCAACTTCTAATTTTTGA
Protein sequenceShow/hide protein sequence
MANVTPNFSVLDTPTLALPILKGYKLERHLTGEKPCPEKFITSLPLYNSVTPEVVVQLIGFTNAKDMWEATHDFFGVRSRAEEDFLRQTFQTTRKGNSNMEDYLRIMKTN
ADNLGQAESPIPRRALISQVLLGLDEVYNPVIVVIQGKPEISWLDMQSTSNF