; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G30150 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G30150
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr1:24663962..24665015
RNA-Seq ExpressionCSPI01G30150
SyntenyCSPI01G30150
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036703.1 pol polyprotein [Cucumis melo var. makuwa]2.3e-7372.6Show/hide
Query:  LAKKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLPSWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVA----------
        L KKYKTEVA AKKF V  FLDYKM+DSKTVASI+EKL PSWKDFKNYLK+K KEIKLEE VVRLGIEENN+KA+KC +D   ++   +           
Subjt:  LAKKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLPSWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVA----------

Query:  -----DIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSL
             DIDLC VI ECNMV NSK+WWVDT ATHHI ANKNMFTSYVSVS+GEQLFM N S SKVE +GKVILKMT +KELTLNNV HVPDI KNLV GSL
Subjt:  -----DIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSL

Query:  LSKNGFKL
        LSKNGFKL
Subjt:  LSKNGFKL

KAA0042920.1 pol polyprotein [Cucumis melo var. makuwa]2.9e-7654.8Show/hide
Query:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLP
        FL E+APILLEGETDKEKQLA                                         KKY+ EVA  KKF V  FLDYKM+DSKTVASI+EKL P
Subjt:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLP

Query:  SWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMD------------------------------------------------------------
        SWKDFKNYLKHK KEIKLEE VVRLGIEENN+KA+KC +D                                                            
Subjt:  SWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMD------------------------------------------------------------

Query:  ----------KVDEVSDGVADIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNV
                  +VDEVSDGV DIDLC VI ECNMVGNSK+WWVDT AT HICANKNMFTSYVSVS+GEQLFM NS TSKV+ +GKVILKMT  KELTLNNV
Subjt:  ----------KVDEVSDGVADIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNV

Query:  LHVPDICKNLVYGSLLSKNGFKL
        LHVPDI KNLV GSLLSKNGFKL
Subjt:  LHVPDICKNLVYGSLLSKNGFKL

KAA0048117.1 hypothetical protein E6C27_scaffold385G001960 [Cucumis melo var. makuwa]1.4e-5755.34Show/hide
Query:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLP
        FLTE+APIL EGET+KEKQLA                                         KKYKTEV  AKKF V  FLDYKM+ SKTV + V+++  
Subjt:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLP

Query:  SWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVADIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLF
          K  KN+                       KK  +  + KVDEVSDGVADIDLC VILECNM+GNSK+WWVDT ATHHICANKN+FT YVS+S+GEQLF
Subjt:  SWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVADIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLF

Query:  MCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL
        M +SSTSKV+ +GKVILKMT  KELTLNNVLHVPDI KNL+ GSLLSKN FKL
Subjt:  MCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL

TYK03675.1 pol polyprotein [Cucumis melo var. makuwa]3.0e-6052.36Show/hide
Query:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLP
        FL E+A ILLEGETDKEKQLA                                         KKYKTEVA AKKFTV   LDYKM+DSKTV S V+++  
Subjt:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLP

Query:  SWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVA-------------------------------------DIDLCRVIL----
          +DFKNYLKHK KEIKLEE VVRLGIEENN+KA+KC +D   +    +                                      DI+L  VIL    
Subjt:  SWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVA-------------------------------------DIDLCRVIL----

Query:  --ECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL
          +     NSK+WWVDT  THHIC NKNMFTSYVSVS+GEQLFM N S SKVE +GKVILKMT  KELTLNNVLHVPDI KNLV GSLLSKNGFKL
Subjt:  --ECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL

XP_008467076.1 PREDICTED: uncharacterized protein LOC103504514, partial [Cucumis melo]5.8e-5651.22Show/hide
Query:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKT---------V
        FL E+APIL EGETDKEKQLA                                          KYKTEVA AKKF V  FLDYKM+DSKT         V
Subjt:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKT---------V

Query:  ASIVEKLLPSWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMD-------------------------KVDEVSDGVADIDLCRVILECNMVGN
        ASI+EKL PSWKDFKNYLKHK K+IKLEE VVRLGIEENN+KA+KC +D                         +VDEVSDGVADIDLC VI ECNMVGN
Subjt:  ASIVEKLLPSWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMD-------------------------KVDEVSDGVADIDLCRVILECNMVGN

Query:  SKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL
        SK+WW                                     VER+GKVILKMT  KELTLNNVLHV DI KNLV GSLLSKNGFKL
Subjt:  SKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL

TrEMBL top hitse value%identityAlignment
A0A1S3CTZ4 uncharacterized protein LOC1035045142.8e-5651.22Show/hide
Query:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKT---------V
        FL E+APIL EGETDKEKQLA                                          KYKTEVA AKKF V  FLDYKM+DSKT         V
Subjt:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKT---------V

Query:  ASIVEKLLPSWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMD-------------------------KVDEVSDGVADIDLCRVILECNMVGN
        ASI+EKL PSWKDFKNYLKHK K+IKLEE VVRLGIEENN+KA+KC +D                         +VDEVSDGVADIDLC VI ECNMVGN
Subjt:  ASIVEKLLPSWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMD-------------------------KVDEVSDGVADIDLCRVILECNMVGN

Query:  SKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL
        SK+WW                                     VER+GKVILKMT  KELTLNNVLHV DI KNLV GSLLSKNGFKL
Subjt:  SKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL

A0A5A7T1C5 Pol polyprotein1.1e-7372.6Show/hide
Query:  LAKKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLPSWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVA----------
        L KKYKTEVA AKKF V  FLDYKM+DSKTVASI+EKL PSWKDFKNYLK+K KEIKLEE VVRLGIEENN+KA+KC +D   ++   +           
Subjt:  LAKKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLPSWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVA----------

Query:  -----DIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSL
             DIDLC VI ECNMV NSK+WWVDT ATHHI ANKNMFTSYVSVS+GEQLFM N S SKVE +GKVILKMT +KELTLNNV HVPDI KNLV GSL
Subjt:  -----DIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSL

Query:  LSKNGFKL
        LSKNGFKL
Subjt:  LSKNGFKL

A0A5A7THY2 Pol polyprotein1.4e-7654.8Show/hide
Query:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLP
        FL E+APILLEGETDKEKQLA                                         KKY+ EVA  KKF V  FLDYKM+DSKTVASI+EKL P
Subjt:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLP

Query:  SWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMD------------------------------------------------------------
        SWKDFKNYLKHK KEIKLEE VVRLGIEENN+KA+KC +D                                                            
Subjt:  SWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMD------------------------------------------------------------

Query:  ----------KVDEVSDGVADIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNV
                  +VDEVSDGV DIDLC VI ECNMVGNSK+WWVDT AT HICANKNMFTSYVSVS+GEQLFM NS TSKV+ +GKVILKMT  KELTLNNV
Subjt:  ----------KVDEVSDGVADIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNV

Query:  LHVPDICKNLVYGSLLSKNGFKL
        LHVPDI KNLV GSLLSKNGFKL
Subjt:  LHVPDICKNLVYGSLLSKNGFKL

A0A5A7U1K6 Uncharacterized protein6.7e-5855.34Show/hide
Query:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLP
        FLTE+APIL EGET+KEKQLA                                         KKYKTEV  AKKF V  FLDYKM+ SKTV + V+++  
Subjt:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLP

Query:  SWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVADIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLF
          K  KN+                       KK  +  + KVDEVSDGVADIDLC VILECNM+GNSK+WWVDT ATHHICANKN+FT YVS+S+GEQLF
Subjt:  SWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVADIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLF

Query:  MCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL
        M +SSTSKV+ +GKVILKMT  KELTLNNVLHVPDI KNL+ GSLLSKN FKL
Subjt:  MCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL

A0A5D3BX52 Pol polyprotein1.4e-6052.36Show/hide
Query:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLP
        FL E+A ILLEGETDKEKQLA                                         KKYKTEVA AKKFTV   LDYKM+DSKTV S V+++  
Subjt:  FLTENAPILLEGETDKEKQLA-----------------------------------------KKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLP

Query:  SWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVA-------------------------------------DIDLCRVIL----
          +DFKNYLKHK KEIKLEE VVRLGIEENN+KA+KC +D   +    +                                      DI+L  VIL    
Subjt:  SWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVA-------------------------------------DIDLCRVIL----

Query:  --ECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL
          +     NSK+WWVDT  THHIC NKNMFTSYVSVS+GEQLFM N S SKVE +GKVILKMT  KELTLNNVLHVPDI KNLV GSLLSKNGFKL
Subjt:  --ECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-0532.89Show/hide
Query:  NSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLV
        +S  W +D+ ATHHI ++ N  + +   + G+ + + + ST  +   G   L  T  + L L+N+L+VP+I KNL+
Subjt:  NSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.3e-0633.72Show/hide
Query:  RVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLV
        R  L  N   N+  W +D+ ATHHI ++ N  + +   + G+ + + + ST  +   G   L  T  + L LN VL+VP+I KNL+
Subjt:  RVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCTCACAGAGAATGCTCCCATCTTACTTGAAGGAGAAACTGACAAAGAAAAGCAACTTGCAAAGAAGTACAAAACTGAAGTTGCTAGTGCAAAGAAATTTACCGT
TGAAAATTTTCTGGATTACAAAATGATGGATTCCAAAACTGTAGCATCAATAGTTGAAAAATTGCTACCCTCATGGAAGGACTTCAAAAATTATCTCAAGCATAAACACA
AAGAGATAAAACTTGAGGAATTTGTGGTCCGACTTGGGATTGAAGAGAATAATAAAAAGGCGAAAAAGTGTACTATGGATAAAGTTGATGAAGTTTCAGATGGTGTTGCA
GATATTGACCTTTGTAGAGTCATTTTAGAATGCAACATGGTGGGCAATTCCAAGAAATGGTGGGTAGACACTAGGGCTACTCATCATATTTGTGCCAACAAGAATATGTT
CACATCATATGTGTCAGTCTCTAGTGGAGAACAATTATTTATGTGTAACTCATCTACTTCAAAGGTTGAGAGACGAGGAAAAGTGATTCTTAAGATGACCTATGACAAGG
AACTCACTCTCAATAATGTGCTTCATGTTCCTGATATTTGCAAGAACTTAGTTTATGGTTCTTTGCTTAGTAAGAATGGCTTCAAGTTG
mRNA sequenceShow/hide mRNA sequence
ATGTTTCTCACAGAGAATGCTCCCATCTTACTTGAAGGAGAAACTGACAAAGAAAAGCAACTTGCAAAGAAGTACAAAACTGAAGTTGCTAGTGCAAAGAAATTTACCGT
TGAAAATTTTCTGGATTACAAAATGATGGATTCCAAAACTGTAGCATCAATAGTTGAAAAATTGCTACCCTCATGGAAGGACTTCAAAAATTATCTCAAGCATAAACACA
AAGAGATAAAACTTGAGGAATTTGTGGTCCGACTTGGGATTGAAGAGAATAATAAAAAGGCGAAAAAGTGTACTATGGATAAAGTTGATGAAGTTTCAGATGGTGTTGCA
GATATTGACCTTTGTAGAGTCATTTTAGAATGCAACATGGTGGGCAATTCCAAGAAATGGTGGGTAGACACTAGGGCTACTCATCATATTTGTGCCAACAAGAATATGTT
CACATCATATGTGTCAGTCTCTAGTGGAGAACAATTATTTATGTGTAACTCATCTACTTCAAAGGTTGAGAGACGAGGAAAAGTGATTCTTAAGATGACCTATGACAAGG
AACTCACTCTCAATAATGTGCTTCATGTTCCTGATATTTGCAAGAACTTAGTTTATGGTTCTTTGCTTAGTAAGAATGGCTTCAAGTTG
Protein sequenceShow/hide protein sequence
MFLTENAPILLEGETDKEKQLAKKYKTEVASAKKFTVENFLDYKMMDSKTVASIVEKLLPSWKDFKNYLKHKHKEIKLEEFVVRLGIEENNKKAKKCTMDKVDEVSDGVA
DIDLCRVILECNMVGNSKKWWVDTRATHHICANKNMFTSYVSVSSGEQLFMCNSSTSKVERRGKVILKMTYDKELTLNNVLHVPDICKNLVYGSLLSKNGFKL