; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G22340 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G22340
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr4:20680619..20681179
RNA-Seq ExpressionCSPI04G22340
SyntenyCSPI04G22340
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036574.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]9.8e-8482.26Show/hide
Query:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH
        + FLA QGTLIQRSCPHTSQQN R ERKHRHILDSVRAQLLS +  EKFWGEAALTSVY+IN L S+V HN+SPFERLYGT P+Y +LK+FGCACFVLLH
Subjt:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH

Query:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS
         HEHTKLEPRA LCCFLGYGTEHKGFRCWDPISQRL  SRHVTFWEHR+FSSLSSFH  LSSPH FF DPS  LFPT DSPS+TTS
Subjt:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS

KAA0043149.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]9.8e-8482.26Show/hide
Query:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH
        + FLA QGTLIQRSCPHTSQQN R ERKHRHILDSVRAQLLS +  EKFWGEAALTSVY+IN L S+V HN+SPFERLYGT P+Y +LK+FGCACFVLLH
Subjt:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH

Query:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS
         HEHTKLEPRA LCCFLGYGTEHKGFRCWDPISQRL  SRHVTFWEHR+FSSLSSFH  LSSPH FF DPS  LFPT DSPS+TTS
Subjt:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS

KAA0065380.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]9.8e-8482.26Show/hide
Query:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH
        + FLA QGTLIQRSCPHTSQQN R ERKHRHILDSVRAQLLS +  EKFWGEAALTSVY+IN L S+V HN+SPFERLYGT P+Y +LK+FGCACFVLLH
Subjt:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH

Query:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS
         HEHTKLEPRA LCCFLGYGTEHKGFRCWDPISQRL  SRHVTFWEHR+FSSLSSFH  LSSPH FF DPS  LFPT DSPS+TTS
Subjt:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS

KAE8649813.1 hypothetical protein Csa_012717 [Cucumis sativus]1.1e-9894.18Show/hide
Query:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH
        MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAY +KFWGE ALTSVYI N LSSKVTHNVSPFERLYGTSPSYFNLKIF CACFVLLH
Subjt:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH

Query:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTSHCG
        PHEHTKLEPRA LCCFLGYGTEHKGFRCWD ISQRL  SRHVTFWEH LFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTSHCG
Subjt:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTSHCG

TYK12316.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]9.8e-8482.26Show/hide
Query:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH
        + FLA QGTLIQRSCPHTSQQN R ERKHRHILDSVRAQLLS +  EKFWGEAALTSVY+IN L S+V HN+SPFERLYGT P+Y +LK+FGCACFVLLH
Subjt:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH

Query:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS
         HEHTKLEPRA LCCFLGYGTEHKGFRCWDPISQRL  SRHVTFWEHR+FSSLSSFH  LSSPH FF DPS  LFPT DSPS+TTS
Subjt:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS

TrEMBL top hitse value%identityAlignment
A0A5A7SZ66 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-8482.26Show/hide
Query:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH
        + FLA QGTLIQRSCPHTSQQN R ERKHRHILDSVRAQLLS +  EKFWGEAALTSVY+IN L S+V HN+SPFERLYGT P+Y +LK+FGCACFVLLH
Subjt:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH

Query:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS
         HEHTKLEPRA LCCFLGYGTEHKGFRCWDPISQRL  SRHVTFWEHR+FSSLSSFH  LSSPH FF DPS  LFPT DSPS+TTS
Subjt:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS

A0A5A7VDW0 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-8482.26Show/hide
Query:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH
        + FLA QGTLIQRSCPHTSQQN R ERKHRHILDSVRAQLLS +  EKFWGEAALTSVY+IN L S+V HN+SPFERLYGT P+Y +LK+FGCACFVLLH
Subjt:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH

Query:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS
         HEHTKLEPRA LCCFLGYGTEHKGFRCWDPISQRL  SRHVTFWEHR+FSSLSSFH  LSSPH FF DPS  LFPT DSPS+TTS
Subjt:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS

A0A5D3BW47 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-8381.18Show/hide
Query:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH
        + FLA QGTLIQRSCPHTSQQN R ERKHRHILDS+RAQLL  + SEKFWGEA LTSVY+IN L S+V HN+SPFERLYGT PSY +LK+FGCA FVLLH
Subjt:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH

Query:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS
        PHEHTKLEPRACLCCFLGYGT+HKGFRCWDPISQRL  SRHVTFWEHR+FSSLSSFH  LSSP+ FF DPS  LFPT DSPS+TTS
Subjt:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS

A0A5D3DG18 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-8482.26Show/hide
Query:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH
        + FLA QGTLIQRSCPHTSQQN R ERKHRHILDSVRAQLLS +  EKFWGEAALTSVY+IN L S+V HN+SPFERLYGT P+Y +LK+FGCACFVLLH
Subjt:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH

Query:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS
         HEHTKLEPRA LCCFLGYGTEHKGFRCWDPISQRL  SRHVTFWEHR+FSSLSSFH  LSSPH FF DPS  LFPT DSPS+TTS
Subjt:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS

A0A5D3DWU7 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-8482.26Show/hide
Query:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH
        + FLA QGTLIQRSCPHTSQQN R ERKHRHILDSVRAQLLS +  EKFWGEAALTSVY+IN L S+V HN+SPFERLYGT P+Y +LK+FGCACFVLLH
Subjt:  MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLH

Query:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS
         HEHTKLEPRA LCCFLGYGTEHKGFRCWDPISQRL  SRHVTFWEHR+FSSLSSFH  LSSPH FF DPS  LFPT DSPS+TTS
Subjt:  PHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTS

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.2e-1329.41Show/hide
Query:  EFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSK--VTHNVSPFERLYGTSPSYFNLKIFGCACFVLL
        +F   +G     + PHT Q N   ER  R I +  R  +  A   + FWGEA LT+ Y+IN + S+  V  + +P+E  +   P   +L++FG   +V +
Subjt:  EFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSK--VTHNVSPFERLYGTSPSYFNLKIFGCACFVLL

Query:  HPHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQR--LSRHVTFWEHRLFSS
          ++  K + ++    F+GY  E  GF+ WD ++++  ++R V   E  + +S
Subjt:  HPHEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQR--LSRHVTFWEHRLFSS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.1e-2032.63Show/hide
Query:  EFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLHP
        E+ +  G   +++ P T Q N   ER +R I++ VR+ L  A   + FWGEA  T+ Y+IN   S       P         SY +LK+FGC  F  +  
Subjt:  EFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLHP

Query:  HEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFID-PSIDLFPT-LDSPSDTTSHCG
         + TKL+ ++  C F+GYG E  G+R WDP+ +++  SR V F E  + ++     +  +   P F+  PS    PT  +S +D  S  G
Subjt:  HEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSFHEFLSSPHPFFID-PSIDLFPT-LDSPSDTTSHCG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.4e-2338.36Show/hide
Query:  EFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLHP
        E+ +  G     S PHT + N   ERKHRHI+++    L  A+  + +W  A   +VY+IN L + +    SPF++L+GTSP+Y  L++FGCAC+  L P
Subjt:  EFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLHP

Query:  HEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEH
        +   KL+ ++  C FLGY      + C    + RL  SRHV F E+
Subjt:  HEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.5e-2236.77Show/hide
Query:  EFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLHP
        ++L+  G     S PHT + N   ERKHRHI++     L  A+  + +W  A   +VY+IN L + +    SPF++L+G  P+Y  LK+FGCAC+  L P
Subjt:  EFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLHP

Query:  HEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSF
        +   KLE ++  C F+GY      + C    + RL  SRHV F E     S ++F
Subjt:  HEHTKLEPRACLCCFLGYGTEHKGFRCWDPISQRL--SRHVTFWEHRLFSSLSSF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTTCTTGCCCATCAAGGTACTCTAATTCAACGCTCATGTCCTCATACATCTCAGCAAAATAGAAGAGTTGAACGCAAACATCGTCACATTCTTGACTCTGTTCG
CGCTCAGCTTCTATCTGCTGCTTATTCAGAAAAATTCTGGGGCGAGGCTGCCCTCACCTCTGTCTATATCATCAATTGTCTTTCTTCAAAAGTCACTCACAATGTTTCCC
CCTTTGAACGACTATACGGTACTTCCCCCTCTTACTTCAATCTCAAGATTTTTGGTTGTGCATGTTTCGTATTATTACATCCTCATGAACATACCAAACTTGAACCACGT
GCATGTCTATGTTGTTTCTTGGGTTATGGTACTGAACATAAAGGATTTCGTTGTTGGGACCCCATCTCTCAACGATTATCTCGTCACGTTACCTTTTGGGAACATCGTCT
GTTTTCTAGTCTTTCTTCATTCCATGAATTTCTTTCAAGTCCTCACCCATTCTTCATCGATCCTTCTATTGACCTCTTTCCCACACTTGACTCGCCGTCTGACACTACAT
CACATTGTGGG
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTTCTTGCCCATCAAGGTACTCTAATTCAACGCTCATGTCCTCATACATCTCAGCAAAATAGAAGAGTTGAACGCAAACATCGTCACATTCTTGACTCTGTTCG
CGCTCAGCTTCTATCTGCTGCTTATTCAGAAAAATTCTGGGGCGAGGCTGCCCTCACCTCTGTCTATATCATCAATTGTCTTTCTTCAAAAGTCACTCACAATGTTTCCC
CCTTTGAACGACTATACGGTACTTCCCCCTCTTACTTCAATCTCAAGATTTTTGGTTGTGCATGTTTCGTATTATTACATCCTCATGAACATACCAAACTTGAACCACGT
GCATGTCTATGTTGTTTCTTGGGTTATGGTACTGAACATAAAGGATTTCGTTGTTGGGACCCCATCTCTCAACGATTATCTCGTCACGTTACCTTTTGGGAACATCGTCT
GTTTTCTAGTCTTTCTTCATTCCATGAATTTCTTTCAAGTCCTCACCCATTCTTCATCGATCCTTCTATTGACCTCTTTCCCACACTTGACTCGCCGTCTGACACTACAT
CACATTGTGGG
Protein sequenceShow/hide protein sequence
MEFLAHQGTLIQRSCPHTSQQNRRVERKHRHILDSVRAQLLSAAYSEKFWGEAALTSVYIINCLSSKVTHNVSPFERLYGTSPSYFNLKIFGCACFVLLHPHEHTKLEPR
ACLCCFLGYGTEHKGFRCWDPISQRLSRHVTFWEHRLFSSLSSFHEFLSSPHPFFIDPSIDLFPTLDSPSDTTSHCG