; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G05950 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G05950
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr3:5029033..5031176
RNA-Seq ExpressionCSPI03G05950
SyntenyCSPI03G05950
Gene Ontology termsNA
InterPro domainsIPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN80490.1 hypothetical protein VITISV_004703 [Vitis vinifera]8.7e-2544.44Show/hide
Query:  ASIVDSSFTYTDALTSTLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRN
        A+I+D  +  T+ LT  L+  NH       +WVLDSGCTYHM P R WF++Y+E+NG  + +GNN  CN+ GIG + + + D   + L+ VRHVP LKRN
Subjt:  ASIVDSSFTYTDALTSTLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRN

Query:  LISLGMLDSLGCEYKGKGGVFRVFMGSKLALVGEK
        LISLG LD  G  +K K G   +   + + + G+K
Subjt:  LISLGMLDSLGCEYKGKGGVFRVFMGSKLALVGEK

RVX11324.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.1e-2435.62Show/hide
Query:  DFNKLILG--------ETKRKTDLQMAIARVEIEKFDE-----KGDFT---LWKAKIKALLGQQKSHKALLDPPELPTTLTTPEASIVDSSFTYTDALTS
        +FNKL+L         E + K  + +      ++ F E     K D T   +  A    +L  + S K   D  +   T     A+I+D  +  T+ LT 
Subjt:  DFNKLILG--------ETKRKTDLQMAIARVEIEKFDE-----KGDFT---LWKAKIKALLGQQKSHKALLDPPELPTTLTTPEASIVDSSFTYTDALTS

Query:  TLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGMLDSLGCEYKG
         L+  NH       +WVLDSGCTYHM P R WF++Y+E+NG  + +GNN  CN+ GIG + + + D   + L+ VRHVP LKRNLISLG LD  G  +K 
Subjt:  TLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGMLDSLGCEYKG

Query:  KGGVFRVFMGSKLALVGEK
        K G   +   + + + G+K
Subjt:  KGGVFRVFMGSKLALVGEK

RVX13343.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.4e-2735.81Show/hide
Query:  MAIARVEIEKFDEKGDFTLWKAKIKALLGQQKSHKALLDPPELPTTLTTPEA-------------------------------------------SIVDS
        M+  + +IEKF+ + DF LWK  +KA+L QQ   KALL   +LP ++T  E                                            SI+D 
Subjt:  MAIARVEIEKFDEKGDFTLWKAKIKALLGQQKSHKALLDPPELPTTLTTPEA-------------------------------------------SIVDS

Query:  SFTYTDALTSTLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGM
         +     LT TL+Q NH       +WVLDSGCTYHM P   WF++Y+E NG  V +GNN  C++ GIG V + + D     L+ VRHVP LK NLISLG 
Subjt:  SFTYTDALTSTLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGM

Query:  LDSLGCEYKGKGGVFRVFMGSKLALVGEK
        LD  G  +K +     +  G+ + + G+K
Subjt:  LDSLGCEYKGKGGVFRVFMGSKLALVGEK

XP_038891593.1 uncharacterized protein LOC120080985 [Benincasa hispida]4.6e-2662.77Show/hide
Query:  MTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKGGVFRVFMGSKLALVGEK
        MTPFR WFNTY++++GE+V+MGNN  C I G+ LV++KLK   +KLLRNVRHVP LKRNLISLG+LD++GCE +GK G+  V   SK  +VGEK
Subjt:  MTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKGGVFRVFMGSKLALVGEK

XP_038904482.1 uncharacterized protein LOC120090851 [Benincasa hispida]1.5e-4041.04Show/hide
Query:  MAIARVEIEKFDEKGDFTLWKAKIKALLGQQKSHKALLDPPELPTTLT----------------------------------------------------
        MA+AR +IEKFD KG+F LWKAKIKA+LGQQ++HKA+ DP +LPT  T                                                    
Subjt:  MAIARVEIEKFDEKGDFTLWKAKIKALLGQQKSHKALLDPPELPTTLT----------------------------------------------------

Query:  ---------------------TPEASIVDSSFTYTDALTSTLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLV
                             + EAS+ +  + Y+DAL +TL+      S    DWVLDS C++HMTP + W +TYR+++G  V+MGNNN C + GI  V
Subjt:  ---------------------TPEASIVDSSFTYTDALTSTLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLV

Query:  TMKLKDETVKLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKGGVFRVFMGS
        ++KL+D +VKLLRNVRHVP LKRNLISLGM DS+ CEY+GK G   V   S
Subjt:  TMKLKDETVKLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKGGVFRVFMGS

TrEMBL top hitse value%identityAlignment
A0A2N9GSE2 Uncharacterized protein1.6e-2431.28Show/hide
Query:  LQMAIARVEIEKFDEKGDFTLWKAKIKALLGQQKSHKALLDPPELPTTLTTPE------ASIVDSSFTYTD-----------------------------
        +  +IA V + KFD  G+F LW+ ++K LL QQ   KAL    + P  +T  E       +IV+   T  D                             
Subjt:  LQMAIARVEIEKFDEKGDFTLWKAKIKALLGQQKSHKALLDPPELPTTLTTPE------ASIVDSSFTYTD-----------------------------

Query:  ------------ALTSTLDQANHVNSLG--------KHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVR
                    ++    D  +  + L          + W+LDS C++H+TP R WF+TYR IN   V MGN+  C I G+G + +K+ D  V+ L  VR
Subjt:  ------------ALTSTLDQANHVNSLG--------KHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVR

Query:  HVPHLKRNLISLGMLDSLGCEYKGKGGVFRVFMGSKLALVGEK
        H+P +++NLISLG LDS G  YK + G+ +V  G+ + + G+K
Subjt:  HVPHLKRNLISLGMLDSLGCEYKGKGGVFRVFMGSKLALVGEK

A0A2U1MW82 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-2447.66Show/hide
Query:  NSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKGGVFRVF
        N   + + ++DSGCT+HMTP R+WF+TY   NG  V+MGN+ +C + G G + +K+ D  V+ +  VRHVP LKRNLISL  L++ GC+Y G+GGV ++F
Subjt:  NSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKGGVFRVF

Query:  MGSKLAL
         G+ L +
Subjt:  MGSKLAL

A0A438JQU4 Retrovirus-related Pol polyprotein from transposon TNT 1-945.5e-2535.62Show/hide
Query:  DFNKLILG--------ETKRKTDLQMAIARVEIEKFDE-----KGDFT---LWKAKIKALLGQQKSHKALLDPPELPTTLTTPEASIVDSSFTYTDALTS
        +FNKL+L         E + K  + +      ++ F E     K D T   +  A    +L  + S K   D  +   T     A+I+D  +  T+ LT 
Subjt:  DFNKLILG--------ETKRKTDLQMAIARVEIEKFDE-----KGDFT---LWKAKIKALLGQQKSHKALLDPPELPTTLTTPEASIVDSSFTYTDALTS

Query:  TLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGMLDSLGCEYKG
         L+  NH       +WVLDSGCTYHM P R WF++Y+E+NG  + +GNN  CN+ GIG + + + D   + L+ VRHVP LKRNLISLG LD  G  +K 
Subjt:  TLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGMLDSLGCEYKG

Query:  KGGVFRVFMGSKLALVGEK
        K G   +   + + + G+K
Subjt:  KGGVFRVFMGSKLALVGEK

A0A438JWM8 Retrovirus-related Pol polyprotein from transposon TNT 1-947.0e-2835.81Show/hide
Query:  MAIARVEIEKFDEKGDFTLWKAKIKALLGQQKSHKALLDPPELPTTLTTPEA-------------------------------------------SIVDS
        M+  + +IEKF+ + DF LWK  +KA+L QQ   KALL   +LP ++T  E                                            SI+D 
Subjt:  MAIARVEIEKFDEKGDFTLWKAKIKALLGQQKSHKALLDPPELPTTLTTPEA-------------------------------------------SIVDS

Query:  SFTYTDALTSTLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGM
         +     LT TL+Q NH       +WVLDSGCTYHM P   WF++Y+E NG  V +GNN  C++ GIG V + + D     L+ VRHVP LK NLISLG 
Subjt:  SFTYTDALTSTLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGM

Query:  LDSLGCEYKGKGGVFRVFMGSKLALVGEK
        LD  G  +K +     +  G+ + + G+K
Subjt:  LDSLGCEYKGKGGVFRVFMGSKLALVGEK

A5AVX7 Uncharacterized protein4.2e-2544.44Show/hide
Query:  ASIVDSSFTYTDALTSTLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRN
        A+I+D  +  T+ LT  L+  NH       +WVLDSGCTYHM P R WF++Y+E+NG  + +GNN  CN+ GIG + + + D   + L+ VRHVP LKRN
Subjt:  ASIVDSSFTYTDALTSTLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRN

Query:  LISLGMLDSLGCEYKGKGGVFRVFMGSKLALVGEK
        LISLG LD  G  +K K G   +   + + + G+K
Subjt:  LISLGMLDSLGCEYKGKGGVFRVFMGSKLALVGEK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.2e-0746.67Show/hide
Query:  EEPVLGITNVTQPFEVETDVCDYALGCVLLQNGHPISYNSWKLNAAKKKYIVSEKEMLAM
        E+P+L + + T+ F + TD  D ALG VL Q+GHP+SY S  LN  +  Y   EKE+LA+
Subjt:  EEPVLGITNVTQPFEVETDVCDYALGCVLLQNGHPISYNSWKLNAAKKKYIVSEKEMLAM

P0CT41 Transposon Tf2-12 polyprotein8.0e-0532.74Show/hide
Query:  MIEEPVLGITNVTQPFEVETDVCDYALGCVLLQNG-----HPISYNSWKLNAAKKKYIVSEKEMLAMSKVLDFNKLILGET----KRKTDLQMAIARVEI
        ++  PVL   + ++   +ETD  D A+G VL Q       +P+ Y S K++ A+  Y VS+KEMLA+ K L   +  L  T    K  TD +  I R+  
Subjt:  MIEEPVLGITNVTQPFEVETDVCDYALGCVLLQNG-----HPISYNSWKLNAAKKKYIVSEKEMLAMSKVLDFNKLILGET----KRKTDLQMAIARVEI

Query:  EKFDEKGDFTLWK
        E   E      W+
Subjt:  EKFDEKGDFTLWK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-0836.89Show/hide
Query:  DWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKGGVFRVFMGSKLA
        +WV+D+  ++H TP R  F  Y   +   V MGN +   IAGIG + +K       +L++VRHVP L+ NLIS   LD  G E       +R+  GS + 
Subjt:  DWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGMLDSLGCEYKGKGGVFRVFMGSKLA

Query:  LVG
          G
Subjt:  LVG

P20825 Retrovirus-related Pol polyprotein from transposon 2973.2e-0643.55Show/hide
Query:  MIEEPVLGITNVTQPFEVETDVCDYALGCVLLQNGHPISYNSWKLNAAKKKYIVSEKEMLAM
        +I +P+L + +  + F + TD  + ALG VL QNGHPIS+ S  LN  +  Y   EKE+LA+
Subjt:  MIEEPVLGITNVTQPFEVETDVCDYALGCVLLQNGHPISYNSWKLNAAKKKYIVSEKEMLAM

Q9UR07 Transposon Tf2-11 polyprotein8.0e-0532.74Show/hide
Query:  MIEEPVLGITNVTQPFEVETDVCDYALGCVLLQNG-----HPISYNSWKLNAAKKKYIVSEKEMLAMSKVLDFNKLILGET----KRKTDLQMAIARVEI
        ++  PVL   + ++   +ETD  D A+G VL Q       +P+ Y S K++ A+  Y VS+KEMLA+ K L   +  L  T    K  TD +  I R+  
Subjt:  MIEEPVLGITNVTQPFEVETDVCDYALGCVLLQNG-----HPISYNSWKLNAAKKKYIVSEKEMLAMSKVLDFNKLILGET----KRKTDLQMAIARVEI

Query:  EKFDEKGDFTLWK
        E   E      W+
Subjt:  EKFDEKGDFTLWK

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein5.3e-0430.77Show/hide
Query:  WVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGMLDS
        W++      +MTP+  +F T        V   +  V  + G G V +++K+   K +RNV  VP L RN++S G + S
Subjt:  WVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVKLLRNVRHVPHLKRNLISLGMLDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTGAGGAGCCAGTTCTTGGAATCACTAATGTGACCCAACCTTTTGAAGTTGAGACTGATGTGTGTGATTATGCTCTAGGTTGTGTTCTCCTGCAGAATGGACACCC
CATCTCATACAATAGTTGGAAACTGAATGCCGCAAAAAAGAAATATATAGTGTCTGAAAAAGAAATGCTTGCCATGAGCAAGGTTCTTGATTTCAACAAACTGATTCTTG
GAGAAACTAAAAGAAAGACTGACCTACAAATGGCCATAGCAAGAGTAGAAATCGAGAAGTTCGATGAAAAGGGAGACTTTACATTATGGAAAGCAAAGATCAAAGCCTTG
CTTGGACAGCAAAAGTCTCATAAAGCCCTTTTAGATCCTCCAGAACTTCCAACAACCCTCACAACACCGGAAGCCTCCATAGTAGATAGTTCTTTTACCTACACTGATGC
CTTGACATCAACCTTAGATCAAGCCAACCATGTAAACTCCTTAGGAAAACATGATTGGGTTCTAGACTCTGGATGCACCTACCATATGACACCTTTTAGAACATGGTTTA
ATACCTATAGGGAAATCAATGGAGAATATGTGTTCATGGGAAATAATAATGTATGTAACATTGCTGGAATTGGATTAGTTACCATGAAATTAAAAGATGAGACTGTAAAA
CTCCTTAGAAATGTAAGACATGTTCCTCACCTCAAAAGAAATTTAATCTCCCTAGGAATGCTAGACTCTCTAGGGTGTGAATACAAAGGGAAAGGTGGAGTTTTTCGAGT
CTTTATGGGATCTAAGTTAGCCTTGGTTGGGGAAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATTGAGGAGCCAGTTCTTGGAATCACTAATGTGACCCAACCTTTTGAAGTTGAGACTGATGTGTGTGATTATGCTCTAGGTTGTGTTCTCCTGCAGAATGGACACCC
CATCTCATACAATAGTTGGAAACTGAATGCCGCAAAAAAGAAATATATAGTGTCTGAAAAAGAAATGCTTGCCATGAGCAAGGTTCTTGATTTCAACAAACTGATTCTTG
GAGAAACTAAAAGAAAGACTGACCTACAAATGGCCATAGCAAGAGTAGAAATCGAGAAGTTCGATGAAAAGGGAGACTTTACATTATGGAAAGCAAAGATCAAAGCCTTG
CTTGGACAGCAAAAGTCTCATAAAGCCCTTTTAGATCCTCCAGAACTTCCAACAACCCTCACAACACCGGAAGCCTCCATAGTAGATAGTTCTTTTACCTACACTGATGC
CTTGACATCAACCTTAGATCAAGCCAACCATGTAAACTCCTTAGGAAAACATGATTGGGTTCTAGACTCTGGATGCACCTACCATATGACACCTTTTAGAACATGGTTTA
ATACCTATAGGGAAATCAATGGAGAATATGTGTTCATGGGAAATAATAATGTATGTAACATTGCTGGAATTGGATTAGTTACCATGAAATTAAAAGATGAGACTGTAAAA
CTCCTTAGAAATGTAAGACATGTTCCTCACCTCAAAAGAAATTTAATCTCCCTAGGAATGCTAGACTCTCTAGGGTGTGAATACAAAGGGAAAGGTGGAGTTTTTCGAGT
CTTTATGGGATCTAAGTTAGCCTTGGTTGGGGAAAAGTAA
Protein sequenceShow/hide protein sequence
MIEEPVLGITNVTQPFEVETDVCDYALGCVLLQNGHPISYNSWKLNAAKKKYIVSEKEMLAMSKVLDFNKLILGETKRKTDLQMAIARVEIEKFDEKGDFTLWKAKIKAL
LGQQKSHKALLDPPELPTTLTTPEASIVDSSFTYTDALTSTLDQANHVNSLGKHDWVLDSGCTYHMTPFRTWFNTYREINGEYVFMGNNNVCNIAGIGLVTMKLKDETVK
LLRNVRHVPHLKRNLISLGMLDSLGCEYKGKGGVFRVFMGSKLALVGEK