; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G10170 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G10170
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase
Genome locationChr4:8171184..8173218
RNA-Seq ExpressionCSPI04G10170
SyntenyCSPI04G10170
Gene Ontology termsNA
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0048713.1 hypothetical protein E6C27_scaffold43G00050 [Cucumis melo var. makuwa]7.9e-5242.17Show/hide
Query:  MMKGCLHHNILGCIIIEQFYFGLSRDTQQFVYAVFIGGMLRLSCNHIKTTLDAMVSNSQEWRDNEFGSHNESKGNRREKGRTNKGSNGNAMITLQSQVTE
        M+K C HH I  C+++EQFYFGLS++T Q    VF GGMLR S N IK  LD M SNSQEWRD  FGS N+S+G +  +GR   G + + M+ LQ QV E
Subjt:  MMKGCLHHNILGCIIIEQFYFGLSRDTQQFVYAVFIGGMLRLSCNHIKTTLDAMVSNSQEWRDNEFGSHNESKGNRREKGRTNKGSNGNAMITLQSQVTE

Query:  MNKLLQSMALSQLNAIGSSIKVVHQVLNL--------------VVDREIISTLAG-VDKIKIRNSDRAIDLTTENKNMSSEEVEHSRWEEQHKDAFDKAK
        MN +LQSMAL Q+N + SS+++V QV  +               ++ EI++ L      I I        ++++  ++ + E++  +          ++ 
Subjt:  MNKLLQSMALSQLNAIGSSIKVVHQVLNL--------------VVDREIISTLAG-VDKIKIRNSDRAIDLTTENKNMSSEEVEHSRWEEQHKDAFDKAK

Query:  SYNPLSPPLFPEQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQ
          N  +P    E+ P+     K++ +  ++  G E+    ++TS+   I  P KM    SFTVP SI RM+L R LC+LGASINLM LSIFKK +IGE+Q
Subjt:  SYNPLSPPLFPEQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQ

Query:  PTLMRLNFEDRSIAKPEGNIEDVLVKVDKFMF
        PT MRL F DRSI KP+  +ED+LVK DKF F
Subjt:  PTLMRLNFEDRSIAKPEGNIEDVLVKVDKFMF

KGN53801.1 hypothetical protein Csa_014772 [Cucumis sativus]2.2e-6298.47Show/hide
Query:  MMKGCLHHNILGCIIIEQFYFGLSRDTQQFVYAVFIGGMLRLSCNHIKTTLDAMVSNSQEWRDNEFGSHNESKGNRREKGRTNKGSNGNAMITLQSQVTE
        MMKGCLHHNILGC+IIEQFYFGLSRDTQQFV AVFIGGMLRLSCNHIKTTLDAMVSNSQEWRDNEFGSHNESKGNRREKGRTNKGSNGNAMITLQSQVTE
Subjt:  MMKGCLHHNILGCIIIEQFYFGLSRDTQQFVYAVFIGGMLRLSCNHIKTTLDAMVSNSQEWRDNEFGSHNESKGNRREKGRTNKGSNGNAMITLQSQVTE

Query:  MNKLLQSMALSQLNAIGSSIKVVHQVLNLVV
        MNKLLQSMALSQLNAIGSSIKVVHQVLNLVV
Subjt:  MNKLLQSMALSQLNAIGSSIKVVHQVLNLVV

XP_030477929.1 uncharacterized protein LOC115694963 [Cannabis sativa]2.5e-4554.68Show/hide
Query:  EQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDR
        EQMP+YV FLKDIL  KR++   ET+ALTE  S +   + P K+ DPGSFT+P SIG  ++GR LC+LGASINLM +SIFKK  IGE +PT + L   DR
Subjt:  EQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDR

Query:  SIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADREVPIILGLLLFLATCHTLIDVHQGEMTMYYCEEEAYQDLFNTEE--NFEEETSSLEEVNVIVGEKS
        S+A PEG IEDVLV+VDKF+F  +FIIL+Y ADR+VPIILG   FLAT  TLIDV  GE+TM   +++   ++FN     +  EE S +  ++ IV EK 
Subjt:  SIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADREVPIILGLLLFLATCHTLIDVHQGEMTMYYCEEEAYQDLFNTEE--NFEEETSSLEEVNVIVGEKS

Query:  VEE
         +E
Subjt:  VEE

XP_030478287.1 uncharacterized protein LOC115695357 [Cannabis sativa]4.2e-4549.38Show/hide
Query:  NKNMSSEEVEHSRWEEQHKDAFDKAKSYN---PLSPPLFPEQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMN
        + ++SS+  + S+ + Q K   D  K  +   PL   L  EQMP+YV FLKDIL  KR++   ET+ALTE  S +   + P K+ DPGSFT+P SIG  +
Subjt:  NKNMSSEEVEHSRWEEQHKDAFDKAKSYN---PLSPPLFPEQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMN

Query:  LGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDRSIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADREVPIILGLLLFLATCHTLIDVHQGEM
        +GR LC+LGASINLM +SIFKK  IGE +PT + L   DRS+A PEG IEDVLV+VDKF+F  +FIIL+Y ADR+VPIILG   FLAT  TLIDV  GE+
Subjt:  LGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDRSIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADREVPIILGLLLFLATCHTLIDVHQGEM

Query:  TMYYCEEEAYQDLFNTEE--NFEEETSSLEEVNVIVGEKSVEE
        TM   +++   ++FN     +  EE S +  ++ +V EK  +E
Subjt:  TMYYCEEEAYQDLFNTEE--NFEEETSSLEEVNVIVGEKSVEE

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]3.5e-4454.73Show/hide
Query:  EQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDR
        EQMP+YV FLKDIL  KR++   ET+ALTE  S +   + P K+ DPGSFT+P SIG  ++GR LC+LGASINLM +SIFKK  IGE +PT + L   DR
Subjt:  EQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDR

Query:  SIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADREVPIILGLLLFLATCHTLIDVHQGEMTMYYCEEEAYQDLFNTEENFEEETSSLEEVNVIVGEKSVE
        S+A PEG IEDVLV+VDKF+F  +FIIL+Y ADR+VPIILG   FLAT  TLIDV  GE+TM   +E              EE S +  ++ IV EK  +
Subjt:  SIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADREVPIILGLLLFLATCHTLIDVHQGEMTMYYCEEEAYQDLFNTEENFEEETSSLEEVNVIVGEKSVE

Query:  E
        E
Subjt:  E

TrEMBL top hitse value%identityAlignment
A0A0A0L0Y4 Uncharacterized protein1.1e-6298.47Show/hide
Query:  MMKGCLHHNILGCIIIEQFYFGLSRDTQQFVYAVFIGGMLRLSCNHIKTTLDAMVSNSQEWRDNEFGSHNESKGNRREKGRTNKGSNGNAMITLQSQVTE
        MMKGCLHHNILGC+IIEQFYFGLSRDTQQFV AVFIGGMLRLSCNHIKTTLDAMVSNSQEWRDNEFGSHNESKGNRREKGRTNKGSNGNAMITLQSQVTE
Subjt:  MMKGCLHHNILGCIIIEQFYFGLSRDTQQFVYAVFIGGMLRLSCNHIKTTLDAMVSNSQEWRDNEFGSHNESKGNRREKGRTNKGSNGNAMITLQSQVTE

Query:  MNKLLQSMALSQLNAIGSSIKVVHQVLNLVV
        MNKLLQSMALSQLNAIGSSIKVVHQVLNLVV
Subjt:  MNKLLQSMALSQLNAIGSSIKVVHQVLNLVV

A0A5A7VJ95 Uncharacterized protein3.2e-4352.56Show/hide
Query:  SRW-EEQHKDAFDKAKSYNPLSPPLFP-------------------EQMPSYVMFLKDI--LANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTV
        S W +EQ  DA  K  S NPL PP FP                   +Q+   ++F+  +  +A K +    ET+ALT  TS+I     P KMTD  SFTV
Subjt:  SRW-EEQHKDAFDKAKSYNPLSPPLFP-------------------EQMPSYVMFLKDI--LANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTV

Query:  PSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDRSIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADREVPIILGLLLFLATCHTL
        P SI  M+LG  LC+LGASINLM LSIFKK +I E+QP  MRL F DRSIAK EG IED+LVKVDKF+FH +FIIL+Y A++EVPII G   FL+T H L
Subjt:  PSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDRSIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADREVPIILGLLLFLATCHTL

Query:  IDVHQGEMTMYYCEE
        IDVHQGE+TM + +E
Subjt:  IDVHQGEMTMYYCEE

A0A5D3CC26 Uncharacterized protein3.8e-5242.17Show/hide
Query:  MMKGCLHHNILGCIIIEQFYFGLSRDTQQFVYAVFIGGMLRLSCNHIKTTLDAMVSNSQEWRDNEFGSHNESKGNRREKGRTNKGSNGNAMITLQSQVTE
        M+K C HH I  C+++EQFYFGLS++T Q    VF GGMLR S N IK  LD M SNSQEWRD  FGS N+S+G +  +GR   G + + M+ LQ QV E
Subjt:  MMKGCLHHNILGCIIIEQFYFGLSRDTQQFVYAVFIGGMLRLSCNHIKTTLDAMVSNSQEWRDNEFGSHNESKGNRREKGRTNKGSNGNAMITLQSQVTE

Query:  MNKLLQSMALSQLNAIGSSIKVVHQVLNL--------------VVDREIISTLAG-VDKIKIRNSDRAIDLTTENKNMSSEEVEHSRWEEQHKDAFDKAK
        MN +LQSMAL Q+N + SS+++V QV  +               ++ EI++ L      I I        ++++  ++ + E++  +          ++ 
Subjt:  MNKLLQSMALSQLNAIGSSIKVVHQVLNL--------------VVDREIISTLAG-VDKIKIRNSDRAIDLTTENKNMSSEEVEHSRWEEQHKDAFDKAK

Query:  SYNPLSPPLFPEQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQ
          N  +P    E+ P+     K++ +  ++  G E+    ++TS+   I  P KM    SFTVP SI RM+L R LC+LGASINLM LSIFKK +IGE+Q
Subjt:  SYNPLSPPLFPEQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQ

Query:  PTLMRLNFEDRSIAKPEGNIEDVLVKVDKFMF
        PT MRL F DRSI KP+  +ED+LVK DKF F
Subjt:  PTLMRLNFEDRSIAKPEGNIEDVLVKVDKFMF

A0A6J1CPJ3 uncharacterized protein LOC1110129477.2e-4359.88Show/hide
Query:  EQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDR
        EQMP+Y  FLKDI+  K+K+   ET+ALTE +SN+   + P K+ DPGSFT+   IG  ++GR LC+LGA INLM LSIFKK +IG+  PT + L   DR
Subjt:  EQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDR

Query:  SIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADREVPIILGLLLFLATCHTLIDVHQGEMTM
        SI KPEG IEDVLVKVDKF+F  +FIIL+  AD++VPIILG   FLAT  TLIDV +GE+TM
Subjt:  SIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADREVPIILGLLLFLATCHTLIDVHQGEMTM

A0A6J1DY39 uncharacterized protein LOC1110256531.7e-4452.26Show/hide
Query:  EQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDR
        EQMP+Y  F+KDI+  K+K+   ET+ALTE +SN+   + P K+ DPGSFT+P  IG  ++GR LC+LGASINLM LSIFKK++IG+  PT + L   DR
Subjt:  EQMPSYVMFLKDILANKRKITGLETMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDR

Query:  SIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADREVPIILGLLLFLATCHTLIDVHQGEMTMYYCEEEAYQDLFNTEENFEEETSSLEEVNVIVGEKSV
        SI KPEG IEDVLVKVDKF+F T+FIIL+  AD++VPIILG   FLAT  TLIDV +GE+TM   +++   ++ +  +  ++    +EE  VI  +K +
Subjt:  SIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADREVPIILGLLLFLATCHTLIDVHQGEMTMYYCEEEAYQDLFNTEENFEEETSSLEEVNVIVGEKSV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGAAAGGGTGTCTTCATCACAACATTCTAGGATGCATTATAATCGAACAATTCTATTTTGGATTGAGTAGAGATACACAACAATTTGTATACGCAGTATTTATAGG
TGGGATGCTTCGATTATCCTGCAACCATATTAAGACAACGCTAGACGCTATGGTCAGTAACAGTCAAGAATGGAGAGATAACGAATTTGGCTCACATAATGAAAGCAAAG
GAAACAGAAGAGAAAAAGGAAGGACAAACAAAGGATCAAATGGGAATGCAATGATAACATTGCAGAGTCAGGTTACTGAAATGAATAAGCTCTTGCAGTCCATGGCTTTG
TCACAACTCAATGCCATTGGAAGTTCAATTAAAGTAGTACACCAAGTGTTGAATTTGGTTGTGGATAGAGAAATCATCTCAACTTTAGCTGGGGTGGACAAGATCAAAAT
ACGAAACAGTGACAGGGCAATTGATTTAACCACAGAGAATAAGAACATGTCAAGCGAGGAAGTGGAACATTCAAGATGGGAAGAACAACACAAAGATGCGTTTGATAAAG
CAAAGAGTTACAATCCATTGTCGCCCCCTTTGTTTCCTGAGCAAATGCCATCTTATGTCATGTTTCTGAAAGACATCTTAGCTAACAAAAGAAAAATCACTGGGTTAGAA
ACAATGGCTTTAACTGAAATCACGAGCAACATCTGTAATATAAGAACACCAGCAAAGATGACAGACCCTGGAAGTTTCACTGTTCCCTCTTCGATAGGCAGAATGAATTT
AGGTCGCACACTTTGCGAACTAGGAGCAAGTATCAACTTGATGACACTATCTATTTTTAAAAAATGGAAAATAGGGGAGATTCAACCAACGCTAATGAGGCTTAATTTCG
AGGACAGATCCATAGCTAAGCCAGAGGGTAATATTGAAGATGTTTTGGTAAAAGTTGACAAATTCATGTTTCACACAAATTTCATCATTCTGAACTATGTAGCTGATAGA
GAAGTGCCAATCATTTTAGGGCTGCTATTATTTCTAGCAACATGTCATACTCTCATAGATGTACACCAAGGGGAAATGACCATGTACTATTGTGAAGAAGAAGCGTACCA
AGATTTGTTTAACACGGAAGAAAACTTTGAAGAAGAGACAAGTTCACTTGAAGAAGTAAACGTCATAGTGGGTGAAAAGAGCGTTGAGGAGGGAGAGGAGTTTGAAACAC
GAGAATTTCCACCACCTCCCGTGCTTAAATGCAAACTAAAAGAAACGCTGATATTGAATGTTGAGGAACGAACAGACCTGGAGCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGAAAGGGTGTCTTCATCACAACATTCTAGGATGCATTATAATCGAACAATTCTATTTTGGATTGAGTAGAGATACACAACAATTTGTATACGCAGTATTTATAGG
TGGGATGCTTCGATTATCCTGCAACCATATTAAGACAACGCTAGACGCTATGGTCAGTAACAGTCAAGAATGGAGAGATAACGAATTTGGCTCACATAATGAAAGCAAAG
GAAACAGAAGAGAAAAAGGAAGGACAAACAAAGGATCAAATGGGAATGCAATGATAACATTGCAGAGTCAGGTTACTGAAATGAATAAGCTCTTGCAGTCCATGGCTTTG
TCACAACTCAATGCCATTGGAAGTTCAATTAAAGTAGTACACCAAGTGTTGAATTTGGTTGTGGATAGAGAAATCATCTCAACTTTAGCTGGGGTGGACAAGATCAAAAT
ACGAAACAGTGACAGGGCAATTGATTTAACCACAGAGAATAAGAACATGTCAAGCGAGGAAGTGGAACATTCAAGATGGGAAGAACAACACAAAGATGCGTTTGATAAAG
CAAAGAGTTACAATCCATTGTCGCCCCCTTTGTTTCCTGAGCAAATGCCATCTTATGTCATGTTTCTGAAAGACATCTTAGCTAACAAAAGAAAAATCACTGGGTTAGAA
ACAATGGCTTTAACTGAAATCACGAGCAACATCTGTAATATAAGAACACCAGCAAAGATGACAGACCCTGGAAGTTTCACTGTTCCCTCTTCGATAGGCAGAATGAATTT
AGGTCGCACACTTTGCGAACTAGGAGCAAGTATCAACTTGATGACACTATCTATTTTTAAAAAATGGAAAATAGGGGAGATTCAACCAACGCTAATGAGGCTTAATTTCG
AGGACAGATCCATAGCTAAGCCAGAGGGTAATATTGAAGATGTTTTGGTAAAAGTTGACAAATTCATGTTTCACACAAATTTCATCATTCTGAACTATGTAGCTGATAGA
GAAGTGCCAATCATTTTAGGGCTGCTATTATTTCTAGCAACATGTCATACTCTCATAGATGTACACCAAGGGGAAATGACCATGTACTATTGTGAAGAAGAAGCGTACCA
AGATTTGTTTAACACGGAAGAAAACTTTGAAGAAGAGACAAGTTCACTTGAAGAAGTAAACGTCATAGTGGGTGAAAAGAGCGTTGAGGAGGGAGAGGAGTTTGAAACAC
GAGAATTTCCACCACCTCCCGTGCTTAAATGCAAACTAAAAGAAACGCTGATATTGAATGTTGAGGAACGAACAGACCTGGAGCCTTAA
Protein sequenceShow/hide protein sequence
MMKGCLHHNILGCIIIEQFYFGLSRDTQQFVYAVFIGGMLRLSCNHIKTTLDAMVSNSQEWRDNEFGSHNESKGNRREKGRTNKGSNGNAMITLQSQVTEMNKLLQSMAL
SQLNAIGSSIKVVHQVLNLVVDREIISTLAGVDKIKIRNSDRAIDLTTENKNMSSEEVEHSRWEEQHKDAFDKAKSYNPLSPPLFPEQMPSYVMFLKDILANKRKITGLE
TMALTEITSNICNIRTPAKMTDPGSFTVPSSIGRMNLGRTLCELGASINLMTLSIFKKWKIGEIQPTLMRLNFEDRSIAKPEGNIEDVLVKVDKFMFHTNFIILNYVADR
EVPIILGLLLFLATCHTLIDVHQGEMTMYYCEEEAYQDLFNTEENFEEETSSLEEVNVIVGEKSVEEGEEFETREFPPPPVLKCKLKETLILNVEERTDLEP