; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G5923 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G5923
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationctg1402:2172431..2173760
RNA-Seq ExpressionCucsat.G5923
SyntenyCucsat.G5923
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013723.1 hypothetical protein SDJN02_23890, partial [Cucurbita argyrosperma subsp. argyrosperma]9.02e-8854.47Show/hide
Query:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI
        MFLV+L+ FEP   ATS L  I+++ADV+FT    S+  S+  PRFVA L M+   F NY VD+ ++S++SLESFHDA+LDGG   SM+IH+L +  QMI
Subjt:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI

Query:  LRFES-SSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGI
        LR+E+ SS+ P +  EL L P Q E LG+++Y KFF+++SK LR++I+ LP+FH D + V AT ++VKFSIASKEI +TKE+  C IVGYEGE+ET++ I
Subjt:  LRFES-SSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGI

Query:  NLNPMLFFLNFTHDTLRVWFYKTTTYHGAMVVPSFGFYSQYVILFP
           PM+FFLNFT+   RVWFYKTT     + VP+FG Y QYV+ FP
Subjt:  NLNPMLFFLNFTHDTLRVWFYKTTTYHGAMVVPSFGFYSQYVILFP

XP_016903187.1 PREDICTED: uncharacterized protein LOC103502263 [Cucumis melo]1.50e-10480.79Show/hide
Query:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI
        MFLVRLK+F+P F ATSRLA IAREAD+KFTPLFFSI  SN+ PRFVAYL MT++CFINYKVDNDHTSRISLESFHDALLDGG SPSMTIHLLANI Q+I
Subjt:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI

Query:  LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECI----IVGYEGEEETK
        LRFESSSHAP+V HELSL PSQEEDLGE+DYAKFFSIDSK LRRVIRNLPIFHGDSICVTAT SQVKFSIASKEIVLTKE +  I    I G   EEE  
Subjt:  LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECI----IVGYEGEEETK

Query:  MGI
        + +
Subjt:  MGI

XP_023000630.1 uncharacterized protein LOC111494874 [Cucurbita maxima]2.13e-8955.69Show/hide
Query:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI
        MFLVRLKNF P    TSRLA IARE+D+ FTPL   +T S   PRF+A L + + CF  Y V+ DH SRISLES HDALLD G S +MTIHLL N N M+
Subjt:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI

Query:  LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGIN
        LRFE+ +H PQ+RH+  L P QE+ + EI+Y+K  ++DS+ LR+VI+ LP+FHGDS+CVT T S+V+FSIAS+E++  KE   C I+G++G+  T+  I 
Subjt:  LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGIN

Query:  LNPMLFFLNFTHDTLRVWFYKT-TTYHGAMVVPSFGFYSQYVILFP
        L PMLFFLN T+D   VWF+KT T  H  M+ P F  ++QYVI FP
Subjt:  LNPMLFFLNFTHDTLRVWFYKT-TTYHGAMVVPSFGFYSQYVILFP

XP_031744160.1 uncharacterized protein LOC116404808 [Cucumis sativus]6.82e-178100Show/hide
Query:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI
        MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI
Subjt:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI

Query:  LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGIN
        LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGIN
Subjt:  LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGIN

Query:  LNPMLFFLNFTHDTLRVWFYKTTTYHGAMVVPSFGFYSQYVILFPNYN
        LNPMLFFLNFTHDTLRVWFYKTTTYHGAMVVPSFGFYSQYVILFPNYN
Subjt:  LNPMLFFLNFTHDTLRVWFYKTTTYHGAMVVPSFGFYSQYVILFPNYN

XP_038875055.1 uncharacterized protein LOC120067580 [Benincasa hispida]4.13e-8056.18Show/hide
Query:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI
        MFLV+L NFEP   ATS LA I+  ADVKFTPL F +      PRFVA L ++  CF NY VD++HTS++ LESFHDA+LDGG   SMTIHLL   NQMI
Subjt:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI

Query:  LRFES-SSHAPQVRHELSLKPSQEEDL---GEIDYAKFFSIDSKALRRVIRNLPIFHGDSI-CVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEET
        LRF++ SS  P + HEL+  P Q  D    G+++  KFF + S+ALRR+I+ LPIF  DS+ CV  T SQ+KFSIASKEIVL  ++  C IVG+E E ET
Subjt:  LRFES-SSHAPQVRHELSLKPSQEEDL---GEIDYAKFFSIDSKALRRVIRNLPIFHGDSI-CVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEET

Query:  KMGINLNPMLFFLNFTHDTLRVWFYKT-TTYHGAMVVPSFGFYSQYVILFP
        +  I L PMLFFLNFT+   +VWFYKT    +  M VP+FG   QYVI FP
Subjt:  KMGINLNPMLFFLNFTHDTLRVWFYKT-TTYHGAMVVPSFGFYSQYVILFP

TrEMBL top hitse value%identityAlignment
A0A1S3C8J1 uncharacterized protein LOC1034980102.88e-8058.22Show/hide
Query:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI
        MFLVRL+ FEP   ATS LA +A++ADVKFTPL   I VSN+ P+FVA L ++   F N+ VD++ +S++SL+ FHDA+LDGG   SMTIHLL   NQM+
Subjt:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI

Query:  LRFESSSH-APQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGI
        LRFE+ SH  P + HEL+L P Q E+LG+++Y  FF++ S+ LRR+I+ LP+FH D++ VT T SQVKFSI SKEI+LTKE   C IVGYEGE ETK+ +
Subjt:  LRFESSSH-APQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGI

Query:  NLNPMLFFLNFTH
         L PM+FFLNFT+
Subjt:  NLNPMLFFLNFTH

A0A1S3CL88 uncharacterized protein LOC1035022501.35e-7856.13Show/hide
Query:  MFLVRLKNFEPFFHATSRLALIARE-ADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQM
        MFLV+LKNF+P   ATS LA I+ + AD+KFTP  F I  S++ PRF+A L ++   F  + VDNDH+S++SLESFHDA+LDGG   SMTIHLL   NQM
Subjt:  MFLVRLKNFEPFFHATSRLALIARE-ADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQM

Query:  ILRFES-SSHAPQVRHELSLKPSQEED--LG--EIDYAKFFSIDSKALRRVIRNLPIFHGDSIC-VTATRSQVKFSIASKEIVLTKENEECIIVGYEGEE
        ILRF++ SS    + HEL+L P Q ED  +G  E+D  K+F + SKALRR+I++LPIF  DSI  V  T S+VKFSIASKEI+LT E   C I G+E E 
Subjt:  ILRFES-SSHAPQVRHELSLKPSQEED--LG--EIDYAKFFSIDSKALRRVIRNLPIFHGDSIC-VTATRSQVKFSIASKEIVLTKENEECIIVGYEGEE

Query:  ETKMGINLNPMLFFLNFTHDTLRVWFYKT-TTYHGAMVVPSFGFYSQYVILFP
        ET+  I L PM+FFLNFT+   RVWFYKT    +  MVVP++G + QYVI FP
Subjt:  ETKMGINLNPMLFFLNFTHDTLRVWFYKT-TTYHGAMVVPSFGFYSQYVILFP

A0A1S4E4N8 uncharacterized protein LOC1035022637.27e-10580.79Show/hide
Query:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI
        MFLVRLK+F+P F ATSRLA IAREAD+KFTPLFFSI  SN+ PRFVAYL MT++CFINYKVDNDHTSRISLESFHDALLDGG SPSMTIHLLANI Q+I
Subjt:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI

Query:  LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECI----IVGYEGEEETK
        LRFESSSHAP+V HELSL PSQEEDLGE+DYAKFFSIDSK LRRVIRNLPIFHGDSICVTAT SQVKFSIASKEIVLTKE +  I    I G   EEE  
Subjt:  LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECI----IVGYEGEEETK

Query:  MGI
        + +
Subjt:  MGI

A0A6J1HGU2 uncharacterized protein LOC1114642091.46e-7154.84Show/hide
Query:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI
        MFLVR+KNF P    TSRLA IARE+D+ FTPL   +T S   PRF+A L + + CF  Y V+ DH SRISLES HDALLD G S SMTIHLL N N M 
Subjt:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI

Query:  LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGIN
        LRFE+ +H PQ+RH++ L P QE+ + EI+Y+K  ++D + LR+VI+ LP+F GDS+CVT T S+V+FSIAS+E++  KE   C I+G++G+  ++  I 
Subjt:  LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGIN

Query:  LNPMLFFLNFTHDTLRV
        L PMLFFLN T+D   V
Subjt:  LNPMLFFLNFTHDTLRV

A0A6J1KIW5 uncharacterized protein LOC1114948741.03e-8955.69Show/hide
Query:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI
        MFLVRLKNF P    TSRLA IARE+D+ FTPL   +T S   PRF+A L + + CF  Y V+ DH SRISLES HDALLD G S +MTIHLL N N M+
Subjt:  MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMI

Query:  LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGIN
        LRFE+ +H PQ+RH+  L P QE+ + EI+Y+K  ++DS+ LR+VI+ LP+FHGDS+CVT T S+V+FSIAS+E++  KE   C I+G++G+  T+  I 
Subjt:  LRFESSSHAPQVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGIN

Query:  LNPMLFFLNFTHDTLRVWFYKT-TTYHGAMVVPSFGFYSQYVILFP
        L PMLFFLN T+D   VWF+KT T  H  M+ P F  ++QYVI FP
Subjt:  LNPMLFFLNFTHDTLRVWFYKT-TTYHGAMVVPSFGFYSQYVILFP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTAGTTAGGCTTAAAAACTTTGAACCTTTTTTTCATGCAACTTCTCGTCTTGCTCTAATTGCTAGAGAAGCCGATGTCAAATTCACACCATTGTTCTTTTCAAT
AACTGTTTCCAATCAATTCCCTCGTTTTGTTGCATATCTTGTAATGACTTACAATTGCTTCATCAATTATAAAGTCGATAATGATCATACCTCAAGAATATCCCTTGAAT
CTTTCCATGACGCTCTCTTGGACGGCGGAGGTTCTCCTTCAATGACTATTCATCTTCTTGCAAACATTAACCAAATGATCCTTAGATTTGAATCTTCAAGCCATGCCCCG
CAAGTGCGTCATGAATTGTCATTGAAACCGTCGCAAGAAGAAGATCTAGGAGAAATTGATTATGCAAAATTTTTCTCAATTGATTCGAAGGCTTTAAGACGTGTTATAAG
AAATTTACCCATCTTCCATGGGGACTCAATATGTGTTACTGCAACGAGGTCACAAGTCAAATTCTCTATTGCTTCTAAAGAGATTGTTCTTACCAAAGAGAATGAAGAAT
GTATAATTGTAGGTTACGAGGGAGAAGAAGAAACTAAAATGGGAATAAATCTAAATCCAATGTTGTTTTTTCTTAATTTCACACATGATACACTTAGGGTATGGTTCTAT
AAGACAACCACTTATCATGGTGCCATGGTTGTCCCATCTTTTGGATTTTATTCTCAATATGTAATCCTTTTTCCCAACTATAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTAGTTAGGCTTAAAAACTTTGAACCTTTTTTTCATGCAACTTCTCGTCTTGCTCTAATTGCTAGAGAAGCCGATGTCAAATTCACACCATTGTTCTTTTCAAT
AACTGTTTCCAATCAATTCCCTCGTTTTGTTGCATATCTTGTAATGACTTACAATTGCTTCATCAATTATAAAGTCGATAATGATCATACCTCAAGAATATCCCTTGAAT
CTTTCCATGACGCTCTCTTGGACGGCGGAGGTTCTCCTTCAATGACTATTCATCTTCTTGCAAACATTAACCAAATGATCCTTAGATTTGAATCTTCAAGCCATGCCCCG
CAAGTGCGTCATGAATTGTCATTGAAACCGTCGCAAGAAGAAGATCTAGGAGAAATTGATTATGCAAAATTTTTCTCAATTGATTCGAAGGCTTTAAGACGTGTTATAAG
AAATTTACCCATCTTCCATGGGGACTCAATATGTGTTACTGCAACGAGGTCACAAGTCAAATTCTCTATTGCTTCTAAAGAGATTGTTCTTACCAAAGAGAATGAAGAAT
GTATAATTGTAGGTTACGAGGGAGAAGAAGAAACTAAAATGGGAATAAATCTAAATCCAATGTTGTTTTTTCTTAATTTCACACATGATACACTTAGGGTATGGTTCTAT
AAGACAACCACTTATCATGGTGCCATGGTTGTCCCATCTTTTGGATTTTATTCTCAATATGTAATCCTTTTTCCCAACTATAATTAG
Protein sequenceShow/hide protein sequence
MFLVRLKNFEPFFHATSRLALIAREADVKFTPLFFSITVSNQFPRFVAYLVMTYNCFINYKVDNDHTSRISLESFHDALLDGGGSPSMTIHLLANINQMILRFESSSHAP
QVRHELSLKPSQEEDLGEIDYAKFFSIDSKALRRVIRNLPIFHGDSICVTATRSQVKFSIASKEIVLTKENEECIIVGYEGEEETKMGINLNPMLFFLNFTHDTLRVWFY
KTTTYHGAMVVPSFGFYSQYVILFPNYN