; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy6G000367 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy6G000367
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationGy14Chr6:274494..275635
RNA-Seq ExpressionCsGy6G000367
SyntenyCsGy6G000367
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN45638.2 hypothetical protein Csa_005465, partial [Cucumis sativus]6.49e-73100Show/hide
Query:  MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSH
        MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSH
Subjt:  MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSH

Query:  HADLTFCSRD
        HADLTFCSRD
Subjt:  HADLTFCSRD

TYK02997.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.54e-9491.77Show/hide
Query:  MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSH
        MFVFIL+EIHSFVDAVS LAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYIN FFPEMF +ERTGFSSLTFSFADS 
Subjt:  MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSH

Query:  HADLTFCSRDGRFREIDFPMYHSDELMDV-NDQLDWETFVSFSSQEFINIVTLNNFDS
        HADLTFCSRDG  REIDFPMYHSD  MD   DQLDWETFVSFSSQEFINIVTLNNFDS
Subjt:  HADLTFCSRDGRFREIDFPMYHSDELMDV-NDQLDWETFVSFSSQEFINIVTLNNFDS

XP_022958855.1 uncharacterized protein LOC111460009 [Cucurbita moschata]5.88e-6847.97Show/hide
Query:  HSFVDAVSALAGL-GYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSHHADLTFCS
        H  VDA S LA     +F++KFSP MFSIMA  TPS  C I LQL P FFN  Y C QLHY +IYI  F+  M + ER GFSSLTF+F +    D     
Subjt:  HSFVDAVSALAGL-GYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSHHADLTFCS

Query:  R-------------------------------DGRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVT-LNNFDSVLVTITNSQVKFSYAGTHM
        R                               +G F E++ PM+ S ++MDV    D  TFVS  SQEFINIVT  N+FD VLVT+ NSQV FSY  T +
Subjt:  R-------------------------------DGRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVT-LNNFDSVLVTITNSQVKFSYAGTHM

Query:  ILTQERKRCIIGGVGASNEIEFVVNLKAKESLCRLASRSKRVWLFKSKNSSKGAICVPLGLYARFLVSFSN
        +LT+ER++CIIGGV AS+E+ FV++L+     C LA R +RVWLFKS +S++G I  PLGLYAR++  FS+
Subjt:  ILTQERKRCIIGGVGASNEIEFVVNLKAKESLCRLASRSKRVWLFKSKNSSKGAICVPLGLYARFLVSFSN

XP_023006011.1 uncharacterized protein LOC111498888 [Cucurbita maxima]5.65e-7149.82Show/hide
Query:  HSFVDAVSALAGL-GYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSHHADLTFCS
        H  VDA S LA     +F++KFSP MFSIMA  TPS  C IALQL P FFN  Y C QLHY +IYI  F+  M + ER GFSSLTF+F +    D     
Subjt:  HSFVDAVSALAGL-GYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSHHADLTFCS

Query:  R-------------------------------DGRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVT-LNNFDSVLVTITNSQVKFSYAGTHM
        R                               +G F E++ PM+ S ++MDV    D+ TFVS  SQEFINIVT  N+FD VLVT+TNSQV FSY  T +
Subjt:  R-------------------------------DGRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVT-LNNFDSVLVTITNSQVKFSYAGTHM

Query:  ILTQERKRCIIGGVGASNEIEFVVNLKAKESLCRLASRSKRVWLFKSKNSSKGAICVPLGLYARFLVSFSN
        ILTQER++CIIGGV AS E+ FV++L+     C LA R +RVWLFKS +S++G I  PLGLYAR+L  FS+
Subjt:  ILTQERKRCIIGGVGASNEIEFVVNLKAKESLCRLASRSKRVWLFKSKNSSKGAICVPLGLYARFLVSFSN

XP_023548336.1 uncharacterized protein LOC111807004 [Cucurbita pepo subsp. pepo]4.54e-6848.16Show/hide
Query:  HSFVDAVSALAG-LGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSHHADLTFCS
        H  VDA S LA     +F++KFSP MFSIMA  TPS  C IALQL P FFN  Y C QLHY +IYI  F+  M + ER GFSSLTF+F +    D     
Subjt:  HSFVDAVSALAG-LGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSHHADLTFCS

Query:  R-------------------------------DGRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVT-LNNFDSVLVTITNSQVKFSYAGTHM
        R                               +G F E++ PM+ S ++MDV    D+ TFVS  SQEFINIVT  N+FD VLVT+ NSQV FSY  T +
Subjt:  R-------------------------------DGRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVT-LNNFDSVLVTITNSQVKFSYAGTHM

Query:  ILTQERKRCIIGGVGASNEIEFVVNLKAKESLCRLASRSKRVWLFKSKNSS-KGAICVPLGLYARFLVSFSN
        +LTQER++CIIGGV AS+E+ FV++L+     C LA R +RVWLFKS +S+ +G +  PLGLYAR++  FS+
Subjt:  ILTQERKRCIIGGVGASNEIEFVVNLKAKESLCRLASRSKRVWLFKSKNSS-KGAICVPLGLYARFLVSFSN

TrEMBL top hitse value%identityAlignment
A0A0A0K833 Uncharacterized protein3.98e-74100Show/hide
Query:  MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSH
        MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSH
Subjt:  MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSH

Query:  HADLTFCSRDG
        HADLTFCSRDG
Subjt:  HADLTFCSRDG

A0A5D3BU35 LINE-1 retrotransposable element ORF2 protein1.23e-9491.77Show/hide
Query:  MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSH
        MFVFIL+EIHSFVDAVS LAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYIN FFPEMF +ERTGFSSLTFSFADS 
Subjt:  MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSH

Query:  HADLTFCSRDGRFREIDFPMYHSDELMDV-NDQLDWETFVSFSSQEFINIVTLNNFDS
        HADLTFCSRDG  REIDFPMYHSD  MD   DQLDWETFVSFSSQEFINIVTLNNFDS
Subjt:  HADLTFCSRDGRFREIDFPMYHSDELMDV-NDQLDWETFVSFSSQEFINIVTLNNFDS

A0A6J1H4N0 uncharacterized protein LOC1114600092.85e-6847.97Show/hide
Query:  HSFVDAVSALAGL-GYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSHHADLTFCS
        H  VDA S LA     +F++KFSP MFSIMA  TPS  C I LQL P FFN  Y C QLHY +IYI  F+  M + ER GFSSLTF+F +    D     
Subjt:  HSFVDAVSALAGL-GYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSHHADLTFCS

Query:  R-------------------------------DGRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVT-LNNFDSVLVTITNSQVKFSYAGTHM
        R                               +G F E++ PM+ S ++MDV    D  TFVS  SQEFINIVT  N+FD VLVT+ NSQV FSY  T +
Subjt:  R-------------------------------DGRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVT-LNNFDSVLVTITNSQVKFSYAGTHM

Query:  ILTQERKRCIIGGVGASNEIEFVVNLKAKESLCRLASRSKRVWLFKSKNSSKGAICVPLGLYARFLVSFSN
        +LT+ER++CIIGGV AS+E+ FV++L+     C LA R +RVWLFKS +S++G I  PLGLYAR++  FS+
Subjt:  ILTQERKRCIIGGVGASNEIEFVVNLKAKESLCRLASRSKRVWLFKSKNSSKGAICVPLGLYARFLVSFSN

A0A6J1KWL1 uncharacterized protein LOC1114988882.73e-7149.82Show/hide
Query:  HSFVDAVSALAGL-GYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSHHADLTFCS
        H  VDA S LA     +F++KFSP MFSIMA  TPS  C IALQL P FFN  Y C QLHY +IYI  F+  M + ER GFSSLTF+F +    D     
Subjt:  HSFVDAVSALAGL-GYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSHHADLTFCS

Query:  R-------------------------------DGRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVT-LNNFDSVLVTITNSQVKFSYAGTHM
        R                               +G F E++ PM+ S ++MDV    D+ TFVS  SQEFINIVT  N+FD VLVT+TNSQV FSY  T +
Subjt:  R-------------------------------DGRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVT-LNNFDSVLVTITNSQVKFSYAGTHM

Query:  ILTQERKRCIIGGVGASNEIEFVVNLKAKESLCRLASRSKRVWLFKSKNSSKGAICVPLGLYARFLVSFSN
        ILTQER++CIIGGV AS E+ FV++L+     C LA R +RVWLFKS +S++G I  PLGLYAR+L  FS+
Subjt:  ILTQERKRCIIGGVGASNEIEFVVNLKAKESLCRLASRSKRVWLFKSKNSSKGAICVPLGLYARFLVSFSN

A0A6J1L2R2 uncharacterized protein LOC1114993081.31e-5542.97Show/hide
Query:  MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSH
        +F F+L+++   VDA S +A    + ++KFS EMF+IMA+  P+    IAL L+P FF++ Y C +L  SW ++ + FP M  +E +GF+SL+F+     
Subjt:  MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSH

Query:  HADLTFCSRDGRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVT-LNNFDSVLVTITNSQVKFSYA-GTHMILTQERKRCIIGGVGASNEIEF
         A+L F + +G   E++F +  S + ++V D  D+ +FVS  S+EF+NIVT  + FD V VT+T+++V FSYA     ILTQE   C+IGG+ A N +EF
Subjt:  HADLTFCSRDGRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVT-LNNFDSVLVTITNSQVKFSYA-GTHMILTQERKRCIIGGVGASNEIEF

Query:  VVNLKAKESLCRLASRSKRVWLFKSKNSSKGAICVPLGLYARFLVSFSN
        ++ L   E+   +A RSKRVWL+KS  ++KG I  PLGLY RF+  F N
Subjt:  VVNLKAKESLCRLASRSKRVWLFKSKNSSKGAICVPLGLYARFLVSFSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCGTCTTCATACTCGACGAAATCCACAGCTTCGTCGATGCCGTCTCCGCTCTCGCCGGCCTTGGCTACGTATTCAATCTCAAATTCTCACCAGAAATGTTCTCAAT
AATGGCCAACCCCACACCTTCCCCTTCTTGTACCATAGCCCTTCAACTATTTCCTCCATTCTTCAATCAACAATATTCCTGCCAACAACTTCACTACTCATGGATTTACA
TCAACCACTTTTTCCCCGAAATGTTCCATTTGGAACGAACCGGTTTTTCTTCGCTCACATTCTCTTTCGCAGATTCGCATCATGCCGACCTCACATTTTGCAGCCGTGAT
GGCCGTTTTCGAGAGATTGATTTCCCAATGTATCATTCGGACGAGTTGATGGATGTCAATGATCAGTTGGATTGGGAAACTTTCGTCTCCTTCAGTTCACAAGAGTTCAT
TAACATTGTAACGTTGAATAATTTTGATTCTGTTTTAGTCACAATAACGAATTCACAAGTCAAATTCTCTTATGCTGGGACACATATGATTCTTACTCAAGAGAGAAAAC
GGTGCATTATTGGTGGAGTTGGGGCAAGCAATGAAATTGAGTTTGTAGTAAATTTGAAAGCAAAGGAAAGTTTATGTAGATTGGCAAGTAGATCAAAAAGGGTTTGGTTA
TTCAAGTCAAAAAATTCGTCCAAAGGTGCAATATGTGTCCCTCTTGGTTTGTATGCTCGATTTTTGGTTTCTTTTTCTAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCGTCTTCATACTCGACGAAATCCACAGCTTCGTCGATGCCGTCTCCGCTCTCGCCGGCCTTGGCTACGTATTCAATCTCAAATTCTCACCAGAAATGTTCTCAAT
AATGGCCAACCCCACACCTTCCCCTTCTTGTACCATAGCCCTTCAACTATTTCCTCCATTCTTCAATCAACAATATTCCTGCCAACAACTTCACTACTCATGGATTTACA
TCAACCACTTTTTCCCCGAAATGTTCCATTTGGAACGAACCGGTTTTTCTTCGCTCACATTCTCTTTCGCAGATTCGCATCATGCCGACCTCACATTTTGCAGCCGTGAT
GGCCGTTTTCGAGAGATTGATTTCCCAATGTATCATTCGGACGAGTTGATGGATGTCAATGATCAGTTGGATTGGGAAACTTTCGTCTCCTTCAGTTCACAAGAGTTCAT
TAACATTGTAACGTTGAATAATTTTGATTCTGTTTTAGTCACAATAACGAATTCACAAGTCAAATTCTCTTATGCTGGGACACATATGATTCTTACTCAAGAGAGAAAAC
GGTGCATTATTGGTGGAGTTGGGGCAAGCAATGAAATTGAGTTTGTAGTAAATTTGAAAGCAAAGGAAAGTTTATGTAGATTGGCAAGTAGATCAAAAAGGGTTTGGTTA
TTCAAGTCAAAAAATTCGTCCAAAGGTGCAATATGTGTCCCTCTTGGTTTGTATGCTCGATTTTTGGTTTCTTTTTCTAATTAG
Protein sequenceShow/hide protein sequence
MFVFILDEIHSFVDAVSALAGLGYVFNLKFSPEMFSIMANPTPSPSCTIALQLFPPFFNQQYSCQQLHYSWIYINHFFPEMFHLERTGFSSLTFSFADSHHADLTFCSRD
GRFREIDFPMYHSDELMDVNDQLDWETFVSFSSQEFINIVTLNNFDSVLVTITNSQVKFSYAGTHMILTQERKRCIIGGVGASNEIEFVVNLKAKESLCRLASRSKRVWL
FKSKNSSKGAICVPLGLYARFLVSFSN