; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G07030 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G07030
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationChr4:4939863..4941813
RNA-Seq ExpressionCSPI04G07030
SyntenyCSPI04G07030
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045000.1 uncharacterized protein E6C27_scaffold74G003100 [Cucumis melo var. makuwa]2.0e-11892.95Show/hide
Query:  MAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSF
        MAQLEEVVSTVKTN+HLVAAEPSVNLGNVYDEICNLVSE+RIIAAKQTDEER+NKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSD+ KVSF
Subjt:  MAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSF

Query:  ACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIR
        A LRLKDAAFSWWMV+HTDLEADGVTVTW+K KELFEKRYLP WLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIR
Subjt:  ACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIR

Query:  GQVCVLNNVSYAEIRNLALMVEQRINK
         Q+C  NNVSYAE+RNLALM EQRINK
Subjt:  GQVCVLNNVSYAEIRNLALMVEQRINK

TYK16471.1 uncharacterized protein E5676_scaffold21G002740 [Cucumis melo var. makuwa]4.1e-11993.39Show/hide
Query:  MAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSF
        MAQLEEVVSTVKTN+HLVAAEPSVNLGNVYDEICNLVSE+RIIAAKQTDEER+NKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSD+ KVSF
Subjt:  MAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSF

Query:  ACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIR
        A LRLKDAAFSWWMV+HTDLEADGVTVTW+KFKELFEKRYLP WLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIR
Subjt:  ACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIR

Query:  GQVCVLNNVSYAEIRNLALMVEQRINK
         Q+C  NNVSYAE+RNLALM EQRINK
Subjt:  GQVCVLNNVSYAEIRNLALMVEQRINK

XP_004147729.1 uncharacterized protein LOC101209793 [Cucumis sativus]4.4e-12998.31Show/hide
Query:  MKIVDIRKTMAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIR
        MKIVDIRKTMAQLEEVVSTVKTND LVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIR
Subjt:  MKIVDIRKTMAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIR

Query:  CSDEHKVSFACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLF
        CSDEHKVSFACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLF
Subjt:  CSDEHKVSFACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLF

Query:  GDGLRADIRGQVCVLNNVSYAEIRNLALMVEQRINK
        GDGLRADIRGQVC+LNN S AEIRNLALMVEQRINK
Subjt:  GDGLRADIRGQVCVLNNVSYAEIRNLALMVEQRINK

XP_022931358.1 uncharacterized protein LOC111437567 isoform X1 [Cucurbita moschata]1.5e-7362.84Show/hide
Query:  VKTNDHLVAAEPSVNLGNVYDEICNLVSE-MRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSFACLRLKDAA
        +KTN  L+AAEP++NLG+V DEIC+L+ E +R++AA+QT+EE  +K + FF C+PP FGEDTDPLVA+RW+L LENIFD I CSDE KVSFACLRLKD+A
Subjt:  VKTNDHLVAAEPSVNLGNVYDEICNLVSE-MRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSFACLRLKDAA

Query:  FSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIRGQVCVLNNV
         SWW+V    L ADG  VTW+KFK+LF KRY PSWLK EK REL  L QG+ TV EYDE+FI L+SL  E  PD++ EARLF +GLR DI  Q+C  +NV
Subjt:  FSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIRGQVCVLNNV

Query:  SYAEIRNLALMVEQRINK
        SY+++RN AL+VEQ +N+
Subjt:  SYAEIRNLALMVEQRINK

XP_023522569.1 uncharacterized protein LOC111786568 [Cucurbita pepo subsp. pepo]5.8e-7362.39Show/hide
Query:  VKTNDHLVAAEPSVNLGNVYDEICNLVSE-MRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSFACLRLKDAA
        +KTN  L+AAEP++NLG+V DEIC+L+ E +R++AA+QT+EE  +K + FF C+PP FGEDTDPLVA+RW+L LENIFD I CSDE  VSFACLRLKD+A
Subjt:  VKTNDHLVAAEPSVNLGNVYDEICNLVSE-MRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSFACLRLKDAA

Query:  FSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIRGQVCVLNNV
         SWW+V    L ADG  VTW+KFK+LF KRY PSWLK EK REL  L QG+ TV EYDE+FI L+SL  E  PD++ EARLF +GLR DI  Q+C  +NV
Subjt:  FSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIRGQVCVLNNV

Query:  SYAEIRNLALMVEQRINK
        SY+++RN AL+VEQ +N+
Subjt:  SYAEIRNLALMVEQRINK

TrEMBL top hitse value%identityAlignment
A0A0A0L042 Retrotrans_gag domain-containing protein2.1e-12998.31Show/hide
Query:  MKIVDIRKTMAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIR
        MKIVDIRKTMAQLEEVVSTVKTND LVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIR
Subjt:  MKIVDIRKTMAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIR

Query:  CSDEHKVSFACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLF
        CSDEHKVSFACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLF
Subjt:  CSDEHKVSFACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLF

Query:  GDGLRADIRGQVCVLNNVSYAEIRNLALMVEQRINK
        GDGLRADIRGQVC+LNN S AEIRNLALMVEQRINK
Subjt:  GDGLRADIRGQVCVLNNVSYAEIRNLALMVEQRINK

A0A5A7TP79 Retrotrans_gag domain-containing protein9.8e-11992.95Show/hide
Query:  MAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSF
        MAQLEEVVSTVKTN+HLVAAEPSVNLGNVYDEICNLVSE+RIIAAKQTDEER+NKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSD+ KVSF
Subjt:  MAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSF

Query:  ACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIR
        A LRLKDAAFSWWMV+HTDLEADGVTVTW+K KELFEKRYLP WLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIR
Subjt:  ACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIR

Query:  GQVCVLNNVSYAEIRNLALMVEQRINK
         Q+C  NNVSYAE+RNLALM EQRINK
Subjt:  GQVCVLNNVSYAEIRNLALMVEQRINK

A0A5D3CX20 Retrotrans_gag domain-containing protein2.0e-11993.39Show/hide
Query:  MAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSF
        MAQLEEVVSTVKTN+HLVAAEPSVNLGNVYDEICNLVSE+RIIAAKQTDEER+NKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSD+ KVSF
Subjt:  MAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSF

Query:  ACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIR
        A LRLKDAAFSWWMV+HTDLEADGVTVTW+KFKELFEKRYLP WLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIR
Subjt:  ACLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIR

Query:  GQVCVLNNVSYAEIRNLALMVEQRINK
         Q+C  NNVSYAE+RNLALM EQRINK
Subjt:  GQVCVLNNVSYAEIRNLALMVEQRINK

A0A6J1EYB6 uncharacterized protein LOC111437567 isoform X17.4e-7462.84Show/hide
Query:  VKTNDHLVAAEPSVNLGNVYDEICNLVSE-MRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSFACLRLKDAA
        +KTN  L+AAEP++NLG+V DEIC+L+ E +R++AA+QT+EE  +K + FF C+PP FGEDTDPLVA+RW+L LENIFD I CSDE KVSFACLRLKD+A
Subjt:  VKTNDHLVAAEPSVNLGNVYDEICNLVSE-MRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSFACLRLKDAA

Query:  FSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIRGQVCVLNNV
         SWW+V    L ADG  VTW+KFK+LF KRY PSWLK EK REL  L QG+ TV EYDE+FI L+SL  E  PD++ EARLF +GLR DI  Q+C  +NV
Subjt:  FSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIRGQVCVLNNV

Query:  SYAEIRNLALMVEQRINK
        SY+++RN AL+VEQ +N+
Subjt:  SYAEIRNLALMVEQRINK

A0A6J1EZ79 uncharacterized protein LOC111437567 isoform X24.2e-6962.93Show/hide
Query:  VNLGNVYDEICNLVSE-MRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSFACLRLKDAAFSWWMVMHTDLEA
        +NLG+V DEIC+L+ E +R++AA+QT+EE  +K + FF C+PP FGEDTDPLVA+RW+L LENIFD I CSDE KVSFACLRLKD+A SWW+V    L A
Subjt:  VNLGNVYDEICNLVSE-MRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSFACLRLKDAAFSWWMVMHTDLEA

Query:  DGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIRGQVCVLNNVSYAEIRNLALMVE
        DG  VTW+KFK+LF KRY PSWLK EK REL  L QG+ TV EYDE+FI L+SL  E  PD++ EARLF +GLR DI  Q+C  +NVSY+++RN AL+VE
Subjt:  DGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIRGQVCVLNNVSYAEIRNLALMVE

Query:  QRINK
        Q +N+
Subjt:  QRINK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATCGTAGACATACGAAAAACAATGGCTCAATTGGAAGAAGTTGTTTCTACAGTGAAGACTAACGATCACTTAGTTGCAGCTGAACCTTCAGTGAATTTGGGAAA
TGTTTATGACGAAATTTGCAACTTAGTAAGTGAAATGAGGATTATTGCAGCTAAACAGACTGATGAAGAACGTACCAACAAGTCTAGTATGTTTTTTGCCTGTGAACCTC
CTAGTTTTGGAGAAGACACCGATCCCTTAGTTGCTCAACGCTGGATTTTGATTTTAGAGAATATCTTTGATTTGATACGATGTTCAGACGAGCACAAAGTTTCTTTCGCT
TGTTTACGTCTGAAAGATGCTGCATTTTCTTGGTGGATGGTGATGCATACGGACTTGGAGGCTGATGGGGTTACAGTTACATGGCAGAAATTTAAGGAATTGTTTGAGAA
GAGATATTTACCTAGTTGGTTGAAGCTCGAGAAGTTTCGTGAACTGTGTAATTTGGAGCAAGGAGATGGAACTGTAGCTGAGTATGACGAACAATTCATTAAGTTGGCTT
CTCTTGCCCACGAGTTTATTCCAGATGAAGCTTGGGAAGCTCGCTTGTTTGGTGATGGTTTAAGGGCAGATATTAGAGGACAAGTTTGCGTCTTAAACAATGTCTCGTAC
GCCGAGATTAGAAACCTGGCATTAATGGTGGAACAGCGCATTAACAAATAA
mRNA sequenceShow/hide mRNA sequence
AAATATTTTAAACAATTTTGTTGTTTGATAGTTTCAAAATTTGGGACAAAATAATAAAGAAATTTTGGAGCAGTGTGTATAGTACGTCGGGTTTTAGCGTAAGGCATAAG
GCAGTAAATAGCACTAACGCCAAAATCGGTCACCCAATCCCGGGTTCGTATGAAAATCGTAGACATACGAAAAACAATGGCTCAATTGGAAGAAGTTGTTTCTACAGTGA
AGACTAACGATCACTTAGTTGCAGCTGAACCTTCAGTGAATTTGGGAAATGTTTATGACGAAATTTGCAACTTAGTAAGTGAAATGAGGATTATTGCAGCTAAACAGACT
GATGAAGAACGTACCAACAAGTCTAGTATGTTTTTTGCCTGTGAACCTCCTAGTTTTGGAGAAGACACCGATCCCTTAGTTGCTCAACGCTGGATTTTGATTTTAGAGAA
TATCTTTGATTTGATACGATGTTCAGACGAGCACAAAGTTTCTTTCGCTTGTTTACGTCTGAAAGATGCTGCATTTTCTTGGTGGATGGTGATGCATACGGACTTGGAGG
CTGATGGGGTTACAGTTACATGGCAGAAATTTAAGGAATTGTTTGAGAAGAGATATTTACCTAGTTGGTTGAAGCTCGAGAAGTTTCGTGAACTGTGTAATTTGGAGCAA
GGAGATGGAACTGTAGCTGAGTATGACGAACAATTCATTAAGTTGGCTTCTCTTGCCCACGAGTTTATTCCAGATGAAGCTTGGGAAGCTCGCTTGTTTGGTGATGGTTT
AAGGGCAGATATTAGAGGACAAGTTTGCGTCTTAAACAATGTCTCGTACGCCGAGATTAGAAACCTGGCATTAATGGTGGAACAGCGCATTAACAAATAATCAAAAAATT
TGAAGTAGTAAGAAAAGGGGTAGTAGAAGCAAACAGATTTTGGAGGTTTCAGCTATGAATTATCTTCACTTCTCAATGTAGGTATTGACTGTTATTGTTAGTTCATTACT
CCATCACTAGAAAGTTTCATAGCTTTAGGAAGTTTATTTTGATAGTGTTTTGTTCTTCCCCAAAAAAACAGAATTCCACGCCCCCCTCTCTGGTCTATAGGATTTATGGT
AATGGTAGTCTTATATACTTTCGTTTGGTGTGAAGTTTATCAGCTTTGTGTAAACAGTTCTTTGTTTCATTTCTTCATCCCATTAATGTTTTCTATTGG
Protein sequenceShow/hide protein sequence
MKIVDIRKTMAQLEEVVSTVKTNDHLVAAEPSVNLGNVYDEICNLVSEMRIIAAKQTDEERTNKSSMFFACEPPSFGEDTDPLVAQRWILILENIFDLIRCSDEHKVSFA
CLRLKDAAFSWWMVMHTDLEADGVTVTWQKFKELFEKRYLPSWLKLEKFRELCNLEQGDGTVAEYDEQFIKLASLAHEFIPDEAWEARLFGDGLRADIRGQVCVLNNVSY
AEIRNLALMVEQRINK