; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G005400 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G005400
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationCmo_Chr16:2631754..2632419
RNA-Seq ExpressionCmoCh16G005400
SyntenyCmoCh16G005400
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577099.1 hypothetical protein SDJN03_24673, partial [Cucurbita argyrosperma subsp. sororia]2.9e-12799.55Show/hide
Query:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA
        MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDT PLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA
Subjt:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA

Query:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
        LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
Subjt:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV

Query:  SYSDLRNSALLVEQCLNRKDV
        SYSDLRNSALLVEQCLNRKDV
Subjt:  SYSDLRNSALLVEQCLNRKDV

KAG7015101.1 hypothetical protein SDJN02_22734, partial [Cucurbita argyrosperma subsp. argyrosperma]5.7e-12396.83Show/hide
Query:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA
        MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDT PLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA
Subjt:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA

Query:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
        LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQ  +   EYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
Subjt:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV

Query:  SYSDLRNSALLVEQCLNRKDV
        SYSDLRNSALLVEQCLNRKDV
Subjt:  SYSDLRNSALLVEQCLNRKDV

XP_022931358.1 uncharacterized protein LOC111437567 isoform X1 [Cucurbita moschata]4.5e-128100Show/hide
Query:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA
        MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA
Subjt:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA

Query:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
        LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
Subjt:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV

Query:  SYSDLRNSALLVEQCLNRKDV
        SYSDLRNSALLVEQCLNRKDV
Subjt:  SYSDLRNSALLVEQCLNRKDV

XP_022931360.1 uncharacterized protein LOC111437567 isoform X2 [Cucurbita moschata]2.6e-120100Show/hide
Query:  MNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSALSWWIVSKRYLNA
        MNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSALSWWIVSKRYLNA
Subjt:  MNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSALSWWIVSKRYLNA

Query:  DGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNVSYSDLRNSALLVE
        DGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNVSYSDLRNSALLVE
Subjt:  DGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNVSYSDLRNSALLVE

Query:  QCLNRKDV
        QCLNRKDV
Subjt:  QCLNRKDV

XP_023522569.1 uncharacterized protein LOC111786568 [Cucurbita pepo subsp. pepo]1.7e-12799.55Show/hide
Query:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA
        MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQ VSFACLRLKDSA
Subjt:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA

Query:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
        LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
Subjt:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV

Query:  SYSDLRNSALLVEQCLNRKDV
        SYSDLRNSALLVEQCLNRKDV
Subjt:  SYSDLRNSALLVEQCLNRKDV

TrEMBL top hitse value%identityAlignment
A0A0A0L042 Retrotrans_gag domain-containing protein5.8e-7361.93Show/hide
Query:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA
        +KTN  L+AAEP++NLG+V DEIC+L+ E +R++AA+QT+EE  +K + FF C+PP FGEDTDPLVA+RW+L LENIFD I CSDE KVSFACLRLKD+A
Subjt:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA

Query:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
         SWW+V    L ADG  VTW+KFK+LF KRY PSWLK EK REL  L QG+ TV EYDE+FI L+SL  E  PD++ EARLF +GLR DI  Q+C  +N 
Subjt:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV

Query:  SYSDLRNSALLVEQCLNR
        S +++RN AL+VEQ +N+
Subjt:  SYSDLRNSALLVEQCLNR

A0A5A7TP79 Retrotrans_gag domain-containing protein4.5e-7361.93Show/hide
Query:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA
        +KTN  L+AAEP++NLG+V DEIC+L+ E +R++AA+QT+EE ++K + FF C+PP FGEDTDPLVA+RW+L LENIFD I CSD+QKVSFA LRLKD+A
Subjt:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA

Query:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
         SWW+V    L ADG  VTWEK K+LF KRY P WLK EK REL  L QG+ TV EYDE+FI L+SL  E  PD++ EARLF +GLR DI  Q+CF++NV
Subjt:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV

Query:  SYSDLRNSALLVEQCLNR
        SY+++RN AL+ EQ +N+
Subjt:  SYSDLRNSALLVEQCLNR

A0A5D3CX20 Retrotrans_gag domain-containing protein9.0e-7462.39Show/hide
Query:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA
        +KTN  L+AAEP++NLG+V DEIC+L+ E +R++AA+QT+EE ++K + FF C+PP FGEDTDPLVA+RW+L LENIFD I CSD+QKVSFA LRLKD+A
Subjt:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA

Query:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
         SWW+V    L ADG  VTWEKFK+LF KRY P WLK EK REL  L QG+ TV EYDE+FI L+SL  E  PD++ EARLF +GLR DI  Q+CF++NV
Subjt:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV

Query:  SYSDLRNSALLVEQCLNR
        SY+++RN AL+ EQ +N+
Subjt:  SYSDLRNSALLVEQCLNR

A0A6J1EYB6 uncharacterized protein LOC111437567 isoform X12.2e-128100Show/hide
Query:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA
        MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA
Subjt:  MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSA

Query:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
        LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV
Subjt:  LSWWIVSKRYLNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNV

Query:  SYSDLRNSALLVEQCLNRKDV
        SYSDLRNSALLVEQCLNRKDV
Subjt:  SYSDLRNSALLVEQCLNRKDV

A0A6J1EZ79 uncharacterized protein LOC111437567 isoform X21.3e-120100Show/hide
Query:  MNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSALSWWIVSKRYLNA
        MNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSALSWWIVSKRYLNA
Subjt:  MNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSALSWWIVSKRYLNA

Query:  DGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNVSYSDLRNSALLVE
        DGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNVSYSDLRNSALLVE
Subjt:  DGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNVSYSDLRNSALLVE

Query:  QCLNRKDV
        QCLNRKDV
Subjt:  QCLNRKDV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAACTAACAGACCCCTAATTGCAGCTGAGCCAGCTATGAATTTGGGAGATGTTTGTGACGAAATTTGCAGTTTAATCAGGGAAGGGCTTAGGGTAGTTGCTGCTAG
ACAGACTAATGAAGAACATAATAGCAAGTTCAATCAGTTTTTTGTCTGTCAACCTCCTTGGTTTGGAGAGGATACCGACCCCTTAGTTGCTAAACGCTGGGTTTTGGGGT
TAGAGAACATCTTTGATTGCATAGGCTGTTCAGACGAGCAAAAGGTTTCCTTTGCTTGTTTAAGACTGAAAGATTCAGCACTTTCTTGGTGGATAGTGTCAAAAAGATAC
TTGAATGCTGATGGGGCTGCAGTGACATGGGAGAAGTTTAAGGACTTATTCTATAAGAGATATTTCCCTAGTTGGTTGAAGCATGAGAAGCATCGTGAACTCTGGAAGTT
GGGACAAGGAAACAGAACCGTGATTGAGTATGATGAAGAATTCATTACCTTGTCTTCTCTTGTTTCTGAACTAAATCCAGACGATTCTTTGGAAGCTAGGCTGTTCTATG
AAGGGTTAAGGCCAGATATTAGTCGACAACTTTGCTTCACGGATAACGTGTCGTACTCCGATCTTAGAAACTCGGCGTTACTGGTCGAACAGTGCCTTAACCGGAAAGAT
GTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAACTAACAGACCCCTAATTGCAGCTGAGCCAGCTATGAATTTGGGAGATGTTTGTGACGAAATTTGCAGTTTAATCAGGGAAGGGCTTAGGGTAGTTGCTGCTAG
ACAGACTAATGAAGAACATAATAGCAAGTTCAATCAGTTTTTTGTCTGTCAACCTCCTTGGTTTGGAGAGGATACCGACCCCTTAGTTGCTAAACGCTGGGTTTTGGGGT
TAGAGAACATCTTTGATTGCATAGGCTGTTCAGACGAGCAAAAGGTTTCCTTTGCTTGTTTAAGACTGAAAGATTCAGCACTTTCTTGGTGGATAGTGTCAAAAAGATAC
TTGAATGCTGATGGGGCTGCAGTGACATGGGAGAAGTTTAAGGACTTATTCTATAAGAGATATTTCCCTAGTTGGTTGAAGCATGAGAAGCATCGTGAACTCTGGAAGTT
GGGACAAGGAAACAGAACCGTGATTGAGTATGATGAAGAATTCATTACCTTGTCTTCTCTTGTTTCTGAACTAAATCCAGACGATTCTTTGGAAGCTAGGCTGTTCTATG
AAGGGTTAAGGCCAGATATTAGTCGACAACTTTGCTTCACGGATAACGTGTCGTACTCCGATCTTAGAAACTCGGCGTTACTGGTCGAACAGTGCCTTAACCGGAAAGAT
GTTTGA
Protein sequenceShow/hide protein sequence
MKTNRPLIAAEPAMNLGDVCDEICSLIREGLRVVAARQTNEEHNSKFNQFFVCQPPWFGEDTDPLVAKRWVLGLENIFDCIGCSDEQKVSFACLRLKDSALSWWIVSKRY
LNADGAAVTWEKFKDLFYKRYFPSWLKHEKHRELWKLGQGNRTVIEYDEEFITLSSLVSELNPDDSLEARLFYEGLRPDISRQLCFTDNVSYSDLRNSALLVEQCLNRKD
V