; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0006857 (gene) of Chayote v1 genome

Gene IDSed0006857
OrganismSechium edule (Chayote v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationLG06:34582587..34583210
RNA-Seq ExpressionSed0006857
SyntenySed0006857
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]5.1e-2135.27Show/hide
Query:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML
        NP Y DW A+D+A+   IN+TLS   L+ V                           L+S LQ++ KK DE+ID Y++  K ++D L+N++  I++E++L
Subjt:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML

Query:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG
        IY LNGLP+E + F+T++RT+S P++FE L  LL+ EE ++  Q K   +    T +LS ++   S     D    RG GH     +GR ++    RG G
Subjt:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG

Query:  SYTPQQQ
        S +P+Q+
Subjt:  SYTPQQQ

KAG6588985.1 Retrovirus-related Pol polyprotein from transposon RE1, partial [Cucurbita argyrosperma subsp. sororia]2.6e-2538.46Show/hide
Query:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML
        NP Y DWFA+D+A+   IN+TLS   L+ V                           L+S LQ++ KKSDE+ID Y++  K ++D L+N++ V++DE++L
Subjt:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML

Query:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS---------DKGTNRGRGH-YGRGNYGNSGRGRG
        IY LNGLP+E + F+T++RT+S+P++FE L  LLK EE ++  Q K        TA+L+ ++   S          +G  RGRGH +G  ++   GRG G
Subjt:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS---------DKGTNRGRGH-YGRGNYGNSGRGRG

Query:  SYTPQQQL
        S + QQQL
Subjt:  SYTPQQQL

KAG7015254.1 hypothetical protein SDJN02_22888, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-2538.46Show/hide
Query:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML
        NP Y DWFA+D+A+   IN+TLS   L+ V                           L+S LQ++ KKSDE+ID Y++  K ++D L+N++ V++DE++L
Subjt:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML

Query:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS---------DKGTNRGRGH-YGRGNYGNSGRGRG
        IY LNGLP+E + F+T++RT+S+P++FE L  LLK EE ++  Q K        TA+L+ ++   S          +G  RGRGH +G  ++   GRG G
Subjt:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS---------DKGTNRGRGH-YGRGNYGNSGRGRG

Query:  SYTPQQQL
        S + QQQL
Subjt:  SYTPQQQL

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]5.1e-2135.27Show/hide
Query:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML
        NP Y DW A+D+A+   IN+TLS   L+ V                           L+S LQ++ KK DE+ID Y++  K ++D L+N++  I++E++L
Subjt:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML

Query:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG
        IY LNGLP+E + F+T++RT+S P++FE L  LL+ EE ++  Q K   +    T +LS ++   S     D    RG GH     +GR ++    RG G
Subjt:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG

Query:  SYTPQQQ
        S +P+Q+
Subjt:  SYTPQQQ

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]5.1e-2135.27Show/hide
Query:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML
        NP Y DW A+D+A+   IN+TLS   L+ V                           L+S LQ++ KK DE+ID Y++  K ++D L+N++  I++E++L
Subjt:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML

Query:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG
        IY LNGLP+E + F+T++RT+S P++FE L  LL+ EE ++  Q K   +    T +LS ++   S     D    RG GH     +GR ++    RG G
Subjt:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG

Query:  SYTPQQQ
        S +P+Q+
Subjt:  SYTPQQQ

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X22.5e-2135.27Show/hide
Query:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML
        NP Y DW A+D+A+   IN+TLS   L+ V                           L+S LQ++ KK DE+ID Y++  K ++D L+N++  I++E++L
Subjt:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML

Query:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG
        IY LNGLP+E + F+T++RT+S P++FE L  LL+ EE ++  Q K   +    T +LS ++   S     D    RG GH     +GR ++    RG G
Subjt:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG

Query:  SYTPQQQ
        S +P+Q+
Subjt:  SYTPQQQ

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X32.5e-2135.27Show/hide
Query:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML
        NP Y DW A+D+A+   IN+TLS   L+ V                           L+S LQ++ KK DE+ID Y++  K ++D L+N++  I++E++L
Subjt:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML

Query:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG
        IY LNGLP+E + F+T++RT+S P++FE L  LL+ EE ++  Q K   +    T +LS ++   S     D    RG GH     +GR ++    RG G
Subjt:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG

Query:  SYTPQQQ
        S +P+Q+
Subjt:  SYTPQQQ

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X12.5e-2135.27Show/hide
Query:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML
        NP Y DW A+D+A+   IN+TLS   L+ V                           L+S LQ++ KK DE+ID Y++  K ++D L+N++  I++E++L
Subjt:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML

Query:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG
        IY LNGLP+E + F+T++RT+S P++FE L  LL+ EE ++  Q K   +    T +LS ++   S     D    RG GH     +GR ++    RG G
Subjt:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG

Query:  SYTPQQQ
        S +P+Q+
Subjt:  SYTPQQQ

A0A5B7BD59 Retrotran_gag_3 domain-containing protein3.4e-2338.12Show/hide
Query:  NPQYIDWFAQDRAIKAPINSTLSTSDLS---------------------------LVLRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML
        NP Y  W  QD+A+ A IN+TL+ + LS                           L L+++LQSL K  D  ID Y+Q  K  RDSL+ ++V IDDE++L
Subjt:  NPQYIDWFAQDRAIKAPINSTLSTSDLS---------------------------LVLRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML

Query:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQP--SDKGTNRGRGHYGRGNYGNSGRGRGSYTPQQQL
        IY LNGLPSE +AF+TAIRT++ P++ E +  LL+ EE S+E   K ++   T +AM++ +   P  +++G NRGRG       G S    G ++  Q+ 
Subjt:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQP--SDKGTNRGRGHYGRGNYGNSGRGRGSYTPQQQL

Query:  TP
         P
Subjt:  TP

A0A5D3CLI6 T4.52.5e-2135.27Show/hide
Query:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML
        NP Y DW A+D+A+   IN+TLS   L+ V                           L+S LQ++ KK DE+ID Y++  K ++D L+N++  I++E++L
Subjt:  NPQYIDWFAQDRAIKAPINSTLSTSDLSLV---------------------------LRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEML

Query:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG
        IY LNGLP+E + F+T++RT+S P++FE L  LL+ EE ++  Q K   +    T +LS ++   S     D    RG GH     +GR ++    RG G
Subjt:  IYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLVTATAMLSINRGQPS-----DKGTNRGRGH-----YGRGNYGNSGRGRG

Query:  SYTPQQQ
        S +P+Q+
Subjt:  SYTPQQQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.9e-0532.03Show/hide
Query:  DENIDTYVQSAKTLRDSLSNINVVIDDEEMLIYVLNGL-PSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLV---TATAMLSINRGQP
        D  +  Y +  K L DSL N++V + D  +++YVLNGL P  D+        Q  P SF+   T+L+ EE  ++   K + T V   +++ +L+ +   P
Subjt:  DENIDTYVQSAKTLRDSLSNINVVIDDEEMLIYVLNGL-PSEDSAFQTAIRTQSSPISFEHLFTLLKTEEHSIELQHKSHKTLV---TATAMLSINRGQP

Query:  SDKGTNRG---RGHYGRGNYGNSGRGRG
               G    G+ GRG   N  RGRG
Subjt:  SDKGTNRG---RGHYGRGNYGNSGRGRG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCACAATACATTGATTGGTTCGCTCAAGATCGAGCAATTAAAGCCCCGATCAATAGCACATTATCTACGTCTGATCTTTCTCTCGTTCTGCGATCATCACTTCA
ATCTCTAATCAAGAAGTCAGATGAAAATATCGACACCTATGTTCAATCTGCCAAAACATTGCGAGACTCTCTTTCTAACATTAATGTTGTTATTGATGATGAGGAAATGC
TGATCTATGTCCTAAATGGACTCCCCAGCGAAGATAGTGCCTTTCAGACAGCGATACGCACTCAATCTTCACCAATCTCGTTTGAGCATTTGTTTACCCTTCTCAAGACT
GAGGAACACTCAATTGAACTACAACATAAAAGCCATAAGACCTTAGTAACCGCTACTGCAATGCTTTCGATAAATCGTGGACAGCCTTCTGATAAAGGAACAAATCGAGG
CAGAGGTCACTATGGTCGAGGTAACTATGGGAATTCTGGTAGAGGTCGTGGATCCTACACTCCACAACAACAGCTCACACCTCCACCTCCCCCCCTCCACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCCACAATACATTGATTGGTTCGCTCAAGATCGAGCAATTAAAGCCCCGATCAATAGCACATTATCTACGTCTGATCTTTCTCTCGTTCTGCGATCATCACTTCA
ATCTCTAATCAAGAAGTCAGATGAAAATATCGACACCTATGTTCAATCTGCCAAAACATTGCGAGACTCTCTTTCTAACATTAATGTTGTTATTGATGATGAGGAAATGC
TGATCTATGTCCTAAATGGACTCCCCAGCGAAGATAGTGCCTTTCAGACAGCGATACGCACTCAATCTTCACCAATCTCGTTTGAGCATTTGTTTACCCTTCTCAAGACT
GAGGAACACTCAATTGAACTACAACATAAAAGCCATAAGACCTTAGTAACCGCTACTGCAATGCTTTCGATAAATCGTGGACAGCCTTCTGATAAAGGAACAAATCGAGG
CAGAGGTCACTATGGTCGAGGTAACTATGGGAATTCTGGTAGAGGTCGTGGATCCTACACTCCACAACAACAGCTCACACCTCCACCTCCCCCCCTCCACTGA
Protein sequenceShow/hide protein sequence
MNPQYIDWFAQDRAIKAPINSTLSTSDLSLVLRSSLQSLIKKSDENIDTYVQSAKTLRDSLSNINVVIDDEEMLIYVLNGLPSEDSAFQTAIRTQSSPISFEHLFTLLKT
EEHSIELQHKSHKTLVTATAMLSINRGQPSDKGTNRGRGHYGRGNYGNSGRGRGSYTPQQQLTPPPPPLH