; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0022533 (gene) of Chayote v1 genome

Gene IDSed0022533
OrganismSechium edule (Chayote v1)
DescriptionDUF4228 domain-containing protein
Genome locationLG10:1338355..1339171
RNA-Seq ExpressionSed0022533
SyntenySed0022533
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581359.1 hypothetical protein SDJN03_21361, partial [Cucurbita argyrosperma subsp. sororia]2.0e-5876.4Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPP-PPRSNNKKRNR
        M NAIRCCISCI PCGSLDVIRIVHCNGHVEEIAG+I ASD+MKA+PKHVLKKPSS SD G+VPKIVI+PPDA+L+RG IYFLMP+PP P ++ +KK+ R
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPP-PPRSNNKKRNR

Query:  ELPVT--GSG--------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL
        E P T  GSG        SDRYLS+ILS +KLSTGQKDKRRGRVGVWRPHLESISEFP DL
Subjt:  ELPVT--GSG--------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL

XP_022925910.1 uncharacterized protein LOC111433184 [Cucurbita moschata]2.0e-5876.4Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPP-PPRSNNKKRNR
        M NAIRCCISCI PCGSLDVIRIVHCNGHVEEIAG+I ASD+MKA+PKHVLKKPSS SD G+VPKIVI+PPDA+L+RG IYFLMP+PP P ++ +KK+ R
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPP-PPRSNNKKRNR

Query:  ELPVT--GSG--------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL
        E P T  GSG        SDRYLS+ILS +KLSTGQKDKRRGRVGVWRPHLESISEFP DL
Subjt:  ELPVT--GSG--------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL

XP_022978779.1 uncharacterized protein LOC111478638 [Cucurbita maxima]2.0e-5876.4Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPP-PPRSNNKKRNR
        M NAIRCCISCI PCGSLDVIRIVHCNGHVEEIAG+I ASD+MKA+PKHVLKKPSS SD G+VPKIVI+PPDA+L+RG IYFLMP+PP P ++ +KK+ R
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPP-PPRSNNKKRNR

Query:  ELPVT--GSG--------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL
        E P T  GSG        SDRYLS+ILS +KLSTGQKDKRRGRVGVWRPHLESISEFP DL
Subjt:  ELPVT--GSG--------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL

XP_023543275.1 uncharacterized protein LOC111803203 isoform X1 [Cucurbita pepo subsp. pepo]9.8e-5875.31Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPPRS--NNKKRN
        M NAIRCCISCI PCGSLDVIRIVHCNGHVEEIAG+I ASD+MKA+PKHVLKKP+S SD G+VPKIVI+PPDA+L+RG IYFLMP+PP P    + KK+ 
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPPRS--NNKKRN

Query:  RELPVT--GSG--------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL
        RE P T  GSG        SDRYLS+ILS +KLSTGQKDKRRGRVGVWRPHLESISEFP DL
Subjt:  RELPVT--GSG--------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL

XP_038882770.1 uncharacterized protein LOC120073923 [Benincasa hispida]9.8e-5874.85Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSS-ASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPPRS------N
        M N IRCCISCILPCGSLDVIRIVHCNGHVEEIAGTI ASDVMKA+PKHVLKKPSS  SDD +VPKIVI+PPDAQL+RG IYFLMP+PP P         
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSS-ASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPPRS------N

Query:  NKKRNRELPVTGSG----------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL
         KK+ +ELP TG G          SDRYLSEILS +KL+T QKDKRRGRVGVWRPHLESISEFPTDL
Subjt:  NKKRNRELPVTGSG----------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL

TrEMBL top hitse value%identityAlignment
A0A0A0KI99 Uncharacterized protein1.2e-5672.02Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSS-ASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPP-----RSNN
        M NAIRCC+SCILPCGSLDVIRIVHC+GHV+EIAG+I ASDVMKA+PKHVLKKPSS  SDD +VPKIVI+PPDA+L+RG IYFLMP+PP P     +S +
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSS-ASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPP-----RSNN

Query:  KKRNRELPVTGSG------------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL
        KK+ +ELP+ G+G            SDRYLSEILS +KL+T QKDKRRGRVGVWRPHLESISEFPTDL
Subjt:  KKRNRELPVTGSG------------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL

A0A5D3CP55 DUF4228 domain-containing protein2.6e-5672.02Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSS-ASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPP-----RSNN
        M NAIRCCISCILPCGSLDVIRIVHC+GHV+EIAG+I ASDVMKA+PKHVLKKPSS  SDD +VPKIVI+PPDA+L+RG IYFLMP+PP P     +S  
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSS-ASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPP-----RSNN

Query:  KKRNRELPVTGSG------------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL
        KK+ +ELP+ G+G            SD+YLSEILS +KL+T QKDKRRGRVGVWRPHLESISEFPTDL
Subjt:  KKRNRELPVTGSG------------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL

A0A6J1EJJ7 uncharacterized protein LOC1114331849.5e-5976.4Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPP-PPRSNNKKRNR
        M NAIRCCISCI PCGSLDVIRIVHCNGHVEEIAG+I ASD+MKA+PKHVLKKPSS SD G+VPKIVI+PPDA+L+RG IYFLMP+PP P ++ +KK+ R
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPP-PPRSNNKKRNR

Query:  ELPVT--GSG--------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL
        E P T  GSG        SDRYLS+ILS +KLSTGQKDKRRGRVGVWRPHLESISEFP DL
Subjt:  ELPVT--GSG--------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL

A0A6J1IUZ9 uncharacterized protein LOC1114786389.5e-5976.4Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPP-PPRSNNKKRNR
        M NAIRCCISCI PCGSLDVIRIVHCNGHVEEIAG+I ASD+MKA+PKHVLKKPSS SD G+VPKIVI+PPDA+L+RG IYFLMP+PP P ++ +KK+ R
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPP-PPRSNNKKRNR

Query:  ELPVT--GSG--------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL
        E P T  GSG        SDRYLS+ILS +KLSTGQKDKRRGRVGVWRPHLESISEFP DL
Subjt:  ELPVT--GSG--------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDL

A0A6P3Z812 uncharacterized protein LOC1074108513.9e-5268.42Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSA-SDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPPR--------
        M N IRCCISCILPCG+LDVIRIVH NG VEEI+GTI AS++MKAHPKHVLKKPSS  SDDG+VPKIVIVPPDA+L+RG IYFLMP PPPP         
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSA-SDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPPR--------

Query:  SNNKKRNRELPVTGSG-------------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTD
        S  KKR   + +TG+              SDRYLSEILS +K+ST QKD+RRGRVGVWRPHLESISE PTD
Subjt:  SNNKKRNRELPVTGSG-------------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G06980.1 unknown protein3.0e-2841.24Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPPRSN-------
        M N++RCC++C+LPCG+LD+IRIVH NG+VEEI  +I A ++++A+P HVL KP S    G+V KI+I+ P+++LKRG+IYFL+P    P          
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPPRSN-------

Query:  NKKRNRELPVTGSGS-------------DRYLSEILSDQKLSTGQKDKRRGR------VGVWRPHLESISEFPTDLN
         +K+N + P   +               ++YL E++S    STG++ + R R      V  WRP L+SISE   DLN
Subjt:  NKKRNRELPVTGSGS-------------DRYLSEILSDQKLSTGQKDKRRGR------VGVWRPHLESISEFPTDLN

AT1G29195.1 unknown protein4.0e-4151.03Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDG------LVPKIVIVPPDAQLKRGNIYFLMPVPP------
        M   IRCCI+CILPCG+LDVIRIVH NGHVEEI+GTI AS++MKAHPKHVLKKPSS + D          KIVIVPP+A+L+RG IYFLMP         
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDG------LVPKIVIVPPDAQLKRGNIYFLMPVPP------

Query:  ---------------PPRSNNKKRNRELPVTGSG------------------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTD
                         RS  ++++R+     +G                  SDRYL+EILS +K++T QKD+R+GRVGVWRPHLESISE  T+
Subjt:  ---------------PPRSNNKKRNRELPVTGSG------------------SDRYLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTD

AT2G30230.1 unknown protein1.8e-3039.89Show/hide
Query:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPPRSNNKKRNRE
        M N++RCC++C+LPCG+LD+IRIVH NGHV+EI   + A ++++A+P HVL KP S    G+V KI+I+ P+++LKRG+IYFL+P    P     K+ +E
Subjt:  MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPPRSNNKKRNRE

Query:  L----PVTGSGSD---------------------RYLSEILSDQKLSTGQKDKRRGR-------VGVWRPHLESISEFPTDLN
        L        SG+D                     +YL +++  +K+S+  K+ R  R       V  WRPHL+SI+E   DLN
Subjt:  L----PVTGSGSD---------------------RYLSEILSDQKLSTGQKDKRRGR-------VGVWRPHLESISEFPTDLN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAACGCAATCAGATGCTGCATCTCCTGCATACTCCCATGTGGATCTCTCGACGTGATCAGGATCGTCCATTGCAACGGCCACGTGGAGGAAATCGCCGGCACCAT
CCTCGCCAGTGACGTCATGAAGGCCCATCCCAAACACGTCCTCAAGAAACCCTCCTCCGCCTCCGACGACGGCCTCGTCCCCAAGATCGTGATCGTGCCGCCGGACGCCC
AGCTCAAACGCGGCAACATTTACTTTCTCATGCCGGTGCCGCCGCCGCCCCGGTCCAACAACAAGAAGAGGAACAGGGAATTGCCGGTCACCGGTTCGGGTTCGGATCGG
TACTTGAGTGAAATCCTTTCGGATCAGAAATTGTCCACGGGTCAGAAGGATAAGCGGCGCGGTCGGGTCGGAGTTTGGAGGCCTCATTTGGAGAGCATTTCGGAGTTCCC
AACTGATCTTAATTAA
mRNA sequenceShow/hide mRNA sequence
AAAACAAACAACCATTAGCTTTCACTTTCATTTTTCAAATTCTATTATTTCAATTCTCTCTCTCTCTCTCTTTATGAGAAACCACAACCCAAAGAGAAACACAAATCGGC
GCTCATGTTGAACGCAATCAGATGCTGCATCTCCTGCATACTCCCATGTGGATCTCTCGACGTGATCAGGATCGTCCATTGCAACGGCCACGTGGAGGAAATCGCCGGCA
CCATCCTCGCCAGTGACGTCATGAAGGCCCATCCCAAACACGTCCTCAAGAAACCCTCCTCCGCCTCCGACGACGGCCTCGTCCCCAAGATCGTGATCGTGCCGCCGGAC
GCCCAGCTCAAACGCGGCAACATTTACTTTCTCATGCCGGTGCCGCCGCCGCCCCGGTCCAACAACAAGAAGAGGAACAGGGAATTGCCGGTCACCGGTTCGGGTTCGGA
TCGGTACTTGAGTGAAATCCTTTCGGATCAGAAATTGTCCACGGGTCAGAAGGATAAGCGGCGCGGTCGGGTCGGAGTTTGGAGGCCTCATTTGGAGAGCATTTCGGAGT
TCCCAACTGATCTTAATTAAATTTTTAATGCAAATTATAAAAATCCAGCTTGTAGGGCTTTTTTCTTTTTTCCTAAATTGATTCATTTCCCTTTTTTCTATATGAATCTC
TCCCTTATATATATATATGTGTGTATAAATGAAGTTAAAGGGAAATTATATAATTTTGCTCCCTTCATTATAAAGTGAAGATCATTTTGGTATGTCAAATGGAAATTATG
TGCTTTTGTGTTGATACCGATTTTTTTAATCGAACTTGTGATCATGG
Protein sequenceShow/hide protein sequence
MLNAIRCCISCILPCGSLDVIRIVHCNGHVEEIAGTILASDVMKAHPKHVLKKPSSASDDGLVPKIVIVPPDAQLKRGNIYFLMPVPPPPRSNNKKRNRELPVTGSGSDR
YLSEILSDQKLSTGQKDKRRGRVGVWRPHLESISEFPTDLN