; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G003620 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G003620
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionDUF4228 domain protein
Genome locationCmo_Chr10:1644822..1645331
RNA-Seq ExpressionCmoCh10G003620
SyntenyCmoCh10G003620
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589641.1 hypothetical protein SDJN03_15064, partial [Cucurbita argyrosperma subsp. sororia]9.2e-8198.74Show/hide
Query:  MASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATKALKHGNSSGF
        MASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEMRSLTYTATKALKHGNSSG 
Subjt:  MASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATKALKHGNSSGF

Query:  GRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
        GRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
Subjt:  GRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA

KAG7023330.1 hypothetical protein SDJN02_14355, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-8698.82Show/hide
Query:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK
        MGNSASC PSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEMRSLTYTATK
Subjt:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK

Query:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
        ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
Subjt:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA

XP_022921761.1 uncharacterized protein LOC111429917 [Cucurbita moschata]2.3e-87100Show/hide
Query:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK
        MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK
Subjt:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK

Query:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
        ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
Subjt:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA

XP_022987495.1 uncharacterized protein LOC111485040 [Cucurbita maxima]2.1e-8598.22Show/hide
Query:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK
        MGNSASCAPSMASNGAAKVLSLDGKLQSY K VQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEMRSLTYTATK
Subjt:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK

Query:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
        ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
Subjt:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA

XP_023516747.1 uncharacterized protein LOC111780552 [Cucurbita pepo subsp. pepo]1.8e-8199.37Show/hide
Query:  MASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATKALKHGNSSGF
        MASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEMRSLTYTATKALKHGNSSGF
Subjt:  MASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATKALKHGNSSGF

Query:  GRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
        GRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
Subjt:  GRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA

TrEMBL top hitse value%identityAlignment
A0A5D3E1W3 DUF4228 domain protein1.8e-6983.33Show/hide
Query:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK
        MGNSASCAPS+ASNGAAKVLSLDG LQS+ KPV AAELMIEHSGKFLCDS+DL VGHRIQGLLPDEDLE RRLYFLLPMDLLYSVLT+EEM SLT+ ATK
Subjt:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK

Query:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSC
        ALK GNSSGFGRIFPVLI++ C S +DV  LK  D D EN+SSK V+RLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSC

A0A6J1C0W6 uncharacterized protein LOC1110072545.8e-7385.88Show/hide
Query:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK
        MGNSASCAPSM SNGAAKVLSLDGKL+SY KPV+AAELMIE+SGKFLCDS DL VGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEM SLTY ATK
Subjt:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK

Query:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDR--ENRSSKPVQRLMSKQRSWKPALETIAETSC
        ALK GNSSGFGRIFPVLI++LCI  S+VNRLKS   DR  EN + KPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDR--ENRSSKPVQRLMSKQRSWKPALETIAETSC

A0A6J1E2A0 uncharacterized protein LOC1114299171.1e-87100Show/hide
Query:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK
        MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK
Subjt:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK

Query:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
        ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
Subjt:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA

A0A6J1HLR9 uncharacterized protein LOC1114653635.4e-7185.8Show/hide
Query:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK
        MGNSA CAPSMA NGAAKVL+LDG LQS+ KPV+AAELMIEHSGKFLCDS DL VGHRI GLL +E LECRRLYFLLPMDLLYSVLTIEEM SLTY ATK
Subjt:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK

Query:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRE-NRSSKPVQRLMSKQRSWKPALETIAETSC
        ALK GNSSGFGRIFPVLI++LCI  SDVN LKSADGD E N+SSKPVQRLMSKQRSWKPALETIAETSC
Subjt:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRE-NRSSKPVQRLMSKQRSWKPALETIAETSC

A0A6J1JH13 uncharacterized protein LOC1114850401.0e-8598.22Show/hide
Query:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK
        MGNSASCAPSMASNGAAKVLSLDGKLQSY K VQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLT+EEMRSLTYTATK
Subjt:  MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATK

Query:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
        ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA
Subjt:  ALKHGNSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G18290.1 unknown protein5.5e-2338.07Show/hide
Query:  MGNSASCAP----SMASNGAAKVLS-LDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRR-LYFLLPMDLLYSVLTIEEMRSL
        MGN++SCAP    + +S+G  K+L+   G L+ + KP++ ++++  HSG F+ DS  L + HR+  + PDE L  RR LY LLP D+L+SVLT EE+  +
Subjt:  MGNSASCAP----SMASNGAAKVLS-LDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRR-LYFLLPMDLLYSVLTIEEMRSL

Query:  TYTATKALKHGNSSGFGRIFPVLI----TDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAET
        +  A + L     +   RIFPV I     D   + S VN  ++ DG     +        SK  SW+P LETI E+
Subjt:  TYTATKALKHGNSSGFGRIFPVLI----TDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAET

AT1G28190.1 unknown protein2.5e-0434.83Show/hide
Query:  PSMASNGA-AKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLL---YSVLTIEEMRS
        P   SNG   K++  DG LQ Y +PV  +EL  +     +C S  L +G +   L   E L+    YFLLP D      S LTI  +++
Subjt:  PSMASNGA-AKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLL---YSVLTIEEMRS

AT3G03280.1 unknown protein9.0e-1031.43Show/hide
Query:  MGNSASCAPSMASNG-AAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECR--RLYFLLPMDLLYSVLTIEEMRSLTYT
        MGN  SCA +  S+   AKV+  DG ++    P +AAELM+E    FL D+  + VG +   L  D+DL+     +Y   PM    S     +M  L  T
Subjt:  MGNSASCAPSMASNG-AAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECR--RLYFLLPMDLLYSVLTIEEMRSLTYT

Query:  ATKALKHGNSSGFGRIFP-------VLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAE
          K  K   + G  R+ P       V +    ++L D+    +A+          + R+ S  +S KP LETIAE
Subjt:  ATKALKHGNSSGFGRIFP-------VLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAE

AT4G37240.1 unknown protein4.1e-1028.36Show/hide
Query:  CAPSMASNGA-AKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATKALKHG
        C+ S ++  A AK++  DG++  +  PV+   +++++   F+C+S D+     +  +  DE+L+  ++YF LP+  L   L  EEM +L   A+ AL  G
Subjt:  CAPSMASNGA-AKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATKALKHG

Query:  NSSGFGRIFPVLITDLCIS--LSDVNRLKSADGD
           G  R         C+   +SD  R++   GD
Subjt:  NSSGFGRIFPVLITDLCIS--LSDVNRLKSADGD

AT5G66580.1 unknown protein4.5e-0930.53Show/hide
Query:  AAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATKALKHGNSSGF
        +AK++ LDG LQ +  PV+  +++ ++   F+C+S ++     +  +  +E+L   +LYF+LP+  L   L  EEM +L   A+ AL      G+
Subjt:  AAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATKALKHGNSSGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAATTCTGCGTCTTGTGCGCCTTCAATGGCCTCCAATGGCGCCGCGAAGGTCTTATCCTTAGACGGAAAATTGCAGAGCTACAGGAAGCCAGTGCAGGCC
GCGGAACTAATGATCGAGCATTCCGGCAAATTCCTCTGCGATTCCGCTGATCTCAGCGTCGGCCACCGGATTCAAGGCCTTTTGCCAGACGAAGATCTCGAGTGC
CGGCGATTGTACTTTCTACTTCCGATGGATCTCCTCTACTCCGTGCTAACGATCGAAGAAATGAGATCTCTGACGTACACCGCTACAAAGGCTCTGAAACATGGA
AATTCGAGTGGATTTGGAAGGATCTTCCCTGTTTTGATCACTGATCTCTGTATTTCTCTGTCGGATGTGAATCGATTGAAATCGGCGGACGGCGATCGAGAGAAT
CGAAGTTCGAAACCGGTGCAGAGATTGATGTCGAAACAGAGATCGTGGAAGCCTGCGCTCGAAACCATCGCTGAAACTTCGTGCGCGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAATTCTGCGTCTTGTGCGCCTTCAATGGCCTCCAATGGCGCCGCGAAGGTCTTATCCTTAGACGGAAAATTGCAGAGCTACAGGAAGCCAGTGCAGGCC
GCGGAACTAATGATCGAGCATTCCGGCAAATTCCTCTGCGATTCCGCTGATCTCAGCGTCGGCCACCGGATTCAAGGCCTTTTGCCAGACGAAGATCTCGAGTGC
CGGCGATTGTACTTTCTACTTCCGATGGATCTCCTCTACTCCGTGCTAACGATCGAAGAAATGAGATCTCTGACGTACACCGCTACAAAGGCTCTGAAACATGGA
AATTCGAGTGGATTTGGAAGGATCTTCCCTGTTTTGATCACTGATCTCTGTATTTCTCTGTCGGATGTGAATCGATTGAAATCGGCGGACGGCGATCGAGAGAAT
CGAAGTTCGAAACCGGTGCAGAGATTGATGTCGAAACAGAGATCGTGGAAGCCTGCGCTCGAAACCATCGCTGAAACTTCGTGCGCGTAG
Protein sequenceShow/hide protein sequence
MGNSASCAPSMASNGAAKVLSLDGKLQSYRKPVQAAELMIEHSGKFLCDSADLSVGHRIQGLLPDEDLECRRLYFLLPMDLLYSVLTIEEMRSLTYTATKALKHG
NSSGFGRIFPVLITDLCISLSDVNRLKSADGDRENRSSKPVQRLMSKQRSWKPALETIAETSCA