; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G000070 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G000070
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of Unknown Function (DUF239)
Genome locationCmo_Chr20:102063..103893
RNA-Seq ExpressionCmoCh20G000070
SyntenyCmoCh20G000070
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0031225 - anchored component of membrane (cellular component)
InterPro domainsIPR004314 - Neprosin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570311.1 hypothetical protein SDJN03_29226, partial [Cucurbita argyrosperma subsp. sororia]1.9e-3869.33Show/hide
Query:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGKT---KNPTAMASKPDTVGIGAITFTMEALAETKTAHEGYHFPLLAMEAIKLKLF
        S+   YLAD+ASMIEWGGEVVNSEAGGLHTLTQMG+GHFPEEGFGKT   +N   + S                 +    A +G     +AMEAIKLKLF
Subjt:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGKT---KNPTAMASKPDTVGIGAITFTMEALAETKTAHEGYHFPLLAMEAIKLKLF

Query:  LVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL
        LVMLLVAFAATTAQPAAPSEAPAPAPASDAPLF PTFFASLSALVFAFFL
Subjt:  LVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL

XP_022944304.1 arabinogalactan peptide 13-like [Cucurbita moschata]1.7e-18100Show/hide
Query:  MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL
        MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL
Subjt:  MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL

XP_023513159.1 arabinogalactan peptide 14-like [Cucurbita pepo subsp. pepo]3.0e-1594.92Show/hide
Query:  MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL
        MEAIKLKL LVMLLVAFAATTAQPAAPSE  APAPASDAPLFYPTFFASLSALVFAFFL
Subjt:  MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL

XP_038901449.1 uncharacterized protein LOC120088312 [Benincasa hispida]2.1e-1386.96Show/hide
Query:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK
        S+  +YLAD+ASMIEWGGEVVNSEA GLHTLTQMGSGHFPEEGFGK
Subjt:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK

XP_038902540.1 arabinogalactan protein 13-like [Benincasa hispida]5.6e-1486.44Show/hide
Query:  MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL
        MEAIKLKLFLV+LL+AFA T AQ A PSEAPAPAPASDA LFYPTFFASLSAL+FAFFL
Subjt:  MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL

TrEMBL top hitse value%identityAlignment
A0A5A7TEM0 Uncharacterized protein1.0e-1386.96Show/hide
Query:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK
        S+  +YLAD+ASMIEWGGEVVNSEA GLHTLTQMGSGHFPEEGFGK
Subjt:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK

A0A5D3DDU5 Uncharacterized protein1.0e-1386.96Show/hide
Query:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK
        S+  +YLAD+ASMIEWGGEVVNSEA GLHTLTQMGSGHFPEEGFGK
Subjt:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK

A0A6J1FWG8 arabinogalactan peptide 13-like8.2e-19100Show/hide
Query:  MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL
        MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL
Subjt:  MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL

A0A6J1J5K6 arabinogalactan peptide 13-like8.2e-19100Show/hide
Query:  MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL
        MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL
Subjt:  MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL

A0A6J1J6Z9 uncharacterized protein LOC1114840471.0e-1386.96Show/hide
Query:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK
        S+   YLA++ASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK
Subjt:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK

SwissProt top hitse value%identityAlignment
Q9LVC0 Arabinogalactan protein 142.9e-0550.85Show/hide
Query:  MEAIKLKLFLVMLLVAFA-ATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFF
        MEA+K+KL++V+L+   A +T  Q  A  +APAP+P SDA  F PTFFAS++ + F FF
Subjt:  MEAIKLKLFLVMLLVAFA-ATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFF

Q9STQ3 Arabinogalactan protein 137.6e-0652.63Show/hide
Query:  MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAF
        MEA+K++LF+ +L+ A A +  Q AA  EAPAP+P SDA L  P FFAS++ L F F
Subjt:  MEAIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAF

Arabidopsis top hitse value%identityAlignment
AT1G55360.1 Protein of Unknown Function (DUF239)1.1e-1269.57Show/hide
Query:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK
        S+  +YL ++ASMIEWGGEVVNS++ G HT TQMGSG FPEEGF K
Subjt:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK

AT3G13510.1 Protein of Unknown Function (DUF239)1.7e-1371.74Show/hide
Query:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK
        S+  +YL ++ASMIEWGGEVVNS++ G HT TQMGSGHFPEEGF K
Subjt:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK

AT5G18460.1 Protein of Unknown Function (DUF239)5.1e-1373.81Show/hide
Query:  TYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK
        T+LAD A+ +EWGGEVVN+ A G HT TQMGSGHFP+EGFGK
Subjt:  TYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK

AT5G56530.1 Protein of Unknown Function (DUF239)4.3e-1267.39Show/hide
Query:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK
        S+  +YLAD+AS++EWGGEVVN E  G HT TQMGSG FP+EGF K
Subjt:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK

AT5G56530.2 Protein of Unknown Function (DUF239)4.3e-1267.39Show/hide
Query:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK
        S+  +YLAD+AS++EWGGEVVN E  G HT TQMGSG FP+EGF K
Subjt:  SYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGACACTGGCGGATGCAATTTGGCAATGAGTATGTGTTGGGATATTGGCCGTCTTTCTTATTCTTGTACCTACCTGGCTGACACTGCCTCCATGATTGAGTGGGG
AGGGGAGGTTGTGAATTCAGAGGCTGGTGGACTGCACACCTTAACCCAGATGGGCAGTGGTCATTTTCCTGAAGAGGGATTTGGGAAGACTAAAAATCCAACTGCTATGG
CGTCCAAACCGGATACAGTGGGGATTGGGGCCATTACTTTTACTATGGAGGCCCTAGCAGAAACCAAAACTGCCCATGAGGGCTACCACTTTCCTCTCCTTGCAATGGAG
GCAATCAAGTTGAAGCTCTTCCTCGTAATGCTGCTGGTGGCCTTCGCTGCCACCACTGCCCAACCTGCTGCACCGTCTGAGGCTCCTGCTCCTGCTCCTGCCTCTGATGC
TCCTCTCTTTTACCCCACCTTCTTTGCTTCTCTTTCTGCTCTTGTTTTTGCCTTCTTCCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGACACTGGCGGATGCAATTTGGCAATGAGTATGTGTTGGGATATTGGCCGTCTTTCTTATTCTTGTACCTACCTGGCTGACACTGCCTCCATGATTGAGTGGGG
AGGGGAGGTTGTGAATTCAGAGGCTGGTGGACTGCACACCTTAACCCAGATGGGCAGTGGTCATTTTCCTGAAGAGGGATTTGGGAAGACTAAAAATCCAACTGCTATGG
CGTCCAAACCGGATACAGTGGGGATTGGGGCCATTACTTTTACTATGGAGGCCCTAGCAGAAACCAAAACTGCCCATGAGGGCTACCACTTTCCTCTCCTTGCAATGGAG
GCAATCAAGTTGAAGCTCTTCCTCGTAATGCTGCTGGTGGCCTTCGCTGCCACCACTGCCCAACCTGCTGCACCGTCTGAGGCTCCTGCTCCTGCTCCTGCCTCTGATGC
TCCTCTCTTTTACCCCACCTTCTTTGCTTCTCTTTCTGCTCTTGTTTTTGCCTTCTTCCTTTAA
Protein sequenceShow/hide protein sequence
MRDTGGCNLAMSMCWDIGRLSYSCTYLADTASMIEWGGEVVNSEAGGLHTLTQMGSGHFPEEGFGKTKNPTAMASKPDTVGIGAITFTMEALAETKTAHEGYHFPLLAME
AIKLKLFLVMLLVAFAATTAQPAAPSEAPAPAPASDAPLFYPTFFASLSALVFAFFL