; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh14G014170 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh14G014170
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionSANT domain-containing protein
Genome locationCma_Chr14:10908220..10912951
RNA-Seq ExpressionCmaCh14G014170
SyntenyCmaCh14G014170
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581997.1 hypothetical protein SDJN03_21999, partial [Cucurbita argyrosperma subsp. sororia]1.8e-47100Show/hide
Query:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

XP_004152146.1 uncharacterized protein LOC101222201 isoform X2 [Cucumis sativus]7.4e-4697Show/hide
Query:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNH EAGQPSSSFDGGNPSNGNSTPVP ADNSSSA+AMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

XP_011653023.1 uncharacterized protein LOC101222201 isoform X1 [Cucumis sativus]7.4e-4697Show/hide
Query:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNH EAGQPSSSFDGGNPSNGNSTPVP ADNSSSA+AMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

XP_016901549.1 PREDICTED: uncharacterized protein LOC103494613 isoform X2 [Cucumis melo]7.4e-4697Show/hide
Query:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNH EAGQPSSSFDGGNPSNGNSTPVP ADNSSSA+AMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

XP_022955472.1 uncharacterized protein LOC111457487 [Cucurbita moschata]1.8e-47100Show/hide
Query:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

TrEMBL top hitse value%identityAlignment
A0A0A0KX84 SANT domain-containing protein3.6e-4697Show/hide
Query:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNH EAGQPSSSFDGGNPSNGNSTPVP ADNSSSA+AMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

A0A1S3BYL7 uncharacterized protein LOC103494613 isoform X13.6e-4697Show/hide
Query:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNH EAGQPSSSFDGGNPSNGNSTPVP ADNSSSA+AMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

A0A1S4E017 uncharacterized protein LOC103494613 isoform X23.6e-4697Show/hide
Query:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNH EAGQPSSSFDGGNPSNGNSTPVP ADNSSSA+AMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

A0A6J1GTR4 uncharacterized protein LOC1114574878.6e-48100Show/hide
Query:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

A0A6J1I3R2 uncharacterized protein LOC1114707108.9e-4594Show/hide
Query:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN
        MANPSGNH EAGQPSSSFDGGNPSNGNSTPVP AD+SSSA+AMKHNPGISTDWTSDEQ+TLEEGLKKYA ESSVIRYAKIAMQLPNKTVRDVALRCRWMN
Subjt:  MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10820.1 Protein of unknown function (DUF3755)3.4e-1250Show/hide
Query:  VPVADNSSS-AIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWM
        +P  D S S A  +K    +  DW+ +EQ  LE GL K   E  + +Y KIA  LP+KTVRDVALRCRWM
Subjt:  VPVADNSSS-AIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWM

AT3G07565.1 Protein of unknown function (DUF3755)1.5e-2050.93Show/hide
Query:  ANPSGNHHEAGQPSSSFDGGNPSNGNSTPV----------PVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD
        ANPSGN+ E    +      + +  N   V            ADNS +  A++HNPGISTDWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHHEAGQPSSSFDGGNPSNGNSTPV----------PVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM
        VALRCRWM
Subjt:  VALRCRWM

AT3G07565.2 Protein of unknown function (DUF3755)1.5e-2050.93Show/hide
Query:  ANPSGNHHEAGQPSSSFDGGNPSNGNSTPV----------PVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD
        ANPSGN+ E    +      + +  N   V            ADNS +  A++HNPGISTDWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHHEAGQPSSSFDGGNPSNGNSTPV----------PVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM
        VALRCRWM
Subjt:  VALRCRWM

AT3G07565.3 Protein of unknown function (DUF3755)1.5e-2050.93Show/hide
Query:  ANPSGNHHEAGQPSSSFDGGNPSNGNSTPV----------PVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD
        ANPSGN+ E    +      + +  N   V            ADNS +  A++HNPGISTDWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHHEAGQPSSSFDGGNPSNGNSTPV----------PVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM
        VALRCRWM
Subjt:  VALRCRWM

AT3G07565.4 Protein of unknown function (DUF3755)1.5e-2050.93Show/hide
Query:  ANPSGNHHEAGQPSSSFDGGNPSNGNSTPV----------PVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD
        ANPSGN+ E    +      + +  N   V            ADNS +  A++HNPGISTDWT +EQ  LE+ L KYA E SV RYAKIAM++ +KTVRD
Subjt:  ANPSGNHHEAGQPSSSFDGGNPSNGNSTPV----------PVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRD

Query:  VALRCRWM
        VALRCRWM
Subjt:  VALRCRWM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAACCCATCTGGGAACCATCACGAAGCGGGCCAACCATCGTCTTCCTTTGATGGAGGGAACCCCAGCAATGGTAATTCGACTCCTGTGCCGGTGGCGGAT
AATTCGAGCTCGGCTATTGCTATGAAGCACAACCCAGGTATCTCGACGGATTGGACATCTGATGAGCAGGTCACTCTGGAAGAAGGGCTTAAGAAATATGCTGCA
GAGTCTAGTGTTATTAGGTATGCAAAGATTGCAATGCAGCTACCAAATAAGACCGTACGAGATGTTGCTTTGCGTTGCAGATGGATGAATGTCGGTTCTTTTTCT
CAGTTCTTGCCTTTATCAACATATAAATTATTTACAAGATTTCTGTTACTTGACTTGAAATCATGCATACCATTAGCATGA
mRNA sequenceShow/hide mRNA sequence
GGAAGTTAAAAAGATGTAACGTTAGCTTACGAATATGTTGACACCTTCCGACGTCAAACCTCCAAACCCTCTCTCGAATTGCTCTCTCGTCTCTGCAACATCCTT
TAGTTTTGTTTAGTTTACTTGAGCTCCCTTCAGTCTTATTCCCTACAAATCTTTCACATTTGTTTCCCTCCTTTTTGTTTTTTGTTTCTTCACTGATTGCTTCAT
CTATCTTCAATTTTGGTTACTGTTTGTTTGTTTTTGTTTGTTTTTTGGGCTTCCTTCTTCTGGGTTCATTGATTTTGGTTGAAGGAATCTGAATTGGCGGGAGCC
AACGTTCAATTATGTGGTTTTAGGAGGTGGGTTTTGCTTGGATTTGGATTTGGGATTTGGGATTGGAAAAGGTTTCGAAATTGTTGGGTGTTTTGATGGCTAACC
CATCTGGGAACCATCACGAAGCGGGCCAACCATCGTCTTCCTTTGATGGAGGGAACCCCAGCAATGGTAATTCGACTCCTGTGCCGGTGGCGGATAATTCGAGCT
CGGCTATTGCTATGAAGCACAACCCAGGTATCTCGACGGATTGGACATCTGATGAGCAGGTCACTCTGGAAGAAGGGCTTAAGAAATATGCTGCAGAGTCTAGTG
TTATTAGGTATGCAAAGATTGCAATGCAGCTACCAAATAAGACCGTACGAGATGTTGCTTTGCGTTGCAGATGGATGAATGTCGGTTCTTTTTCTCAGTTCTTGC
CTTTATCAACATATAAATTATTTACAAGATTTCTGTTACTTGACTTGAAATCATGCATACCATTAGCATGA
Protein sequenceShow/hide protein sequence
MANPSGNHHEAGQPSSSFDGGNPSNGNSTPVPVADNSSSAIAMKHNPGISTDWTSDEQVTLEEGLKKYAAESSVIRYAKIAMQLPNKTVRDVALRCRWMNVGSFS
QFLPLSTYKLFTRFLLLDLKSCIPLA