; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020332 (gene) of Snake gourd v1 genome

Gene IDTan0020332
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCupin_3 domain-containing protein
Genome locationLG10:57005870..57009951
RNA-Seq ExpressionTan0020332
SyntenyTan0020332
Gene Ontology termsGO:0016310 - phosphorylation (biological process)
GO:0016301 - kinase activity (molecular function)
InterPro domainsIPR008579 - (S)-ureidoglycine aminohydrolase, cupin-3 domain
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141517.1 uncharacterized protein LOC101218376 [Cucumis sativus]2.9e-6489.78Show/hide
Query:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MAS+VDGFLLLNLKP +LLT P N SLY  KRA SL+IRADSMATERLGIKVEKNP ESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNF
        AGS ESVEIGAGDLVVFPKGMSCTWDVSV VDKHY F
Subjt:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNF

XP_008459495.1 PREDICTED: uncharacterized protein LOC103498612 [Cucumis melo]7.5e-6589.13Show/hide
Query:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MAS+VDGFL LNLKP +LLT P N SLY  KRA SL+IRADSMATERLGIKVEKNP ESKL+ELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG
        AGS+ESVEIGAGDLVVFPKGMSCTWDVSV VDKHYNFG
Subjt:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG

XP_022974049.1 uncharacterized protein LOC111472678 [Cucurbita maxima]2.7e-6285.51Show/hide
Query:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MAS+V+GF LLNLKP T L+ PRN SL   KRA SL+IRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG
        AGS+E VE+GAGDLVVFP GMSCTWDVS+ VDKHY FG
Subjt:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG

XP_023534884.1 uncharacterized protein LOC111796480 isoform X1 [Cucurbita pepo subsp. pepo]5.4e-6386.96Show/hide
Query:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MAS+V+GF LLNLKP T L+ PRN SL  AKRA SL+IRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG
        AGS+E VEIGAGDLVVFP GMSCTWDVS+ VDKHY FG
Subjt:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG

XP_038889944.1 uncharacterized protein LOC120079700 [Benincasa hispida]8.3e-6488.41Show/hide
Query:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MAS+V GF LLNLKP  LL+ P N SLY  KRA SL+IRADSMATE LGIKVEKNPPESKLTELGVR+WPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG
        AGSEESVEIGAGDLVVFPKGMSCTWDVSV VDKHYNFG
Subjt:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG

TrEMBL top hitse value%identityAlignment
A0A0A0KVZ0 Cupin_3 domain-containing protein1.4e-6489.78Show/hide
Query:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MAS+VDGFLLLNLKP +LLT P N SLY  KRA SL+IRADSMATERLGIKVEKNP ESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNF
        AGS ESVEIGAGDLVVFPKGMSCTWDVSV VDKHY F
Subjt:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNF

A0A1S3C9T6 uncharacterized protein LOC1034986123.6e-6589.13Show/hide
Query:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MAS+VDGFL LNLKP +LLT P N SLY  KRA SL+IRADSMATERLGIKVEKNP ESKL+ELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG
        AGS+ESVEIGAGDLVVFPKGMSCTWDVSV VDKHYNFG
Subjt:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG

A0A6J1DS18 uncharacterized protein LOC1110230002.2e-6284.06Show/hide
Query:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MAS+VDGFLLLNL+P  LL+ PR  SL+S K A SL +RA+SMATE+LGIK+EKNPPESKLT+LGVRQWPKWGC PSKFPWTYSDKETC+LLEGKVKVTP
Subjt:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG
        AGSEESVEIG+GDLVVFPKGMSCTWDVSV VDKHYNFG
Subjt:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG

A0A6J1EQ24 uncharacterized protein LOC1114366482.9e-6285.51Show/hide
Query:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MAS+++GF LLNLKP T L+ PRN SL  AK A SL+IRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG
        AGS+E VEIGAGDLVVFP GMSCTWDVS+ VDKHY FG
Subjt:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG

A0A6J1IGD9 uncharacterized protein LOC1114726781.3e-6285.51Show/hide
Query:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
        MAS+V+GF LLNLKP T L+ PRN SL   KRA SL+IRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP
Subjt:  MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP

Query:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG
        AGS+E VE+GAGDLVVFP GMSCTWDVS+ VDKHY FG
Subjt:  AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNFG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G04300.1 RmlC-like cupins superfamily protein8.4e-3057.78Show/hide
Query:  LGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNF
        + I +E NP   +L++LGV  WPKW C P K+   + ++ETCYL++GKVKV P GS E VE GAGDLV  PKG+SCTWDVS+ +DKHY F
Subjt:  LGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDKHYNF

AT4G10280.1 RmlC-like cupins superfamily protein6.4e-2246.6Show/hide
Query:  IRADS-MATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAG---SEESVEIGAGDLVVFPKGMSCTWDVSVTVDK
        + +DS + TE  G+K+ +   ++KL +LGV  WPKW   PSKFPW +   ET Y +EGKVKV   G    EE+ EIG GD+VVFPK M   W+++  V K
Subjt:  IRADS-MATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAG---SEESVEIGAGDLVVFPKGMSCTWDVSVTVDK

Query:  HYN
         Y+
Subjt:  HYN

AT4G10290.1 RmlC-like cupins superfamily protein6.9e-1643.56Show/hide
Query:  IRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAGSE---ESVEIGAGDLVVFPKGMSCTWDVSVTVDKH
        I A  + TE  G+KV +   ++KL ELGV  W  W   P KFPW +   ET Y +EGK+KV         E++E  AGDLVVFP+ M+   DV   V K 
Subjt:  IRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAGSE---ESVEIGAGDLVVFPKGMSCTWDVSVTVDKH

Query:  Y
        Y
Subjt:  Y

AT4G10300.1 RmlC-like cupins superfamily protein5.4e-4572.73Show/hide
Query:  YSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAGSEESVEIGAGDLVVFPKGMSCTWDV
        Y+++R  S+     + +TE+LGI +EKNPPESKLT+LGVR WPKWGC PSKFPWTYS KETCYLL+GKVKV P GS+E VEI AGD VVFPKGMSCTWDV
Subjt:  YSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAGSEESVEIGAGDLVVFPKGMSCTWDV

Query:  SVTVDKHYNF
        SV VDKHY F
Subjt:  SVTVDKHYNF

AT4G28703.1 RmlC-like cupins superfamily protein1.9e-2652.88Show/hide
Query:  MATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP---------AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDK
        MA +   I VE+NP +++L EL  + WPKWGC P K+   Y  +E CY+L GKVKV P         A  E  VE GAGD+V FPKG+SCTWDVS++VDK
Subjt:  MATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTP---------AGSEESVEIGAGDLVVFPKGMSCTWDVSVTVDK

Query:  HYNF
        HY F
Subjt:  HYNF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTCTGTGGACGGCTTCTTATTACTCAATTTAAAACCATCCACCCTCCTCACTAATCCTAGAAACTTCTCTCTGTACTCTGCAAAAAGGGCACTCTCTCTTCG
AATTCGAGCAGATTCTATGGCGACTGAGAGGCTGGGCATCAAAGTTGAGAAGAATCCTCCTGAATCCAAGCTTACTGAACTCGGCGTCCGTCAATGGCCCAAGTGGGGAT
GTGGACCGAGCAAATTCCCATGGACATATTCGGACAAGGAGACCTGTTATCTGCTAGAAGGAAAGGTAAAGGTTACTCCTGCTGGGTCCGAGGAGTCAGTTGAGATTGGC
GCTGGGGATTTGGTTGTGTTCCCAAAAGGAATGAGCTGCACTTGGGATGTCTCAGTTACTGTGGACAAGCACTATAACTTTGGTTAG
mRNA sequenceShow/hide mRNA sequence
GAATTACCAGATAAGGAACGAGTAGATGGAATCAATTCCAAGTGGCATTCAAAGACACTGAACATGAACTTGATAATCGATATGGCCGTTGCTTTCCCTTCATGAGAAAC
CCAAAAAATTGGGGGCTTTGGGAATTGGGATGGCTTCCTCTGTGGACGGCTTCTTATTACTCAATTTAAAACCATCCACCCTCCTCACTAATCCTAGAAACTTCTCTCTG
TACTCTGCAAAAAGGGCACTCTCTCTTCGAATTCGAGCAGATTCTATGGCGACTGAGAGGCTGGGCATCAAAGTTGAGAAGAATCCTCCTGAATCCAAGCTTACTGAACT
CGGCGTCCGTCAATGGCCCAAGTGGGGATGTGGACCGAGCAAATTCCCATGGACATATTCGGACAAGGAGACCTGTTATCTGCTAGAAGGAAAGGTAAAGGTTACTCCTG
CTGGGTCCGAGGAGTCAGTTGAGATTGGCGCTGGGGATTTGGTTGTGTTCCCAAAAGGAATGAGCTGCACTTGGGATGTCTCAGTTACTGTGGACAAGCACTATAACTTT
GGTTAGTGCTACAATAAAACTTGCTGTATTATCTTCAGCCTTTGTATTTTGATGAACTCCATGACTGCCTTTCTTGTTCGATGACGGGCGTGTAGGGTTCTATAACTGAT
TTGCTTCCATAACAAATTCAGTCGATATGAATCCAGATCACATATGAGATTGAGGTTTCTAATGGGAACCTAAACTTTTTCTTGATATATATAAGCAGCGGTAAAAGGAT
AAAAATACACGAATACTAGGTTTATATGGATCCACTCCGACGCAGGGCTATGCCTTGTTGTTGGTAGACTTTGCAGCTTCATTAGTAATGTAGAGATGCAAAGGCATACA
CATCACTTTGTAACACATAACTAGTTAGGTTAACTGTGAGGGCGAAATTTGATTTCACTGAACGAAAGGACACCCCATCTCTCTTTGGATGTGTTGGTGTTGATATAATT
AAATTTGCCATAATCTATCAGCTTAAACTTTTGGGTTAATTGGTGATTTACATGGTATCAGAGCAGAAGGTCCTGTGTTCGAACCCTGGTGAAGTTGTTTTCTCCCAATT
AATATTAATAAATTTTCAAGCCCACAAGTGAGGGAGAGCTTAAGCTTTTGGGGTTAATTGGTGATTTACTGGATGATAGATTGTTAAGGAAAAGGTCCAAGAATCTGTCA
AGAGAAAATTAGAGGAAACTAGGACTAGGGGGATTGTGGGATTTACAATTGTAAATTTGTTGAATTTAAA
Protein sequenceShow/hide protein sequence
MASSVDGFLLLNLKPSTLLTNPRNFSLYSAKRALSLRIRADSMATERLGIKVEKNPPESKLTELGVRQWPKWGCGPSKFPWTYSDKETCYLLEGKVKVTPAGSEESVEIG
AGDLVVFPKGMSCTWDVSVTVDKHYNFG