; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021151 (gene) of Snake gourd v1 genome

Gene IDTan0021151
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCupin_3 domain-containing protein
Genome locationLG06:17962806..17963869
RNA-Seq ExpressionTan0021151
SyntenyTan0021151
Gene Ontology termsNA
InterPro domainsIPR008579 - (S)-ureidoglycine aminohydrolase, cupin-3 domain
IPR011051 - RmlC-like cupin domain superfamily
IPR014710 - RmlC-like jelly roll fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592849.1 hypothetical protein SDJN03_12325, partial [Cucurbita argyrosperma subsp. sororia]4.5e-5293.75Show/hide
Query:  MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSV
        MANEEGNS NNPSTNNLRII+ERNPSQAKLS LNIQ WPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSA++EYTEFGAGDLVTIPKGLSCTWDVSV
Subjt:  MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSV

Query:  AVDKFYKFESQS
        AVDKFYKFES S
Subjt:  AVDKFYKFESQS

KAG7025258.1 hypothetical protein SDJN02_11753 [Cucurbita argyrosperma subsp. argyrosperma]3.5e-5294.64Show/hide
Query:  MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSV
        MANEEGNS NNPSTNNLRIIVERNPSQAKLS LNIQ WPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSA++EYTEFGAGDLVTIPKGLSCTWDVSV
Subjt:  MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSV

Query:  AVDKFYKFESQS
        AVDKFYKFES S
Subjt:  AVDKFYKFESQS

XP_022960185.1 uncharacterized protein LOC111460998 [Cucurbita moschata]5.9e-5294.64Show/hide
Query:  MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSV
        MANEEGNS NNPSTNNLRIIVERNPSQAKLS LNIQ WPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSA++EYTEFGAGDLVTIPKGLSCTWDVSV
Subjt:  MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSV

Query:  AVDKFYKFESQS
        AVDKFYKFES S
Subjt:  AVDKFYKFESQS

XP_023005099.1 uncharacterized protein LOC111498188 [Cucurbita maxima]1.3e-5193.75Show/hide
Query:  MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSV
        MANEEGNS NNPSTNNLRIIVERNPS+AKLS LNIQ WPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSA++EYTEFGAGDLVTIPKGLSCTWDVSV
Subjt:  MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSV

Query:  AVDKFYKFESQS
        AVDKFYKFES S
Subjt:  AVDKFYKFESQS

XP_038906875.1 uncharacterized protein LOC120092760 [Benincasa hispida]8.3e-4684.03Show/hide
Query:  MANEEGNSNNNPST-NNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKG------SADSEYTEFGAGDLVTIPKGLS
        MAN E   N NPST NNL+IIVERNPSQAKLSQLNI  WPKWGCSAGKYQLKFEAEETCYLVKGKVKAY KG      S+  EYTEFGAGDLVTIPKGLS
Subjt:  MANEEGNSNNNPST-NNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKG------SADSEYTEFGAGDLVTIPKGLS

Query:  CTWDVSVAVDKFYKFESQS
        CTWDVSVAVDKFYKFESQS
Subjt:  CTWDVSVAVDKFYKFESQS

TrEMBL top hitse value%identityAlignment
A0A1S3CBJ5 uncharacterized protein LOC1034989324.4e-4580.8Show/hide
Query:  MANEEG---NSNNNPSTNN-LRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKG---------SADSEYTEFGAGDLVT
        MANEEG   N++NNPSTNN L+IIVERNPSQAKLSQLNI  WPKWGCSAGKYQLKFEAEETCYLVKGKVKAY KG         S   EY EFGAGDLV 
Subjt:  MANEEG---NSNNNPSTNN-LRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKG---------SADSEYTEFGAGDLVT

Query:  IPKGLSCTWDVSVAVDKFYKFESQS
        IPKGLSCTWDVSVAVDKFYKFESQS
Subjt:  IPKGLSCTWDVSVAVDKFYKFESQS

A0A2P5AKT6 RmlC-like cupins superfamily protein3.5e-4275.68Show/hide
Query:  ANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSVA
        ++   +S ++ S  +LRIIVE+NPS+AKLS+LNI+CWPKWGCS GKYQLKF+AEETCYL+KGKVKAY KGS+ SE+ EFGAGDLVTIPKGLSCTWDVSVA
Subjt:  ANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSVA

Query:  VDKFYKFESQS
        VDK+YKFES S
Subjt:  VDKFYKFESQS

A0A2P5C0K2 RmlC-like cupins superfamily protein7.1e-4380Show/hide
Query:  SNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSVAVDKFYK
        SN++ S  +LRIIVE+NPS+A+LS+LNI+CWPKWGCS GKYQLKF+AEETCYL+KGKVKAY KGS+ SE+ EFGAGDLVTIPKGLSCTWDVSVAVDK+YK
Subjt:  SNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSVAVDKFYK

Query:  FESQS
        FES S
Subjt:  FESQS

A0A6J1H6X6 uncharacterized protein LOC1114609982.9e-5294.64Show/hide
Query:  MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSV
        MANEEGNS NNPSTNNLRIIVERNPSQAKLS LNIQ WPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSA++EYTEFGAGDLVTIPKGLSCTWDVSV
Subjt:  MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSV

Query:  AVDKFYKFESQS
        AVDKFYKFES S
Subjt:  AVDKFYKFESQS

A0A6J1KU03 uncharacterized protein LOC1114981886.4e-5293.75Show/hide
Query:  MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSV
        MANEEGNS NNPSTNNLRIIVERNPS+AKLS LNIQ WPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSA++EYTEFGAGDLVTIPKGLSCTWDVSV
Subjt:  MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSV

Query:  AVDKFYKFESQS
        AVDKFYKFES S
Subjt:  AVDKFYKFESQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G32180.1 plastid transcriptionally active 186.9e-0634.72Show/hide
Query:  VERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGL
        VER  SQ +L +L +  W  W     K    ++ ++  Y+ +G+V+   +GS    Y +F AGDLV  PK L
Subjt:  VERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGL

AT2G32650.1 RmlC-like cupins superfamily protein6.9e-0634.72Show/hide
Query:  VERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGL
        VER  SQ +L +L +  W  W     K    ++ ++  Y+ +G+V+   +GS    Y +F AGDLV  PK L
Subjt:  VERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGL

AT3G04300.1 RmlC-like cupins superfamily protein3.0e-3366.67Show/hide
Query:  LRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSVAVDKFYKFE
        + I++E NPS  +LS L +  WPKW C  GKY L FE  ETCYLVKGKVK Y KGS  SE+ EFGAGDLVTIPKGLSCTWDVS+ +DK YKF+
Subjt:  LRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSVAVDKFYKFE

AT4G10300.1 RmlC-like cupins superfamily protein3.7e-2856.7Show/hide
Query:  STNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSVAVDKFYKFE
        ST  L I +E+NP ++KL+QL ++ WPKWGC   K+   + A+ETCYL++GKVK Y  GS   E  E  AGD V  PKG+SCTWDVSVAVDK Y+FE
Subjt:  STNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSVAVDKFYKFE

AT4G28703.1 RmlC-like cupins superfamily protein2.5e-3263.73Show/hide
Query:  NLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAK----GSADSEY---TEFGAGDLVTIPKGLSCTWDVSVAVDKFYKF
        N RIIVE+NPSQA+L +L  + WPKWGCS GKY LK+EAEE CY+++GKVK Y K     S+D+E     EFGAGD+VT PKGLSCTWDVS++VDK Y F
Subjt:  NLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAK----GSADSEY---TEFGAGDLVTIPKGLSCTWDVSVAVDKFYKF

Query:  ES
         S
Subjt:  ES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAATGAAGAAGGAAATAGTAACAATAATCCTTCAACAAACAACCTCAGAATCATTGTAGAAAGGAACCCTTCTCAAGCCAAACTTTCTCAGCTCAACATCCAGTG
CTGGCCAAAATGGGGTTGTTCGGCTGGGAAATACCAGCTGAAATTTGAGGCAGAAGAGACATGCTATTTAGTGAAGGGAAAAGTGAAGGCTTATGCAAAAGGATCGGCTG
ATTCTGAATACACAGAGTTTGGGGCTGGCGATTTGGTCACTATTCCTAAAGGCCTTAGCTGCACTTGGGATGTTTCTGTTGCTGTTGATAAGTTCTATAAATTTGAATCT
CAATCTTAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAACAAACCCAGAGAAAATATGGCCAATGAAGAAGGAAATAGTAACAATAATCCTTCAACAAACAACCTCAGAATCATTGTAGAAAGGAACCCTTCTCAAGCCAA
ACTTTCTCAGCTCAACATCCAGTGCTGGCCAAAATGGGGTTGTTCGGCTGGGAAATACCAGCTGAAATTTGAGGCAGAAGAGACATGCTATTTAGTGAAGGGAAAAGTGA
AGGCTTATGCAAAAGGATCGGCTGATTCTGAATACACAGAGTTTGGGGCTGGCGATTTGGTCACTATTCCTAAAGGCCTTAGCTGCACTTGGGATGTTTCTGTTGCTGTT
GATAAGTTCTATAAATTTGAATCTCAATCTTAATTTATTTATTATTAACCCCCTCCCCCCTAATTTACCTCCAATTTCATTCTTTCCCCTCCTTTTTGTTCTAATTTCTA
TTTCCCCCCACTGTAATTAAAATATATATTCTTCTTCTCACATTTCTTTCATATGCATCTTTCTTATGGGGTTCATGTTTTTTTTTCCTCTCTCTCTTTAGAAAATGGGA
GATAAATTATAGGGTGTACAAAAAAAGTGGTTTTGATCTAATCCATTTTTGGGTATGGGTTAATTAAATTTTGATTTAGTTACCTGACTTTTTAGGATTGTATATCAACT
TTTAAATTTGTGTGTAATAAGTTTCCGAATTTAAAGAATGGTTAGGGAATTTTTGGAATTTTAATTTTATATTTAATAAGTGTTTTTTAAATTTAGAGATTTATCAAACA
CAAAATAGATAATTTAAAGACAATTTTTCTTTTTAAGTTGAAAGGTATGAG
Protein sequenceShow/hide protein sequence
MANEEGNSNNNPSTNNLRIIVERNPSQAKLSQLNIQCWPKWGCSAGKYQLKFEAEETCYLVKGKVKAYAKGSADSEYTEFGAGDLVTIPKGLSCTWDVSVAVDKFYKFES
QS