; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012444 (gene) of Snake gourd v1 genome

Gene IDTan0012444
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationLG04:57061125..57062514
RNA-Seq ExpressionTan0012444
SyntenyTan0012444
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607098.1 hypothetical protein SDJN03_00440, partial [Cucurbita argyrosperma subsp. sororia]2.1e-5166.49Show/hide
Query:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAA----------TPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNG-SMDGLKVPKSP
        M   TEPTIP T  QS+ P  SEQ  E SGAG+T A          TPDREKK +G+++ME SK ECKTP PDE+TEEKQRILVP NG  MD LKVPKSP
Subjt:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAA----------TPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNG-SMDGLKVPKSP

Query:  NVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS
        NVAG             +  QR   GGI LPKNGTPNRLKVPKAFKY ERY SPTDLMMSPI+KGLLARTRKGAVPSKMHELR+ EMSL S
Subjt:  NVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS

XP_022948389.1 uncharacterized protein LOC111452080 [Cucurbita moschata]2.5e-5267.02Show/hide
Query:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAA----------TPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNG-SMDGLKVPKSP
        M   TEPTIP TP QS+ P  SEQ  E SGAG+T A          TPDREKK +G+++ME SK ECKTP PDE+TEEKQRILVP NG  MD LKVPKSP
Subjt:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAA----------TPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNG-SMDGLKVPKSP

Query:  NVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS
        NVAG             +  QR   GGI LPKNGTPNRLKVPKAFKY ERY SPTDLMMSPI+KGLLARTRKGAVPSKMHELR+ EMSL S
Subjt:  NVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS

XP_022998048.1 uncharacterized protein LOC111492813 [Cucurbita maxima]1.6e-5165.97Show/hide
Query:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAA----------TPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNG-SMDGLKVPKSP
        M   TEP IP TP QS+ P  SEQ  E SGAG+T A          TPDREKK + +N+ME SK ECKTP PDE+TEEKQRILVP NG  MD LKVPKSP
Subjt:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAA----------TPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNG-SMDGLKVPKSP

Query:  NVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS
        NVAG+          +KQ       GGI  PKNGTPNRLKVPKAFKY ERY SPTDLMMSPI+KGLLARTRKGAVPSKMHELR+ EMSL S
Subjt:  NVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS

XP_023525350.1 uncharacterized protein LOC111788976 [Cucurbita pepo subsp. pepo]3.3e-5267.02Show/hide
Query:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAA----------TPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNG-SMDGLKVPKSP
        M   TEPTIP TP QS+ P  SEQ  E SGAG+T A          TPDREKK +GEN+ E SK ECKTP PDE+TEEKQRILVP NG  MD LKVPKSP
Subjt:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAA----------TPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNG-SMDGLKVPKSP

Query:  NVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS
        NVAG             +  QR   GGI LP+NGTPNRLKVPKAFKY ERY SPTDLMMSPI+KGLLARTRKGAVPSKMHELR+ EMSL S
Subjt:  NVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS

XP_038881324.1 uncharacterized protein LOC120072868 [Benincasa hispida]7.8e-4655.87Show/hide
Query:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAG--QTAATPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNGSMDGLKVPKSPNVAGKLNLE
        M  +T+PTIP++PI       S+Q  +I G+G  QTAATP  E     + ++E SK ECKTP P+E+TEEKQRILV +NGS+D   VPKSP   GKLNLE
Subjt:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAG--QTAATPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNGSMDGLKVPKSPNVAGKLNLE

Query:  CKTPTPIKQT---------------------------------GQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVP
        CKTP+P ++T                                  QRV  GGIE PKNGTPNRLK+P AFKYPERY SPTDLM+SPISKG+LARTRKGAVP
Subjt:  CKTPTPIKQT---------------------------------GQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVP

Query:  SKMHELRVPEMSL
        SKMHELR PE+SL
Subjt:  SKMHELRVPEMSL

TrEMBL top hitse value%identityAlignment
A0A1S3CI15 uncharacterized protein LOC1035011891.2e-4467.27Show/hide
Query:  EISGAGQTAATPDR--EKKSRGENEMEDSK---------SECKTPIPDERTEEKQRILVPKNGSMDGLKVPKSPNVAGKLNLECKTPTPIKQTGQRVNFG
        EIS +     TPD   E+K R     ++SK          ECKTP PDE+T++K+RILVPKNGSMD  KVPKSP   GK+NLECKTPT      QRV  G
Subjt:  EISGAGQTAATPDR--EKKSRGENEMEDSK---------SECKTPIPDERTEEKQRILVPKNGSMDGLKVPKSPNVAGKLNLECKTPTPIKQTGQRVNFG

Query:  GIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS
        GIELPKNGTPNRLK+P AFKYPERYKSPTDLM+SPISKGLLARTRKGAVPSKMHELR  EMSL S
Subjt:  GIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS

A0A5A7UZK0 Uncharacterized protein1.2e-4467.27Show/hide
Query:  EISGAGQTAATPDR--EKKSRGENEMEDSK---------SECKTPIPDERTEEKQRILVPKNGSMDGLKVPKSPNVAGKLNLECKTPTPIKQTGQRVNFG
        EIS +     TPD   E+K R     ++SK          ECKTP PDE+T++K+RILVPKNGSMD  KVPKSP   GK+NLECKTPT      QRV  G
Subjt:  EISGAGQTAATPDR--EKKSRGENEMEDSK---------SECKTPIPDERTEEKQRILVPKNGSMDGLKVPKSPNVAGKLNLECKTPTPIKQTGQRVNFG

Query:  GIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS
        GIELPKNGTPNRLK+P AFKYPERYKSPTDLM+SPISKGLLARTRKGAVPSKMHELR  EMSL S
Subjt:  GIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS

A0A6J1DLV5 uncharacterized protein LOC1110221708.7e-4356.25Show/hide
Query:  EPTIPQTPIQSESP------ARSEQNPEISG----------------------AGQTAATPD----REKKSRGENEMEDSKSECKTPIPDERTE---EKQ
        + +IP +P QSE          S+Q PEISG                      AG+T ATP+    +++K+R +N+ E S SE KT  P E+ E   EK+
Subjt:  EPTIPQTPIQSESP------ARSEQNPEISG----------------------AGQTAATPD----REKKSRGENEMEDSKSECKTPIPDERTE---EKQ

Query:  RILVPKNGSMDGLKVPKSPNVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHE
        RILVPKNGSMD LKVPKSPN AGK                 V F GIELPKNGTPNRLKVPKAFKYPERY SPTDLMMSPISKGLLARTRKGAVPSKMHE
Subjt:  RILVPKNGSMDGLKVPKSPNVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHE

Query:  LRVPEMSL
        LR+ EMSL
Subjt:  LRVPEMSL

A0A6J1G9R7 uncharacterized protein LOC1114520801.2e-5267.02Show/hide
Query:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAA----------TPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNG-SMDGLKVPKSP
        M   TEPTIP TP QS+ P  SEQ  E SGAG+T A          TPDREKK +G+++ME SK ECKTP PDE+TEEKQRILVP NG  MD LKVPKSP
Subjt:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAA----------TPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNG-SMDGLKVPKSP

Query:  NVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS
        NVAG             +  QR   GGI LPKNGTPNRLKVPKAFKY ERY SPTDLMMSPI+KGLLARTRKGAVPSKMHELR+ EMSL S
Subjt:  NVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS

A0A6J1KFQ1 uncharacterized protein LOC1114928137.9e-5265.97Show/hide
Query:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAA----------TPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNG-SMDGLKVPKSP
        M   TEP IP TP QS+ P  SEQ  E SGAG+T A          TPDREKK + +N+ME SK ECKTP PDE+TEEKQRILVP NG  MD LKVPKSP
Subjt:  MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAA----------TPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNG-SMDGLKVPKSP

Query:  NVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS
        NVAG+          +KQ       GGI  PKNGTPNRLKVPKAFKY ERY SPTDLMMSPI+KGLLARTRKGAVPSKMHELR+ EMSL S
Subjt:  NVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G02120.1 hydroxyproline-rich glycoprotein family protein1.7e-1439.72Show/hide
Query:  MEDSKSECKTPIPDERTEEKQRILVPKNGSMDGLKVPKS--PNVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLM
        ME  +S  +TP+   +T+   R +   N        P+S  P    + + +  +P P   T +        + K GTP RL+VP AFKYPERY+SPTD M
Subjt:  MEDSKSECKTPIPDERTEEKQRILVPKNGSMDGLKVPKS--PNVAGKLNLECKTPTPIKQTGQRVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLM

Query:  MSPISKGLLARTRKGA---VP-----SKMHELRVPEMSLQS
        MSP++KGLLARTRK +   +P     +K+ ELR PE  L S
Subjt:  MSPISKGLLARTRKGA---VP-----SKMHELRVPEMSLQS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGACTGTAACTGAACCAACCATTCCCCAAACCCCAATTCAGTCGGAATCTCCTGCACGTTCCGAACAAAACCCTGAAATCTCCGGCGCCGGACAAACGGCGGCGAC
CCCAGATCGAGAGAAGAAGAGCAGAGGCGAAAACGAAATGGAAGATTCAAAATCCGAGTGTAAAACGCCAATCCCAGATGAGAGAACAGAGGAAAAACAGAGGATTTTGG
TCCCTAAAAATGGATCGATGGATGGATTGAAGGTGCCGAAGAGTCCGAATGTGGCTGGAAAATTGAATTTGGAGTGCAAAACTCCGACCCCAATTAAACAAACAGGGCAA
AGGGTGAATTTTGGAGGCATTGAATTGCCTAAAAATGGTACACCAAATCGGCTGAAAGTGCCCAAAGCATTCAAATATCCTGAAAGGTATAAAAGTCCCACTGATTTGAT
GATGTCTCCTATCAGCAAAGGCCTTCTTGCCAGAACCAGGAAAGGGGCTGTGCCTTCCAAGATGCATGAGTTGAGAGTTCCAGAGATGAGTCTTCAAAGCTGA
mRNA sequenceShow/hide mRNA sequence
CTCAAAACTCAAATTCCCACAGAATCTCCCACTCTTATTCTTCATTCAATTCCACCATTTTCAAACTTAAATTTTCTTTTAAACCTAATCCAATCGCGCATCGATTCCAA
TGGGGACTGTAACTGAACCAACCATTCCCCAAACCCCAATTCAGTCGGAATCTCCTGCACGTTCCGAACAAAACCCTGAAATCTCCGGCGCCGGACAAACGGCGGCGACC
CCAGATCGAGAGAAGAAGAGCAGAGGCGAAAACGAAATGGAAGATTCAAAATCCGAGTGTAAAACGCCAATCCCAGATGAGAGAACAGAGGAAAAACAGAGGATTTTGGT
CCCTAAAAATGGATCGATGGATGGATTGAAGGTGCCGAAGAGTCCGAATGTGGCTGGAAAATTGAATTTGGAGTGCAAAACTCCGACCCCAATTAAACAAACAGGGCAAA
GGGTGAATTTTGGAGGCATTGAATTGCCTAAAAATGGTACACCAAATCGGCTGAAAGTGCCCAAAGCATTCAAATATCCTGAAAGGTATAAAAGTCCCACTGATTTGATG
ATGTCTCCTATCAGCAAAGGCCTTCTTGCCAGAACCAGGAAAGGGGCTGTGCCTTCCAAGATGCATGAGTTGAGAGTTCCAGAGATGAGTCTTCAAAGCTGAATGAATGA
GTTCTGTTTTTCCTCTTCAAGAATTTGCCTCACTCAGATTGTGATGTAAAAATCAGTTCTACTTCTTAGAATATTCTTCTATTTCCAAGGGTTTTTTTTAAAAAAAAAAT
TATTTTCATTGGTTTTATTATTTTCAATGTGTAGTTGTTGGTGAGTCATGTAAGATATATTGAGGGACAAGTTTTCTCAATTGCTTTGAACAACATATATACATTTGTAA
TTGTAACCATAAAAGTTGCTAGTAATGCAGATGGTTTGATGAACTTTTGGACCATTTGAAC
Protein sequenceShow/hide protein sequence
MGTVTEPTIPQTPIQSESPARSEQNPEISGAGQTAATPDREKKSRGENEMEDSKSECKTPIPDERTEEKQRILVPKNGSMDGLKVPKSPNVAGKLNLECKTPTPIKQTGQ
RVNFGGIELPKNGTPNRLKVPKAFKYPERYKSPTDLMMSPISKGLLARTRKGAVPSKMHELRVPEMSLQS