; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019540 (gene) of Snake gourd v1 genome

Gene IDTan0019540
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationLG04:24161229..24161857
RNA-Seq ExpressionTan0019540
SyntenyTan0019540
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600825.1 hypothetical protein SDJN03_06058, partial [Cucurbita argyrosperma subsp. sororia]6.4e-3468.25Show/hide
Query:  MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF
        M IQTRP YA  I   F +LSINVV  HG + S+KLDE  G +D  +KCTPCT    PPPPPPPPKKPS AYCPPPP PPSSFIY+LGPPGNLYPID+DF
Subjt:  MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF

Query:  ASA-RSRMVVELPVVALLGLIGFIAL
        A A R R+VVEL  VAL GLIGF+ +
Subjt:  ASA-RSRMVVELPVVALLGLIGFIAL

KAG7031461.1 hypothetical protein SDJN02_05501, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-3367.46Show/hide
Query:  MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF
        M IQTRP YA  I     +LSINVV  HG + S+KLDE  G +D  +KCTPCT    PPPPPPPPKKPS AYCPPPP PPSSFIY+LGPPGNLYPID+DF
Subjt:  MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF

Query:  ASA-RSRMVVELPVVALLGLIGFIAL
        A A R R+VVEL  VAL GLIGF+ +
Subjt:  ASA-RSRMVVELPVVALLGLIGFIAL

KGN50831.1 hypothetical protein Csa_004723 [Cucumis sativus]1.2e-3269.05Show/hide
Query:  MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF
        M IQ  P Y+ +IL  FLILSIN+  IHG I S+KLDEP   +DSS+KCTPCTRY   PPPPPPPKKP   YCPPPP PPSSFIY+LGPP NLYPI+ DF
Subjt:  MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF

Query:  ASARSRMV-VELPVVALLGLIGFIAL
        ASA  R V +ELPVVA  GLIG IAL
Subjt:  ASARSRMV-VELPVVALLGLIGFIAL

XP_022155381.1 acrosin-like [Momordica charantia]2.1e-3773.81Show/hide
Query:  MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF
        M+IQT+      IL L LILSINVV IHGLI S KLD   G  DS++KCTPCTRY PPPPPPPPPKKP S+YCPPPP PPSSFIY+LGPPGNLYPI QDF
Subjt:  MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF

Query:  ASARSRMVVELPVVALLGLIGFIALW
        A  R R+ VELPVVALLGL+GFIA+W
Subjt:  ASARSRMVVELPVVALLGLIGFIALW

XP_023546806.1 formin-like protein 20 [Cucurbita pepo subsp. pepo]1.6e-3267.72Show/hide
Query:  MFIQTRPCYAPMILILFL-ILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQD
        M IQTRP YA  I   F+  LSINVV  HG + S+KLDE  G +D S+KCTPCT    PPPPPPPPKKPS AYCPPPP PPSSFIY+LGPPGNLYPID+D
Subjt:  MFIQTRPCYAPMILILFL-ILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQD

Query:  FASA-RSRMVVELPVVALLGLIGFIAL
        FA A R R+ VEL  VAL GLIGF+ +
Subjt:  FASA-RSRMVVELPVVALLGLIGFIAL

TrEMBL top hitse value%identityAlignment
A0A0A0KSV7 Uncharacterized protein5.8e-3369.05Show/hide
Query:  MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF
        M IQ  P Y+ +IL  FLILSIN+  IHG I S+KLDEP   +DSS+KCTPCTRY   PPPPPPPKKP   YCPPPP PPSSFIY+LGPP NLYPI+ DF
Subjt:  MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF

Query:  ASARSRMV-VELPVVALLGLIGFIAL
        ASA  R V +ELPVVA  GLIG IAL
Subjt:  ASARSRMV-VELPVVALLGLIGFIAL

A0A4S4DUT5 Uncharacterized protein3.1e-1850.38Show/hide
Query:  TRPCYAP-MILILFLILSINVVAIHGLIPSEKLDEP--PGSNDSSIKCT----PCTRYPPPPPPPPP--PKKPSSAYCPPPPSPPSSFIYVLGPPGNLYP
        T P + P  +L++F +++     IHGLIP  KLD+    G  DS IKCT    PC + PPPPPPPPP  PK P + YCPPPPSP  SFIYV G PGNLYP
Subjt:  TRPCYAP-MILILFLILSINVVAIHGLIPSEKLDEP--PGSNDSSIKCT----PCTRYPPPPPPPPP--PKKPSSAYCPPPPSPPSSFIYVLGPPGNLYP

Query:  IDQDFASARSRMVVELPVVALLGLIGFIALW
        ID  F+SA     V +P++    L+G +A W
Subjt:  IDQDFASARSRMVVELPVVALLGLIGFIALW

A0A6J1DRJ1 acrosin-like1.0e-3773.81Show/hide
Query:  MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF
        M+IQT+      IL L LILSINVV IHGLI S KLD   G  DS++KCTPCTRY PPPPPPPPPKKP S+YCPPPP PPSSFIY+LGPPGNLYPI QDF
Subjt:  MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF

Query:  ASARSRMVVELPVVALLGLIGFIALW
        A  R R+ VELPVVALLGL+GFIA+W
Subjt:  ASARSRMVVELPVVALLGLIGFIALW

A0A6J1FVH5 probable glycosyltransferase 49.9e-3367.72Show/hide
Query:  MFIQTRPCYAPMIL-ILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQD
        M IQTRP YA  IL   F  LS NVV  HG + S+KLDE  G +D S+KCTPCT    PPPPPPPPKKPS AYCPPPP PPSSFIY+LGPPGNLYPID+D
Subjt:  MFIQTRPCYAPMIL-ILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQD

Query:  FASA-RSRMVVELPVVALLGLIGFIAL
        FA A R R+ VEL  VAL GLIGF+ +
Subjt:  FASA-RSRMVVELPVVALLGLIGFIAL

A5B8N9 Uncharacterized protein6.9e-1848.8Show/hide
Query:  ILILFLILSINVVAIHGLIPSEKLDEPP--GSNDSSIKC-------TPCTRYPPPPP---PPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF
        +L +FL+++     ++G   S KLDE P  GS D+ +KC        PC + PPPPP   PPPPPKKP + YCPPPP PP+SF+YV GPPG LYPIDQD+
Subjt:  ILILFLILSINVVAIHGLIPSEKLDEPP--GSNDSSIKC-------TPCTRYPPPPP---PPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF

Query:  ASARSRMVVELPVVALLGLIGFIAL
          A    +V LP++A  GL+  + L
Subjt:  ASARSRMVVELPVVALLGLIGFIAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02405.1 proline-rich family protein6.6e-0536.73Show/hide
Query:  IPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKK---PSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDFASARSRMVVELPVVALLGLIGFIAL
        IP  +   PP  +     CTP     PPPP PPPPKK   P S   PPPP PP ++++   PPG+LYPI+  + +A +     +  + + G++ F+ L
Subjt:  IPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKK---PSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDFASARSRMVVELPVVALLGLIGFIAL

AT1G23040.1 hydroxyproline-rich glycoprotein family protein4.3e-1242.11Show/hide
Query:  IPSEKLDEPPGSNDSSIKCTP-CTRYPPPPPP----------------PPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF--ASARSRMVVEL
        + + KLDE        IKC+P C + PPPP P                PPPP K  S+YCPPP  PP++F+Y+ GPPGNLYP+D+ F  A+ +S MVV+L
Subjt:  IPSEKLDEPPGSNDSSIKCTP-CTRYPPPPPP----------------PPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDF--ASARSRMVVEL

Query:  PVVALLGLIGFIAL
          +   G++ F+ L
Subjt:  PVVALLGLIGFIAL

AT1G70990.1 proline-rich family protein3.3e-1246.08Show/hide
Query:  SEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPP------------PSPPSSFIYVLGPPGNLYPIDQDFASARSRMVVELPVVALLGLI
        + KL+E P      IKCTPC +  PPP PPPP   P S  CPPP            P PPS++IY+ GPPG LYPIDQ F +A ++      VV + GLI
Subjt:  SEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPP------------PSPPSSFIYVLGPPGNLYPIDQDFASARSRMVVELPVVALLGLI

Query:  GF
         F
Subjt:  GF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCATCCAAACGCGGCCTTGTTATGCTCCAATGATCCTCATCTTGTTCTTGATTCTTTCAATCAATGTGGTAGCCATCCATGGCTTGATCCCTTCCGAGAAGCTCGA
CGAGCCACCCGGCAGCAACGATTCGAGCATCAAGTGTACACCGTGCACCCGTTATCCACCACCGCCTCCTCCACCACCTCCACCGAAGAAACCATCATCGGCGTACTGCC
CTCCGCCTCCATCTCCTCCGTCTTCTTTCATATATGTACTTGGCCCGCCGGGAAACTTGTATCCCATTGACCAGGACTTCGCCAGTGCCCGGAGTAGGATGGTTGTGGAG
CTGCCGGTGGTTGCTCTCTTGGGGTTGATTGGTTTTATTGCTTTGTGGCGATTTTAA
mRNA sequenceShow/hide mRNA sequence
TTTTAGTCTTCAAATTTTGATTCAATATGTTCATCCAAACGCGGCCTTGTTATGCTCCAATGATCCTCATCTTGTTCTTGATTCTTTCAATCAATGTGGTAGCCATCCAT
GGCTTGATCCCTTCCGAGAAGCTCGACGAGCCACCCGGCAGCAACGATTCGAGCATCAAGTGTACACCGTGCACCCGTTATCCACCACCGCCTCCTCCACCACCTCCACC
GAAGAAACCATCATCGGCGTACTGCCCTCCGCCTCCATCTCCTCCGTCTTCTTTCATATATGTACTTGGCCCGCCGGGAAACTTGTATCCCATTGACCAGGACTTCGCCA
GTGCCCGGAGTAGGATGGTTGTGGAGCTGCCGGTGGTTGCTCTCTTGGGGTTGATTGGTTTTATTGCTTTGTGGCGATTTTAATGGAGTTTGAGTGATGGAAAGTTTAAT
GAATTGACGAAGATAATAAACTCGAAATGAAGGTGGTAGGTTCGAATCTTCTCTTATTATTATTTTTTTTTTTTTTTGTATTCTTGGTATTATTGACTAAAAGTTTTTTA
ATGATGAGTTTTTAGTCATGGTGTAATTCATCACATTAGGAAAATCTAATAAATGTATATAAATTTGGATTCTGGATTA
Protein sequenceShow/hide protein sequence
MFIQTRPCYAPMILILFLILSINVVAIHGLIPSEKLDEPPGSNDSSIKCTPCTRYPPPPPPPPPPKKPSSAYCPPPPSPPSSFIYVLGPPGNLYPIDQDFASARSRMVVE
LPVVALLGLIGFIALWRF