; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015351 (gene) of Snake gourd v1 genome

Gene IDTan0015351
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionhydroxyproline-rich glycoprotein family protein
Genome locationLG01:103786138..103788267
RNA-Seq ExpressionTan0015351
SyntenyTan0015351
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR037699 - Uncharacterized protein At5g65660-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591711.1 hypothetical protein SDJN03_14057, partial [Cucurbita argyrosperma subsp. sororia]3.6e-5684.5Show/hide
Query:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRF
        MEYDDREAQD ASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRS+LGCPDH DHR P +PPET A SPP+KFSPIHTIWKENRPESL+VLMPGD+VPRF
Subjt:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRF

Query:  IAMACPPCAAAAAVVEIVVQKP-QSIPDS
        IAMACPPC  AA +VEIV+QKP QSI ++
Subjt:  IAMACPPCAAAAAVVEIVVQKP-QSIPDS

XP_008466113.1 PREDICTED: uncharacterized protein At5g65660-like [Cucumis melo]3.6e-5684.33Show/hide
Query:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPP--ETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVP
        MEYD+REAQDPASLSFPVGLVLLLTFLFCMCCFF CCLHWEKLRS LGCPDHL H  PP+PP    AA SPP+KFSPIHTIWKENRP+S+SVLMPGD+VP
Subjt:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPP--ETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVP

Query:  RFIAMACPPCA-AAAAVVEIVVQKP-QSIPDSSL
        RFIA+ACPPCA AAAA+VEIVVQKP QSI DSSL
Subjt:  RFIAMACPPCA-AAAAVVEIVVQKP-QSIPDSSL

XP_022936981.1 uncharacterized protein At5g65660-like [Cucurbita moschata]6.2e-5686.51Show/hide
Query:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRF
        MEYDDREAQD ASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRS+LGCPDH DHR P +PPET A SPP+KFSPIHTIWKENRPESL+VLMPGD+VPRF
Subjt:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRF

Query:  IAMACPPCAAAAAVVEIVVQKP-QSI
        IAMACPPC  AA +VEIV+QKP QSI
Subjt:  IAMACPPCAAAAAVVEIVVQKP-QSI

XP_023536259.1 uncharacterized protein At5g65660-like [Cucurbita pepo subsp. pepo]3.1e-5585.71Show/hide
Query:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRF
        MEYDDREAQD ASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRS+LGCPDH DHR P +PPET A SPP+KFSPIHTIWKENRP SL+VLMPGD+VPRF
Subjt:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRF

Query:  IAMACPPCAAAAAVVEIVVQKP-QSI
        IAMACPPC  AA +VEIV+QKP QSI
Subjt:  IAMACPPCAAAAAVVEIVVQKP-QSI

XP_038899315.1 uncharacterized protein At5g65660-like [Benincasa hispida]1.9e-5786.36Show/hide
Query:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHT-IWKENRPESLSVLMPGDDVPR
        MEYD+REAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRS LGCPDHL H  PP+PP+TAA SPP+K SPIHT IW+ENRP+SLSVLMPGD+VPR
Subjt:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHT-IWKENRPESLSVLMPGDDVPR

Query:  FIAMACPPCAAAAAVVEIVVQKP-QSIPDSSL
        FIAMACPPC AAAA+VEIVVQKP QS PDSSL
Subjt:  FIAMACPPCAAAAAVVEIVVQKP-QSIPDSSL

TrEMBL top hitse value%identityAlignment
A0A1S3CQS4 uncharacterized protein At5g65660-like1.8e-5684.33Show/hide
Query:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPP--ETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVP
        MEYD+REAQDPASLSFPVGLVLLLTFLFCMCCFF CCLHWEKLRS LGCPDHL H  PP+PP    AA SPP+KFSPIHTIWKENRP+S+SVLMPGD+VP
Subjt:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPP--ETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVP

Query:  RFIAMACPPCA-AAAAVVEIVVQKP-QSIPDSSL
        RFIA+ACPPCA AAAA+VEIVVQKP QSI DSSL
Subjt:  RFIAMACPPCA-AAAAVVEIVVQKP-QSIPDSSL

A0A5D3E545 Uncharacterized protein1.8e-5684.33Show/hide
Query:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPP--ETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVP
        MEYD+REAQDPASLSFPVGLVLLLTFLFCMCCFF CCLHWEKLRS LGCPDHL H  PP+PP    AA SPP+KFSPIHTIWKENRP+S+SVLMPGD+VP
Subjt:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPP--ETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVP

Query:  RFIAMACPPCA-AAAAVVEIVVQKP-QSIPDSSL
        RFIA+ACPPCA AAAA+VEIVVQKP QSI DSSL
Subjt:  RFIAMACPPCA-AAAAVVEIVVQKP-QSIPDSSL

A0A6J1CGJ4 uncharacterized protein At5g65660-like2.8e-4672.26Show/hide
Query:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRF
        M+YDDREAQDPASLSFPVGLVLLLTF FCMCCFFSCCLHW+KLRS LGCPDH     PP+PPE AASSPP+K  P+HT    N+ +SL V+MPGD+ PRF
Subjt:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRF

Query:  IAMACPPCAAAAAV---VEIVVQK-PQSIPDSSLELS
        IAMACPPCAAA A    V+++VQK PQS PDSS  LS
Subjt:  IAMACPPCAAAAAV---VEIVVQK-PQSIPDSSLELS

A0A6J1F906 uncharacterized protein At5g65660-like3.0e-5686.51Show/hide
Query:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRF
        MEYDDREAQD ASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRS+LGCPDH DHR P +PPET A SPP+KFSPIHTIWKENRPESL+VLMPGD+VPRF
Subjt:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRF

Query:  IAMACPPCAAAAAVVEIVVQKP-QSI
        IAMACPPC  AA +VEIV+QKP QSI
Subjt:  IAMACPPCAAAAAVVEIVVQKP-QSI

A0A6J1IEI5 uncharacterized protein At5g65660-like2.5e-5585.71Show/hide
Query:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRF
        MEYDDREAQD ASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRS+LGCPDH DH  P +PPET A SPP+KFSPIHTIWKENRPESL+VLMPGD+VPRF
Subjt:  MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRF

Query:  IAMACPPCAAAAAVVEIVVQKP-QSI
        IAMACPPC  AA +VEIV+QKP QSI
Subjt:  IAMACPPCAAAAAVVEIVVQKP-QSI

SwissProt top hitse value%identityAlignment
Q9LSK9 Uncharacterized protein At5g656609.1e-1039.47Show/hide
Query:  SLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRFIAMACPPCAAAA
        SL FP+G  LLL  +F +   FSCC HW+K RSL      L + RP    E    S P K  P     K+ +  S+ VLMPGD+ P+FIA+ CP      
Subjt:  SLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRFIAMACPPCAAAA

Query:  AVVEIVVQKPQSIP
          + + VQ P   P
Subjt:  AVVEIVVQKPQSIP

Arabidopsis top hitse value%identityAlignment
AT4G28170.1 unknown protein5.9e-0443.75Show/hide
Query:  SPP--EKFSPIHTIWKENRPESLSVLMPGDDVPRFIAMACPPCAAAAA
        SPP  +KF P  +   +     +SVLMPG+DVP FIA  CPP +++++
Subjt:  SPP--EKFSPIHTIWKENRPESLSVLMPGDDVPRFIAMACPPCAAAAA

AT5G65660.1 hydroxyproline-rich glycoprotein family protein6.5e-1139.47Show/hide
Query:  SLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRFIAMACPPCAAAA
        SL FP+G  LLL  +F +   FSCC HW+K RSL      L + RP    E    S P K  P     K+ +  S+ VLMPGD+ P+FIA+ CP      
Subjt:  SLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRFIAMACPPCAAAA

Query:  AVVEIVVQKPQSIP
          + + VQ P   P
Subjt:  AVVEIVVQKPQSIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTATGACGACCGCGAAGCACAGGATCCGGCCTCTCTAAGCTTTCCAGTTGGTTTGGTTCTTTTGTTGACATTTTTGTTCTGCATGTGTTGTTTTTTTTCTTGTTG
TCTTCACTGGGAAAAGCTCCGATCATTGCTCGGCTGTCCCGATCATCTCGACCACCGCCGTCCTCCTCTTCCGCCGGAAACAGCCGCCTCCTCGCCGCCCGAGAAATTTT
CGCCAATCCATACGATATGGAAGGAGAATCGACCGGAAAGCCTGTCAGTGTTGATGCCCGGCGACGATGTTCCGAGATTTATAGCAATGGCATGTCCGCCGTGTGCGGCG
GCGGCGGCGGTTGTGGAAATTGTTGTACAGAAACCTCAGAGTATTCCAGATTCTTCTCTGGAGCTCAGTCTTTAA
mRNA sequenceShow/hide mRNA sequence
CATTAACCCCTAAGTCTATAAAAATAAAAATAATAATGAATAAATAAAATAATACACCAATTAATATTTTTGTCCATAAATTTAATTGTCGCCACTCGTCGCTATTAAAC
GAATTCTGCAGAACTTTGCTGGGAGAAAGCTCGAGAGCTACAAAAATGGAGTATGACGACCGCGAAGCACAGGATCCGGCCTCTCTAAGCTTTCCAGTTGGTTTGGTTCT
TTTGTTGACATTTTTGTTCTGCATGTGTTGTTTTTTTTCTTGTTGTCTTCACTGGGAAAAGCTCCGATCATTGCTCGGCTGTCCCGATCATCTCGACCACCGCCGTCCTC
CTCTTCCGCCGGAAACAGCCGCCTCCTCGCCGCCCGAGAAATTTTCGCCAATCCATACGATATGGAAGGAGAATCGACCGGAAAGCCTGTCAGTGTTGATGCCCGGCGAC
GATGTTCCGAGATTTATAGCAATGGCATGTCCGCCGTGTGCGGCGGCGGCGGCGGTTGTGGAAATTGTTGTACAGAAACCTCAGAGTATTCCAGATTCTTCTCTGGAGCT
CAGTCTTTAAGTTAATGCAGTTTTTGTTCGCTTCTGTGCGTAAATATATATATTTAAATTCAAGTGAATTTGAATTCGATTATTCCTTCGGGATTTTCTAGCA
Protein sequenceShow/hide protein sequence
MEYDDREAQDPASLSFPVGLVLLLTFLFCMCCFFSCCLHWEKLRSLLGCPDHLDHRRPPLPPETAASSPPEKFSPIHTIWKENRPESLSVLMPGDDVPRFIAMACPPCAA
AAAVVEIVVQKPQSIPDSSLELSL