; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017806 (gene) of Snake gourd v1 genome

Gene IDTan0017806
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionArabinogalactan protein 22, putative
Genome locationLG08:1491075..1493223
RNA-Seq ExpressionTan0017806
SyntenyTan0017806
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009424 - Arabinogalactan protein 16/20/22/41


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0061559.1 putative Arabinogalactan protein 22 [Cucumis melo var. makuwa]2.5e-1879.37Show/hide
Query:  ISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        +S +KLTA P VGF+FLI+LQLA+GHSHDISPA  PSNDGAAIDQGIAYVLLL+ALA+TYI+H
Subjt:  ISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

KAB2027289.1 hypothetical protein ES319_D05G024500v1 [Gossypium barbadense]1.4e-1062.3Show/hide
Query:  MMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        M KL     +GF+FL L QL+YG S   SPA  PSNDG+AIDQGIAY+LLL+ALA+TY++H
Subjt:  MMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

KAB2079758.1 hypothetical protein ES319_A05G024400v1 [Gossypium barbadense]1.9e-1062.3Show/hide
Query:  MMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        M KL     +GF+FL L QL+YG S   SPA  PSNDG AIDQGIAY+LLL+ALA+TY++H
Subjt:  MMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

KGN60622.1 hypothetical protein Csa_019490 [Cucumis sativus]1.7e-1985.71Show/hide
Query:  ISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        +S +KLTA PIVGF+FLI+LQLA+GHSHDISPAA PSNDGAAIDQGIAYVLLL+ALAVTYIVH
Subjt:  ISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

KJB54180.1 hypothetical protein B456_009G024200 [Gossypium raimondii]3.2e-1060.66Show/hide
Query:  MMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        M KL     +GF+FL L QL+YG S   SPA  PSNDG+AIDQGIAY+LL++ALA+TY++H
Subjt:  MMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

TrEMBL top hitse value%identityAlignment
A0A0A0LIU0 Uncharacterized protein8.2e-2085.71Show/hide
Query:  ISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        +S +KLTA PIVGF+FLI+LQLA+GHSHDISPAA PSNDGAAIDQGIAYVLLL+ALAVTYIVH
Subjt:  ISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

A0A5A7V7C0 Putative Arabinogalactan protein 221.2e-1879.37Show/hide
Query:  ISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        +S +KLTA P VGF+FLI+LQLA+GHSHDISPA  PSNDGAAIDQGIAYVLLL+ALA+TYI+H
Subjt:  ISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

A0A5D2CEW7 Uncharacterized protein7.0e-1162.3Show/hide
Query:  MMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        M KL     +GF+FL L QL+YG S   SPA  PSNDG+AIDQGIAY+LLL+ALA+TY++H
Subjt:  MMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

A0A5J5RBR4 Uncharacterized protein7.0e-1162.3Show/hide
Query:  MMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        M KL     +GF+FL L QL+YG S   SPA  PSNDG+AIDQGIAY+LLL+ALA+TY++H
Subjt:  MMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

A0A7J9JNB2 Uncharacterized protein7.0e-1162.3Show/hide
Query:  MMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        M KL     +GF+FL L QL+YG S   SPA  PSNDG+AIDQGIAY+LLL+ALA+TY++H
Subjt:  MMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

SwissProt top hitse value%identityAlignment
O82337 Arabinogalactan protein 161.2e-0746.88Show/hide
Query:  MISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        M S   +T   +  FVF ++L LA   S  ++PA AP++DG +IDQGIAY+L++VAL +TY++H
Subjt:  MISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

Q8L9T8 Arabinogalactan protein 412.4e-0856.6Show/hide
Query:  IVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        IV  +F ILL +A  H+   +PA AP++DG  IDQGIAYVL+LVAL +TY++H
Subjt:  IVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

Q9FK16 Arabinogalactan protein 224.1e-0846.03Show/hide
Query:  ISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        ++ +K     +  FV + ++ L    SH  SPA AP++DG +IDQGIAYVL++VALA+TY +H
Subjt:  ISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

Q9M373 Arabinogalactan protein 201.4e-0540.62Show/hide
Query:  MISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        M S   +    +  FVF ++   A   S  ++PA +P++DG +IDQGIAY+L++VAL +TY++H
Subjt:  MISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

Arabidopsis top hitse value%identityAlignment
AT2G46330.1 arabinogalactan protein 168.4e-0946.88Show/hide
Query:  MISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        M S   +T   +  FVF ++L LA   S  ++PA AP++DG +IDQGIAY+L++VAL +TY++H
Subjt:  MISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

AT3G61640.1 arabinogalactan protein 201.0e-0640.62Show/hide
Query:  MISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        M S   +    +  FVF ++   A   S  ++PA +P++DG +IDQGIAY+L++VAL +TY++H
Subjt:  MISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

AT5G24105.1 arabinogalactan protein 411.7e-0956.6Show/hide
Query:  IVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        IV  +F ILL +A  H+   +PA AP++DG  IDQGIAYVL+LVAL +TY++H
Subjt:  IVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH

AT5G53250.1 arabinogalactan protein 222.9e-0946.03Show/hide
Query:  ISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH
        ++ +K     +  FV + ++ L    SH  SPA AP++DG +IDQGIAYVL++VALA+TY +H
Subjt:  ISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCGATGATGAAGCTAACTGCCGCTCCGATCGTCGGATTCGTGTTCTTGATTCTTCTGCAATTGGCCTACGGTCACTCTCACGATATTTCTCCGGCGGCGGCGCC
TTCGAACGACGGAGCTGCAATTGACCAAGGAATTGCTTACGTTCTTCTTCTGGTGGCTCTCGCCGTTACTTACATCGTCCATTGA
mRNA sequenceShow/hide mRNA sequence
TCGCGTTTTTGACTCGTCCAATATCGCATTCTATTTCCAAGAAAAACCACCCAATTTTTTTTCATATAAAATCCCTAATTTTCAATATCCAATATTCAACTTCAATTTCA
GCTTCTCTGTTATTTTCGTTTCTCTTCGGATTCAATAGAAGATAAACATTTTTCAAAGTTTCAGAGATTATTCGCTTCGATTCGAAAAAGATGATTTCGATGATGAAGCT
AACTGCCGCTCCGATCGTCGGATTCGTGTTCTTGATTCTTCTGCAATTGGCCTACGGTCACTCTCACGATATTTCTCCGGCGGCGGCGCCTTCGAACGACGGAGCTGCAA
TTGACCAAGGAATTGCTTACGTTCTTCTTCTGGTGGCTCTCGCCGTTACTTACATCGTCCATTGATAATTTTTATGGAGTTGTCTGAGAGATTCAAATGAGTTCGTTTGG
ATATGTAATTTACTTTCTTTTATTTTGTAGTTTTTTCTCCGTTTATTGGTGAAGAAATTGCGTTGGAACTTTGAAATTCATTATTCTTTATTTACAATATAAATATAATT
TGCTCTGTTGATTCGGTGACAAGGAAAGTTTATTTAGCAGCTTGGAATTAAAAAAGAAACAAAAGAAAAGTCTTATTGGATGAATTTGAATAATAGTACAGCGCTAGCCG
CCCTACAACTACCATTGCTTTTAGCCTTTAAGAAAAAATAATTAAAAAATAAAAAACAGTGGTCTAGCTCCTCGAGAAATTATTCGGTGAGGCCGCCACAATGTGACCGA
CAGTTTTGCACTGTTAGATGAAAATAGAGAAGTACCTCGAATTTATATACAAAGAATATGAGGGGGCATGAAAATTATTGGATTCATATCAAACGGTTATTGTGAGGGAG
CCACATTGTGGCGGCCTCATTGAATAATTTGTCTAGCTCGAGGCCTCTCGTACTCCTGTTGTATCGTTGTCTATATTCCATTCTAGAAATAAAGTAATGATGTTTCTGAT
ATAGTGTTTGGACAAGATGATAATAGCAATGCTCGCTACCTTGTGTTTGAGGAAGTTGGTGCAAGTGAGACATTTGACTTCGAGTGGCATTGATACAAACATAAAAAAGA
GACAAGACATAATAATAACATTAAAATAGCTTACTTTATGGGGCTTGGAGGTTTATTTTTTCTATTCATGTTCCTCGGTTTAAACTAGCAAGAGGGGAAAGGACTAAGTC
ACAACAACCTTGCTGGGGCAACCAATCACCCGGAGAAGAGTTGAGATATGCTTACTCACTCATTACAATTATGCAAGTTCACCTAAAGACCATTCGATCGAAAAAATCGA
CAATAGAGTTGCAAACTTGAAAGCTAACTTACACAACAAAATGAAGAGAAACTTCAACCATGCATGTTGAGACCTCACATCAGAAAATGAGGATAAACTTTACAACTTAT
AAGATATATAGACCACTTTTCTTATTGTCAATTAATTTTGAGATACAACTCGAAGATAAAAAATCAAAATTTA
Protein sequenceShow/hide protein sequence
MISMMKLTAAPIVGFVFLILLQLAYGHSHDISPAAAPSNDGAAIDQGIAYVLLLVALAVTYIVH