; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009341 (gene) of Snake gourd v1 genome

Gene IDTan0009341
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionarabinogalactan peptide 22-like
Genome locationLG07:66900208..66900688
RNA-Seq ExpressionTan0009341
SyntenyTan0009341
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009424 - Arabinogalactan protein 16/20/22/41


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579545.1 Arabinogalactan protein 16, partial [Cucurbita argyrosperma subsp. sororia]3.2e-1377.61Show/hide
Query:  VPF----AVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSISF
        +PF     V+FVLAML PA  A + +PAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHS DLSISF
Subjt:  VPF----AVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSISF

KAG6606052.1 Arabinogalactan protein 20, partial [Cucurbita argyrosperma subsp. sororia]1.9e-1076.67Show/hide
Query:  AVLFVLAMLSPAFAAAAF-SPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS
        AV F LAML PA AA  F SPAQPP ASDGTS+DQGIAYVLMLLAL+LTYIIH  +L IS
Subjt:  AVLFVLAMLSPAFAAAAF-SPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS

KAG7017004.1 Arabinogalactan peptide 16 [Cucurbita argyrosperma subsp. argyrosperma]1.1e-1381.97Show/hide
Query:  FAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSISF
        F V+FVLAM+ PA  A + +PAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHS DLSISF
Subjt:  FAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSISF

XP_022996281.1 arabinogalactan peptide 22-like [Cucurbita maxima]1.5e-1076.67Show/hide
Query:  AVLFVLAMLSPAFAA-AAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS
        AV FVLAML PA AA  + SPAQPP ASDGTS+DQGIAYVLMLLAL+LTYIIH  +L IS
Subjt:  AVLFVLAMLSPAFAA-AAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS

XP_023532560.1 arabinogalactan peptide 20-like [Cucurbita pepo subsp. pepo]1.5e-1076.67Show/hide
Query:  AVLFVLAMLSPAFAA-AAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS
        AV FVLAML PA AA  + SPAQPP ASDGTS+DQGIAYVLMLLAL+LTYIIH  +L IS
Subjt:  AVLFVLAMLSPAFAA-AAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS

TrEMBL top hitse value%identityAlignment
A0A0A0KK44 Uncharacterized protein2.7e-1073.33Show/hide
Query:  FAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS
        F V  VL  LS A+ A + SPAQPPA  DGTSIDQGIAYVLMLLALVLTYIIHS DL +S
Subjt:  FAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS

A0A1S3ATI9 arabinogalactan peptide 22-like4.6e-1073.33Show/hide
Query:  FAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS
        F V  VL  LSPA  A + SPAQPPA SDGTSIDQGIAY LMLLALVLTYIIHS D  +S
Subjt:  FAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS

A0A5D3C2F1 Arabinogalactan peptide 22-like4.6e-1073.33Show/hide
Query:  FAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS
        F V  VL  LSPA  A + SPAQPPA SDGTSIDQGIAY LMLLALVLTYIIHS D  +S
Subjt:  FAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS

A0A6J1CXM0 arabinogalactan peptide 22-like7.8e-1071.21Show/hide
Query:  VPFAVLFVLAMLSPAFAAA---AFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSISF
        V  AV FVL    P  A A   + SPAQP A SDGTSIDQGIAY LMLLALVLTYIIHS DLSISF
Subjt:  VPFAVLFVLAMLSPAFAAA---AFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSISF

A0A6J1K4A3 arabinogalactan peptide 22-like7.1e-1176.67Show/hide
Query:  AVLFVLAMLSPAFAA-AAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS
        AV FVLAML PA AA  + SPAQPP ASDGTS+DQGIAYVLMLLAL+LTYIIH  +L IS
Subjt:  AVLFVLAMLSPAFAA-AAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS

SwissProt top hitse value%identityAlignment
O82337 Arabinogalactan protein 161.0e-0656.14Show/hide
Query:  FVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSISF
        FV A++     A + +PA P   SDGTSIDQGIAY+LM++ALVLTY+IH  D S S+
Subjt:  FVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSISF

Q8L9T8 Arabinogalactan protein 418.6e-0660.78Show/hide
Query:  VLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIH
        V  + A+L P   A + +PA P   SDGT+IDQGIAYVLML+ALVLTY+IH
Subjt:  VLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIH

Q9FK16 Arabinogalactan protein 221.6e-0451.92Show/hide
Query:  AVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIH
        AV  +++++    A +  S   P   SDGTSIDQGIAYVLM++AL LTY IH
Subjt:  AVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIH

Q9M373 Arabinogalactan protein 202.1e-0756.25Show/hide
Query:  MAVPFAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS
        +AV     FV A++SP   A + +PA P   SDGTSIDQGIAY+LM++ALVLTY+IH  D S S
Subjt:  MAVPFAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS

Arabidopsis top hitse value%identityAlignment
AT2G46330.1 arabinogalactan protein 167.2e-0856.14Show/hide
Query:  FVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSISF
        FV A++     A + +PA P   SDGTSIDQGIAY+LM++ALVLTY+IH  D S S+
Subjt:  FVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSISF

AT3G61640.1 arabinogalactan protein 201.5e-0856.25Show/hide
Query:  MAVPFAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS
        +AV     FV A++SP   A + +PA P   SDGTSIDQGIAY+LM++ALVLTY+IH  D S S
Subjt:  MAVPFAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSIS

AT5G24105.1 arabinogalactan protein 416.1e-0760.78Show/hide
Query:  VLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIH
        V  + A+L P   A + +PA P   SDGT+IDQGIAYVLML+ALVLTY+IH
Subjt:  VLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIH

AT5G53250.1 arabinogalactan protein 221.2e-0551.92Show/hide
Query:  AVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIH
        AV  +++++    A +  S   P   SDGTSIDQGIAYVLM++AL LTY IH
Subjt:  AVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGTCCCTTTTGCCGTTTTATTTGTTTTGGCTATGCTTTCCCCTGCTTTTGCTGCTGCTGCTTTCTCTCCGGCTCAACCTCCTGCAGCTAGCGATGGGACAAGCAT
AGACCAAGGAATTGCCTATGTCTTGATGTTGTTGGCTTTGGTGCTCACTTACATTATCCATTCAACCGACCTTTCTATCTCTTTTTAA
mRNA sequenceShow/hide mRNA sequence
CATTGGTAGAGAGGCAGACCGCCATTGCAACGCCAGTGATCAGAGTGCAGCCATCGTCGGTGGCCCGGCCCAATAATCCCTCTCTGCTGCGCCTCTCCGGCCGGAGCCTT
CTTCCATTTCTGAGAGACGTTGCAGAGAGAGATTTTCAGTTCTTTCCTTCATTCATTCTTTGATTCATTTTAAGAATGGCGGTCCCTTTTGCCGTTTTATTTGTTTTGGC
TATGCTTTCCCCTGCTTTTGCTGCTGCTGCTTTCTCTCCGGCTCAACCTCCTGCAGCTAGCGATGGGACAAGCATAGACCAAGGAATTGCCTATGTCTTGATGTTGTTGG
CTTTGGTGCTCACTTACATTATCCATTCAACCGACCTTTCTATCTCTTTTTAA
Protein sequenceShow/hide protein sequence
MAVPFAVLFVLAMLSPAFAAAAFSPAQPPAASDGTSIDQGIAYVLMLLALVLTYIIHSTDLSISF