; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000221 (gene) of Snake gourd v1 genome

Gene IDTan0000221
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionarabinogalactan peptide 22-like
Genome locationLG01:1408816..1409628
RNA-Seq ExpressionTan0000221
SyntenyTan0000221
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009424 - Arabinogalactan protein 16/20/22/41


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022140891.1 arabinogalactan peptide 22-like [Momordica charantia]1.6e-1783.08Show/hide
Query:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        M + RACFGIYA IIAI +V+ILPVS AEHSS PAPAP+SDGTTIDQ IAYILMLLALVLTYIIH
Subjt:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

XP_022945187.1 arabinogalactan peptide 22-like [Cucurbita moschata]5.0e-1986.15Show/hide
Query:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        M +LRACFGIYAVIIAIFYVV+LPVS AEH+S PAPAPTSDGT IDQ IAYILMLLAL LTYIIH
Subjt:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

XP_022968216.1 arabinogalactan peptide 22-like isoform X1 [Cucurbita maxima]8.0e-1782.09Show/hide
Query:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSS--PPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        M  LRACFG+YAVIIAIFYVV+LPVS AEH+S   PAPAPTSDGT IDQ IAYILMLLAL LTYIIH
Subjt:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSS--PPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

XP_022968217.1 arabinogalactan peptide 22-like isoform X2 [Cucurbita maxima]8.0e-1782.09Show/hide
Query:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSS--PPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        M  LRACFG+YAVIIAIFYVV+LPVS AEH+S   PAPAPTSDGT IDQ IAYILMLLAL LTYIIH
Subjt:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSS--PPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

XP_038898060.1 arabinogalactan protein 41-like [Benincasa hispida]2.1e-1783.33Show/hide
Query:  MDILRACFGIY-AVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        M +L+ACFGIY AVIIA+FYVV LPVSAAE SS PAPAPTSDGTTIDQ IAY+LML+ALVLTYIIH
Subjt:  MDILRACFGIY-AVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

TrEMBL top hitse value%identityAlignment
A0A5D3BTZ5 Arabinogalactan peptide 221.6e-1579.1Show/hide
Query:  MDILRA-CFGIY-AVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        M +LRA CFGIY AV+IA+FYV+ LPVS+AE SS PAPAPTSDGTTIDQ IAY+LML+ALVLTYIIH
Subjt:  MDILRA-CFGIY-AVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

A0A6J1CHE0 arabinogalactan peptide 22-like7.8e-1883.08Show/hide
Query:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        M + RACFGIYA IIAI +V+ILPVS AEHSS PAPAP+SDGTTIDQ IAYILMLLALVLTYIIH
Subjt:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

A0A6J1G081 arabinogalactan peptide 22-like2.4e-1986.15Show/hide
Query:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        M +LRACFGIYAVIIAIFYVV+LPVS AEH+S PAPAPTSDGT IDQ IAYILMLLAL LTYIIH
Subjt:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

A0A6J1HT15 arabinogalactan peptide 22-like isoform X23.9e-1782.09Show/hide
Query:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSS--PPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        M  LRACFG+YAVIIAIFYVV+LPVS AEH+S   PAPAPTSDGT IDQ IAYILMLLAL LTYIIH
Subjt:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSS--PPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

A0A6J1HU96 arabinogalactan peptide 22-like isoform X13.9e-1782.09Show/hide
Query:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSS--PPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        M  LRACFG+YAVIIAIFYVV+LPVS AEH+S   PAPAPTSDGT IDQ IAYILMLLAL LTYIIH
Subjt:  MDILRACFGIYAVIIAIFYVVILPVSAAEHSS--PPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

SwissProt top hitse value%identityAlignment
O82337 Arabinogalactan protein 162.1e-0759.62Show/hide
Query:  IIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        + +  + VIL ++ A+ S  PAPAPTSDGT+IDQ IAY+LM++ALVLTY+IH
Subjt:  IIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

Q8L9T8 Arabinogalactan protein 418.9e-1160.66Show/hide
Query:  RACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        R  FG+ + I++I + ++LP++ A+ S+ PAPAPTSDGTTIDQ IAY+LML+ALVLTY+IH
Subjt:  RACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

Q9FK16 Arabinogalactan protein 221.7e-0963.46Show/hide
Query:  IIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        +  I  V++LP+ A  HSS PAPAPTSDGT+IDQ IAY+LM++AL LTY IH
Subjt:  IIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

Q9M373 Arabinogalactan protein 202.1e-0759.62Show/hide
Query:  IIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        + A  + VI P + A+ S  PAP+PTSDGT+IDQ IAY+LM++ALVLTY+IH
Subjt:  IIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

Arabidopsis top hitse value%identityAlignment
AT2G46330.1 arabinogalactan protein 161.5e-0859.62Show/hide
Query:  IIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        + +  + VIL ++ A+ S  PAPAPTSDGT+IDQ IAY+LM++ALVLTY+IH
Subjt:  IIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

AT3G61640.1 arabinogalactan protein 201.5e-0859.62Show/hide
Query:  IIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        + A  + VI P + A+ S  PAP+PTSDGT+IDQ IAY+LM++ALVLTY+IH
Subjt:  IIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

AT5G24105.1 arabinogalactan protein 416.3e-1260.66Show/hide
Query:  RACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        R  FG+ + I++I + ++LP++ A+ S+ PAPAPTSDGTTIDQ IAY+LML+ALVLTY+IH
Subjt:  RACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH

AT5G53250.1 arabinogalactan protein 221.2e-1063.46Show/hide
Query:  IIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH
        +  I  V++LP+ A  HSS PAPAPTSDGT+IDQ IAY+LM++AL LTY IH
Subjt:  IIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATATTTTGAGGGCTTGTTTTGGAATCTATGCTGTGATTATTGCTATCTTTTATGTTGTTATTTTGCCTGTATCTGCAGCTGAACATTCCTCTCCTCCAGCTCCGGC
TCCCACTAGTGATGGCACCACAATAGACCAATGCATAGCATACATTCTAATGCTGTTGGCTTTAGTGCTCACTTATATCATCCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATATTTTGAGGGCTTGTTTTGGAATCTATGCTGTGATTATTGCTATCTTTTATGTTGTTATTTTGCCTGTATCTGCAGCTGAACATTCCTCTCCTCCAGCTCCGGC
TCCCACTAGTGATGGCACCACAATAGACCAATGCATAGCATACATTCTAATGCTGTTGGCTTTAGTGCTCACTTATATCATCCATTGA
Protein sequenceShow/hide protein sequence
MDILRACFGIYAVIIAIFYVVILPVSAAEHSSPPAPAPTSDGTTIDQCIAYILMLLALVLTYIIH