; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC03G048450 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC03G048450
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
Descriptionnon-classical arabinogalactan protein 31
Genome locationCicolChr03:6294600..6295657
RNA-Seq ExpressionCcUC03G048450
SyntenyCcUC03G048450
Gene Ontology termsGO:0071944 - cell periphery (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583991.1 Non-classical arabinogalactan protein 31, partial [Cucurbita argyrosperma subsp. sororia]3.2e-9579.92Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQ-PVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVR
        M SV A  + ++LLL C+LNAFQ AA HG+   AP   D  +  PVAAP+   HHHHHHHPPSQSP+ HHR   S SPAPSPVY PPPPAHYAPVPSPV+
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQ-PVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVR

Query:  PPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASK
        PPK STY+PRSFVEVQGVVYCKSC YPGVDTLLGAKPL+GATVKLSCKNTKYAPTVETAT+DKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS+C+K SK
Subjt:  PPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASK

Query:  LNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        +NGGEDGAELKPA+AFTD EKKPVVLYNVGPLAFEPTC+
Subjt:  LNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

KAG7019613.1 Non-classical arabinogalactan protein 31, partial [Cucurbita argyrosperma subsp. argyrosperma]3.2e-9579.92Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQ-PVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVR
        M SV A  + ++LLL C+LNAFQ AA HG+   AP   D  +  PVAAP+   HHHHHHHPPSQSP+ HH   HS SPAPSPVY PPPPAHYAPVPSPV+
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQ-PVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVR

Query:  PPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASK
        PPK STY+PRSFVEVQGVVYCKSC YPGVDTLLGAKPL+GATVKLSCKNTKYAPTVETAT+DKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS+C+K SK
Subjt:  PPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASK

Query:  LNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        +NGGEDGAELKPA+AFTD EKKPVVLYNVGPLAFEPTC+
Subjt:  LNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

XP_004146606.2 non-classical arabinogalactan protein 31 [Cucumis sativus]1.7e-10485.12Show/hide
Query:  MASVTANVSFLLLLLCCS-LNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPS
        MAS+TAN SFLLLLLCC+ LNAFQ AAY  T APAP+HH+  H PVAAPTPSFH   HHHHHH P+QSP SHH HPHSPSPAPSPVYP  PPAHYAPVPS
Subjt:  MASVTANVSFLLLLLCCS-LNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPS

Query:  PVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSK
        P   PKPST +PRSFV+VQGVVYCKSCKYP VDTLLGAKPLSGATVKLSCKNTKYAP VETATSD+NGYFRLAAPKNVTSYAFHRCKVYLVKSPDS C K
Subjt:  PVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSK

Query:  ASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        ASK+NGG DGAELKPARAFTD+EKKPVVLYNVGPLAFEPTCS
Subjt:  ASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

XP_008442660.1 PREDICTED: non-classical arabinogalactan protein 31 [Cucumis melo]2.1e-10787.4Show/hide
Query:  MASVTANVSFL-LLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYA----
        MAS+TAN SFL LLLLCCSLNAFQ AAY  T APAP+HH+  H PVAAP PSFH   HHHHHH PSQSP SHH HPHSPSPAPSPVYP PP AHYA    
Subjt:  MASVTANVSFL-LLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYA----

Query:  PVPSPVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS
        PVPSPV  PKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLV+SPDS
Subjt:  PVPSPVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS

Query:  TCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
         C KASKLNGGEDGAELKPARAFTD+EKKPVVLYNVGPLAFEPTCS
Subjt:  TCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

XP_038876988.1 non-classical arabinogalactan protein 31 [Benincasa hispida]2.1e-11090.12Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFHH--HHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYA----PV
        MASVTAN SFL+LLLCCS+NAFQTAAYHGT APAPSHH GGH PVAAPTPSFHH  HH HH PS SP+SHH HPH  SPAPSPVYP PP AHYA    PV
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFHH--HHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYA----PV

Query:  PSPVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTC
        PSPV P KPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS+C
Subjt:  PSPVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTC

Query:  SKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTC
        SKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTC
Subjt:  SKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTC

TrEMBL top hitse value%identityAlignment
A0A0A0LXF0 Structural constituent of cell wall8.2e-10585.12Show/hide
Query:  MASVTANVSFLLLLLCCS-LNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPS
        MAS+TAN SFLLLLLCC+ LNAFQ AAY  T APAP+HH+  H PVAAPTPSFH   HHHHHH P+QSP SHH HPHSPSPAPSPVYP  PPAHYAPVPS
Subjt:  MASVTANVSFLLLLLCCS-LNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPS

Query:  PVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSK
        P   PKPST +PRSFV+VQGVVYCKSCKYP VDTLLGAKPLSGATVKLSCKNTKYAP VETATSD+NGYFRLAAPKNVTSYAFHRCKVYLVKSPDS C K
Subjt:  PVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSK

Query:  ASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        ASK+NGG DGAELKPARAFTD+EKKPVVLYNVGPLAFEPTCS
Subjt:  ASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

A0A1S3B5Q3 non-classical arabinogalactan protein 311.0e-10787.4Show/hide
Query:  MASVTANVSFL-LLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYA----
        MAS+TAN SFL LLLLCCSLNAFQ AAY  T APAP+HH+  H PVAAP PSFH   HHHHHH PSQSP SHH HPHSPSPAPSPVYP PP AHYA    
Subjt:  MASVTANVSFL-LLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYA----

Query:  PVPSPVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS
        PVPSPV  PKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLV+SPDS
Subjt:  PVPSPVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS

Query:  TCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
         C KASKLNGGEDGAELKPARAFTD+EKKPVVLYNVGPLAFEPTCS
Subjt:  TCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

A0A5A7URZ5 Non-classical arabinogalactan protein 311.0e-10787.4Show/hide
Query:  MASVTANVSFL-LLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYA----
        MAS+TAN SFL LLLLCCSLNAFQ AAY  T APAP+HH+  H PVAAP PSFH   HHHHHH PSQSP SHH HPHSPSPAPSPVYP PP AHYA    
Subjt:  MASVTANVSFL-LLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYA----

Query:  PVPSPVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS
        PVPSPV  PKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLV+SPDS
Subjt:  PVPSPVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS

Query:  TCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
         C KASKLNGGEDGAELKPARAFTD+EKKPVVLYNVGPLAFEPTCS
Subjt:  TCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

A0A6J1EHN2 non-classical arabinogalactan protein 31-like3.4e-9579.5Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQ-PVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVR
        M SV A  + ++LLL C+LNAFQ AA HG+   AP   D  +  PVAAP+   HHHHHHHPPSQSP+ HH   HS SPAPSPVY PPPPAHYAPVPSPV+
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQ-PVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVR

Query:  PPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASK
        PPK STY+PRSFVEVQGVVYCKSC YPGVDTLLGAKPL+GATVKLSCKNTKYAPT+ETAT+DKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS+C+K SK
Subjt:  PPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASK

Query:  LNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        +NGGEDGAELKPA+AFTD EKKPVVLYNVGPLAFEPTC+
Subjt:  LNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

A0A6J1KM21 non-classical arabinogalactan protein 31-like3.9e-9177.41Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVRP
        M SV A  + ++LLL C+LNAFQ AA HG+  PAP   D  H PV    PS HHHHHHHPPSQSP+ HH   H+ SPAPSPVY PPPPAHYA    PV+P
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVRP

Query:  PKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASKL
        PK STY+PRSFVEVQGVVYCKSC YPGVDTLLGAKPL+GA VKLSCKNTKYAPTVETAT+DKNGYFRLAAPKNVTSYAFHRCKV+LVKSPDS+CSK SK+
Subjt:  PKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASKL

Query:  NGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCSH
        NGG DGAELKPA+AFTD EKKPVVLYNVGPLAFEP+C+H
Subjt:  NGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCSH

SwissProt top hitse value%identityAlignment
P93013 Non-classical arabinogalactan protein 302.0e-2837.9Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGH---QPVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSP
        M  +  +VS  L  L C  ++  T   +   +  P H    H    P+  PT          PP+++P+    +P + +P   P  PP       P   P
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGH---QPVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSP

Query:  VRPP-KPSTYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDST
        ++PP  P  Y P   ++ V V+GVVYCK+CKY GV+ + GAKP+  A V+L CKN K   ++    +DKNGYF L APK VT+Y    C+ +LVKSPD+ 
Subjt:  VRPP-KPSTYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDST

Query:  CSKASKLNGGEDGAELKPA--RAFTDDEKK--PVVLYNVGPLAFEPTC
        CSK S L+ G  G+ LKP     F+    +     +YNVGP AFEPTC
Subjt:  CSKASKLNGGEDGAELKPA--RAFTDDEKK--PVVLYNVGPLAFEPTC

Q03211 Pistil-specific extensin-like protein6.6e-1937.89Show/hide
Query:  PVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPV----PSPV----------RPPKPSTYVPRSFVE--------------VQ
        PV AP+PS        P +Q P    + P  P    SP+ PPPPP  Y PV    PSP            PP     +PR                  V 
Subjt:  PVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPV----PSPV----------RPPKPSTYVPRSFVE--------------VQ

Query:  GVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASKLNGGEDGAELKPARAF
        G+VYCKSC   GV TLL A  L GA VKL C   K    V+ AT+D  G FR+  PK++T+    +CKVYLVKSP+  C+  +  NGG+ G  LKP    
Subjt:  GVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASKLNGGEDGAELKPARAF

Query:  TDDEKKPVV--------LYNVGPLAFE
               VV        LY VGP  FE
Subjt:  TDDEKKPVV--------LYNVGPLAFE

Q9FZA2 Non-classical arabinogalactan protein 311.3e-3552.15Show/hide
Query:  PPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVRPP-KPSTYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTV
        PP++ PV     P +  P   PVYPP       PV  P +PP  P  Y P   RS V V+G VYCKSCKY   +TLLGAKP+ GATVKL CK+ K   T 
Subjt:  PPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVRPP-KPSTYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTV

Query:  ETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASKLNGGEDGAELKPARAFTDD----EKKPVVLYNVGPLAFEPTC
        ET T+DKNGYF L APK VT++ F  C+VYLVKS D  CSK SKL GG+ GAELKP +          K    L+NVGP AF P+C
Subjt:  ETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASKLNGGEDGAELKPARAFTDD----EKKPVVLYNVGPLAFEPTC

Arabidopsis top hitse value%identityAlignment
AT1G28290.1 arabinogalactan protein 319.4e-3752.15Show/hide
Query:  PPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVRPP-KPSTYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTV
        PP++ PV     P +  P   PVYPP       PV  P +PP  P  Y P   RS V V+G VYCKSCKY   +TLLGAKP+ GATVKL CK+ K   T 
Subjt:  PPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVRPP-KPSTYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTV

Query:  ETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASKLNGGEDGAELKPARAFTDD----EKKPVVLYNVGPLAFEPTC
        ET T+DKNGYF L APK VT++ F  C+VYLVKS D  CSK SKL GG+ GAELKP +          K    L+NVGP AF P+C
Subjt:  ETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASKLNGGEDGAELKPARAFTDD----EKKPVVLYNVGPLAFEPTC

AT1G28290.2 arabinogalactan protein 311.1e-3439.48Show/hide
Query:  LLLLLCCSLNAFQTAAYHGTH------APAPSHHDGGHQPVAAPTPSFHHHHHHH------------PPSQSPVSHHRHP--------------------
        L+ L C + + F     H T       APAP HH   H     P P  HHH H H            PP ++PVS    P                    
Subjt:  LLLLLCCSLNAFQTAAYHGTH------APAPSHHDGGHQPVAAPTPSFHHHHHHH------------PPSQSPVSHHRHP--------------------

Query:  ------------------HSPSPAP----------SPVYPPPPPAHYAPVPSPVRPP---------KPSTYVP---RSFVEVQGVVYCKSCKYPGVDTLL
                          + P+ AP           PVYPP  P  Y P  +PV+PP          P  Y P   RS V V+G VYCKSCKY   +TLL
Subjt:  ------------------HSPSPAP----------SPVYPPPPPAHYAPVPSPVRPP---------KPSTYVP---RSFVEVQGVVYCKSCKYPGVDTLL

Query:  GAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASKLNGGEDGAELKPARAFTDD----EKKPVVLYNV
        GAKP+ GATVKL CK+ K   T ET T+DKNGYF L APK VT++ F  C+VYLVKS D  CSK SKL GG+ GAELKP +          K    L+NV
Subjt:  GAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASKLNGGEDGAELKPARAFTDD----EKKPVVLYNV

Query:  GPLAFEPTC
        GP AF P+C
Subjt:  GPLAFEPTC

AT2G33790.1 arabinogalactan protein 301.5e-2937.9Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGH---QPVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSP
        M  +  +VS  L  L C  ++  T   +   +  P H    H    P+  PT          PP+++P+    +P + +P   P  PP       P   P
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGH---QPVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSP

Query:  VRPP-KPSTYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDST
        ++PP  P  Y P   ++ V V+GVVYCK+CKY GV+ + GAKP+  A V+L CKN K   ++    +DKNGYF L APK VT+Y    C+ +LVKSPD+ 
Subjt:  VRPP-KPSTYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDST

Query:  CSKASKLNGGEDGAELKPA--RAFTDDEKK--PVVLYNVGPLAFEPTC
        CSK S L+ G  G+ LKP     F+    +     +YNVGP AFEPTC
Subjt:  CSKASKLNGGEDGAELKPA--RAFTDDEKK--PVVLYNVGPLAFEPTC

AT2G34700.1 Pollen Ole e 1 allergen and extensin family protein8.8e-3552.38Show/hide
Query:  SPVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVK----SPD
        SP+ PP     + R  V V+G+VYCKSCKY GVDTLL A PL GATVKL+C NTK   T+ET T DKNGYF + APK +T+YAFH C+ +       +  
Subjt:  SPVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVK----SPD

Query:  STCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
         TC+  SKLN G  GA LKP++     E    VL++VGP AFEP C+
Subjt:  STCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

AT5G05500.1 Pollen Ole e 1 allergen and extensin family protein3.4e-1029.75Show/hide
Query:  PPPPPAHYAPVPSPVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPT-VETATSDKNGYF-----RLAAPKNVTSY
        PPPPP    P    V               V+G+VYC+SC   G  +L GA+ ++GA + + CKN +   +  +   +D  G+F           +   +
Subjt:  PPPPPAHYAPVPSPVRPPKPSTYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPT-VETATSDKNGYF-----RLAAPKNVTSY

Query:  AFHRCKVYLVKSPDSTCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEP
          H C+  LV SP   C+  S +N   DGA L+             V+Y  GPLAF P
Subjt:  AFHRCKVYLVKSPDSTCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTGTTACTGCCAACGTCTCCTTCCTTCTTCTTCTCCTCTGCTGCAGCCTCAATGCCTTCCAAACCGCCGCATATCACGGAACCCATGCTCCGGCACCA
AGTCACCATGACGGCGGCCACCAGCCAGTAGCTGCCCCAACTCCAAGCTTCCACCACCACCACCACCACCACCCGCCGAGCCAGTCTCCGGTTTCCCACCACCGC
CATCCTCATTCCCCTTCCCCTGCTCCTTCCCCTGTTTACCCACCTCCCCCTCCGGCTCACTACGCCCCGGTTCCGTCACCGGTTCGACCACCCAAACCGTCCACC
TACGTCCCAAGAAGCTTTGTTGAAGTCCAAGGCGTTGTTTATTGCAAGTCTTGCAAGTACCCTGGCGTCGATACCCTTCTCGGAGCTAAACCCCTTTCCGGTGCT
ACAGTGAAGCTATCATGCAAGAACACCAAGTACGCTCCGACAGTGGAAACCGCCACCAGCGACAAGAATGGTTACTTCCGGCTAGCGGCGCCGAAGAATGTGACG
AGCTACGCATTCCACCGCTGCAAGGTTTATCTGGTGAAGTCGCCGGATAGCACTTGCAGTAAGGCGTCAAAACTGAACGGCGGAGAGGACGGAGCGGAGTTGAAG
CCGGCGAGAGCATTCACGGATGACGAGAAAAAGCCTGTTGTGCTTTACAATGTTGGGCCATTGGCTTTTGAACCCACTTGCAGTCATTGA
mRNA sequenceShow/hide mRNA sequence
CATTCATATCATTACCAAATGGCTTCTGTTACTGCCAACGTCTCCTTCCTTCTTCTTCTCCTCTGCTGCAGCCTCAATGCCTTCCAAACCGCCGCATATCACGGA
ACCCATGCTCCGGCACCAAGTCACCATGACGGCGGCCACCAGCCAGTAGCTGCCCCAACTCCAAGCTTCCACCACCACCACCACCACCACCCGCCGAGCCAGTCT
CCGGTTTCCCACCACCGCCATCCTCATTCCCCTTCCCCTGCTCCTTCCCCTGTTTACCCACCTCCCCCTCCGGCTCACTACGCCCCGGTTCCGTCACCGGTTCGA
CCACCCAAACCGTCCACCTACGTCCCAAGAAGCTTTGTTGAAGTCCAAGGCGTTGTTTATTGCAAGTCTTGCAAGTACCCTGGCGTCGATACCCTTCTCGGAGCT
AAACCCCTTTCCGGTGCTACAGTGAAGCTATCATGCAAGAACACCAAGTACGCTCCGACAGTGGAAACCGCCACCAGCGACAAGAATGGTTACTTCCGGCTAGCG
GCGCCGAAGAATGTGACGAGCTACGCATTCCACCGCTGCAAGGTTTATCTGGTGAAGTCGCCGGATAGCACTTGCAGTAAGGCGTCAAAACTGAACGGCGGAGAG
GACGGAGCGGAGTTGAAGCCGGCGAGAGCATTCACGGATGACGAGAAAAAGCCTGTTGTGCTTTACAATGTTGGGCCATTGGCTTTTGAACCCACTTGCAGTCAT
TGAAAGCAGGGCGGAGGATGGAGGGGTAGTTTAGTAATATTTGGGGTTTTGTTTTGATATGATGCCTTTGGATTCCTCTCTTTTCTAGGTTATTGTGAGTGACAT
TGTTATTATTATGAATCGTTTCAATGGCATGGTATTAAGTTTTTAGTCGTT
Protein sequenceShow/hide protein sequence
MASVTANVSFLLLLLCCSLNAFQTAAYHGTHAPAPSHHDGGHQPVAAPTPSFHHHHHHHPPSQSPVSHHRHPHSPSPAPSPVYPPPPPAHYAPVPSPVRPPKPST
YVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSTCSKASKLNGGEDGAELK
PARAFTDDEKKPVVLYNVGPLAFEPTCSH