; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc03G08700 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc03G08700
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionnon-classical arabinogalactan protein 31
Genome locationClcChr03:8951334..8952223
RNA-Seq ExpressionClc03G08700
SyntenyClc03G08700
Gene Ontology termsGO:0071944 - cell periphery (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019613.1 Non-classical arabinogalactan protein 31, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-9580.33Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQ-PVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVR
        M SV A  + ++LLL C+LNAFQ AA HG+P  AP   D  +  PVAAP+   HHHHHHHPPSQSP+ HH   HS SPAPSPVY PPPPAHYAPVPSPV+
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQ-PVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVR

Query:  PPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASK
        PPK S+Y+PRSFVEVQGVVYCKSC YPGVDTLLGAKPL+GATVKLSCKNTKYAPTVETAT+DKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSC+K SK
Subjt:  PPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASK

Query:  LNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        +NGGEDGAELKPA+AFTD EKKPVVLYNVGPLAFEPTC+
Subjt:  LNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

XP_004146606.2 non-classical arabinogalactan protein 31 [Cucumis sativus]1.3e-10485.12Show/hide
Query:  MASVTANVSFLLLLLCCS-LNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPS
        MAS+TAN SFLLLLLCC+ LNAFQ AAY  TPAPAP+HH+  H PVAAPTPSFH   HHHHHH P+QSP S HHHPHSPSPAPSPVYP  PPAHYAPVPS
Subjt:  MASVTANVSFLLLLLCCS-LNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPS

Query:  PVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSK
        P   PKPS+ +PRSFV+VQGVVYCKSCKYP VDTLLGAKPLSGATVKLSCKNTKYAP VETATSD+NGYFRLAAPKNVTSYAFHRCKVYLVKSPDS C K
Subjt:  PVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSK

Query:  ASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        ASK+NGG DGAELKPARAFTD+EKKPVVLYNVGPLAFEPTCS
Subjt:  ASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

XP_008442660.1 PREDICTED: non-classical arabinogalactan protein 31 [Cucumis melo]1.6e-10787.4Show/hide
Query:  MASVTANVSFL-LLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYA----
        MAS+TAN SFL LLLLCCSLNAFQ AAY  TPAPAP+HH+  H PVAAP PSFH   HHHHHH PSQSP S HHHPHSPSPAPSPVYP PP AHYA    
Subjt:  MASVTANVSFL-LLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYA----

Query:  PVPSPVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS
        PVPSPV  PKPS+YVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLV+SPDS
Subjt:  PVPSPVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS

Query:  SCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        +C KASKLNGGEDGAELKPARAFTD+EKKPVVLYNVGPLAFEPTCS
Subjt:  SCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

XP_022927607.1 non-classical arabinogalactan protein 31-like [Cucurbita moschata]2.4e-9579.92Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQ-PVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVR
        M SV A  + ++LLL C+LNAFQ AA HG+P  AP   D  +  PVAAP+   HHHHHHHPPSQSP+ HH   HS SPAPSPVY PPPPAHYAPVPSPV+
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQ-PVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVR

Query:  PPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASK
        PPK S+Y+PRSFVEVQGVVYCKSC YPGVDTLLGAKPL+GATVKLSCKNTKYAPT+ETAT+DKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSC+K SK
Subjt:  PPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASK

Query:  LNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        +NGGEDGAELKPA+AFTD EKKPVVLYNVGPLAFEPTC+
Subjt:  LNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

XP_038876988.1 non-classical arabinogalactan protein 31 [Benincasa hispida]7.1e-11190.53Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFHH--HHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYA----PV
        MASVTAN SFL+LLLCCS+NAFQTAAYHGTPAPAPSHH GGH PVAAPTPSFHH  HH HH PS SP+S HHHPH  SPAPSPVYP PP AHYA    PV
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFHH--HHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYA----PV

Query:  PSPVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSC
        PSPV P KPS+YVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSC
Subjt:  PSPVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSC

Query:  SKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTC
        SKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTC
Subjt:  SKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTC

TrEMBL top hitse value%identityAlignment
A0A0A0LXF0 Structural constituent of cell wall6.2e-10585.12Show/hide
Query:  MASVTANVSFLLLLLCCS-LNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPS
        MAS+TAN SFLLLLLCC+ LNAFQ AAY  TPAPAP+HH+  H PVAAPTPSFH   HHHHHH P+QSP S HHHPHSPSPAPSPVYP  PPAHYAPVPS
Subjt:  MASVTANVSFLLLLLCCS-LNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPS

Query:  PVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSK
        P   PKPS+ +PRSFV+VQGVVYCKSCKYP VDTLLGAKPLSGATVKLSCKNTKYAP VETATSD+NGYFRLAAPKNVTSYAFHRCKVYLVKSPDS C K
Subjt:  PVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSK

Query:  ASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        ASK+NGG DGAELKPARAFTD+EKKPVVLYNVGPLAFEPTCS
Subjt:  ASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

A0A1S3B5Q3 non-classical arabinogalactan protein 317.9e-10887.4Show/hide
Query:  MASVTANVSFL-LLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYA----
        MAS+TAN SFL LLLLCCSLNAFQ AAY  TPAPAP+HH+  H PVAAP PSFH   HHHHHH PSQSP S HHHPHSPSPAPSPVYP PP AHYA    
Subjt:  MASVTANVSFL-LLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYA----

Query:  PVPSPVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS
        PVPSPV  PKPS+YVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLV+SPDS
Subjt:  PVPSPVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS

Query:  SCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        +C KASKLNGGEDGAELKPARAFTD+EKKPVVLYNVGPLAFEPTCS
Subjt:  SCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

A0A5A7URZ5 Non-classical arabinogalactan protein 317.9e-10887.4Show/hide
Query:  MASVTANVSFL-LLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYA----
        MAS+TAN SFL LLLLCCSLNAFQ AAY  TPAPAP+HH+  H PVAAP PSFH   HHHHHH PSQSP S HHHPHSPSPAPSPVYP PP AHYA    
Subjt:  MASVTANVSFL-LLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFH---HHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYA----

Query:  PVPSPVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS
        PVPSPV  PKPS+YVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLV+SPDS
Subjt:  PVPSPVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDS

Query:  SCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        +C KASKLNGGEDGAELKPARAFTD+EKKPVVLYNVGPLAFEPTCS
Subjt:  SCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

A0A6J1EHN2 non-classical arabinogalactan protein 31-like1.2e-9579.92Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQ-PVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVR
        M SV A  + ++LLL C+LNAFQ AA HG+P  AP   D  +  PVAAP+   HHHHHHHPPSQSP+ HH   HS SPAPSPVY PPPPAHYAPVPSPV+
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQ-PVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVR

Query:  PPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASK
        PPK S+Y+PRSFVEVQGVVYCKSC YPGVDTLLGAKPL+GATVKLSCKNTKYAPT+ETAT+DKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSC+K SK
Subjt:  PPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASK

Query:  LNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
        +NGGEDGAELKPA+AFTD EKKPVVLYNVGPLAFEPTC+
Subjt:  LNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

A0A6J1KM21 non-classical arabinogalactan protein 31-like3.9e-9178.24Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVRP
        M SV A  + ++LLL C+LNAFQ AA HG+P PAP   D  H PV    PS HHHHHHHPPSQSP+ +HHH H  SPAPSPVY PPPPAHYA    PV+P
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVRP

Query:  PKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASKL
        PK S+Y+PRSFVEVQGVVYCKSC YPGVDTLLGAKPL+GA VKLSCKNTKYAPTVETAT+DKNGYFRLAAPKNVTSYAFHRCKV+LVKSPDSSCSK SK+
Subjt:  PKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASKL

Query:  NGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCSH
        NGG DGAELKPA+AFTD EKKPVVLYNVGPLAFEP+C+H
Subjt:  NGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCSH

SwissProt top hitse value%identityAlignment
P93013 Non-classical arabinogalactan protein 301.3e-2737.9Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGH---QPVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSP
        M  +  +VS  L  L C  ++  T   +   +  P H    H    P+  PT          PP+++P+    +P + +P   P  PP       P   P
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGH---QPVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSP

Query:  VRPP-KPSSYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSS
        ++PP  P  Y P   ++ V V+GVVYCK+CKY GV+ + GAKP+  A V+L CKN K   ++    +DKNGYF L APK VT+Y    C+ +LVKSPD+ 
Subjt:  VRPP-KPSSYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSS

Query:  CSKASKLNGGEDGAELKPA--RAFTDDEKK--PVVLYNVGPLAFEPTC
        CSK S L+ G  G+ LKP     F+    +     +YNVGP AFEPTC
Subjt:  CSKASKLNGGEDGAELKPA--RAFTDDEKK--PVVLYNVGPLAFEPTC

Q03211 Pistil-specific extensin-like protein6.2e-1736.93Show/hide
Query:  PAPAPSHH-DGGHQPVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPV----PSPV----------RPPKPSSYVPRSFVE--
        P+P+P+        PV AP+PS        PP++ P      P  P    SP+ PPPPP  Y PV    PSP            PP     +PR      
Subjt:  PAPAPSHH-DGGHQPVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPV----PSPV----------RPPKPSSYVPRSFVE--

Query:  ------------VQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASKLN
                    V G+VYCKSC   GV TLL A  L GA VKL C   K    V+ AT+D  G FR+  PK++T+    +CKVYLVKSP+ +C+  +  N
Subjt:  ------------VQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASKLN

Query:  GGEDGAELKPARAFTDDEKKPVV--------LYNVGPLAFE
        GG+ G  LKP           VV        LY VGP  FE
Subjt:  GGEDGAELKPARAFTDDEKKPVV--------LYNVGPLAFE

Q9FZA2 Non-classical arabinogalactan protein 316.6e-3552.15Show/hide
Query:  PPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVRPP-KPSSYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTV
        PP++ PV     P +  P   PVYPP       PV  P +PP  P  Y P   RS V V+G VYCKSCKY   +TLLGAKP+ GATVKL CK+ K   T 
Subjt:  PPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVRPP-KPSSYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTV

Query:  ETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASKLNGGEDGAELKPARAFTDD----EKKPVVLYNVGPLAFEPTC
        ET T+DKNGYF L APK VT++ F  C+VYLVKS D  CSK SKL GG+ GAELKP +          K    L+NVGP AF P+C
Subjt:  ETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASKLNGGEDGAELKPARAFTDD----EKKPVVLYNVGPLAFEPTC

Arabidopsis top hitse value%identityAlignment
AT1G28290.1 arabinogalactan protein 314.7e-3652.15Show/hide
Query:  PPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVRPP-KPSSYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTV
        PP++ PV     P +  P   PVYPP       PV  P +PP  P  Y P   RS V V+G VYCKSCKY   +TLLGAKP+ GATVKL CK+ K   T 
Subjt:  PPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVRPP-KPSSYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTV

Query:  ETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASKLNGGEDGAELKPARAFTDD----EKKPVVLYNVGPLAFEPTC
        ET T+DKNGYF L APK VT++ F  C+VYLVKS D  CSK SKL GG+ GAELKP +          K    L+NVGP AF P+C
Subjt:  ETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASKLNGGEDGAELKPARAFTDD----EKKPVVLYNVGPLAFEPTC

AT1G28290.2 arabinogalactan protein 312.0e-3439.81Show/hide
Query:  LLLLLCCSLNAFQTAAYHGT------PAPAPSHHDGGHQPVAAPTPSFHHHHHHH------------PPSQSPVSHHHHP--------------------
        L+ L C + + F     H T      PAPAP HH   H     P P  HHH H H            PP ++PVS    P                    
Subjt:  LLLLLCCSLNAFQTAAYHGT------PAPAPSHHDGGHQPVAAPTPSFHHHHHHH------------PPSQSPVSHHHHP--------------------

Query:  ------------------HSPSPAP----------SPVYPPPPPAHYAPVPSPVRPP---------KPSSYVP---RSFVEVQGVVYCKSCKYPGVDTLL
                          + P+ AP           PVYPP  P  Y P  +PV+PP          P  Y P   RS V V+G VYCKSCKY   +TLL
Subjt:  ------------------HSPSPAP----------SPVYPPPPPAHYAPVPSPVRPP---------KPSSYVP---RSFVEVQGVVYCKSCKYPGVDTLL

Query:  GAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASKLNGGEDGAELKPARAFTDD----EKKPVVLYNV
        GAKP+ GATVKL CK+ K   T ET T+DKNGYF L APK VT++ F  C+VYLVKS D  CSK SKL GG+ GAELKP +          K    L+NV
Subjt:  GAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASKLNGGEDGAELKPARAFTDD----EKKPVVLYNV

Query:  GPLAFEPTC
        GP AF P+C
Subjt:  GPLAFEPTC

AT2G33790.1 arabinogalactan protein 309.4e-2937.9Show/hide
Query:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGH---QPVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSP
        M  +  +VS  L  L C  ++  T   +   +  P H    H    P+  PT          PP+++P+    +P + +P   P  PP       P   P
Subjt:  MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGH---QPVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSP

Query:  VRPP-KPSSYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSS
        ++PP  P  Y P   ++ V V+GVVYCK+CKY GV+ + GAKP+  A V+L CKN K   ++    +DKNGYF L APK VT+Y    C+ +LVKSPD+ 
Subjt:  VRPP-KPSSYVP---RSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSS

Query:  CSKASKLNGGEDGAELKPA--RAFTDDEKK--PVVLYNVGPLAFEPTC
        CSK S L+ G  G+ LKP     F+    +     +YNVGP AFEPTC
Subjt:  CSKASKLNGGEDGAELKPA--RAFTDDEKK--PVVLYNVGPLAFEPTC

AT2G34700.1 Pollen Ole e 1 allergen and extensin family protein2.6e-3451.7Show/hide
Query:  SPVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVK----SPD
        SP+ PP   + + R  V V+G+VYCKSCKY GVDTLL A PL GATVKL+C NTK   T+ET T DKNGYF + APK +T+YAFH C+ +       +  
Subjt:  SPVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVK----SPD

Query:  SSCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS
         +C+  SKLN G  GA LKP++     E    VL++VGP AFEP C+
Subjt:  SSCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEPTCS

AT5G05500.1 Pollen Ole e 1 allergen and extensin family protein3.4e-1029.75Show/hide
Query:  PPPPPAHYAPVPSPVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPT-VETATSDKNGYF-----RLAAPKNVTSY
        PPPPP    P    V               V+G+VYC+SC   G  +L GA+ ++GA + + CKN +   +  +   +D  G+F           +   +
Subjt:  PPPPPAHYAPVPSPVRPPKPSSYVPRSFVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPT-VETATSDKNGYF-----RLAAPKNVTSY

Query:  AFHRCKVYLVKSPDSSCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEP
          H C+  LV SP   C+  S +N   DGA L+             V+Y  GPLAF P
Subjt:  AFHRCKVYLVKSPDSSCSKASKLNGGEDGAELKPARAFTDDEKKPVVLYNVGPLAFEP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTGTTACTGCCAACGTCTCTTTCCTTCTTCTTCTCCTCTGCTGCAGCCTCAATGCCTTCCAAACCGCCGCATACCACGGAACCCCTGCTCCGGCACCAAGTCA
CCACGACGGCGGCCACCAGCCAGTAGCTGCCCCAACTCCAAGCTTCCACCACCACCACCACCACCACCCGCCGAGCCAGTCTCCGGTTTCCCACCACCACCATCCTCATT
CCCCTTCCCCTGCTCCTTCCCCTGTTTACCCACCTCCCCCTCCGGCTCACTACGCCCCGGTTCCGTCACCGGTTCGACCACCCAAACCGTCCTCCTACGTCCCAAGAAGC
TTCGTTGAAGTCCAAGGCGTTGTTTACTGCAAGTCTTGCAAGTACCCTGGCGTCGATACCCTTCTCGGAGCTAAACCCCTTTCCGGTGCTACAGTGAAGCTATCATGCAA
GAACACCAAGTACGCTCCGACAGTGGAAACCGCCACCAGCGACAAGAATGGTTACTTCCGGCTAGCGGCGCCGAAGAATGTGACGAGCTACGCATTCCACCGCTGCAAGG
TTTATCTGGTGAAGTCGCCGGATAGCAGTTGCAGTAAGGCGTCGAAACTGAACGGCGGAGAGGACGGAGCGGAGTTGAAGCCGGCGAGGGCATTCACGGATGACGAGAAA
AAGCCTGTTGTGCTTTACAATGTTGGACCATTGGCTTTTGAACCCACCTGCAGTCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTGTTACTGCCAACGTCTCTTTCCTTCTTCTTCTCCTCTGCTGCAGCCTCAATGCCTTCCAAACCGCCGCATACCACGGAACCCCTGCTCCGGCACCAAGTCA
CCACGACGGCGGCCACCAGCCAGTAGCTGCCCCAACTCCAAGCTTCCACCACCACCACCACCACCACCCGCCGAGCCAGTCTCCGGTTTCCCACCACCACCATCCTCATT
CCCCTTCCCCTGCTCCTTCCCCTGTTTACCCACCTCCCCCTCCGGCTCACTACGCCCCGGTTCCGTCACCGGTTCGACCACCCAAACCGTCCTCCTACGTCCCAAGAAGC
TTCGTTGAAGTCCAAGGCGTTGTTTACTGCAAGTCTTGCAAGTACCCTGGCGTCGATACCCTTCTCGGAGCTAAACCCCTTTCCGGTGCTACAGTGAAGCTATCATGCAA
GAACACCAAGTACGCTCCGACAGTGGAAACCGCCACCAGCGACAAGAATGGTTACTTCCGGCTAGCGGCGCCGAAGAATGTGACGAGCTACGCATTCCACCGCTGCAAGG
TTTATCTGGTGAAGTCGCCGGATAGCAGTTGCAGTAAGGCGTCGAAACTGAACGGCGGAGAGGACGGAGCGGAGTTGAAGCCGGCGAGGGCATTCACGGATGACGAGAAA
AAGCCTGTTGTGCTTTACAATGTTGGACCATTGGCTTTTGAACCCACCTGCAGTCATTGA
Protein sequenceShow/hide protein sequence
MASVTANVSFLLLLLCCSLNAFQTAAYHGTPAPAPSHHDGGHQPVAAPTPSFHHHHHHHPPSQSPVSHHHHPHSPSPAPSPVYPPPPPAHYAPVPSPVRPPKPSSYVPRS
FVEVQGVVYCKSCKYPGVDTLLGAKPLSGATVKLSCKNTKYAPTVETATSDKNGYFRLAAPKNVTSYAFHRCKVYLVKSPDSSCSKASKLNGGEDGAELKPARAFTDDEK
KPVVLYNVGPLAFEPTCSH