; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC01g1416 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC01g1416
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionnon-classical arabinogalactan protein 31-like
Genome locationMC01:18766079..18767902
RNA-Seq ExpressionMC01g1416
SyntenyMC01g1416
Gene Ontology termsGO:0071944 - cell periphery (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594600.1 Non-classical arabinogalactan protein 30, partial [Cucurbita argyrosperma subsp. sororia]8.15e-13772.09Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHH+HHHH     APKPAPV+PPTH PVH PAQPP H HHHHHVH QPPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH

Query:  PPVNAAP----------------------------PTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPV
        PPV+ AP                            PT APAPGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPV
Subjt:  PPVNAAP----------------------------PTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPV

Query:  AGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
        AGASVKLICQNTK+PLVQ+ TTD NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  AGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

Query:  R
         
Subjt:  R

XP_022926154.1 non-classical arabinogalactan protein 31-like [Cucurbita moschata]6.03e-13877.12Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHH-VHGQPPVH
        M S + K    LLF AIF VS ADD TAAETLA  PHH AAVP    PAHHHHHH       +P PAPVAPPTH P+H PAQPP HHHHHH +H QPPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHH-VHGQPPVH

Query:  PPVNAAPPTHAPAPGHHHR-DVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYF
        PP    PP HAPAPGHHH  DVHPLPPPTHS AP++PPKPR+ RSFISVQGVVYCKSCKYAGVDTLLGAT VAGA+VKLICQNTKYPLVQ+ TTDKNGYF
Subjt:  PPVNAAPPTHAPAPGHHHR-DVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYF

Query:  FITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC
        FITAPKA+TSYAFHKCKV+L +SP+ +CSKPSALHGG +GAPLRP KSYIDANKLP+VLYSVGPFAFEPTC
Subjt:  FITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC

XP_022926436.1 non-classical arabinogalactan protein 30-like isoform X1 [Cucurbita moschata]9.47e-13672.09Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHH+HHHH     APKPAPV+PPTH PVH PAQPP H HHHHHVH QPPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH

Query:  PPVNAAP-----PTHAP-----------------------APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPV
        PPV+ AP     PTH P                       APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPV
Subjt:  PPVNAAP-----PTHAP-----------------------APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPV

Query:  AGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
        AGASVKLICQNTK+PLVQ+ TTD NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  AGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

Query:  R
         
Subjt:  R

XP_022926437.1 non-classical arabinogalactan protein 30-like isoform X2 [Cucurbita moschata]1.09e-13877.7Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHH+HHHH     APKPAPV+PPTH PVH PAQPP H HHHHHVH  PPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH

Query:  PPVNAAP-----PTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTD
        PPVN AP     PT A APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPVAGASVKLICQNTK+PLVQ+ TTD
Subjt:  PPVNAAP-----PTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTD

Query:  KNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR
         NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP 
Subjt:  KNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR

XP_023003678.1 non-classical arabinogalactan protein 31-like [Cucurbita maxima]2.76e-13770.87Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHH----------
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHHHHHHHH     PKPAPV+PPTH PVH PAQPP HHHHH          
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHH----------

Query:  ----------------------HVHGQPPVHPPVNAAPPTHAP-----APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVD
                              +VH QPPVHPPVN APP H P     APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVD
Subjt:  ----------------------HVHGQPPVHPPVNAAPPTHAP-----APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVD

Query:  TLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGP
        TL GATPVAGASVKLICQNTKYPLVQ+ TTDKNGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGP
Subjt:  TLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGP

Query:  FAFEPTCPR
        FAFEPTCP 
Subjt:  FAFEPTCPR

TrEMBL top hitse value%identityAlignment
A0A6J1EEB1 non-classical arabinogalactan protein 31-like2.92e-13877.12Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHH-VHGQPPVH
        M S + K    LLF AIF VS ADD TAAETLA  PHH AAVP    PAHHHHHH       +P PAPVAPPTH P+H PAQPP HHHHHH +H QPPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHH-VHGQPPVH

Query:  PPVNAAPPTHAPAPGHHHR-DVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYF
        PP    PP HAPAPGHHH  DVHPLPPPTHS AP++PPKPR+ RSFISVQGVVYCKSCKYAGVDTLLGAT VAGA+VKLICQNTKYPLVQ+ TTDKNGYF
Subjt:  PPVNAAPPTHAPAPGHHHR-DVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYF

Query:  FITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC
        FITAPKA+TSYAFHKCKV+L +SP+ +CSKPSALHGG +GAPLRP KSYIDANKLP+VLYSVGPFAFEPTC
Subjt:  FITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC

A0A6J1EEG4 non-classical arabinogalactan protein 30-like isoform X25.27e-13977.7Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHH+HHHH     APKPAPV+PPTH PVH PAQPP H HHHHHVH  PPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH

Query:  PPVNAAP-----PTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTD
        PPVN AP     PT A APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPVAGASVKLICQNTK+PLVQ+ TTD
Subjt:  PPVNAAP-----PTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTD

Query:  KNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR
         NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP 
Subjt:  KNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR

A0A6J1EEX2 non-classical arabinogalactan protein 30-like isoform X14.59e-13672.09Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHH+HHHH     APKPAPV+PPTH PVH PAQPP H HHHHHVH QPPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH

Query:  PPVNAAP-----PTHAP-----------------------APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPV
        PPV+ AP     PTH P                       APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPV
Subjt:  PPVNAAP-----PTHAP-----------------------APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPV

Query:  AGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
        AGASVKLICQNTK+PLVQ+ TTD NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  AGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

Query:  R
         
Subjt:  R

A0A6J1IR37 non-classical arabinogalactan protein 31-like6.20e-13375.27Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHH-VHGQPPVH
        M S + K    LLF AIF    ADD TAAETLA  PHH AAVP    P HHHHHH       +P PAPVAPPTH P+H PAQPP HHHHHH +H QPPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHH-VHGQPPVH

Query:  PPVNAAPPTH----APAPGHHHR-DVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDK
        PP +  PPTH    APAPGHHH  DVHPLPPPTHS AP++PPKPR+ RSFISVQGVVYCKSCKYAGVDTLLGAT VAGA+VKLICQNTKYPLVQ+ TTDK
Subjt:  PPVNAAPPTH----APAPGHHHR-DVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDK

Query:  NGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC
        NGYFFITAPKA+TSYAFHKCKV+L +SP+ +CSKPSALHGG +GAPLRP KSYIDANKLP+VLYSVGPFAFEPTC
Subjt:  NGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC

A0A6J1KXA6 non-classical arabinogalactan protein 31-like1.33e-13770.87Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHH----------
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHHHHHHHH     PKPAPV+PPTH PVH PAQPP HHHHH          
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHH----------

Query:  ----------------------HVHGQPPVHPPVNAAPPTHAP-----APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVD
                              +VH QPPVHPPVN APP H P     APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVD
Subjt:  ----------------------HVHGQPPVHPPVNAAPPTHAP-----APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVD

Query:  TLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGP
        TL GATPVAGASVKLICQNTKYPLVQ+ TTDKNGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGP
Subjt:  TLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGP

Query:  FAFEPTCPR
        FAFEPTCP 
Subjt:  FAFEPTCPR

SwissProt top hitse value%identityAlignment
P93013 Non-classical arabinogalactan protein 305.2e-2838.63Show/hide
Query:  PSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPR
        P S  P H    H   PP+  P   P   P   P + PA+ P+         + P  PP  A  P   P        + P+ PP     P+YPPK    +
Subjt:  PSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPR

Query:  SFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA---
        + ++V+GVVYCK+CKYAGV+ + GA PV  A V+L+C+N K  + ++  TDKNGYF + APK +T+Y    C+  L  SP   CSK S+LH G  G+   
Subjt:  SFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA---

Query:  -PLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR
          L+P  S        + +Y+VGPFAFEPTCP+
Subjt:  -PLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR

Q03211 Pistil-specific extensin-like protein1.6e-1637.79Show/hide
Query:  PHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPV---------------APPTHSPVHHPAQPPVHHHHHHVHGQPPV-HPPV-------NAAPPTHAPA
        P  SA  P   PPA         PPV AP P+P                +P T  P   P  PP       +   PPV +PPV        A PP  AP 
Subjt:  PHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPV---------------APPTHSPVHHPAQPPVHHHHHHVHGQPPV-HPPV-------NAAPPTHAPA

Query:  PGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFH
        P       +P   P   A P+  P P + +  I V G+VYCKSC   GV TLL A+ + GA VKLIC   K  +VQ  TTD  G F I  PK++T+    
Subjt:  PGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFH

Query:  KCKVLLSASPSAACSKPSALHGGLSGA---PLRPHKSYIDANKLPFV-----LYSVGPFAFE
        KCKV L  SP+  C+ P+  +GG SG    PL P K  I    +P       LY VGPF FE
Subjt:  KCKVLLSASPSAACSKPSALHGGLSGA---PLRPHKSYIDANKLPFV-----LYSVGPFAFE

Q9FZA2 Non-classical arabinogalactan protein 311.0e-3947.49Show/hide
Query:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHS--AAPIYPPKPRMPRSFISVQGVVYCKSC
        PPV+ P  APV PPT  PV  P  PP          +PPV PPV   PPT AP          P+ PPT      P+YPPK    RS ++V+G VYCKSC
Subjt:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHS--AAPIYPPKPRMPRSFISVQGVVYCKSC

Query:  KYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANK
        KYA  +TLLGA P+ GA+VKL+C++ K  +    TTDKNGYF + APK +T++ F  C+V L  S    CSK S L GG  GA L+P     KS +  NK
Subjt:  KYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANK

Query:  LPFVLYSVGPFAFEPTCPR
        L + L++VGPFAF P+CP+
Subjt:  LPFVLYSVGPFAFEPTCPR

Arabidopsis top hitse value%identityAlignment
AT1G28290.1 arabinogalactan protein 317.1e-4147.49Show/hide
Query:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHS--AAPIYPPKPRMPRSFISVQGVVYCKSC
        PPV+ P  APV PPT  PV  P  PP          +PPV PPV   PPT AP          P+ PPT      P+YPPK    RS ++V+G VYCKSC
Subjt:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHS--AAPIYPPKPRMPRSFISVQGVVYCKSC

Query:  KYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANK
        KYA  +TLLGA P+ GA+VKL+C++ K  +    TTDKNGYF + APK +T++ F  C+V L  S    CSK S L GG  GA L+P     KS +  NK
Subjt:  KYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANK

Query:  LPFVLYSVGPFAFEPTCPR
        L + L++VGPFAF P+CP+
Subjt:  LPFVLYSVGPFAFEPTCPR

AT1G28290.2 arabinogalactan protein 311.1e-3838.63Show/hide
Query:  VSKFLLLLLFSAIFTVSAADDFTAAETLAVSP---HHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPP
        VS   L    S++FT    +  T   +LA +P   HH    P   PP  HHHH H HP  H P  +PV PP  +PV  PA+PPV         +PPV+PP
Subjt:  VSKFLLLLLFSAIFTVSAADDFTAAETLAVSP---HHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPP

Query:  VNA--APPTHAPA-------------PGHHHRDVHPLPPPTH----------SAAPIYPP----------------------KPRMPRSFISVQGVVYCK
          A   PPT  P              P  +     P+ PPT           +  P+YPP                       P+  RS ++V+G VYCK
Subjt:  VNA--APPTHAPA-------------PGHHHRDVHPLPPPTH----------SAAPIYPP----------------------KPRMPRSFISVQGVVYCK

Query:  SCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDA
        SCKYA  +TLLGA P+ GA+VKL+C++ K  +    TTDKNGYF + APK +T++ F  C+V L  S    CSK S L GG  GA L+P     KS +  
Subjt:  SCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDA

Query:  NKLPFVLYSVGPFAFEPTCPR
        NKL + L++VGPFAF P+CP+
Subjt:  NKLPFVLYSVGPFAFEPTCPR

AT2G33790.1 arabinogalactan protein 303.7e-2938.63Show/hide
Query:  PSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPR
        P S  P H    H   PP+  P   P   P   P + PA+ P+         + P  PP  A  P   P        + P+ PP     P+YPPK    +
Subjt:  PSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPR

Query:  SFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA---
        + ++V+GVVYCK+CKYAGV+ + GA PV  A V+L+C+N K  + ++  TDKNGYF + APK +T+Y    C+  L  SP   CSK S+LH G  G+   
Subjt:  SFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA---

Query:  -PLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR
          L+P  S        + +Y+VGPFAFEPTCP+
Subjt:  -PLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR

AT2G34700.1 Pollen Ole e 1 allergen and extensin family protein4.0e-3651.7Show/hide
Query:  SAAPIYPPK--PRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVL-LSASPSAA
        SA+P+ PP    +M R  ++V+G+VYCKSCKY+GVDTLL A+P+ GA+VKL C NTK  +     TDKNGYFF+ APK +T+YAFH C+    +  P+ A
Subjt:  SAAPIYPPK--PRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVL-LSASPSAA

Query:  ---CSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC
           C+ PS L+ G++GA L+P K+ I+  +  +VL+SVGPFAFEP C
Subjt:  ---CSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC

AT3G62680.1 proline-rich protein 33.3e-0627.42Show/hide
Query:  LLLLLFSAIFTVSAADDFT-AAETLAVSPHHSAAVPS--------------------SLPPAHHH---HHHHHHPPVHAPK--PAPV-APPTHSPVHHP-
        L + L  ++ T++ AD ++ ++  +  SP H   +PS                    ++PP  +    + H   PPV+     P PV  PP + P   P 
Subjt:  LLLLLFSAIFTVSAADDFT-AAETLAVSPHHSAAVPS--------------------SLPPAHHH---HHHHHHPPVHAPK--PAPV-APPTHSPVHHP-

Query:  ------AQPPVH----HHHHHVHGQPPVHPPVNAAPPTHAPAPG---HHHRDVHPLPPPTHSAAPIYPP--KPRMPRSFISVQGVVYCKSCKYAGVDTLL
                PPV+    +    V+ +P + PPV   PP + P P    +     +  PPP +   P Y P  KP +P    +V G++ CK+    G +T  
Subjt:  ------AQPPVH----HHHHHVHGQPPVHPPVNAAPPTHAPAPG---HHHRDVHPLPPPTHSAAPIYPP--KPRMPRSFISVQGVVYCKSCKYAGVDTLL

Query:  GATPVAGASVKLIC--------QNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLR--PHKSYIDANKLPF
           P+ GA ++++C         NT+  ++ S  TD  GYF ++   +I   A+  C+V L  SP   C  P+ ++ GL+G PL    ++ Y D N    
Subjt:  GATPVAGASVKLIC--------QNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLR--PHKSYIDANKLPF

Query:  VLYSVGPFAF
         L+SVGPF +
Subjt:  VLYSVGPFAF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCGCTGTCTCCAAATTTCTCCTCCTCCTCCTCTTCTCCGCCATTTTCACCGTCTCCGCCGCCGACGATTTCACGGCGGCGGAAACACTCGCCGTTTCCCCCCA
CCATTCCGCCGCCGTGCCGTCGTCATTACCTCCTGCCCACCATCACCACCACCACCACCACCACCCTCCGGTTCATGCTCCGAAGCCGGCACCCGTGGCTCCACCGACCC
ACTCGCCGGTTCACCATCCGGCTCAACCGCCGGTTCACCACCACCACCACCATGTCCACGGCCAGCCTCCAGTTCACCCACCTGTAAATGCAGCCCCTCCGACTCACGCG
CCGGCGCCCGGCCACCACCACCGCGACGTCCACCCATTGCCGCCCCCGACTCACTCGGCGGCTCCGATCTACCCTCCGAAGCCTCGGATGCCGAGGAGCTTCATCTCCGT
TCAAGGCGTTGTTTATTGTAAGTCCTGCAAGTACGCCGGAGTCGACACCCTCCTCGGAGCCACCCCAGTCGCCGGTGCGAGCGTGAAGCTAATTTGCCAGAACACGAAAT
ACCCACTTGTCCAGAGCGGGACGACGGACAAGAACGGCTACTTCTTCATCACAGCGCCCAAGGCCATAACCAGCTACGCCTTCCACAAGTGCAAGGTCCTGCTCTCGGCC
TCCCCCTCCGCCGCCTGCAGCAAACCCTCCGCCCTCCACGGCGGCCTCTCCGGCGCCCCCCTCAGGCCTCACAAGTCTTACATCGATGCCAACAAACTCCCCTTCGTCCT
CTACTCCGTCGGCCCTTTTGCCTTCGAACCCACTTGCCCTCGCTAG
mRNA sequenceShow/hide mRNA sequence
CCGAAAAGAATAAGCTGGATAATGGAGACAGAAAAGGGAAAAAAAGACACAATGATAATTTTGTCATTATATAAGAAAAATGATATTGTGAAAATTACAAATAAACTAAT
TTTTTTTAAAATATAATTGTAGAAAAAGAGCTACAGTACATTATGCAGAAGCTGCTATATATTTTTGCCCTAAACTCTGTAAATGTCGTCCTCTACCTTCTAATTTTCAC
AGCTCAGTTATGGATTCCGCTGTCTCCAAATTTCTCCTCCTCCTCCTCTTCTCCGCCATTTTCACCGTCTCCGCCGCCGACGATTTCACGGCGGCGGAAACACTCGCCGT
TTCCCCCCACCATTCCGCCGCCGTGCCGTCGTCATTACCTCCTGCCCACCATCACCACCACCACCACCACCACCCTCCGGTTCATGCTCCGAAGCCGGCACCCGTGGCTC
CACCGACCCACTCGCCGGTTCACCATCCGGCTCAACCGCCGGTTCACCACCACCACCACCATGTCCACGGCCAGCCTCCAGTTCACCCACCTGTAAATGCAGCCCCTCCG
ACTCACGCGCCGGCGCCCGGCCACCACCACCGCGACGTCCACCCATTGCCGCCCCCGACTCACTCGGCGGCTCCGATCTACCCTCCGAAGCCTCGGATGCCGAGGAGCTT
CATCTCCGTTCAAGGCGTTGTTTATTGTAAGTCCTGCAAGTACGCCGGAGTCGACACCCTCCTCGGAGCCACCCCAGTCGCCGGTGCGAGCGTGAAGCTAATTTGCCAGA
ACACGAAATACCCACTTGTCCAGAGCGGGACGACGGACAAGAACGGCTACTTCTTCATCACAGCGCCCAAGGCCATAACCAGCTACGCCTTCCACAAGTGCAAGGTCCTG
CTCTCGGCCTCCCCCTCCGCCGCCTGCAGCAAACCCTCCGCCCTCCACGGCGGCCTCTCCGGCGCCCCCCTCAGGCCTCACAAGTCTTACATCGATGCCAACAAACTCCC
CTTCGTCCTCTACTCCGTCGGCCCTTTTGCCTTCGAACCCACTTGCCCTCGCTAGAAACACTTTCTCTCTATCCAAACGCTGTCGTCTCGTTTCGGCCTTCTCTCTGTGG
CTTGCCTTGTTCTCACTTGTTTAAATAAGTTACATTTTAGGTTTTTCTTATTTCAAATCACTTCTTATGAACTTATTGTCTTCTGATTTCGGTTTTTTTCGCTGCACTGC
ATTTATATTTTACTCTTTTAATTTGGAGCATGTTTCTCAATCCTGGT
Protein sequenceShow/hide protein sequence
MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHA
PAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSA
SPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR