; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g31780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g31780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionnon-classical arabinogalactan protein 31-like
Genome locationchr1:22366602..22367984
RNA-Seq ExpressionMoc01g31780
SyntenyMoc01g31780
Gene Ontology termsGO:0071944 - cell periphery (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594600.1 Non-classical arabinogalactan protein 30, partial [Cucurbita argyrosperma subsp. sororia]5.4e-10772.33Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHH+HHH     HAPKPAPV+PPTH PVH PAQPP H HHHHHVH QPPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH

Query:  PPVNAA----------------------------PPTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPV
        PPV+ A                            PPT APAPGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPV
Subjt:  PPVNAA----------------------------PPTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPV

Query:  AGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
        AGASVKLICQNTK+PLVQ+ TTD NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  AGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

XP_022926154.1 non-classical arabinogalactan protein 31-like [Cucurbita moschata]1.4e-10777.12Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPV-HHHHHHVHGQPPVH
        M S + K    LLF AIF VS ADD TAAETLA  PHH AAV    PPAHHHHH       H+P PAPVAPPTH P+H PAQPP  HHHHHH+H QPPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPV-HHHHHHVHGQPPVH

Query:  PPVNAAPPTHAPAPG-HHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYF
        PP    PP HAPAPG HHH DVHPLPPPTHS AP++PPKPR+ RSFISVQGVVYCKSCKYAGVDTLLGAT VAGA+VKLICQNTKYPLVQ+ TTDKNGYF
Subjt:  PPVNAAPPTHAPAPG-HHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYF

Query:  FITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC
        FITAPKA+TSYAFHKCKV+L +SP+ +CSKPSALHGG +GAPLRP KSYIDANKLP+VLYSVGPFAFEPTC
Subjt:  FITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC

XP_022926436.1 non-classical arabinogalactan protein 30-like isoform X1 [Cucurbita moschata]3.5e-10672.33Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHH+HHH     HAPKPAPV+PPTH PVH PAQPP H HHHHHVH QPPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH

Query:  PPVNAA-----PPTHAP-----------------------APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPV
        PPV+ A     PPTH P                       APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPV
Subjt:  PPVNAA-----PPTHAP-----------------------APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPV

Query:  AGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
        AGASVKLICQNTK+PLVQ+ TTD NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  AGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

XP_022926437.1 non-classical arabinogalactan protein 30-like isoform X2 [Cucurbita moschata]3.7e-10877.98Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHH+HHH     HAPKPAPV+PPTH PVH PAQPP H HHHHHVH  PPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH

Query:  PPVNAA-----PPTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTD
        PPVN A     PPT A APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPVAGASVKLICQNTK+PLVQ+ TTD
Subjt:  PPVNAA-----PPTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTD

Query:  KNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
         NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  KNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

XP_023003678.1 non-classical arabinogalactan protein 31-like [Cucurbita maxima]1.8e-10771.1Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHH-------------
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHHHHHHH    H PKPAPV+PPTH PVH PAQPP HH             
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHH-------------

Query:  -------------------HHHHVHGQPPVHPPVNAAPPTH-----APAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVD
                           HHH+VH QPPVHPPVN APP H      PAPGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVD
Subjt:  -------------------HHHHVHGQPPVHPPVNAAPPTH-----APAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVD

Query:  TLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGP
        TL GATPVAGASVKLICQNTKYPLVQ+ TTDKNGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGP
Subjt:  TLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGP

Query:  FAFEPTCP
        FAFEPTCP
Subjt:  FAFEPTCP

TrEMBL top hitse value%identityAlignment
A0A6J1EEB1 non-classical arabinogalactan protein 31-like6.8e-10877.12Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPV-HHHHHHVHGQPPVH
        M S + K    LLF AIF VS ADD TAAETLA  PHH AAV    PPAHHHHH       H+P PAPVAPPTH P+H PAQPP  HHHHHH+H QPPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPV-HHHHHHVHGQPPVH

Query:  PPVNAAPPTHAPAPG-HHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYF
        PP    PP HAPAPG HHH DVHPLPPPTHS AP++PPKPR+ RSFISVQGVVYCKSCKYAGVDTLLGAT VAGA+VKLICQNTKYPLVQ+ TTDKNGYF
Subjt:  PPVNAAPPTHAPAPG-HHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYF

Query:  FITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC
        FITAPKA+TSYAFHKCKV+L +SP+ +CSKPSALHGG +GAPLRP KSYIDANKLP+VLYSVGPFAFEPTC
Subjt:  FITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC

A0A6J1EEG4 non-classical arabinogalactan protein 30-like isoform X21.8e-10877.98Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHH+HHH     HAPKPAPV+PPTH PVH PAQPP H HHHHHVH  PPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH

Query:  PPVNAA-----PPTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTD
        PPVN A     PPT A APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPVAGASVKLICQNTK+PLVQ+ TTD
Subjt:  PPVNAA-----PPTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTD

Query:  KNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
         NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  KNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

A0A6J1EEX2 non-classical arabinogalactan protein 30-like isoform X11.7e-10672.33Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHH+HHH     HAPKPAPV+PPTH PVH PAQPP H HHHHHVH QPPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVH

Query:  PPVNAA-----PPTHAP-----------------------APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPV
        PPV+ A     PPTH P                       APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPV
Subjt:  PPVNAA-----PPTHAP-----------------------APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPV

Query:  AGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
        AGASVKLICQNTK+PLVQ+ TTD NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  AGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

A0A6J1IR37 non-classical arabinogalactan protein 31-like6.0e-10475.27Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPV-HHHHHHVHGQPPVH
        M S + K    LLF AIF    ADD TAAETLA  PHH AAV    PP HHHHH       H+P PAPVAPPTH P+H PAQPP  HHHHHH+H QPPVH
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPV-HHHHHHVHGQPPVH

Query:  PPVNAAPPT----HAPAPG-HHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDK
        PP +  PPT    HAPAPG HHH DVHPLPPPTHS AP++PPKPR+ RSFISVQGVVYCKSCKYAGVDTLLGAT VAGA+VKLICQNTKYPLVQ+ TTDK
Subjt:  PPVNAAPPT----HAPAPG-HHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDK

Query:  NGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC
        NGYFFITAPKA+TSYAFHKCKV+L +SP+ +CSKPSALHGG +GAPLRP KSYIDANKLP+VLYSVGPFAFEPTC
Subjt:  NGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC

A0A6J1KXA6 non-classical arabinogalactan protein 31-like8.9e-10871.1Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHH-------------
        MD A+SK    LL  AIF VS A+D TAAETLA SPHH AAVP + PPAHHHHHHHH    H PKPAPV+PPTH PVH PAQPP HH             
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHH-------------

Query:  -------------------HHHHVHGQPPVHPPVNAAPPTH-----APAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVD
                           HHH+VH QPPVHPPVN APP H      PAPGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVD
Subjt:  -------------------HHHHVHGQPPVHPPVNAAPPTH-----APAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVD

Query:  TLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGP
        TL GATPVAGASVKLICQNTKYPLVQ+ TTDKNGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGP
Subjt:  TLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGP

Query:  FAFEPTCP
        FAFEPTCP
Subjt:  FAFEPTCP

SwissProt top hitse value%identityAlignment
P93013 Non-classical arabinogalactan protein 305.2e-2838.63Show/hide
Query:  PSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPR
        P S  P H    H   PP+  P   P   P   P + PA+ P+         + P  PP  A  P   P        + P+ PP     P+YPPK    +
Subjt:  PSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPR

Query:  SFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA---
        + ++V+GVVYCK+CKYAGV+ + GA PV  A V+L+C+N K  + ++  TDKNGYF + APK +T+Y    C+  L  SP   CSK S+LH G  G+   
Subjt:  SFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA---

Query:  -PLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR
          L+P  S        + +Y+VGPFAFEPTCP+
Subjt:  -PLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR

Q03211 Pistil-specific extensin-like protein1.6e-1637.79Show/hide
Query:  PHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPV---------------APPTHSPVHHPAQPPVHHHHHHVHGQPPV-HPPV-------NAAPPTHAPA
        P  SA  P   PPA         PPV AP P+P                +P T  P   P  PP       +   PPV +PPV        A PP  AP 
Subjt:  PHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPV---------------APPTHSPVHHPAQPPVHHHHHHVHGQPPV-HPPV-------NAAPPTHAPA

Query:  PGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFH
        P       +P   P   A P+  P P + +  I V G+VYCKSC   GV TLL A+ + GA VKLIC   K  +VQ  TTD  G F I  PK++T+    
Subjt:  PGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFH

Query:  KCKVLLSASPSAACSKPSALHGGLSGA---PLRPHKSYIDANKLPFV-----LYSVGPFAFE
        KCKV L  SP+  C+ P+  +GG SG    PL P K  I    +P       LY VGPF FE
Subjt:  KCKVLLSASPSAACSKPSALHGGLSGA---PLRPHKSYIDANKLPFV-----LYSVGPFAFE

Q9FZA2 Non-classical arabinogalactan protein 311.0e-3947.49Show/hide
Query:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHS--AAPIYPPKPRMPRSFISVQGVVYCKSC
        PPV+ P  APV PPT  PV  P  PP          +PPV PPV   PPT AP          P+ PPT      P+YPPK    RS ++V+G VYCKSC
Subjt:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHS--AAPIYPPKPRMPRSFISVQGVVYCKSC

Query:  KYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANK
        KYA  +TLLGA P+ GA+VKL+C++ K  +    TTDKNGYF + APK +T++ F  C+V L  S    CSK S L GG  GA L+P     KS +  NK
Subjt:  KYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANK

Query:  LPFVLYSVGPFAFEPTCPR
        L + L++VGPFAF P+CP+
Subjt:  LPFVLYSVGPFAFEPTCPR

Arabidopsis top hitse value%identityAlignment
AT1G28290.1 arabinogalactan protein 317.1e-4147.49Show/hide
Query:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHS--AAPIYPPKPRMPRSFISVQGVVYCKSC
        PPV+ P  APV PPT  PV  P  PP          +PPV PPV   PPT AP          P+ PPT      P+YPPK    RS ++V+G VYCKSC
Subjt:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHS--AAPIYPPKPRMPRSFISVQGVVYCKSC

Query:  KYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANK
        KYA  +TLLGA P+ GA+VKL+C++ K  +    TTDKNGYF + APK +T++ F  C+V L  S    CSK S L GG  GA L+P     KS +  NK
Subjt:  KYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANK

Query:  LPFVLYSVGPFAFEPTCPR
        L + L++VGPFAF P+CP+
Subjt:  LPFVLYSVGPFAFEPTCPR

AT1G28290.2 arabinogalactan protein 311.1e-3838.63Show/hide
Query:  VSKFLLLLLFSAIFTVSAADDFTAAETLAVSP---HHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPP
        VS   L    S++FT    +  T   +LA +P   HH    P   PP  HHHH H HP  H P  +PV PP  +PV  PA+PPV         +PPV+PP
Subjt:  VSKFLLLLLFSAIFTVSAADDFTAAETLAVSP---HHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPP

Query:  VNA--APPTHAPA-------------PGHHHRDVHPLPPPTH----------SAAPIYPP----------------------KPRMPRSFISVQGVVYCK
          A   PPT  P              P  +     P+ PPT           +  P+YPP                       P+  RS ++V+G VYCK
Subjt:  VNA--APPTHAPA-------------PGHHHRDVHPLPPPTH----------SAAPIYPP----------------------KPRMPRSFISVQGVVYCK

Query:  SCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDA
        SCKYA  +TLLGA P+ GA+VKL+C++ K  +    TTDKNGYF + APK +T++ F  C+V L  S    CSK S L GG  GA L+P     KS +  
Subjt:  SCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDA

Query:  NKLPFVLYSVGPFAFEPTCPR
        NKL + L++VGPFAF P+CP+
Subjt:  NKLPFVLYSVGPFAFEPTCPR

AT2G33790.1 arabinogalactan protein 303.7e-2938.63Show/hide
Query:  PSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPR
        P S  P H    H   PP+  P   P   P   P + PA+ P+         + P  PP  A  P   P        + P+ PP     P+YPPK    +
Subjt:  PSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPR

Query:  SFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA---
        + ++V+GVVYCK+CKYAGV+ + GA PV  A V+L+C+N K  + ++  TDKNGYF + APK +T+Y    C+  L  SP   CSK S+LH G  G+   
Subjt:  SFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA---

Query:  -PLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR
          L+P  S        + +Y+VGPFAFEPTCP+
Subjt:  -PLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR

AT2G34700.1 Pollen Ole e 1 allergen and extensin family protein4.0e-3651.7Show/hide
Query:  SAAPIYPPK--PRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVL-LSASPSAA
        SA+P+ PP    +M R  ++V+G+VYCKSCKY+GVDTLL A+P+ GA+VKL C NTK  +     TDKNGYFF+ APK +T+YAFH C+    +  P+ A
Subjt:  SAAPIYPPK--PRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVL-LSASPSAA

Query:  ---CSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC
           C+ PS L+ G++GA L+P K+ I+  +  +VL+SVGPFAFEP C
Subjt:  ---CSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC

AT3G62680.1 proline-rich protein 33.3e-0627.42Show/hide
Query:  LLLLLFSAIFTVSAADDFT-AAETLAVSPHHSAAVPS--------------------SLPPAHHH---HHHHHHPPVHAPK--PAPV-APPTHSPVHHP-
        L + L  ++ T++ AD ++ ++  +  SP H   +PS                    ++PP  +    + H   PPV+     P PV  PP + P   P 
Subjt:  LLLLLFSAIFTVSAADDFT-AAETLAVSPHHSAAVPS--------------------SLPPAHHH---HHHHHHPPVHAPK--PAPV-APPTHSPVHHP-

Query:  ------AQPPVH----HHHHHVHGQPPVHPPVNAAPPTHAPAPG---HHHRDVHPLPPPTHSAAPIYPP--KPRMPRSFISVQGVVYCKSCKYAGVDTLL
                PPV+    +    V+ +P + PPV   PP + P P    +     +  PPP +   P Y P  KP +P    +V G++ CK+    G +T  
Subjt:  ------AQPPVH----HHHHHVHGQPPVHPPVNAAPPTHAPAPG---HHHRDVHPLPPPTHSAAPIYPP--KPRMPRSFISVQGVVYCKSCKYAGVDTLL

Query:  GATPVAGASVKLIC--------QNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLR--PHKSYIDANKLPF
           P+ GA ++++C         NT+  ++ S  TD  GYF ++   +I   A+  C+V L  SP   C  P+ ++ GL+G PL    ++ Y D N    
Subjt:  GATPVAGASVKLIC--------QNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLR--PHKSYIDANKLPF

Query:  VLYSVGPFAF
         L+SVGPF +
Subjt:  VLYSVGPFAF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCGCTGTCTCCAAATTTCTCCTCCTCCTCCTCTTCTCCGCCATTTTCACCGTCTCCGCCGCCGACGATTTCACGGCGGCGGAAACACTCGCCGTTTCC
CCCCACCATTCCGCCGCCGTGCCGTCGTCATTACCTCCTGCCCACCATCACCACCACCACCACCACCACCCTCCGGTTCATGCTCCGAAGCCGGCACCCGTGGCT
CCACCGACCCACTCGCCGGTTCACCATCCGGCTCAACCGCCGGTTCACCACCACCACCACCATGTCCACGGCCAGCCTCCAGTTCACCCACCTGTAAATGCAGCC
CCTCCGACTCACGCGCCGGCGCCCGGCCACCACCACCGCGACGTCCACCCATTGCCGCCCCCGACTCACTCGGCGGCTCCGATCTACCCTCCGAAGCCTCGGATG
CCGAGGAGCTTCATCTCCGTTCAAGGCGTTGTTTATTGTAAGTCCTGCAAGTACGCCGGAGTCGACACCCTCCTCGGAGCCACCCCAGTCGCCGGTGCGAGCGTG
AAGCTAATTTGCCAGAACACGAAATACCCACTTGTCCAGAGCGGGACGACGGACAAGAACGGCTACTTCTTCATCACAGCGCCCAAGGCCATAACCAGCTACGCC
TTCCACAAGTGCAAGGTCCTGCTCTCGGCCTCCCCCTCCGCCGCCTGCAGCAAACCCTCCGCCCTCCACGGCGGCCTCTCCGGCGCCCCCCTCAGGCCTCACAAG
TCTTACATCGATGCCAACAAACTCCCCTTCGTCCTCTACTCCGTCGGCCCTTTTGCCTTCGAACCCACTTGCCCTCGCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCGCTGTCTCCAAATTTCTCCTCCTCCTCCTCTTCTCCGCCATTTTCACCGTCTCCGCCGCCGACGATTTCACGGCGGCGGAAACACTCGCCGTTTCC
CCCCACCATTCCGCCGCCGTGCCGTCGTCATTACCTCCTGCCCACCATCACCACCACCACCACCACCACCCTCCGGTTCATGCTCCGAAGCCGGCACCCGTGGCT
CCACCGACCCACTCGCCGGTTCACCATCCGGCTCAACCGCCGGTTCACCACCACCACCACCATGTCCACGGCCAGCCTCCAGTTCACCCACCTGTAAATGCAGCC
CCTCCGACTCACGCGCCGGCGCCCGGCCACCACCACCGCGACGTCCACCCATTGCCGCCCCCGACTCACTCGGCGGCTCCGATCTACCCTCCGAAGCCTCGGATG
CCGAGGAGCTTCATCTCCGTTCAAGGCGTTGTTTATTGTAAGTCCTGCAAGTACGCCGGAGTCGACACCCTCCTCGGAGCCACCCCAGTCGCCGGTGCGAGCGTG
AAGCTAATTTGCCAGAACACGAAATACCCACTTGTCCAGAGCGGGACGACGGACAAGAACGGCTACTTCTTCATCACAGCGCCCAAGGCCATAACCAGCTACGCC
TTCCACAAGTGCAAGGTCCTGCTCTCGGCCTCCCCCTCCGCCGCCTGCAGCAAACCCTCCGCCCTCCACGGCGGCCTCTCCGGCGCCCCCCTCAGGCCTCACAAG
TCTTACATCGATGCCAACAAACTCCCCTTCGTCCTCTACTCCGTCGGCCCTTTTGCCTTCGAACCCACTTGCCCTCGCTAG
Protein sequenceShow/hide protein sequence
MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHHSAAVPSSLPPAHHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAA
PPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYA
FHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR