; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS010868 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS010868
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionnon-classical arabinogalactan protein 31-like
Genome locationscaffold35:2625950..2627448
RNA-Seq ExpressionMS010868
SyntenyMS010868
Gene Ontology termsGO:0071944 - cell periphery (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594600.1 Non-classical arabinogalactan protein 30, partial [Cucurbita argyrosperma subsp. sororia]2.9e-10572.24Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVHP
        MD A+SK    LL  AIF VS A+D TAAETLA SPH  AAVP + PPAHHH+HHH    HAPKPAPV+PPTH PVH PAQPP H HHHHHVH QPPVHP
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVHP

Query:  PVNAA----------------------------PPTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVA
        PV+ A                            PPT APAPGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPVA
Subjt:  PVNAA----------------------------PPTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVA

Query:  GASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
        GASVKLICQNTK+PLVQ+ TTD NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  GASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

XP_022926154.1 non-classical arabinogalactan protein 31-like [Cucurbita moschata]2.0e-10676.58Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHSAAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPV-HHHHHHVHGQPPVHPP
        M S + K    LLF AIF VS ADD TAAETLA  PH  A   ++PPAHHHHH      H+P PAPVAPPTH P+H PAQPP  HHHHHH+H QPPVHPP
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHSAAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPV-HHHHHHVHGQPPVHPP

Query:  VNAAPPTHAPAPG-HHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFI
            PP HAPAPG HHH DVHPLPPPTHS AP++PPKPR+ RSFISVQGVVYCKSCKYAGVDTLLGAT VAGA+VKLICQNTKYPLVQ+ TTDKNGYFFI
Subjt:  VNAAPPTHAPAPG-HHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFI

Query:  TAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC
        TAPKA+TSYAFHKCKV+L +SP+ +CSKPSALHGG +GAPLRP KSYIDANKLP+VLYSVGPFAFEPTC
Subjt:  TAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC

XP_022926436.1 non-classical arabinogalactan protein 30-like isoform X1 [Cucurbita moschata]1.9e-10472.24Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVHP
        MD A+SK    LL  AIF VS A+D TAAETLA SPH  AAVP + PPAHHH+HHH    HAPKPAPV+PPTH PVH PAQPP H HHHHHVH QPPVHP
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVHP

Query:  PVNAA-----PPTHAP-----------------------APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVA
        PV+ A     PPTH P                       APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPVA
Subjt:  PVNAA-----PPTHAP-----------------------APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVA

Query:  GASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
        GASVKLICQNTK+PLVQ+ TTD NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  GASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

XP_022926437.1 non-classical arabinogalactan protein 30-like isoform X2 [Cucurbita moschata]2.0e-10677.9Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVHP
        MD A+SK    LL  AIF VS A+D TAAETLA SPH  AAVP + PPAHHH+HHH    HAPKPAPV+PPTH PVH PAQPP H HHHHHVH  PPVHP
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVHP

Query:  PVNAA-----PPTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDK
        PVN A     PPT A APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPVAGASVKLICQNTK+PLVQ+ TTD 
Subjt:  PVNAA-----PPTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDK

Query:  NGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
        NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  NGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

XP_023003678.1 non-classical arabinogalactan protein 31-like [Cucurbita maxima]1.0e-10571.01Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHH--------------
        MD A+SK    LL  AIF VS A+D TAAETLA SPH  AAVP + PPAHHHHHHHH   H PKPAPV+PPTH PVH PAQPP HH              
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHH--------------

Query:  ------------------HHHHVHGQPPVHPPVNAAPPTH-----APAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDT
                          HHH+VH QPPVHPPVN APP H      PAPGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDT
Subjt:  ------------------HHHHVHGQPPVHPPVNAAPPTH-----APAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDT

Query:  LLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPF
        L GATPVAGASVKLICQNTKYPLVQ+ TTDKNGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPF
Subjt:  LLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPF

Query:  AFEPTCP
        AFEPTCP
Subjt:  AFEPTCP

TrEMBL top hitse value%identityAlignment
A0A6J1EEB1 non-classical arabinogalactan protein 31-like9.8e-10776.58Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHSAAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPV-HHHHHHVHGQPPVHPP
        M S + K    LLF AIF VS ADD TAAETLA  PH  A   ++PPAHHHHH      H+P PAPVAPPTH P+H PAQPP  HHHHHH+H QPPVHPP
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHSAAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPV-HHHHHHVHGQPPVHPP

Query:  VNAAPPTHAPAPG-HHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFI
            PP HAPAPG HHH DVHPLPPPTHS AP++PPKPR+ RSFISVQGVVYCKSCKYAGVDTLLGAT VAGA+VKLICQNTKYPLVQ+ TTDKNGYFFI
Subjt:  VNAAPPTHAPAPG-HHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFI

Query:  TAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC
        TAPKA+TSYAFHKCKV+L +SP+ +CSKPSALHGG +GAPLRP KSYIDANKLP+VLYSVGPFAFEPTC
Subjt:  TAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC

A0A6J1EEG4 non-classical arabinogalactan protein 30-like isoform X29.8e-10777.9Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVHP
        MD A+SK    LL  AIF VS A+D TAAETLA SPH  AAVP + PPAHHH+HHH    HAPKPAPV+PPTH PVH PAQPP H HHHHHVH  PPVHP
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVHP

Query:  PVNAA-----PPTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDK
        PVN A     PPT A APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPVAGASVKLICQNTK+PLVQ+ TTD 
Subjt:  PVNAA-----PPTHAPAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDK

Query:  NGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
        NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  NGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

A0A6J1EEX2 non-classical arabinogalactan protein 30-like isoform X19.2e-10572.24Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVHP
        MD A+SK    LL  AIF VS A+D TAAETLA SPH  AAVP + PPAHHH+HHH    HAPKPAPV+PPTH PVH PAQPP H HHHHHVH QPPVHP
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVH-HHHHHVHGQPPVHP

Query:  PVNAA-----PPTHAP-----------------------APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVA
        PV+ A     PPTH P                       APGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDTL GATPVA
Subjt:  PVNAA-----PPTHAP-----------------------APGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVA

Query:  GASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP
        GASVKLICQNTK+PLVQ+ TTD NGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPFAFEPTCP
Subjt:  GASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCP

A0A6J1IR37 non-classical arabinogalactan protein 31-like8.6e-10374.73Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHSAAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPV-HHHHHHVHGQPPVHPP
        M S + K    LLF AIF    ADD TAAETLA  PH  A   ++PP HHHHH      H+P PAPVAPPTH P+H PAQPP  HHHHHH+H QPPVHPP
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHSAAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPV-HHHHHHVHGQPPVHPP

Query:  VNAAPPT----HAPAPG-HHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNG
         +  PPT    HAPAPG HHH DVHPLPPPTHS AP++PPKPR+ RSFISVQGVVYCKSCKYAGVDTLLGAT VAGA+VKLICQNTKYPLVQ+ TTDKNG
Subjt:  VNAAPPT----HAPAPG-HHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNG

Query:  YFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC
        YFFITAPKA+TSYAFHKCKV+L +SP+ +CSKPSALHGG +GAPLRP KSYIDANKLP+VLYSVGPFAFEPTC
Subjt:  YFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC

A0A6J1KXA6 non-classical arabinogalactan protein 31-like4.9e-10671.01Show/hide
Query:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHH--------------
        MD A+SK    LL  AIF VS A+D TAAETLA SPH  AAVP + PPAHHHHHHHH   H PKPAPV+PPTH PVH PAQPP HH              
Subjt:  MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHS-AAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHH--------------

Query:  ------------------HHHHVHGQPPVHPPVNAAPPTH-----APAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDT
                          HHH+VH QPPVHPPVN APP H      PAPGH HHRDV PL PP HS +P+YPPKPR+ RSFISVQGVVYCKSCKYAGVDT
Subjt:  ------------------HHHHVHGQPPVHPPVNAAPPTH-----APAPGH-HHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDT

Query:  LLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPF
        L GATPVAGASVKLICQNTKYPLVQ+ TTDKNGYFFI APKAITSYAFHKCKVLL +SP AACSKPSALHGG +GA L+P KSYIDANKLPFVLYSVGPF
Subjt:  LLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPF

Query:  AFEPTCP
        AFEPTCP
Subjt:  AFEPTCP

SwissProt top hitse value%identityAlignment
P93013 Non-classical arabinogalactan protein 303.5e-2940.09Show/hide
Query:  HHPPVHAPKP---APVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYC
        H  P H P P    P  PP  +P+  PA PP          + P+  P    PP  AP        + P+ PP     P+YPPK    ++ ++V+GVVYC
Subjt:  HHPPVHAPKP---APVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYC

Query:  KSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA----PLRPHKSYID
        K+CKYAGV+ + GA PV  A V+L+C+N K  + ++  TDKNGYF + APK +T+Y    C+  L  SP   CSK S+LH G  G+     L+P  S   
Subjt:  KSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA----PLRPHKSYID

Query:  ANKLPFVLYSVGPFAFEPTCPR
             + +Y+VGPFAFEPTCP+
Subjt:  ANKLPFVLYSVGPFAFEPTCPR

Q03211 Pistil-specific extensin-like protein1.7e-1840.62Show/hide
Query:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPV-HPPV-------NAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGV
        PPV AP P+P   P   P   P  PP       +   PPV +PPV        A PP  AP P       +P   P   A P+  P P + +  I V G+
Subjt:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPV-HPPV-------NAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGV

Query:  VYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA---PLRPHKSY
        VYCKSC   GV TLL A+ + GA VKLIC   K  +VQ  TTD  G F I  PK++T+    KCKV L  SP+  C+ P+  +GG SG    PL P K  
Subjt:  VYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA---PLRPHKSY

Query:  IDANKLPFV-----LYSVGPFAFE
        I    +P       LY VGPF FE
Subjt:  IDANKLPFV-----LYSVGPFAFE

Q9FZA2 Non-classical arabinogalactan protein 311.0e-3947.49Show/hide
Query:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHS--AAPIYPPKPRMPRSFISVQGVVYCKSC
        PPV+ P  APV PPT  PV  P  PP          +PPV PPV   PPT AP          P+ PPT      P+YPPK    RS ++V+G VYCKSC
Subjt:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHS--AAPIYPPKPRMPRSFISVQGVVYCKSC

Query:  KYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANK
        KYA  +TLLGA P+ GA+VKL+C++ K  +    TTDKNGYF + APK +T++ F  C+V L  S    CSK S L GG  GA L+P     KS +  NK
Subjt:  KYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANK

Query:  LPFVLYSVGPFAFEPTCPR
        L + L++VGPFAF P+CP+
Subjt:  LPFVLYSVGPFAFEPTCPR

Arabidopsis top hitse value%identityAlignment
AT1G28290.1 arabinogalactan protein 317.1e-4147.49Show/hide
Query:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHS--AAPIYPPKPRMPRSFISVQGVVYCKSC
        PPV+ P  APV PPT  PV  P  PP          +PPV PPV   PPT AP          P+ PPT      P+YPPK    RS ++V+G VYCKSC
Subjt:  PPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHS--AAPIYPPKPRMPRSFISVQGVVYCKSC

Query:  KYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANK
        KYA  +TLLGA P+ GA+VKL+C++ K  +    TTDKNGYF + APK +T++ F  C+V L  S    CSK S L GG  GA L+P     KS +  NK
Subjt:  KYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANK

Query:  LPFVLYSVGPFAFEPTCPR
        L + L++VGPFAF P+CP+
Subjt:  LPFVLYSVGPFAFEPTCPR

AT1G28290.2 arabinogalactan protein 311.1e-3838.17Show/hide
Query:  VSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHSAAVPSSLP-PAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNA-
        VS   L    S++FT    +  T   +LA +P         P P HHHH H HP  H P  +PV PP  +PV  PA+PPV         +PPV+PP  A 
Subjt:  VSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHSAAVPSSLP-PAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNA-

Query:  -APPTHAPA-------------PGHHHRDVHPLPPPTH----------SAAPIYPP----------------------KPRMPRSFISVQGVVYCKSCKY
          PPT  P              P  +     P+ PPT           +  P+YPP                       P+  RS ++V+G VYCKSCKY
Subjt:  -APPTHAPA-------------PGHHHRDVHPLPPPTH----------SAAPIYPP----------------------KPRMPRSFISVQGVVYCKSCKY

Query:  AGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANKLP
        A  +TLLGA P+ GA+VKL+C++ K  +    TTDKNGYF + APK +T++ F  C+V L  S    CSK S L GG  GA L+P     KS +  NKL 
Subjt:  AGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLRPH----KSYIDANKLP

Query:  FVLYSVGPFAFEPTCPR
        + L++VGPFAF P+CP+
Subjt:  FVLYSVGPFAFEPTCPR

AT2G33790.1 arabinogalactan protein 302.5e-3040.09Show/hide
Query:  HHPPVHAPKP---APVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYC
        H  P H P P    P  PP  +P+  PA PP          + P+  P    PP  AP        + P+ PP     P+YPPK    ++ ++V+GVVYC
Subjt:  HHPPVHAPKP---APVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPAPGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYC

Query:  KSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA----PLRPHKSYID
        K+CKYAGV+ + GA PV  A V+L+C+N K  + ++  TDKNGYF + APK +T+Y    C+  L  SP   CSK S+LH G  G+     L+P  S   
Subjt:  KSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASPSAACSKPSALHGGLSGA----PLRPHKSYID

Query:  ANKLPFVLYSVGPFAFEPTCPR
             + +Y+VGPFAFEPTCP+
Subjt:  ANKLPFVLYSVGPFAFEPTCPR

AT2G34700.1 Pollen Ole e 1 allergen and extensin family protein4.0e-3651.7Show/hide
Query:  SAAPIYPPK--PRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVL-LSASPSAA
        SA+P+ PP    +M R  ++V+G+VYCKSCKY+GVDTLL A+P+ GA+VKL C NTK  +     TDKNGYFF+ APK +T+YAFH C+    +  P+ A
Subjt:  SAAPIYPPK--PRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVL-LSASPSAA

Query:  ---CSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC
           C+ PS L+ G++GA L+P K+ I+  +  +VL+SVGPFAFEP C
Subjt:  ---CSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTC

AT3G62680.1 proline-rich protein 31.3e-0729.77Show/hide
Query:  VSPHSAAVPSSLPPAHH---HHHHHHPPVHAPK--PAPV-APPTHSPVHHP-------AQPPVH----HHHHHVHGQPPVHPPVNAAPPTHAPAPG---H
        +SP     P+  PP +    + H   PPV+     P PV  PP + P   P         PPV+    +    V+ +P + PPV   PP + P P    +
Subjt:  VSPHSAAVPSSLPPAHH---HHHHHHPPVHAPK--PAPV-APPTHSPVHHP-------AQPPVH----HHHHHVHGQPPVHPPVNAAPPTHAPAPG---H

Query:  HHRDVHPLPPPTHSAAPIYPP--KPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLIC--------QNTKYPLVQSGTTDKNGYFFITAPKA
             +  PPP +   P Y P  KP +P    +V G++ CK+    G +T     P+ GA ++++C         NT+  ++ S  TD  GYF ++   +
Subjt:  HHRDVHPLPPPTHSAAPIYPP--KPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLIC--------QNTKYPLVQSGTTDKNGYFFITAPKA

Query:  ITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLR--PHKSYIDANKLPFVLYSVGPFAF
        I   A+  C+V L  SP   C  P+ ++ GL+G PL    ++ Y D N     L+SVGPF +
Subjt:  ITSYAFHKCKVLLSASPSAACSKPSALHGGLSGAPLR--PHKSYIDANKLPFVLYSVGPFAF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCCGCTGTCTCCAAATTTCTCCTCCTCCTCCTCTTCTCCGCCATTTTCACCGTCTCCGCCGCCGACGATTTCACGGCGGCGGAAACACTCGCCGTTTCCCCCCA
TTCCGCCGCCGTGCCGTCGTCATTACCTCCTGCCCACCATCACCACCACCACCACCACCCTCCGGTTCATGCTCCGAAGCCGGCACCCGTGGCTCCACCGACCCACTCGC
CGGTTCACCATCCGGCTCAACCGCCGGTTCACCACCACCACCACCATGTCCACGGCCAGCCTCCAGTTCACCCACCTGTAAATGCAGCCCCTCCGACTCACGCGCCGGCG
CCCGGCCACCACCACCGCGACGTCCACCCATTGCCGCCCCCGACTCACTCGGCTGCTCCGATCTACCCTCCGAAGCCTCGGATGCCGAGGAGCTTCATCTCCGTTCAAGG
CGTTGTTTATTGTAAGTCCTGCAAGTACGCCGGAGTCGACACCCTCCTCGGAGCCACCCCAGTCGCCGGTGCGAGCGTGAAGCTAATTTGCCAGAACACGAAATACCCAC
TGGTCCAGAGCGGGACGACGGACAAGAACGGCTACTTCTTCATCACAGCGCCCAAGGCCATAACCAGCTACGCCTTCCACAAGTGCAAGGTCCTGCTCTCGGCCTCCCCC
TCCGCCGCCTGCAGCAAACCCTCCGCCCTCCACGGCGGCCTCTCCGGCGCCCCCCTCAGGCCTCACAAGTCTTACATCGATGCCAACAAACTCCCCTTCGTCCTCTACTC
CGTCGGCCCTTTTGCCTTCGAACCCACTTGCCCTCGC
mRNA sequenceShow/hide mRNA sequence
ATGGATTCCGCTGTCTCCAAATTTCTCCTCCTCCTCCTCTTCTCCGCCATTTTCACCGTCTCCGCCGCCGACGATTTCACGGCGGCGGAAACACTCGCCGTTTCCCCCCA
TTCCGCCGCCGTGCCGTCGTCATTACCTCCTGCCCACCATCACCACCACCACCACCACCCTCCGGTTCATGCTCCGAAGCCGGCACCCGTGGCTCCACCGACCCACTCGC
CGGTTCACCATCCGGCTCAACCGCCGGTTCACCACCACCACCACCATGTCCACGGCCAGCCTCCAGTTCACCCACCTGTAAATGCAGCCCCTCCGACTCACGCGCCGGCG
CCCGGCCACCACCACCGCGACGTCCACCCATTGCCGCCCCCGACTCACTCGGCTGCTCCGATCTACCCTCCGAAGCCTCGGATGCCGAGGAGCTTCATCTCCGTTCAAGG
CGTTGTTTATTGTAAGTCCTGCAAGTACGCCGGAGTCGACACCCTCCTCGGAGCCACCCCAGTCGCCGGTGCGAGCGTGAAGCTAATTTGCCAGAACACGAAATACCCAC
TGGTCCAGAGCGGGACGACGGACAAGAACGGCTACTTCTTCATCACAGCGCCCAAGGCCATAACCAGCTACGCCTTCCACAAGTGCAAGGTCCTGCTCTCGGCCTCCCCC
TCCGCCGCCTGCAGCAAACCCTCCGCCCTCCACGGCGGCCTCTCCGGCGCCCCCCTCAGGCCTCACAAGTCTTACATCGATGCCAACAAACTCCCCTTCGTCCTCTACTC
CGTCGGCCCTTTTGCCTTCGAACCCACTTGCCCTCGC
Protein sequenceShow/hide protein sequence
MDSAVSKFLLLLLFSAIFTVSAADDFTAAETLAVSPHSAAVPSSLPPAHHHHHHHHPPVHAPKPAPVAPPTHSPVHHPAQPPVHHHHHHVHGQPPVHPPVNAAPPTHAPA
PGHHHRDVHPLPPPTHSAAPIYPPKPRMPRSFISVQGVVYCKSCKYAGVDTLLGATPVAGASVKLICQNTKYPLVQSGTTDKNGYFFITAPKAITSYAFHKCKVLLSASP
SAACSKPSALHGGLSGAPLRPHKSYIDANKLPFVLYSVGPFAFEPTCPR