; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G17210 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G17210
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionlysine-rich arabinogalactan protein 18-like
Genome locationClcChr01:29951411..29953589
RNA-Seq ExpressionClc01G17210
SyntenyClc01G17210
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR044981 - Lysine-rich arabinogalactan protein AGP9/17/18


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595001.1 hypothetical protein SDJN03_11554, partial [Cucurbita argyrosperma subsp. sororia]2.7e-8183.4Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        MGRQSVIALV+ICAVVAGVGGQSPAAAPTTTP    P AAKY +PAASPV PPTNSSPAAAPQKP TPAPVST     PPAVAPVASPPASTPPTASVP 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA
        SSPPAASVP +SPP ATVPASSPPVPVP SSPPVPVP SSPPVPTP ESPPAPE++PPAPVASPP EVP+PAPSKKKSKKHKAPAPSPALLGPPAPPSEA
Subjt:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA

Query:  PGASEEGPSPSPSLEDKSGAETL-WNLQKVAGSLAFGWAAVAVSFIF
        PG SEEGPSP+PSL+DKSGAE L  N+QKV GSLA G++A   SF+F
Subjt:  PGASEEGPSPSPSLEDKSGAETL-WNLQKVAGSLAFGWAAVAVSFIF

NP_001267612.1 arabinogalactan protein precursor [Cucumis sativus]3.4e-9291.46Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        MGRQSVIALVLICAVVAGVGGQSPAAAPTTTP ATPP AA Y  PAASPVS PTN SPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVP 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA
        SSPPAASVPP+SPPAATVPASSPPVPVPVSSPPV VPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKH+APAPSPALLGPPAPPSEA
Subjt:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA

Query:  PGASEEGPSPSPSLEDKSGAETLWNLQKVAGSLAFGWAAVAVSFIF
        P  SEEGP+PSPSLEDKSGAE    L KVAGSLA GWAAVAVS IF
Subjt:  PGASEEGPSPSPSLEDKSGAETLWNLQKVAGSLAFGWAAVAVSFIF

TYK12844.1 lysine-rich arabinogalactan protein 18-like [Cucumis melo var. makuwa]1.9e-8294.47Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        MGRQSVIALVLICAVVA VGGQSPAAAPTTTP ATPP AAKY  PAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVP 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA
        SSPPAASVPP+SPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPP EVPAPAPS KKSKKH+APAPSPALLGPPAPPSEA
Subjt:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA

Query:  PGASEEGPSPSPSLEDK
        P  SEEGPSPSPSLEDK
Subjt:  PGASEEGPSPSPSLEDK

XP_008440341.2 PREDICTED: lysine-rich arabinogalactan protein 18-like [Cucumis melo]2.6e-9292.28Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        MGRQSVIALVLICAVVA VGGQSPAAAPTTTP ATPP AAKY  PAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVP 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA
        SSPPAASVPP+SPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPP EVPAPAPS KKSKKH+APAPSPALLGPPAPPSEA
Subjt:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA

Query:  PGASEEGPSPSPSLEDKSGAETLWNLQKVAGSLAFGWAAVAVSFIF
        P  SEEGPSPSPSLEDKSGAE    L KVAGSLA GWAAVAVS IF
Subjt:  PGASEEGPSPSPSLEDKSGAETLWNLQKVAGSLAFGWAAVAVSFIF

XP_023003133.1 lysine-rich arabinogalactan protein 18-like [Cucurbita maxima]4.6e-8181.64Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        MGRQSVIALVLICAVVAGVGGQSPAAAPTTTP    P AAKY  PAASPV PPTNSSPAAAPQKP TPAPVST     PPAVAPVASPPASTPPTASVP 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPA---------ATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALL
        SSPPAASVP +SPPA         ATVPASSPPVPVP SSPPVPVP SSPPVPTP ESPPAPES+PPAPVASPP EVP+PAPSKKKSKKHKAPAPSPALL
Subjt:  SSPPAASVPPTSPPA---------ATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALL

Query:  GPPAPPSEAPGASEEGPSPSPSLEDKSGAETL-WNLQKVAGSLAFGWAAVAVSFIF
        GPPAPPSEAPG SEEGPSP+PSL+DKSGAE L  N+QKV GSLA G++A   SF+F
Subjt:  GPPAPPSEAPGASEEGPSPSPSLEDKSGAETL-WNLQKVAGSLAFGWAAVAVSFIF

TrEMBL top hitse value%identityAlignment
A0A1S3B0X7 lysine-rich arabinogalactan protein 18-like1.3e-9292.28Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        MGRQSVIALVLICAVVA VGGQSPAAAPTTTP ATPP AAKY  PAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVP 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA
        SSPPAASVPP+SPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPP EVPAPAPS KKSKKH+APAPSPALLGPPAPPSEA
Subjt:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA

Query:  PGASEEGPSPSPSLEDKSGAETLWNLQKVAGSLAFGWAAVAVSFIF
        P  SEEGPSPSPSLEDKSGAE    L KVAGSLA GWAAVAVS IF
Subjt:  PGASEEGPSPSPSLEDKSGAETLWNLQKVAGSLAFGWAAVAVSFIF

A0A5D3CP79 Lysine-rich arabinogalactan protein 18-like9.0e-8394.47Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        MGRQSVIALVLICAVVA VGGQSPAAAPTTTP ATPP AAKY  PAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVP 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA
        SSPPAASVPP+SPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPP EVPAPAPS KKSKKH+APAPSPALLGPPAPPSEA
Subjt:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA

Query:  PGASEEGPSPSPSLEDK
        P  SEEGPSPSPSLEDK
Subjt:  PGASEEGPSPSPSLEDK

A0A6J1HFT1 lysine-rich arabinogalactan protein 18-like3.2e-8083Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        MGRQSVIALVLICAVVAGVGGQSPAAAPTTTP    P AAK   PAASPV PPTNSSPAAAPQKP TPAPVST     PPAVAPVASPPASTPPTASVP 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA
        SSPPAASVP +SPP ATVPASSPPVPVP SSPPVPVP SSPPVPTP ESPPAPE++PPAPVASPP EVP+PAPSKKKSKKHKAPAPSPALLGPPAPPSEA
Subjt:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA

Query:  PGASEEGPSPSPSLEDKSGAETL-WNLQKVAGSLAFGWAAVAVSFIF
        PG SEEGP+P+PSL+DKSGAE L  N+QKV GSLA G++A   SF+F
Subjt:  PGASEEGPSPSPSLEDKSGAETL-WNLQKVAGSLAFGWAAVAVSFIF

A0A6J1KVL2 lysine-rich arabinogalactan protein 18-like2.2e-8181.64Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        MGRQSVIALVLICAVVAGVGGQSPAAAPTTTP    P AAKY  PAASPV PPTNSSPAAAPQKP TPAPVST     PPAVAPVASPPASTPPTASVP 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPA---------ATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALL
        SSPPAASVP +SPPA         ATVPASSPPVPVP SSPPVPVP SSPPVPTP ESPPAPES+PPAPVASPP EVP+PAPSKKKSKKHKAPAPSPALL
Subjt:  SSPPAASVPPTSPPA---------ATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALL

Query:  GPPAPPSEAPGASEEGPSPSPSLEDKSGAETL-WNLQKVAGSLAFGWAAVAVSFIF
        GPPAPPSEAPG SEEGPSP+PSL+DKSGAE L  N+QKV GSLA G++A   SF+F
Subjt:  GPPAPPSEAPGASEEGPSPSPSLEDKSGAETL-WNLQKVAGSLAFGWAAVAVSFIF

Q9XIV1 Arabinogalactan protein1.6e-9291.46Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        MGRQSVIALVLICAVVAGVGGQSPAAAPTTTP ATPP AA Y  PAASPVS PTN SPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVP 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA
        SSPPAASVPP+SPPAATVPASSPPVPVPVSSPPV VPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKH+APAPSPALLGPPAPPSEA
Subjt:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEA

Query:  PGASEEGPSPSPSLEDKSGAETLWNLQKVAGSLAFGWAAVAVSFIF
        P  SEEGP+PSPSLEDKSGAE    L KVAGSLA GWAAVAVS IF
Subjt:  PGASEEGPSPSPSLEDKSGAETLWNLQKVAGSLAFGWAAVAVSFIF

SwissProt top hitse value%identityAlignment
O22194 Lysine-rich arabinogalactan protein 171.2e-0736.65Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        M R  ++ + LIC V   VGGQSPA AP                            SP+ +P KP         P S  PA++P A  P S         
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPA--PSKKKSKKHK-APAPSPA--LLGPPA
                  T  PA T      PV  PV +PP P P S+P +  P  SP A   +P AP  +P  +VPAPA    KKK+KKHK APAP PA  LL PPA
Subjt:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPA--PSKKKSKKHK-APAPSPA--LLGPPA

Query:  PPSEAPGASEEGPSP--SPSLEDKSGAETLWNLQKVAGSLAFGWAAVAVSF
        PP EAPG    GPS   SP+ +D+SGA+ +  + ++ G+ A  W+ + ++F
Subjt:  PPSEAPGASEEGPSP--SPSLEDKSGAETLWNLQKVAGSLAFGWAAVAVSF

Q9C5S0 Classical arabinogalactan protein 95.2e-1142.32Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPP--ASAPPAVAPVASPPASTPPTASV
        M R   IA++ I  ++AGV GQ+P + PT TP                  +PPT ++P  A    ATP PVS PP   ++PP V   A PPA+ PP    
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPP--ASAPPAVAPVASPPASTPPTASV

Query:  PTSSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPS
        P SSPP AS PP +PP    P +SP  P PV+SPP   P + PPV TP          PPAP+ASPP +VPAPAP+ K      +P+PSP+   PP P S
Subjt:  PTSSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPS

Query:  EAPGASEEGPSPSPS---LEDKSGAETLWNLQKVAGSLAFG
        +APG S +  SP+PS   + D++GA       K+  SL FG
Subjt:  EAPGASEEGPSPSPS---LEDKSGAETLWNLQKVAGSLAFG

Q9FPR2 Lysine-rich arabinogalactan protein 183.5e-2346.61Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        M R  ++ + LIC VVAGVGGQSP ++PT +P  T P+A     P  SP   P  +SP        T AP  TP ASA    +PV SP +  P       
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSK-----KKSKKHK-APAPSPALLGPP
               V  +SPP   VP SSPPVP P+    V  PVSSPPVP P         SPPAPVA+P  +VPAPAPSK     KKSKKH+ APAP+P LLGPP
Subjt:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSK-----KKSKKHK-APAPSPALLGPP

Query:  APPSEAPGASEEGPSPSPSLEDKSGAETLWNLQKVA-GSLAFGWAAVAVSF
        APP+E+PG + +  SP PS +D+SGA +   L+ VA G++A  WA + ++F
Subjt:  APPSEAPGASEEGPSPSPSLEDKSGAETLWNLQKVA-GSLAFGWAAVAVSF

Q9S740 Lysine-rich arabinogalactan protein 196.6e-0640.16Show/hide
Query:  MGRQSVI-ALVLICAVVA--GVGGQSPAAAP-TTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPA---TPAPVSTPPASAPPAVAPVASPPASTP
        M   S+I +L+L  A+++   V  Q PAA+P T+T  A PP  A   T AA P  P T + P +A Q PA   TP P  TP +   P VAPV SP    P
Subjt:  MGRQSVI-ALVLICAVVA--GVGGQSPAAAP-TTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPA---TPAPVSTPPASAPPAVAPVASPPASTP

Query:  PTASVPTSSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSP----PAPVASPPVEVPAPAPSKKKSK---KHKAPAP
             P +S P  S PP SPP    PA + P P P S PP P   +SPP P P   PPAP S P    P+P++ PP   PAP   K+K K    H APAP
Subjt:  PTASVPTSSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSP----PAPVASPPVEVPAPAPSKKKSK---KHKAPAP

Query:  SPALLGPPAPPSEAPGASEEGPSPSP------SLEDKSGAETLW
        +P    PP+PP       +  P+PSP      +L    G   +W
Subjt:  SPALLGPPAPPSEAPGASEEGPSPSP------SLEDKSGAETLW

Arabidopsis top hitse value%identityAlignment
AT1G68725.1 arabinogalactan protein 194.7e-0740.16Show/hide
Query:  MGRQSVI-ALVLICAVVA--GVGGQSPAAAP-TTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPA---TPAPVSTPPASAPPAVAPVASPPASTP
        M   S+I +L+L  A+++   V  Q PAA+P T+T  A PP  A   T AA P  P T + P +A Q PA   TP P  TP +   P VAPV SP    P
Subjt:  MGRQSVI-ALVLICAVVA--GVGGQSPAAAP-TTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPA---TPAPVSTPPASAPPAVAPVASPPASTP

Query:  PTASVPTSSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSP----PAPVASPPVEVPAPAPSKKKSK---KHKAPAP
             P +S P  S PP SPP    PA + P P P S PP P   +SPP P P   PPAP S P    P+P++ PP   PAP   K+K K    H APAP
Subjt:  PTASVPTSSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSP----PAPVASPPVEVPAPAPSKKKSK---KHKAPAP

Query:  SPALLGPPAPPSEAPGASEEGPSPSP------SLEDKSGAETLW
        +P    PP+PP       +  P+PSP      +L    G   +W
Subjt:  SPALLGPPAPPSEAPGASEEGPSPSP------SLEDKSGAETLW

AT2G14890.1 arabinogalactan protein 93.7e-1242.32Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPP--ASAPPAVAPVASPPASTPPTASV
        M R   IA++ I  ++AGV GQ+P + PT TP                  +PPT ++P  A    ATP PVS PP   ++PP V   A PPA+ PP    
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPP--ASAPPAVAPVASPPASTPPTASV

Query:  PTSSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPS
        P SSPP AS PP +PP    P +SP  P PV+SPP   P + PPV TP          PPAP+ASPP +VPAPAP+ K      +P+PSP+   PP P S
Subjt:  PTSSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPS

Query:  EAPGASEEGPSPSPS---LEDKSGAETLWNLQKVAGSLAFG
        +APG S +  SP+PS   + D++GA       K+  SL FG
Subjt:  EAPGASEEGPSPSPS---LEDKSGAETLWNLQKVAGSLAFG

AT2G14890.2 arabinogalactan protein 95.3e-1143.58Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPP--ASAPPAVAPVASPPASTPPTASV
        M R   IA++ I  ++AGV GQ+P + PT TP                  +PPT ++P  A    ATP PVS PP   ++PP V   A PPA+ PP    
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPP--ASAPPAVAPVASPPASTPPTASV

Query:  PTSSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPS
        P SSPP AS PP +PP    P +SP  P PV+SPP   P + PPV TP          PPAP+ASPP +VPAPAP+ K      +P+PSP+   PP P S
Subjt:  PTSSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPS

Query:  EAPGASEEGPSPSPSLED
        +APG S +  SP+PS  D
Subjt:  EAPGASEEGPSPSPSLED

AT2G23130.1 arabinogalactan protein 178.5e-0936.65Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        M R  ++ + LIC V   VGGQSPA AP                            SP+ +P KP         P S  PA++P A  P S         
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPA--PSKKKSKKHK-APAPSPA--LLGPPA
                  T  PA T      PV  PV +PP P P S+P +  P  SP A   +P AP  +P  +VPAPA    KKK+KKHK APAP PA  LL PPA
Subjt:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPA--PSKKKSKKHK-APAPSPA--LLGPPA

Query:  PPSEAPGASEEGPSP--SPSLEDKSGAETLWNLQKVAGSLAFGWAAVAVSF
        PP EAPG    GPS   SP+ +D+SGA+ +  + ++ G+ A  W+ + ++F
Subjt:  PPSEAPGASEEGPSP--SPSLEDKSGAETLWNLQKVAGSLAFGWAAVAVSF

AT4G37450.1 arabinogalactan protein 182.5e-2446.61Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT
        M R  ++ + LIC VVAGVGGQSP ++PT +P  T P+A     P  SP   P  +SP        T AP  TP ASA    +PV SP +  P       
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPT

Query:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSK-----KKSKKHK-APAPSPALLGPP
               V  +SPP   VP SSPPVP P+    V  PVSSPPVP P         SPPAPVA+P  +VPAPAPSK     KKSKKH+ APAP+P LLGPP
Subjt:  SSPPAASVPPTSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSK-----KKSKKHK-APAPSPALLGPP

Query:  APPSEAPGASEEGPSPSPSLEDKSGAETLWNLQKVA-GSLAFGWAAVAVSF
        APP+E+PG + +  SP PS +D+SGA +   L+ VA G++A  WA + ++F
Subjt:  APPSEAPGASEEGPSPSPSLEDKSGAETLWNLQKVA-GSLAFGWAAVAVSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGACAGTCCGTAATAGCACTTGTTCTCATCTGCGCCGTCGTCGCTGGTGTCGGTGGCCAGTCTCCGGCTGCAGCTCCGACTACTACGCCGGGGGCCACCCCGCC
TGCAGCCGCGAAGTATCTAACTCCTGCCGCATCTCCTGTATCACCACCGACAAATTCATCTCCCGCCGCCGCTCCTCAAAAGCCGGCCACTCCAGCACCGGTCTCCACGC
CGCCTGCTTCGGCGCCACCAGCAGTCGCTCCAGTAGCCTCTCCACCAGCCAGCACTCCTCCGACTGCTTCAGTTCCTACTAGCTCTCCACCGGCTGCTTCTGTACCACCT
ACCTCTCCGCCGGCTGCTACCGTTCCAGCGAGCTCCCCACCTGTTCCGGTCCCGGTGAGTTCTCCACCAGTTCCCGTACCGGTGAGCTCTCCTCCAGTACCAACTCCAAC
TGAATCTCCTCCCGCTCCCGAAAGTTCTCCTCCTGCTCCAGTCGCATCGCCTCCCGTTGAGGTTCCGGCGCCGGCGCCTAGCAAGAAGAAGTCCAAGAAGCACAAAGCAC
CAGCTCCTTCTCCGGCGTTGCTTGGTCCACCTGCTCCTCCTTCCGAAGCCCCTGGAGCAAGCGAGGAAGGTCCTTCGCCCAGCCCTTCACTAGAGGACAAGAGTGGAGCT
GAAACATTGTGGAACCTGCAGAAGGTGGCCGGAAGTTTGGCTTTCGGATGGGCTGCCGTCGCCGTCAGCTTCATCTTCTAG
mRNA sequenceShow/hide mRNA sequence
CTGTATTCAACAACATGTTGAAGTAAAAAAAAAAATTCTACAGTTCTACATATTTGTTGAATGCAAGAGAGAAATTAAACGGAAAAAACCCCCATGGGCTTTGGACAGTC
TGTTTAAATCTCCTATATTTATACCGTTCTTTTGTTTGATTCCGTTCGTGGGCCCTTTTCCCTCTCCCAACGGCTAGTACAGGCCCATTAATAGTGGGCCCTTTTGCCTT
TTTTTATTTTTCTTTATTTTCTTTATTTTTTTAATTAAAAGAAACTCCTTTGGGACGTTATTTACTGGTCTATATATACACTATTATATCTCGGGCTCCGAAACATTCAC
TTCATCATCTACTACTCACGCTCACACAGTAGAAATGGGGAGACAGTCCGTAATAGCACTTGTTCTCATCTGCGCCGTCGTCGCTGGTGTCGGTGGCCAGTCTCCGGCTG
CAGCTCCGACTACTACGCCGGGGGCCACCCCGCCTGCAGCCGCGAAGTATCTAACTCCTGCCGCATCTCCTGTATCACCACCGACAAATTCATCTCCCGCCGCCGCTCCT
CAAAAGCCGGCCACTCCAGCACCGGTCTCCACGCCGCCTGCTTCGGCGCCACCAGCAGTCGCTCCAGTAGCCTCTCCACCAGCCAGCACTCCTCCGACTGCTTCAGTTCC
TACTAGCTCTCCACCGGCTGCTTCTGTACCACCTACCTCTCCGCCGGCTGCTACCGTTCCAGCGAGCTCCCCACCTGTTCCGGTCCCGGTGAGTTCTCCACCAGTTCCCG
TACCGGTGAGCTCTCCTCCAGTACCAACTCCAACTGAATCTCCTCCCGCTCCCGAAAGTTCTCCTCCTGCTCCAGTCGCATCGCCTCCCGTTGAGGTTCCGGCGCCGGCG
CCTAGCAAGAAGAAGTCCAAGAAGCACAAAGCACCAGCTCCTTCTCCGGCGTTGCTTGGTCCACCTGCTCCTCCTTCCGAAGCCCCTGGAGCAAGCGAGGAAGGTCCTTC
GCCCAGCCCTTCACTAGAGGACAAGAGTGGAGCTGAAACATTGTGGAACCTGCAGAAGGTGGCCGGAAGTTTGGCTTTCGGATGGGCTGCCGTCGCCGTCAGCTTCATCT
TCTAGAGAGAGAGAGAGAGATGTGATCGTGCTTCATTTATTTATATTTAATCTCTTTATTTTATTTTTGGTTAAAATTTTCCCCCATTGTACTCCATATCATAAGAGGTG
CTTTATTGATTGAATTGATTGATGAAATTTGTTTTACAAAATTGAGCACTGGGGTGCTTTCTTCGATGAACATTTTGGCTTGGAGTATATGATTTGTTGTAATTCTTTCT
GTGGAGGACTTTGGAGTGGGGGAAATTCTTTTTATTACATTTGGATCCTTTTTGTGTTCTGTAATTAAATCTCTTTGATTTCTGTTCTACGTCTGTGTCAATTTACTTTT
ACATTTCCATTCTTTGATATAATATTATGCTTAATCATTACATTAAGCTTCTAAAATAGCATTAGCCAACGAATTTATCAATAATTACAACACTCGAATCTTTCACACTA
ATGTGTAAAGTGTAACTTAAACCTCCGACCAATTAAAAGAACACTCCAAATCGACATGTACACGGATCTATGGGGTCGGTCGGTC
Protein sequenceShow/hide protein sequence
MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPGATPPAAAKYLTPAASPVSPPTNSSPAAAPQKPATPAPVSTPPASAPPAVAPVASPPASTPPTASVPTSSPPAASVPP
TSPPAATVPASSPPVPVPVSSPPVPVPVSSPPVPTPTESPPAPESSPPAPVASPPVEVPAPAPSKKKSKKHKAPAPSPALLGPPAPPSEAPGASEEGPSPSPSLEDKSGA
ETLWNLQKVAGSLAFGWAAVAVSFIF