; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001021 (gene) of Snake gourd v1 genome

Gene IDTan0001021
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionlysine-rich arabinogalactan protein 18-like
Genome locationContig00035_ERROPOS14500000+:220505..222418
RNA-Seq ExpressionTan0001021
SyntenyTan0001021
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR044981 - Lysine-rich arabinogalactan protein AGP9/17/18


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6595001.1 hypothetical protein SDJN03_11554, partial [Cucurbita argyrosperma subsp. sororia]6.1e-8988.49Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        MGRQSVIALV+ICAVVAGVGGQSPAAAPTTTPA PAPVAAKYP PAASPVVPPTNSSPAAAPQKP TPAPVSTPPA     VAPVASPP STPP ASVPA
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA
        SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVP SSPPVPTP +SPP      APE+APPAPVASPP EVPSPAPSKKKSKKHKAPAPSPALLGPPA
Subjt:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA

Query:  PPSEAPGASEDGPSPAPSLDDKSGAEAL-RNMQKVVGSLALGLAAAAVSFMF
        PPSEAPG SE+GPSPAPSLDDKSGAEAL RNMQKVVGSLALG +AA  SFMF
Subjt:  PPSEAPGASEDGPSPAPSLDDKSGAEAL-RNMQKVVGSLALGLAAAAVSFMF

NP_001267612.1 arabinogalactan protein precursor [Cucumis sativus]2.1e-8184.13Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAA  PVAA YPPPAASPV  PTN SPAAAPQKPATPAPVSTPPAS PP VAPVASPP STPP ASVPA
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPP-ATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPP
        SSPPAASVP SSPP ATVPASSPPVPVP SSPPV VP SSPPVPTPT+SPP      APES+PPAPVASPPVEVP+PAPSKKKSKKH+APAPSPALLGPP
Subjt:  SSPPAASVPASSPP-ATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPP

Query:  APPSEAPGASEDGPSPAPSLDDKSGAEALRNMQKVVGSLALGLAAAAVSFMF
        APPSEAP  SE+GP+P+PSL+DKSGAEAL    KV GSLALG AA AVS +F
Subjt:  APPSEAPGASEDGPSPAPSLDDKSGAEALRNMQKVVGSLALGLAAAAVSFMF

XP_022963306.1 lysine-rich arabinogalactan protein 18-like [Cucurbita moschata]1.0e-8888.49Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPA PAPVAAK PPPAASPVVPPTNSSPAAAPQKP TPAPVSTPPA     VAPVASPP STPP ASVPA
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA
        SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVP SSPPVPTP +SPP      APE+APPAPVASPP EVPSPAPSKKKSKKHKAPAPSPALLGPPA
Subjt:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA

Query:  PPSEAPGASEDGPSPAPSLDDKSGAEAL-RNMQKVVGSLALGLAAAAVSFMF
        PPSEAPG SE+GP+PAPSLDDKSGAEAL RNMQKVVGSLALG +AA  SFMF
Subjt:  PPSEAPGASEDGPSPAPSLDDKSGAEAL-RNMQKVVGSLALGLAAAAVSFMF

XP_023003133.1 lysine-rich arabinogalactan protein 18-like [Cucurbita maxima]1.4e-8888.67Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPA PAPVAAKYPPPAASPVVPPTNSSPAAAPQKP TPAPVSTPPA     VAPVASPP STPP ASVPA
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPP-ATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTP---TIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALL
        SSPPAASVPASSPP A+VPASSPP  VPASSPPVPVPASSPPVP P  SPPVPTP     APESAPPAPVASPP EVPSPAPSKKKSKKHKAPAPSPALL
Subjt:  SSPPAASVPASSPP-ATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTP---TIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALL

Query:  GPPAPPSEAPGASEDGPSPAPSLDDKSGAEAL-RNMQKVVGSLALGLAAAAVSFMF
        GPPAPPSEAPG SE+GPSPAPSLDDKSGAEAL RNMQKVVGSLALG +AA  SFMF
Subjt:  GPPAPPSEAPGASEDGPSPAPSLDDKSGAEAL-RNMQKVVGSLALGLAAAAVSFMF

XP_023517454.1 lysine-rich arabinogalactan protein 18-like [Cucurbita pepo subsp. pepo]1.2e-8988.89Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPA PAPVAAKYPPPAASPVVPPTNSSPAAAPQKP TPAPVST     PP VAPVASPP STPP ASVPA
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA
        SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVP SSPPVPTP +SPP      APE+APPAPVASPP EVPSPAPSKKKSKKHKAPAPSPALLGPPA
Subjt:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA

Query:  PPSEAPGASEDGPSPAPSLDDKSGAEALR-NMQKVVGSLALGLAAAAVSFMF
        PPSEAPG SE+GPSPAPSLDDKSGAEALR NMQKVVGSLALG +AA  SFMF
Subjt:  PPSEAPGASEDGPSPAPSLDDKSGAEALR-NMQKVVGSLALGLAAAAVSFMF

TrEMBL top hitse value%identityAlignment
A0A1S3B0X7 lysine-rich arabinogalactan protein 18-like1.0e-8184.92Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        MGRQSVIALVLICAVVA VGGQSPAAAPTTTPAA  PVAAKYPPPAASPV PPTNSSPAAAPQKPATPAPVSTPPAS PP VAPVASPP STPP ASVPA
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPP-ATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPP
        SSPPAASVP SSPP ATVPASSPPVPVP SSPPVPVP SSPPVPTPT+SPP      APES+PPAPVASPP EVP+PAPS KKSKKH+APAPSPALLGPP
Subjt:  SSPPAASVPASSPP-ATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPP

Query:  APPSEAPGASEDGPSPAPSLDDKSGAEALRNMQKVVGSLALGLAAAAVSFMF
        APPSEAP  SE+GPSP+PSL+DKSGAEAL    KV GSLALG AA AVS +F
Subjt:  APPSEAPGASEDGPSPAPSLDDKSGAEALRNMQKVVGSLALGLAAAAVSFMF

A0A6J1HFT1 lysine-rich arabinogalactan protein 18-like5.0e-8988.49Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPA PAPVAAK PPPAASPVVPPTNSSPAAAPQKP TPAPVSTPPA     VAPVASPP STPP ASVPA
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA
        SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVP SSPPVPTP +SPP      APE+APPAPVASPP EVPSPAPSKKKSKKHKAPAPSPALLGPPA
Subjt:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA

Query:  PPSEAPGASEDGPSPAPSLDDKSGAEAL-RNMQKVVGSLALGLAAAAVSFMF
        PPSEAPG SE+GP+PAPSLDDKSGAEAL RNMQKVVGSLALG +AA  SFMF
Subjt:  PPSEAPGASEDGPSPAPSLDDKSGAEAL-RNMQKVVGSLALGLAAAAVSFMF

A0A6J1IK34 lysine-rich arabinogalactan protein 18-like7.1e-7578.54Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        MGRQSVIALVLICAV+AGVGGQSPAAAPT+ P A +P AAK+ PPAASPV PPTNSSP AAPQKPATPAPVSTPPAS P  VAPVASPP STPP ASVP 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPP-----------ATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAP
        SSPPAASVPASSPP           A+VPASSPPV VPASSPPVPVP SSPPV TPTQSPP PTPT  PESA       PPVEVP+PAPS  KSKKHKAP
Subjt:  SSPPAASVPASSPP-----------ATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAP

Query:  APSPALLGPPAPPSEAPGASEDGPSPAPSLDDKSGAEALRNMQKVVGSLALGLAAAAVSFM
        APSPALLGPPAPPSEAPGASE+GPSP PSL+DKSGAEAL   QKV GSLALG AA A+SF+
Subjt:  APSPALLGPPAPPSEAPGASEDGPSPAPSLDDKSGAEALRNMQKVVGSLALGLAAAAVSFM

A0A6J1KVL2 lysine-rich arabinogalactan protein 18-like6.6e-8988.67Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPA PAPVAAKYPPPAASPVVPPTNSSPAAAPQKP TPAPVSTPPA     VAPVASPP STPP ASVPA
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPP-ATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTP---TIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALL
        SSPPAASVPASSPP A+VPASSPP  VPASSPPVPVPASSPPVP P  SPPVPTP     APESAPPAPVASPP EVPSPAPSKKKSKKHKAPAPSPALL
Subjt:  SSPPAASVPASSPP-ATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTP---TIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALL

Query:  GPPAPPSEAPGASEDGPSPAPSLDDKSGAEAL-RNMQKVVGSLALGLAAAAVSFMF
        GPPAPPSEAPG SE+GPSPAPSLDDKSGAEAL RNMQKVVGSLALG +AA  SFMF
Subjt:  GPPAPPSEAPGASEDGPSPAPSLDDKSGAEAL-RNMQKVVGSLALGLAAAAVSFMF

Q9XIV1 Arabinogalactan protein1.0e-8184.13Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAA  PVAA YPPPAASPV  PTN SPAAAPQKPATPAPVSTPPAS PP VAPVASPP STPP ASVPA
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPP-ATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPP
        SSPPAASVP SSPP ATVPASSPPVPVP SSPPV VP SSPPVPTPT+SPP      APES+PPAPVASPPVEVP+PAPSKKKSKKH+APAPSPALLGPP
Subjt:  SSPPAASVPASSPP-ATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPP

Query:  APPSEAPGASEDGPSPAPSLDDKSGAEALRNMQKVVGSLALGLAAAAVSFMF
        APPSEAP  SE+GP+P+PSL+DKSGAEAL    KV GSLALG AA AVS +F
Subjt:  APPSEAPGASEDGPSPAPSLDDKSGAEALRNMQKVVGSLALGLAAAAVSFMF

SwissProt top hitse value%identityAlignment
Q9C5S0 Classical arabinogalactan protein 93.4e-1044.26Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        M R   IA++ I  ++AGV GQ+P + PT TPA P P     PPPAA+P  PP ++           P PV+T P   P T AP   PP + PP    P 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA
        SSPP    PAS PPAT        P P +SPP PV  +SPP  TP   PPV TP       PPAP+ASPP +VP+PAP+ K      +P+PSP+   PP 
Subjt:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA

Query:  PPSEAPGASEDGPSPAPS---LDDKSGAEALRNMQKVVGSLALG
        P S+APG S D  SPAPS   ++D++GA       K+V SL  G
Subjt:  PPSEAPGASEDGPSPAPS---LDDKSGAEALRNMQKVVGSLALG

Q9FPQ6 Vegetative cell wall protein gp12.3e-0639.51Show/hide
Query:  GVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPASSPPAASVPASSPPATV
        G    +P + P+  P +PAP +   P PA     PP+ + P+ AP  PA P+P S  P S  P   P  SPP+  PP    PA   P+  VP S  P   
Subjt:  GVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPASSPPAASVPASSPPATV

Query:  PASSPPVPVPAS-SPPVP----VPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPAPPSEAPGASEDG
        P+ +PP P P S SPPVP     P+ +PPVP P+ +PP P P + P  APP+P +  P   PSPAP         +P P       PAPPS  P A    
Subjt:  PASSPPVPVPAS-SPPVP----VPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPAPPSEAPGASEDG

Query:  PSPAP
        PSP P
Subjt:  PSPAP

Q9FPR2 Lysine-rich arabinogalactan protein 181.5e-2144.14Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        M R  ++ + LIC VVAGVGGQSP ++PT +P  P           ++P   PT S                  PA   PT AP  +P  S    AS P 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSK-----KKSKKHK-APAPSPA
         SP        SP     +S PP PVP SSPPVP P  S PV     SPPVP P      +PPAPVA+P  +VP+PAPSK     KKSKKH+ APAP+P 
Subjt:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSK-----KKSKKHK-APAPSPA

Query:  LLGPPAPPSEAPGASEDGPSPAPSLDDKSGAEALRNMQKV-VGSLALGLAAAAVSF
        LLGPPAPP+E+PG + D  SP PS DD+SGA + R ++ V VG++A   A   ++F
Subjt:  LLGPPAPPSEAPGASEDGPSPAPSLDDKSGAEALRNMQKV-VGSLALGLAAAAVSF

Q9S740 Lysine-rich arabinogalactan protein 191.3e-0441.8Show/hide
Query:  MGRQSVI-ALVLICAVVA--GVGGQSPAAAP-TTTPAAPAPVAAKYPPPAASPVVPPTNSSP----AAAPQKPATPAPVSTPPASVPPTVAPVASPPTST
        M   S+I +L+L  A+++   V  Q PAA+P T+T  AP P  A  PP  A+P  PPT ++P    A  P  P TP P  TP +   P VAPV SP T  
Subjt:  MGRQSVI-ALVLICAVVA--GVGGQSPAAAP-TTTPAAPAPVAAKYPPPAASPVVPPTNSSP----AAAPQKPATPAPVSTPPASVPPTVAPVASPPTST

Query:  P-PAASVPASSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSK---KHKA
        P P  S PAS+P  +  P S PPA  P S PP P   +SPP P PAS PP P    SPP P P   P    P+P++ PP   P+P   K+K K    H A
Subjt:  P-PAASVPASSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSK---KHKA

Query:  PAPSPALLGPPAPPSEAPGASEDGPSPAPSLDDKSGAEALRNMQ
        PAP+P    PP+PPS          +PAPS  + +G  AL  ++
Subjt:  PAPSPALLGPPAPPSEAPGASEDGPSPAPSLDDKSGAEALRNMQ

Arabidopsis top hitse value%identityAlignment
AT1G68725.1 arabinogalactan protein 199.0e-0641.8Show/hide
Query:  MGRQSVI-ALVLICAVVA--GVGGQSPAAAP-TTTPAAPAPVAAKYPPPAASPVVPPTNSSP----AAAPQKPATPAPVSTPPASVPPTVAPVASPPTST
        M   S+I +L+L  A+++   V  Q PAA+P T+T  AP P  A  PP  A+P  PPT ++P    A  P  P TP P  TP +   P VAPV SP T  
Subjt:  MGRQSVI-ALVLICAVVA--GVGGQSPAAAP-TTTPAAPAPVAAKYPPPAASPVVPPTNSSP----AAAPQKPATPAPVSTPPASVPPTVAPVASPPTST

Query:  P-PAASVPASSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSK---KHKA
        P P  S PAS+P  +  P S PPA  P S PP P   +SPP P PAS PP P    SPP P P   P    P+P++ PP   P+P   K+K K    H A
Subjt:  P-PAASVPASSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSK---KHKA

Query:  PAPSPALLGPPAPPSEAPGASEDGPSPAPSLDDKSGAEALRNMQ
        PAP+P    PP+PPS          +PAPS  + +G  AL  ++
Subjt:  PAPSPALLGPPAPPSEAPGASEDGPSPAPSLDDKSGAEALRNMQ

AT2G14890.1 arabinogalactan protein 92.4e-1144.26Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        M R   IA++ I  ++AGV GQ+P + PT TPA P P     PPPAA+P  PP ++           P PV+T P   P T AP   PP + PP    P 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA
        SSPP    PAS PPAT        P P +SPP PV  +SPP  TP   PPV TP       PPAP+ASPP +VP+PAP+ K      +P+PSP+   PP 
Subjt:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA

Query:  PPSEAPGASEDGPSPAPS---LDDKSGAEALRNMQKVVGSLALG
        P S+APG S D  SPAPS   ++D++GA       K+V SL  G
Subjt:  PPSEAPGASEDGPSPAPS---LDDKSGAEALRNMQKVVGSLALG

AT2G14890.2 arabinogalactan protein 92.1e-1045.7Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        M R   IA++ I  ++AGV GQ+P + PT TPA P P     PPPAA+P  PP ++           P PV+T P   P T AP   PP + PP    P 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA
        SSPP    PAS PPAT        P P +SPP PV  +SPP  TP   PPV TP       PPAP+ASPP +VP+PAP+ K      +P+PSP+   PP 
Subjt:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPA

Query:  PPSEAPGASEDGPSPAPSLDD
        P S+APG S D  SPAPS  D
Subjt:  PPSEAPGASEDGPSPAPSLDD

AT4G37450.1 arabinogalactan protein 181.1e-2244.14Show/hide
Query:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA
        M R  ++ + LIC VVAGVGGQSP ++PT +P  P           ++P   PT S                  PA   PT AP  +P  S    AS P 
Subjt:  MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPA

Query:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSK-----KKSKKHK-APAPSPA
         SP        SP     +S PP PVP SSPPVP P  S PV     SPPVP P      +PPAPVA+P  +VP+PAPSK     KKSKKH+ APAP+P 
Subjt:  SSPPAASVPASSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSK-----KKSKKHK-APAPSPA

Query:  LLGPPAPPSEAPGASEDGPSPAPSLDDKSGAEALRNMQKV-VGSLALGLAAAAVSF
        LLGPPAPP+E+PG + D  SP PS DD+SGA + R ++ V VG++A   A   ++F
Subjt:  LLGPPAPPSEAPGASEDGPSPAPSLDDKSGAEALRNMQKV-VGSLALGLAAAAVSF

AT5G14920.1 Gibberellin-regulated family protein1.3e-0441.14Show/hide
Query:  PAAAPTTTPAAPA--PVAAKYPPPA--ASPVVPPTNSSPAAAPQKPATPA--PVSTPPASVPPTVAPVASPPTST--PPAASVPASSPPAASV--PASS-
        P+ +P T P +PA  P    Y PP    +P+ PPT   P   P  P TP   PVSTPP  +PP   P   PPT T  PP+   P   PP  +V  P +S 
Subjt:  PAAAPTTTPAAPA--PVAAKYPPPA--ASPVVPPTNSSPAAAPQKPATPA--PVSTPPASVPPTVAPVASPPTST--PPAASVPASSPPAASV--PASS-

Query:  --PPATVPASSPPVPVPASSP---PVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKK
          PP T P  SPPV  P   P   PV  P ++PPV  PT +PPV  PT  P + P  P  +PPV+ P+P P + +
Subjt:  --PPATVPASSPPVPVPASSP---PVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAGACAGTCCGTGATCGCACTTGTTCTGATCTGCGCCGTCGTCGCCGGTGTCGGTGGCCAGTCTCCGGCTGCAGCACCGACTACTACGCCGGCGGCTCCCGCGCC
CGTAGCAGCCAAGTATCCACCTCCAGCCGCATCTCCTGTAGTGCCACCGACAAATTCATCTCCCGCCGCCGCTCCCCAAAAGCCGGCCACTCCAGCACCGGTCTCCACGC
CGCCTGCATCGGTGCCACCAACAGTAGCTCCAGTGGCCTCTCCGCCGACCAGCACTCCTCCGGCTGCTTCCGTTCCAGCTAGCTCTCCACCGGCTGCTTCCGTTCCGGCT
AGCTCCCCGCCAGCTACCGTTCCAGCGAGCTCTCCGCCGGTTCCAGTTCCGGCAAGTTCTCCTCCCGTGCCAGTACCGGCGAGCTCTCCTCCCGTACCAACTCCGACGCA
ATCTCCTCCCGTTCCGACGCCTACAATCGCTCCGGAAAGTGCTCCTCCTGCTCCGGTCGCATCACCTCCCGTTGAGGTCCCGTCTCCGGCGCCAAGCAAGAAGAAGTCGA
AGAAGCACAAAGCACCAGCTCCTTCTCCTGCATTGCTCGGTCCACCTGCGCCTCCTTCTGAAGCCCCTGGAGCAAGCGAGGACGGTCCTTCACCCGCCCCTTCACTGGAT
GACAAGAGTGGAGCTGAAGCGCTGAGGAACATGCAGAAGGTGGTCGGAAGCTTGGCTCTGGGACTGGCCGCCGCCGCCGTCAGCTTCATGTTCTAG
mRNA sequenceShow/hide mRNA sequence
TACCGTTCTTTTGTTTGATTCCCTTCGTGGGCCCTTCTTCTCTCTCCCAACGGCTAGTACAGGCCCATTAATAGTGGGCCCCTTTTGTCTGGCCAGTTCTGTTGCCTTTT
TTTTATTTATTTATTTTTAATTCAAAAAATTCCTTTGGGGTGTTATTACTAGTCTATATATACACTATTGCATCTCGGGCTCGGAAACATTCACTTAATCTTCTACTAAC
GCTCACACACTAGAAATGGGGAGACAGTCCGTGATCGCACTTGTTCTGATCTGCGCCGTCGTCGCCGGTGTCGGTGGCCAGTCTCCGGCTGCAGCACCGACTACTACGCC
GGCGGCTCCCGCGCCCGTAGCAGCCAAGTATCCACCTCCAGCCGCATCTCCTGTAGTGCCACCGACAAATTCATCTCCCGCCGCCGCTCCCCAAAAGCCGGCCACTCCAG
CACCGGTCTCCACGCCGCCTGCATCGGTGCCACCAACAGTAGCTCCAGTGGCCTCTCCGCCGACCAGCACTCCTCCGGCTGCTTCCGTTCCAGCTAGCTCTCCACCGGCT
GCTTCCGTTCCGGCTAGCTCCCCGCCAGCTACCGTTCCAGCGAGCTCTCCGCCGGTTCCAGTTCCGGCAAGTTCTCCTCCCGTGCCAGTACCGGCGAGCTCTCCTCCCGT
ACCAACTCCGACGCAATCTCCTCCCGTTCCGACGCCTACAATCGCTCCGGAAAGTGCTCCTCCTGCTCCGGTCGCATCACCTCCCGTTGAGGTCCCGTCTCCGGCGCCAA
GCAAGAAGAAGTCGAAGAAGCACAAAGCACCAGCTCCTTCTCCTGCATTGCTCGGTCCACCTGCGCCTCCTTCTGAAGCCCCTGGAGCAAGCGAGGACGGTCCTTCACCC
GCCCCTTCACTGGATGACAAGAGTGGAGCTGAAGCGCTGAGGAACATGCAGAAGGTGGTCGGAAGCTTGGCTCTGGGACTGGCCGCCGCCGCCGTCAGCTTCATGTTCTA
GAGAGAGAGACGTGATCGTGCTTCATTTATTATATTTAATCTCTTTGTTTTTTGGTTAAAATTTTCCCCCATTGTACTCCATAACATAAGAGGTGCTTTATTGATTGAAT
TGATTGATGAAATTTGTTTTACAAAATTGAGCACTGGAGTGTTTGATTTGATGAACATTTTGGATGGAGATAGGATTTGGCATAATATGATTTGTAATTCTTTCTGTGGA
GAACTTGGGATTGGGGAAATCTTTTATTACATTTGGATCCTTTTGTGTTCTGTAATTAATTCTCGTTCATTTCTGTTCTTCTACATTTATGTCAATTAACTTTTACCATT
TCATGTCAATAAATAAATTTGGTTTGGAACTATT
Protein sequenceShow/hide protein sequence
MGRQSVIALVLICAVVAGVGGQSPAAAPTTTPAAPAPVAAKYPPPAASPVVPPTNSSPAAAPQKPATPAPVSTPPASVPPTVAPVASPPTSTPPAASVPASSPPAASVPA
SSPPATVPASSPPVPVPASSPPVPVPASSPPVPTPTQSPPVPTPTIAPESAPPAPVASPPVEVPSPAPSKKKSKKHKAPAPSPALLGPPAPPSEAPGASEDGPSPAPSLD
DKSGAEALRNMQKVVGSLALGLAAAAVSFMF