; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G023040 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G023040
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionnon-classical arabinogalactan protein 31-like
Genome locationCG_Chr01:36466539..36469588
RNA-Seq ExpressionClCG01G023040
SyntenyClCG01G023040
Gene Ontology termsGO:0071944 - cell periphery (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134701.1 non-classical arabinogalactan protein 31 [Cucumis sativus]3.3e-11485.2Show/hide
Query:  LSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRP
        ++AAETLPVPH         HHHH HAP+PAP+PPPTHLPLHPP H PA PP+ HH HH HAQ+P HPP NAPSHHLPPTHP    PAP HHHHHH+V P
Subjt:  LSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRP

Query:  LPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP
        + PP+HSPAPIYPPKPRLVRSFISVQGVVYCKSCKY GADTLLGATPVAGASVKLICQNTKYPL+QTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP
Subjt:  LPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP

Query:  SPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
        SP+CTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
Subjt:  SPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH

XP_008439849.1 PREDICTED: non-classical arabinogalactan protein 31-like [Cucumis melo]3.1e-11283.67Show/hide
Query:  LSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPG-HHHHHHDVR
        ++ AETLP+PHH+            HAPTPAP+PPPTHLPLHPPAH P      HH HH H Q P HPP NAPSHHLPPTH PAHSPAP  HHHHHH+V 
Subjt:  LSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPG-HHHHHHDVR

Query:  PLPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSS
        P+PPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPL+QTATTDKNGYFFITAPKAITSYAFHKCKVVLG S
Subjt:  PLPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSS

Query:  PSPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
        PSP+C+KPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
Subjt:  PSPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH

XP_022926154.1 non-classical arabinogalactan protein 31-like [Cucurbita moschata]1.3e-10781.25Show/hide
Query:  TTTPTLSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPS-HHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHH
        +T    +AAETL  P H+ AAVPPAHHHH H+PTPAPV PPTHLPLHP    PAQPPS HHHHHH+H+Q P HP    P +HL    PPAH+PAPG HHH
Subjt:  TTTPTLSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPS-HHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHH

Query:  HHDVRPLPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKV
        HHDV PLPPPTHSPAP++PPKPRL+RSFISVQGVVYCKSCKYAG DTLLGAT VAGA+VKLICQNTKYPL+QTATTDKNGYFFITAPKA+TSYAFHKCKV
Subjt:  HHDVRPLPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKV

Query:  VLGSSPSPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
        VLGSSP+PSC+KPSALHGGAAGAPLRPQKSYIDANKLP+VLYSVGPFAFEPTC HH
Subjt:  VLGSSPSPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH

XP_022978745.1 non-classical arabinogalactan protein 31-like [Cucurbita maxima]8.9e-11284Show/hide
Query:  SAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPS-HHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRP
        +AAETL  P H+ AAVPP HHHH H+PTPAPV PPTHLPLHP    PAQPPS HHHHHH+H Q P HP    P HHLPPTHPP H+PAPG HHHHHDV P
Subjt:  SAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPS-HHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRP

Query:  LPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP
        LPPPTHSPAP++PPKPRL+RSFISVQGVVYCKSCKYAG DTLLGAT VAGA+VKLICQNTKYPL+QTATTDKNGYFFITAPKA+TSYAFHKCKVVLGSSP
Subjt:  LPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP

Query:  SPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
        +PSC+KPSALHGGAAGAPLRPQKSYIDANKLP+VLYSVGPFAFEPTC HH
Subjt:  SPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH

XP_038881349.1 non-classical arabinogalactan protein 31-like [Benincasa hispida]1.7e-12691.6Show/hide
Query:  LSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRP
        ++A ETLPVPHH  AAVPPAHHHH HAPTPAPVP PTHLP+H P H PAQPPS HHHHHVH Q P HPP NAPSHHLPPTHPPAHSPAPGHH HHHD+RP
Subjt:  LSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRP

Query:  LPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP
        LPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPL+QTATTDKNGYFFITAPKAITSYAFHKCKV+LGSSP
Subjt:  LPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP

Query:  SPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
        SPSC KPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
Subjt:  SPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH

TrEMBL top hitse value%identityAlignment
A0A0A0KLH4 Structural constituent of cell wall1.6e-11485.2Show/hide
Query:  LSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRP
        ++AAETLPVPH         HHHH HAP+PAP+PPPTHLPLHPP H PA PP+ HH HH HAQ+P HPP NAPSHHLPPTHP    PAP HHHHHH+V P
Subjt:  LSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRP

Query:  LPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP
        + PP+HSPAPIYPPKPRLVRSFISVQGVVYCKSCKY GADTLLGATPVAGASVKLICQNTKYPL+QTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP
Subjt:  LPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP

Query:  SPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
        SP+CTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
Subjt:  SPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH

A0A1S3B0F6 non-classical arabinogalactan protein 31-like1.5e-11283.67Show/hide
Query:  LSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPG-HHHHHHDVR
        ++ AETLP+PHH+            HAPTPAP+PPPTHLPLHPPAH P      HH HH H Q P HPP NAPSHHLPPTH PAHSPAP  HHHHHH+V 
Subjt:  LSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPG-HHHHHHDVR

Query:  PLPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSS
        P+PPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPL+QTATTDKNGYFFITAPKAITSYAFHKCKVVLG S
Subjt:  PLPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSS

Query:  PSPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
        PSP+C+KPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
Subjt:  PSPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH

A0A5A7U9U1 Non-classical arabinogalactan protein 31-like1.5e-11283.67Show/hide
Query:  LSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPG-HHHHHHDVR
        ++ AETLP+PHH+            HAPTPAP+PPPTHLPLHPPAH P      HH HH H Q P HPP NAPSHHLPPTH PAHSPAP  HHHHHH+V 
Subjt:  LSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPG-HHHHHHDVR

Query:  PLPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSS
        P+PPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPL+QTATTDKNGYFFITAPKAITSYAFHKCKVVLG S
Subjt:  PLPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSS

Query:  PSPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
        PSP+C+KPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
Subjt:  PSPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH

A0A6J1EEB1 non-classical arabinogalactan protein 31-like6.5e-10881.25Show/hide
Query:  TTTPTLSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPS-HHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHH
        +T    +AAETL  P H+ AAVPPAHHHH H+PTPAPV PPTHLPLHP    PAQPPS HHHHHH+H+Q P HP    P +HL    PPAH+PAPG HHH
Subjt:  TTTPTLSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPS-HHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHH

Query:  HHDVRPLPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKV
        HHDV PLPPPTHSPAP++PPKPRL+RSFISVQGVVYCKSCKYAG DTLLGAT VAGA+VKLICQNTKYPL+QTATTDKNGYFFITAPKA+TSYAFHKCKV
Subjt:  HHDVRPLPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKV

Query:  VLGSSPSPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
        VLGSSP+PSC+KPSALHGGAAGAPLRPQKSYIDANKLP+VLYSVGPFAFEPTC HH
Subjt:  VLGSSPSPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH

A0A6J1IR37 non-classical arabinogalactan protein 31-like4.3e-11284Show/hide
Query:  SAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPS-HHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRP
        +AAETL  P H+ AAVPP HHHH H+PTPAPV PPTHLPLHP    PAQPPS HHHHHH+H Q P HP    P HHLPPTHPP H+PAPG HHHHHDV P
Subjt:  SAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPS-HHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRP

Query:  LPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP
        LPPPTHSPAP++PPKPRL+RSFISVQGVVYCKSCKYAG DTLLGAT VAGA+VKLICQNTKYPL+QTATTDKNGYFFITAPKA+TSYAFHKCKVVLGSSP
Subjt:  LPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP

Query:  SPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH
        +PSC+KPSALHGGAAGAPLRPQKSYIDANKLP+VLYSVGPFAFEPTC HH
Subjt:  SPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTCPHH

SwissProt top hitse value%identityAlignment
P93013 Non-classical arabinogalactan protein 308.9e-3041.55Show/hide
Query:  PTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRPLPPPTHSPA--PIYPPKPRLVRSFISVQGVVYCKS
        P HLPL PP   P  PP+         + P++PP  AP     PT PPA +P            P  PP   P   P+YPPK    ++ ++V+GVVYCK+
Subjt:  PTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRPLPPPTHSPA--PIYPPKPRLVRSFISVQGVVYCKS

Query:  CKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSPSCTKPSALHGGAAGAPLRPQK----SYIDAN
        CKYAG + + GA PV  A V+L+C+N K  + +T  TDKNGYF + APK +T+Y    C+  L  SP   C+K S+LH G  G+ L+P      S     
Subjt:  CKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSPSCTKPSALHGGAAGAPLRPQK----SYIDAN

Query:  KLPFVLYSVGPFAFEPTCP
           + +Y+VGPFAFEPTCP
Subjt:  KLPFVLYSVGPFAFEPTCP

Q03211 Pistil-specific extensin-like protein1.1e-1934.66Show/hide
Query:  PKPCKCRPKPSHSIPLIPWL----------SLSPNLSFLSSSSPFSPSSPPTISRQPKHSPFPTTTT------PPCHLPTTTTPTLSAAETLPVPHHYDA
        P P    P P  +IPLIP            S  P+ + L    P  P  PP     P + P P++ +      PP   P   +P   +A+  P P     
Subjt:  PKPCKCRPKPSHSIPLIPWL----------SLSPNLSFLSSSSPFSPSSPPTISRQPKHSPFPTTTT------PPCHLPTTTTPTLSAAETLPVPHHYDA

Query:  AVPPAHHHHPHAPTPAPV--PPPTHLPLHPPAHSPA-QPPSHHHHHHVHAQSPSHPPTNAPSHHLPP----THPPAHSPAPGHHHHHHDVRPLPPPTHSP
          PP       AP+P+P   PPP   P+  P+ SPA QPP+         + P  PP    S  LPP     +PP  +P+P        + P P P  +P
Subjt:  AVPPAHHHHPHAPTPAPV--PPPTHLPLHPPAHSPA-QPPSHHHHHHVHAQSPSHPPTNAPSHHLPP----THPPAHSPAPGHHHHHHDVRPLPPPTHSP

Query:  --------APIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP
                 P+  P P L +  I V G+VYCKSC   G  TLL A+ + GA VKLIC   K  ++Q ATTD  G F I  PK++T+    KCKV L  SP
Subjt:  --------APIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSP

Query:  SPSCTKPSALHGGAAGA---PLRPQKSYIDANKLPFV-----LYSVGPFAFE
        +P+C  P+  +GG +G    PL P K  I    +P       LY VGPF FE
Subjt:  SPSCTKPSALHGGAAGA---PLRPQKSYIDANKLPFV-----LYSVGPFAFE

Q9FZA2 Non-classical arabinogalactan protein 312.3e-3339.53Show/hide
Query:  SPSSPPTISRQPKHSPFPTTTTPPCHLPT--TTTPTLSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQS
        SP  PP   + P   P      PP + PT     P        PV       V P  +    AP   P  PP   P++PP  +P +PP+         + 
Subjt:  SPSSPPTISRQPKHSPFPTTTTPPCHLPT--TTTPTLSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQS

Query:  PSHPPTNAPSHHLPPTHPPAHSPA----------PGHHHHHHDVR-PLPPPTHSPA--PIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGA
        P +PPT AP    PPT PP   P           P +      V+ P+ PPT  P   P+YPPK    RS ++V+G VYCKSCKYA  +TLLGA P+ GA
Subjt:  PSHPPTNAPSHHLPPTHPPAHSPA----------PGHHHHHHDVR-PLPPPTHSPA--PIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGA

Query:  SVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSPSCTKPSALHGGAAGAPLRPQ----KSYIDANKLPFVLYSVGPFAFEPTC
        +VKL+C++ K    +T TTDKNGYF + APK +T++ F  C+V L  S    C+K S L GG  GA L+P+    KS +  NKL + L++VGPFAF P+C
Subjt:  SVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSPSCTKPSALHGGAAGAPLRPQ----KSYIDANKLPFVLYSVGPFAFEPTC

Query:  P
        P
Subjt:  P

Arabidopsis top hitse value%identityAlignment
AT1G28290.1 arabinogalactan protein 311.6e-3439.53Show/hide
Query:  SPSSPPTISRQPKHSPFPTTTTPPCHLPT--TTTPTLSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQS
        SP  PP   + P   P      PP + PT     P        PV       V P  +    AP   P  PP   P++PP  +P +PP+         + 
Subjt:  SPSSPPTISRQPKHSPFPTTTTPPCHLPT--TTTPTLSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQS

Query:  PSHPPTNAPSHHLPPTHPPAHSPA----------PGHHHHHHDVR-PLPPPTHSPA--PIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGA
        P +PPT AP    PPT PP   P           P +      V+ P+ PPT  P   P+YPPK    RS ++V+G VYCKSCKYA  +TLLGA P+ GA
Subjt:  PSHPPTNAPSHHLPPTHPPAHSPA----------PGHHHHHHDVR-PLPPPTHSPA--PIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGA

Query:  SVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSPSCTKPSALHGGAAGAPLRPQ----KSYIDANKLPFVLYSVGPFAFEPTC
        +VKL+C++ K    +T TTDKNGYF + APK +T++ F  C+V L  S    C+K S L GG  GA L+P+    KS +  NKL + L++VGPFAF P+C
Subjt:  SVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSPSCTKPSALHGGAAGAPLRPQ----KSYIDANKLPFVLYSVGPFAFEPTC

Query:  P
        P
Subjt:  P

AT1G28290.2 arabinogalactan protein 312.0e-4041.2Show/hide
Query:  TTTPTLSAAETLPVPHHYDAAVP-PAHHHHPH----------------------------------APTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHV
        T TP+L+ A   P P+H+    P P HHHHPH                                   PT APV PPT  P+ PP   PA+PP        
Subjt:  TTTPTLSAAETLPVPHHYDAAVP-PAHHHHPH----------------------------------APTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHV

Query:  HAQSPSHPPTNAPSHHLPPTHPPAHSPA------PGHHHHHHDVR-PLPPPTHSPA--PIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGA
          + P +PPT AP    PPT PP   P       P +      V+ P+ PPT  P   P+YPPK    RS ++V+G VYCKSCKYA  +TLLGA P+ GA
Subjt:  HAQSPSHPPTNAPSHHLPPTHPPAHSPA------PGHHHHHHDVR-PLPPPTHSPA--PIYPPKPRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGA

Query:  SVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSPSCTKPSALHGGAAGAPLRPQ----KSYIDANKLPFVLYSVGPFAFEPTC
        +VKL+C++ K    +T TTDKNGYF + APK +T++ F  C+V L  S    C+K S L GG  GA L+P+    KS +  NKL + L++VGPFAF P+C
Subjt:  SVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSPSCTKPSALHGGAAGAPLRPQ----KSYIDANKLPFVLYSVGPFAFEPTC

Query:  P
        P
Subjt:  P

AT2G33790.1 arabinogalactan protein 306.3e-3141.55Show/hide
Query:  PTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRPLPPPTHSPA--PIYPPKPRLVRSFISVQGVVYCKS
        P HLPL PP   P  PP+         + P++PP  AP     PT PPA +P            P  PP   P   P+YPPK    ++ ++V+GVVYCK+
Subjt:  PTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRPLPPPTHSPA--PIYPPKPRLVRSFISVQGVVYCKS

Query:  CKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSPSCTKPSALHGGAAGAPLRPQK----SYIDAN
        CKYAG + + GA PV  A V+L+C+N K  + +T  TDKNGYF + APK +T+Y    C+  L  SP   C+K S+LH G  G+ L+P      S     
Subjt:  CKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSPSCTKPSALHGGAAGAPLRPQK----SYIDAN

Query:  KLPFVLYSVGPFAFEPTCP
           + +Y+VGPFAFEPTCP
Subjt:  KLPFVLYSVGPFAFEPTCP

AT2G34700.1 Pollen Ole e 1 allergen and extensin family protein5.5e-3550Show/hide
Query:  SPAPIYPPK--PRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSP--
        S +P+ PP    ++ R  ++V+G+VYCKSCKY+G DTLL A+P+ GA+VKL C NTK  +     TDKNGYFF+ APK +T+YAFH C+    ++P P  
Subjt:  SPAPIYPPK--PRLVRSFISVQGVVYCKSCKYAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSP--

Query:  ---SCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTC
           +CT PS L+ G  GA L+P K+ I+  +  +VL+SVGPFAFEP C
Subjt:  ---SCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEPTC

AT5G53870.1 early nodulin-like protein 18.6e-0436.79Show/hide
Query:  PSHSIPLIPWLSLSPNLSFLSSS--SPFSPSSPPTISRQPKHSPFPTTTTPPCHLPTTTTPTLSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPP---
        P+HS    P  S+SP     S S  SP +P+S P+ S+ P+ S  P    P    P + TP LS       P H       A  H P  P+P+P  P   
Subjt:  PSHSIPLIPWLSLSPNLSFLSSS--SPFSPSSPPTISRQPKHSPFPTTTTPPCHLPTTTTPTLSAAETLPVPHHYDAAVPPAHHHHPHAPTPAPVPP---

Query:  ---PTHLPLHPPAHSPAQPPSH---HHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPA------PGHHHHHHDVRPLPPPTHSPAPIYPPKP
           P+H P H P+HSPA  PSH   H   H  A +PSH P +APSH   P H P+HSPA      P          P P    SP+P+  P P
Subjt:  ---PTHLPLHPPAHSPAQPPSH---HHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPA------PGHHHHHHDVRPLPPPTHSPAPIYPPKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGATGACAATGTGCAATCCCCAGACGCGGCTATATATTTCTGGCCCTAAACCCTGCAAATGTCGTCCTAAACCTTCTCATTCAATTCCACTCATCCCATGGCTTTC
ACTCTCTCCAAATCTTTCTTTTCTCTCTTCTTCTTCGCCATTTTCGCCCTCTTCGCCGCCGACCATATCACGGCAGCCGAAACACTCCCCGTTCCCCACCACTACGACGC
CGCCGTGCCACCTGCCCACCACCACCACCCCCACGCTCTCGGCAGCCGAAACACTCCCCGTTCCCCACCACTACGACGCCGCCGTGCCACCTGCCCACCACCACCACCCC
CACGCTCCTACGCCGGCACCAGTACCTCCACCGACCCACTTGCCGCTTCACCCTCCAGCCCATTCTCCGGCTCAGCCGCCTAGCCACCACCATCACCACCACGTCCACGC
CCAGTCTCCTTCTCACCCACCGACAAATGCGCCCTCTCACCATCTCCCTCCAACTCACCCGCCAGCTCACTCGCCGGCTCCCGGTCACCACCACCACCACCACGACGTCC
GCCCATTGCCGCCTCCGACTCACTCCCCGGCTCCGATCTACCCTCCGAAGCCTCGGTTGGTGAGGAGCTTCATTTCGGTTCAAGGCGTTGTCTATTGCAAGTCCTGTAAG
TATGCCGGAGCTGACACACTTCTCGGAGCTACCCCCGTCGCCGGTGCAAGCGTGAAGCTAATTTGCCAGAACACAAAATACCCACTCATTCAAACCGCCACCACCGACAA
AAACGGCTATTTCTTCATCACAGCCCCAAAGGCCATCACCAGCTACGCTTTCCACAAGTGCAAGGTCGTTCTCGGCTCATCCCCCTCCCCTTCCTGCACCAAGCCCTCCG
CCCTCCACGGCGGCGCCGCCGGAGCCCCCCTCAGGCCTCAGAAGTCTTACATCGACGCCAACAAGCTCCCCTTCGTCCTCTACTCTGTTGGCCCTTTTGCCTTCGAACCC
ACTTGCCCTCATCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGACGATGACAATGTGCAATCCCCAGACGCGGCTATATATTTCTGGCCCTAAACCCTGCAAATGTCGTCCTAAACCTTCTCATTCAATTCCACTCATCCCATGGCTTTC
ACTCTCTCCAAATCTTTCTTTTCTCTCTTCTTCTTCGCCATTTTCGCCCTCTTCGCCGCCGACCATATCACGGCAGCCGAAACACTCCCCGTTCCCCACCACTACGACGC
CGCCGTGCCACCTGCCCACCACCACCACCCCCACGCTCTCGGCAGCCGAAACACTCCCCGTTCCCCACCACTACGACGCCGCCGTGCCACCTGCCCACCACCACCACCCC
CACGCTCCTACGCCGGCACCAGTACCTCCACCGACCCACTTGCCGCTTCACCCTCCAGCCCATTCTCCGGCTCAGCCGCCTAGCCACCACCATCACCACCACGTCCACGC
CCAGTCTCCTTCTCACCCACCGACAAATGCGCCCTCTCACCATCTCCCTCCAACTCACCCGCCAGCTCACTCGCCGGCTCCCGGTCACCACCACCACCACCACGACGTCC
GCCCATTGCCGCCTCCGACTCACTCCCCGGCTCCGATCTACCCTCCGAAGCCTCGGTTGGTGAGGAGCTTCATTTCGGTTCAAGGCGTTGTCTATTGCAAGTCCTGTAAG
TATGCCGGAGCTGACACACTTCTCGGAGCTACCCCCGTCGCCGGTGCAAGCGTGAAGCTAATTTGCCAGAACACAAAATACCCACTCATTCAAACCGCCACCACCGACAA
AAACGGCTATTTCTTCATCACAGCCCCAAAGGCCATCACCAGCTACGCTTTCCACAAGTGCAAGGTCGTTCTCGGCTCATCCCCCTCCCCTTCCTGCACCAAGCCCTCCG
CCCTCCACGGCGGCGCCGCCGGAGCCCCCCTCAGGCCTCAGAAGTCTTACATCGACGCCAACAAGCTCCCCTTCGTCCTCTACTCTGTTGGCCCTTTTGCCTTCGAACCC
ACTTGCCCTCATCATTAG
Protein sequenceShow/hide protein sequence
MTMTMCNPQTRLYISGPKPCKCRPKPSHSIPLIPWLSLSPNLSFLSSSSPFSPSSPPTISRQPKHSPFPTTTTPPCHLPTTTTPTLSAAETLPVPHHYDAAVPPAHHHHP
HAPTPAPVPPPTHLPLHPPAHSPAQPPSHHHHHHVHAQSPSHPPTNAPSHHLPPTHPPAHSPAPGHHHHHHDVRPLPPPTHSPAPIYPPKPRLVRSFISVQGVVYCKSCK
YAGADTLLGATPVAGASVKLICQNTKYPLIQTATTDKNGYFFITAPKAITSYAFHKCKVVLGSSPSPSCTKPSALHGGAAGAPLRPQKSYIDANKLPFVLYSVGPFAFEP
TCPHH