; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019522 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019522
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionnon-classical arabinogalactan protein 31
Genome locationtig00153348:458712..465241
RNA-Seq ExpressionSgr019522
SyntenySgr019522
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146606.2 non-classical arabinogalactan protein 31 [Cucumis sativus]5.4e-4250Show/hide
Query:  LFRCFNVLAAYHGSPAAETPTPAPSPTYHHGHHPVAAPS------------HPTTTTTTT--RPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRT--
        L  C  +L A+    AA + TPAP+PT+H+ HHPVAAP+            H  T + T+   P SP  P  + ++ L     +  +       +P T  
Subjt:  LFRCFNVLAAYHGSPAAETPTPAPSPTYHHGHHPVAAPS------------HPTTTTTTT--RPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRT--

Query:  ------LKKLSEDSETKLADFLPPSLCLSIAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGG
              ++ +      K  D         ++GATVKLSCKNTKYAPAV+TAT+D+NGYFRLAAPKNVTSYAFHRCKVYLVKS +  C K S +NGGVDG 
Subjt:  ------LKKLSEDSETKLADFLPPSLCLSIAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGG

Query:  ELKPEKAFYDAEKKPVVLYNVGPLAFEPTC
        ELKP +AF D EKKPVVLYNVGPLAFEPTC
Subjt:  ELKPEKAFYDAEKKPVVLYNVGPLAFEPTC

XP_008442660.1 PREDICTED: non-classical arabinogalactan protein 31 [Cucumis melo]5.0e-4049.32Show/hide
Query:  AAETPTPAPSPTYHHGHHPVAAPS-----------------------HPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDS
        AA + TPAP+PT+H+ HHPVAAP+                       HP + +    P  P  PT     +   + + +          PR+  ++    
Subjt:  AAETPTPAPSPTYHHGHHPVAAPS-----------------------HPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDS

Query:  ETKLADFLPPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDA
          K   +      L    ++GATVKLSCKNTKYAP V+TAT+DKNGYFRLAAPKNVTSYAFHRCKVYLV+S + +C K S LNGG DG ELKP +AF D 
Subjt:  ETKLADFLPPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDA

Query:  EKKPVVLYNVGPLAFEPTC
        EKKPVVLYNVGPLAFEPTC
Subjt:  EKKPVVLYNVGPLAFEPTC

XP_023001750.1 non-classical arabinogalactan protein 31-like [Cucurbita maxima]8.6e-4053.52Show/hide
Query:  AAYHGSPAAETPTPAPSPTYHHGHHPVAAPSH-------PTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCR-PRTLKKLSEDSETKLADFL
        AA HGS     PTPAP P   H   PVAAPSH       P+ +      A    P+   ++       +  ++  KR    PR+  ++      K   + 
Subjt:  AAYHGSPAAETPTPAPSPTYHHGHHPVAAPSH-------PTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCR-PRTLKKLSEDSETKLADFL

Query:  PPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDAEKKPVVLY
             L    + GA VKLSCKNTKYAP V+TATTDKNGYFRLAAPKNVTSYAFHRCKV+LVKS + SCSK S +NGGVDG ELKP KAF DAEKKPVVLY
Subjt:  PPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDAEKKPVVLY

Query:  NVGPLAFEPTCVH
        NVGPLAFEP+C H
Subjt:  NVGPLAFEPTCVH

XP_023519972.1 non-classical arabinogalactan protein 31-like [Cucurbita pepo subsp. pepo]2.4e-4251.38Show/hide
Query:  AAYHGSPA-------AETPTPAPSPTYHHGHH--------PVAAPSHPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSE
        AA HGSP           P P  +P++HH HH        PV AP  P        P  P   +T +                     PR+  ++     
Subjt:  AAYHGSPA-------AETPTPAPSPTYHHGHH--------PVAAPSHPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSE

Query:  TKLADFLPPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDAE
         K   +      L    + GATVKLSCKNTKYAP V+TATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKS + SCSK S +NGG DG ELKP KAF DAE
Subjt:  TKLADFLPPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDAE

Query:  KKPVVLYNVGPLAFEPTC
        KKPVVLYNVGPLAFEPTC
Subjt:  KKPVVLYNVGPLAFEPTC

XP_038876988.1 non-classical arabinogalactan protein 31 [Benincasa hispida]3.5e-4151.12Show/hide
Query:  AAYHGSPAAETPTPAPSPTYHHGHHPVAAP--------------------SHPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKL
        AAYHG       TPAP+P++H GHHPVAAP                    SH    +    P  P  PT     +   + + +          PR+  ++
Subjt:  AAYHGSPAAETPTPAPSPTYHHGHHPVAAP--------------------SHPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKL

Query:  SEDSETKLADFLPPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKA
              K   +      L    ++GATVKLSCKNTKYAP V+TAT+DKNGYFRLAAPKNVTSYAFHRCKVYLVKS + SCSK S LNGG DG ELKP +A
Subjt:  SEDSETKLADFLPPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKA

Query:  FYDAEKKPVVLYNVGPLAFEPTC
        F D EKKPVVLYNVGPLAFEPTC
Subjt:  FYDAEKKPVVLYNVGPLAFEPTC

TrEMBL top hitse value%identityAlignment
A0A0A0LXF0 Structural constituent of cell wall2.6e-4250Show/hide
Query:  LFRCFNVLAAYHGSPAAETPTPAPSPTYHHGHHPVAAPS------------HPTTTTTTT--RPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRT--
        L  C  +L A+    AA + TPAP+PT+H+ HHPVAAP+            H  T + T+   P SP  P  + ++ L     +  +       +P T  
Subjt:  LFRCFNVLAAYHGSPAAETPTPAPSPTYHHGHHPVAAPS------------HPTTTTTTT--RPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRT--

Query:  ------LKKLSEDSETKLADFLPPSLCLSIAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGG
              ++ +      K  D         ++GATVKLSCKNTKYAPAV+TAT+D+NGYFRLAAPKNVTSYAFHRCKVYLVKS +  C K S +NGGVDG 
Subjt:  ------LKKLSEDSETKLADFLPPSLCLSIAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGG

Query:  ELKPEKAFYDAEKKPVVLYNVGPLAFEPTC
        ELKP +AF D EKKPVVLYNVGPLAFEPTC
Subjt:  ELKPEKAFYDAEKKPVVLYNVGPLAFEPTC

A0A1S3B5Q3 non-classical arabinogalactan protein 312.4e-4049.32Show/hide
Query:  AAETPTPAPSPTYHHGHHPVAAPS-----------------------HPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDS
        AA + TPAP+PT+H+ HHPVAAP+                       HP + +    P  P  PT     +   + + +          PR+  ++    
Subjt:  AAETPTPAPSPTYHHGHHPVAAPS-----------------------HPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDS

Query:  ETKLADFLPPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDA
          K   +      L    ++GATVKLSCKNTKYAP V+TAT+DKNGYFRLAAPKNVTSYAFHRCKVYLV+S + +C K S LNGG DG ELKP +AF D 
Subjt:  ETKLADFLPPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDA

Query:  EKKPVVLYNVGPLAFEPTC
        EKKPVVLYNVGPLAFEPTC
Subjt:  EKKPVVLYNVGPLAFEPTC

A0A5A7URZ5 Non-classical arabinogalactan protein 312.4e-4049.32Show/hide
Query:  AAETPTPAPSPTYHHGHHPVAAPS-----------------------HPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDS
        AA + TPAP+PT+H+ HHPVAAP+                       HP + +    P  P  PT     +   + + +          PR+  ++    
Subjt:  AAETPTPAPSPTYHHGHHPVAAPS-----------------------HPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDS

Query:  ETKLADFLPPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDA
          K   +      L    ++GATVKLSCKNTKYAP V+TAT+DKNGYFRLAAPKNVTSYAFHRCKVYLV+S + +C K S LNGG DG ELKP +AF D 
Subjt:  ETKLADFLPPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDA

Query:  EKKPVVLYNVGPLAFEPTC
        EKKPVVLYNVGPLAFEPTC
Subjt:  EKKPVVLYNVGPLAFEPTC

A0A6J1EHN2 non-classical arabinogalactan protein 31-like3.5e-3950Show/hide
Query:  AAYHGSPA-------AETPTPAPSPTYHHGHH-----PVAAP-----SHPTTTTTTTRPASPR--VPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLS
        AA HGSP           P P  +P++HH HH     P  +P     SH    +    P  P    P  + +    R  T++          PR+  ++ 
Subjt:  AAYHGSPA-------AETPTPAPSPTYHHGHH-----PVAAP-----SHPTTTTTTTRPASPR--VPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLS

Query:  EDSETKLADFLPPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAF
             K   +      L    + GATVKLSCKNTKYAP ++TATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKS + SC+K S +NGG DG ELKP KAF
Subjt:  EDSETKLADFLPPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAF

Query:  YDAEKKPVVLYNVGPLAFEPTC
         DAEKKPVVLYNVGPLAFEPTC
Subjt:  YDAEKKPVVLYNVGPLAFEPTC

A0A6J1KM21 non-classical arabinogalactan protein 31-like4.2e-4053.52Show/hide
Query:  AAYHGSPAAETPTPAPSPTYHHGHHPVAAPSH-------PTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCR-PRTLKKLSEDSETKLADFL
        AA HGS     PTPAP P   H   PVAAPSH       P+ +      A    P+   ++       +  ++  KR    PR+  ++      K   + 
Subjt:  AAYHGSPAAETPTPAPSPTYHHGHHPVAAPSH-------PTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCR-PRTLKKLSEDSETKLADFL

Query:  PPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDAEKKPVVLY
             L    + GA VKLSCKNTKYAP V+TATTDKNGYFRLAAPKNVTSYAFHRCKV+LVKS + SCSK S +NGGVDG ELKP KAF DAEKKPVVLY
Subjt:  PPSLCLS---IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDAEKKPVVLY

Query:  NVGPLAFEPTCVH
        NVGPLAFEP+C H
Subjt:  NVGPLAFEPTCVH

SwissProt top hitse value%identityAlignment
P93013 Non-classical arabinogalactan protein 301.3e-1129.3Show/hide
Query:  SNALPLSCDLPNLASSQHSSMNHLPQLNSQVAPLIPSPPAKLRPLFRCFNVLAAYHGSPAAETPTPAPSPTYHHGHHPVAAPSHPTTTTTTTRPASPRVP
        S+   L  + P  +   HS   HL           P PP KL  L             P A+ P   P+  Y     P+  P+ P        P  P + 
Subjt:  SNALPLSCDLPNLASSQHSSMNHLPQLNSQVAPLIPSPPAKLRPLFRCFNVLAAYHGSPAAETPTPAPSPTYHHGHHPVAAPSHPTTTTTTTRPASPRVP

Query:  TTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSETKLADFLPPSLCLSIAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVY
           L  +    +   L+           ++ +      K A          +  A V+L CKN K   ++    TDKNGYF L APK VT+Y    C+ +
Subjt:  TTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSETKLADFLPPSLCLSIAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVY

Query:  LVKSAEGSCSKPSNLNGGVDGGELKP--EKAFYDAEKK--PVVLYNVGPLAFEPTC
        LVKS +  CSK S+L+ G  G  LKP  +  F     +     +YNVGP AFEPTC
Subjt:  LVKSAEGSCSKPSNLNGGVDGGELKP--EKAFYDAEKK--PVVLYNVGPLAFEPTC

Q03211 Pistil-specific extensin-like protein1.1e-0831.8Show/hide
Query:  PLIPSPPAKLRPLFRCFNVLAAYHGSPAAETPTPAPSPTYHHGHHPVAAPSHPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLR--LIKRRCRPRTLK
        P  P PP K            A   SPA + PT  P P       P+  P  P        P+    P+      ++  F        LI RR  P  +K
Subjt:  PLIPSPPAKLRPLFRCFNVLAAYHGSPAAETPTPAPSPTYHHGHHPVAAPSHPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLR--LIKRRCRPRTLK

Query:  KLSEDSETKLADFL-----------PPSLCLS-IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGG
         L    +  +   L           P  L  S + GA VKL C   K    VQ ATTD  G FR+  PK++T+    +CKVYLVKS   +C+ P+N NGG
Subjt:  KLSEDSETKLADFL-----------PPSLCLS-IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGG

Query:  VDGGELKP--------EKAFYDAEKKPVVLYNVGPLAFE
          GG LKP          A    +     LY VGP  FE
Subjt:  VDGGELKP--------EKAFYDAEKKPVVLYNVGPLAFE

Q9FZA2 Non-classical arabinogalactan protein 311.1e-1637.63Show/hide
Query:  TPTPAPSPTYHHGHHPVAAPSHPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSETKLADFLPPSLCLSIAGATVKLSCK
        T  P   P       PV  P +P T      P SP         +    F   L+           ++        K A F        I GATVKL CK
Subjt:  TPTPAPSPTYHHGHHPVAAPSHPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSETKLADFLPPSLCLSIAGATVKLSCK

Query:  NTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDA----EKKPVVLYNVGPLAFEPTC
        + K   A    TTDKNGYF L APK VT++ F  C+VYLVKS +  CSK S L GG  G ELKPEK    +     K    L+NVGP AF P+C
Subjt:  NTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDA----EKKPVVLYNVGPLAFEPTC

Arabidopsis top hitse value%identityAlignment
AT1G28290.1 arabinogalactan protein 317.5e-1837.63Show/hide
Query:  TPTPAPSPTYHHGHHPVAAPSHPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSETKLADFLPPSLCLSIAGATVKLSCK
        T  P   P       PV  P +P T      P SP         +    F   L+           ++        K A F        I GATVKL CK
Subjt:  TPTPAPSPTYHHGHHPVAAPSHPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSETKLADFLPPSLCLSIAGATVKLSCK

Query:  NTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDA----EKKPVVLYNVGPLAFEPTC
        + K   A    TTDKNGYF L APK VT++ F  C+VYLVKS +  CSK S L GG  G ELKPEK    +     K    L+NVGP AF P+C
Subjt:  NTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDA----EKKPVVLYNVGPLAFEPTC

AT1G28290.2 arabinogalactan protein 317.5e-1838.5Show/hide
Query:  PAAETPTPAP--SPTYHHGHHPVAAPSHPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSETKLADFLPPSLCLSIAGAT
        P    PT AP   PT      PV  P+ P     T  P  P V   T   +   ++     R +        ++        K A F        I GAT
Subjt:  PAAETPTPAP--SPTYHHGHHPVAAPSHPTTTTTTTRPASPRVPTTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSETKLADFLPPSLCLSIAGAT

Query:  VKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDA----EKKPVVLYNVGPLAFEPTC
        VKL CK+ K   A    TTDKNGYF L APK VT++ F  C+VYLVKS +  CSK S L GG  G ELKPEK    +     K    L+NVGP AF P+C
Subjt:  VKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSNLNGGVDGGELKPEKAFYDA----EKKPVVLYNVGPLAFEPTC

AT2G33790.1 arabinogalactan protein 309.5e-1329.3Show/hide
Query:  SNALPLSCDLPNLASSQHSSMNHLPQLNSQVAPLIPSPPAKLRPLFRCFNVLAAYHGSPAAETPTPAPSPTYHHGHHPVAAPSHPTTTTTTTRPASPRVP
        S+   L  + P  +   HS   HL           P PP KL  L             P A+ P   P+  Y     P+  P+ P        P  P + 
Subjt:  SNALPLSCDLPNLASSQHSSMNHLPQLNSQVAPLIPSPPAKLRPLFRCFNVLAAYHGSPAAETPTPAPSPTYHHGHHPVAAPSHPTTTTTTTRPASPRVP

Query:  TTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSETKLADFLPPSLCLSIAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVY
           L  +    +   L+           ++ +      K A          +  A V+L CKN K   ++    TDKNGYF L APK VT+Y    C+ +
Subjt:  TTTLMHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSETKLADFLPPSLCLSIAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVY

Query:  LVKSAEGSCSKPSNLNGGVDGGELKP--EKAFYDAEKK--PVVLYNVGPLAFEPTC
        LVKS +  CSK S+L+ G  G  LKP  +  F     +     +YNVGP AFEPTC
Subjt:  LVKSAEGSCSKPSNLNGGVDGGELKP--EKAFYDAEKK--PVVLYNVGPLAFEPTC

AT2G34700.1 Pollen Ole e 1 allergen and extensin family protein8.1e-2047.62Show/hide
Query:  IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVK----SAEGSCSKPSNLNGGVDGGELKPEKAFYDAEKKPVVLYNVGPLA
        + GATVKL+C NTK    ++T  TDKNGYF + APK +T+YAFH C+ +       +A  +C+ PS LN G+ G  LKP K     E    VL++VGP A
Subjt:  IAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVK----SAEGSCSKPSNLNGGVDGGELKPEKAFYDAEKKPVVLYNVGPLA

Query:  FEPTC
        FEP C
Subjt:  FEPTC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAAGCCTCTGAATCCAATGCTTTACCCCTTTCCTGTGATCTTCCCAACTTGGCTTCTTCTCAACACTCATCAATGAACCATCTCCCCCAACTGAATTCTCAAGT
TGCTCCTCTAATACCTTCACCACCTGCCAAACTCCGGCCTCTTTTCCGCTGCTTCAATGTCTTAGCCGCATATCACGGCAGCCCTGCGGCTGAGACGCCCACTCCAGCTC
CTTCACCGACTTACCACCACGGTCACCACCCAGTAGCCGCCCCAAGCCACCCCACCACCACCACCACCACCACCCGCCCAGCCAGTCCCCGAGTCCCCACTACCACCCTC
ATGCACCTACTGCTTCGCCTGTTTACCCACCTCCTCCTCCGGCTCATAAAGCGCCGGTGCCGTCCCCGAACTCTGAAGAAACTGTCTGAAGATTCTGAAACCAAGCTTGC
TGACTTTCTTCCCCCCTCCCTCTGTCTGTCCATTGCAGGTGCTACAGTTAAGCTTTCATGCAAGAACACCAAGTACGCTCCGGCCGTCCAAACCGCCACCACTGACAAGA
ACGGCTACTTCCGGCTGGCTGCGCCGAAGAATGTAACCAGCTACGCATTCCACCGGTGCAAGGTTTACCTGGTGAAGTCGGCGGAGGGCAGTTGCAGTAAGCCCTCTAAT
CTCAACGGCGGAGTCGACGGTGGGGAGTTGAAGCCGGAGAAGGCATTCTACGACGCCGAAAAGAAACCGGTGGTGCTTTACAATGTCGGGCCATTGGCGTTTGAACCCAC
CTGCGTGCACGTTGAACGGGAGGGCAACGGTCGGAAATTGATTGCAGTGTGGCAGAAAATGTGGCTAGTGAGCTGCCGTGTGTTGGGTGACGTGATTCGTAGGGTAGGGG
CTAGGGAATTGGGTTACTGTCTGCCGCTCCCACTGATGCTCTGTCGGGAGTCTCGAGCAGAATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCAAGCCTCTGAATCCAATGCTTTACCCCTTTCCTGTGATCTTCCCAACTTGGCTTCTTCTCAACACTCATCAATGAACCATCTCCCCCAACTGAATTCTCAAGT
TGCTCCTCTAATACCTTCACCACCTGCCAAACTCCGGCCTCTTTTCCGCTGCTTCAATGTCTTAGCCGCATATCACGGCAGCCCTGCGGCTGAGACGCCCACTCCAGCTC
CTTCACCGACTTACCACCACGGTCACCACCCAGTAGCCGCCCCAAGCCACCCCACCACCACCACCACCACCACCCGCCCAGCCAGTCCCCGAGTCCCCACTACCACCCTC
ATGCACCTACTGCTTCGCCTGTTTACCCACCTCCTCCTCCGGCTCATAAAGCGCCGGTGCCGTCCCCGAACTCTGAAGAAACTGTCTGAAGATTCTGAAACCAAGCTTGC
TGACTTTCTTCCCCCCTCCCTCTGTCTGTCCATTGCAGGTGCTACAGTTAAGCTTTCATGCAAGAACACCAAGTACGCTCCGGCCGTCCAAACCGCCACCACTGACAAGA
ACGGCTACTTCCGGCTGGCTGCGCCGAAGAATGTAACCAGCTACGCATTCCACCGGTGCAAGGTTTACCTGGTGAAGTCGGCGGAGGGCAGTTGCAGTAAGCCCTCTAAT
CTCAACGGCGGAGTCGACGGTGGGGAGTTGAAGCCGGAGAAGGCATTCTACGACGCCGAAAAGAAACCGGTGGTGCTTTACAATGTCGGGCCATTGGCGTTTGAACCCAC
CTGCGTGCACGTTGAACGGGAGGGCAACGGTCGGAAATTGATTGCAGTGTGGCAGAAAATGTGGCTAGTGAGCTGCCGTGTGTTGGGTGACGTGATTCGTAGGGTAGGGG
CTAGGGAATTGGGTTACTGTCTGCCGCTCCCACTGATGCTCTGTCGGGAGTCTCGAGCAGAATTTTGA
Protein sequenceShow/hide protein sequence
MAQASESNALPLSCDLPNLASSQHSSMNHLPQLNSQVAPLIPSPPAKLRPLFRCFNVLAAYHGSPAAETPTPAPSPTYHHGHHPVAAPSHPTTTTTTTRPASPRVPTTTL
MHLLLRLFTHLLLRLIKRRCRPRTLKKLSEDSETKLADFLPPSLCLSIAGATVKLSCKNTKYAPAVQTATTDKNGYFRLAAPKNVTSYAFHRCKVYLVKSAEGSCSKPSN
LNGGVDGGELKPEKAFYDAEKKPVVLYNVGPLAFEPTCVHVEREGNGRKLIAVWQKMWLVSCRVLGDVIRRVGARELGYCLPLPLMLCRESRAEF