; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024549 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024549
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionmucin-2-like isoform X1
Genome locationtig00001291:4177255..4178881
RNA-Seq ExpressionSgr024549
SyntenySgr024549
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579620.1 hypothetical protein SDJN03_24068, partial [Cucurbita argyrosperma subsp. sororia]7.8e-13478.83Show/hide
Query:  NGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHN-NGNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPSLE
        +GNGSK RWMMGLHLKGRKE DNEDLHLFRELHKR KERTAC LLPV  ELEH+  GNS FYRIQSIRKES FEL SEGNKNDYDWLKTPPATPLFPSLE
Subjt:  NGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHN-NGNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPSLE

Query:  MEATALHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIKRSN
        MEA A HM  QKET  +Q LSQPQSQAS+NSE TKRS+GIEKSPTT P+IPSRSITPS++PRINSSTE KNT+R T    NPNQR +QASSTDPTIKR+N
Subjt:  MEATALHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIKRSN

Query:  IISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNSNPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEA--INPR
           N  KS NLKES TD+LTSNLSK     AK NPN NPRSRTTSPIVRSTIAS I +FSNETPPNLRTDRSSSVTRGRQV        QK E   IN R
Subjt:  IISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNSNPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEA--INPR

Query:  RQSCSPSVTRGRKVEGKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNE
        RQSCSPSVTRGRKVE KQEINRGGNLSNDQRRTE+TNI+GSRMVERVMNARKGN N ++
Subjt:  RQSCSPSVTRGRKVEGKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNE

XP_008437498.1 PREDICTED: mucin-2-like isoform X1 [Cucumis melo]4.1e-13572.7Show/hide
Query:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS
        MNNGNGSK RWMMGLH KGRKERDNEDLHLFREL+KR+KERTA LLL   D+LEHN+ GNSPFYRI SI+KES      E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK
        LEMEATA    N  +ETPL+QPLSQPQSQASSNSE TK+SSGIEKSP  K K+PSRS TPSHRPRINSS + KNTKRTT P+PNP+QR  Q S  D TIK
Subjt:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK

Query:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI
        R    +NN K  N+KES TD+LTSNLSKGS N  KPNPN   NPRSRTTSPIVRSTIAS I EFSNETPPNLRTDRSSSVTRGRQ       G  +    
Subjt:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI

Query:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGEIGFMSKSSADNTLRQFVSFLSNN
        NPRRQSCSPSVTRGRKVE  KQE NRGGNLSNDQRRTE+TNILGSRMVERVMNARK  G GNE+RD KP  RSGIGE             RQ V  LS +
Subjt:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGEIGFMSKSSADNTLRQFVSFLSNN

Query:  HLP
         LP
Subjt:  HLP

XP_008437499.1 PREDICTED: mucin-2-like isoform X2 [Cucumis melo]5.4e-13575.86Show/hide
Query:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS
        MNNGNGSK RWMMGLH KGRKERDNEDLHLFREL+KR+KERTA LLL   D+LEHN+ GNSPFYRI SI+KES      E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK
        LEMEATA    N  +ETPL+QPLSQPQSQASSNSE TK+SSGIEKSP  K K+PSRS TPSHRPRINSS + KNTKRTT P+PNP+QR  Q S  D TIK
Subjt:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK

Query:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI
        R    +NN K  N+KES TD+LTSNLSKGS N  KPNPN   NPRSRTTSPIVRSTIAS I EFSNETPPNLRTDRSSSVTRGRQ       G  +    
Subjt:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI

Query:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGE
        NPRRQSCSPSVTRGRKVE  KQE NRGGNLSNDQRRTE+TNILGSRMVERVMNARK  G GNE+RD KP  RSGIGE
Subjt:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGE

XP_023520514.1 putative uncharacterized protein DDB_G0282133 isoform X1 [Cucurbita pepo subsp. pepo]2.8e-13679.39Show/hide
Query:  NGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEH-NNGNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPSLE
        +GNGSK RWMMGLHLKGRKE DNEDLHLFRELHKR KERTAC LLPV D LEH N GNS FYRIQSIRKES FEL SEGNKNDYDWLKTPPATPLFPSLE
Subjt:  NGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEH-NNGNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPSLE

Query:  MEATALHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIKRSN
        MEA A HM  QKET  +Q LSQPQSQAS+NSE TKRS+G+EKSPTT P+IPSRSITPS++PRINSSTE KNT+R T    NPNQR +QASSTDPTIKR+N
Subjt:  MEATALHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIKRSN

Query:  IISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNSNPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEA--INPR
           N  KS NLKESCTD+LTSNLSK     AK NPN NPRSRTTSPIVRSTIAS I EFSNETPPNLRTDRSSSVTRGRQV        QK EA  IN R
Subjt:  IISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNSNPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEA--INPR

Query:  RQSCSPSVTRGRKVEGKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNE
        RQSCSPSVTRGRKVE K+EINRGGNLSNDQRRTE+TNI+GSRMVERVMNARKGN N ++
Subjt:  RQSCSPSVTRGRKVEGKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNE

XP_031740890.1 serine/arginine repetitive matrix protein 1 isoform X3 [Cucumis sativus]2.2e-13677.78Show/hide
Query:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS
        MNNGNGSK RWMMGLH KGRKERDNEDLHLFREL+KR+KERTAC LLPV D+LEHN+ GNSPFYRI SI+KES F    EGNKNDYDWLKTPPATPLFPS
Subjt:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK
        LEMEATA  H N QKETPLVQPLSQPQSQASSNSE TK+SSGIEKSP TK KIPSRSITPS+RPRINSS + KNTKRTT P+PNPN R  Q S  D T+K
Subjt:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK

Query:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI
        R    +NN K  NLKES TD+LTSNL KGS N  KPN N   NPRSR TSPIVRSTIAS I EFSNETPPNLRTDRSSSVTRGRQ         +K+EA 
Subjt:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI

Query:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLS-NDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGE
        NPRRQSCSPSVTRGRKVE  KQE NRGGNLS NDQRRTE TNILGSRMVERVMNARK    GNEERD KP  R GIGE
Subjt:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLS-NDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGE

TrEMBL top hitse value%identityAlignment
A0A0A0KJQ3 Uncharacterized protein1.1e-13677.78Show/hide
Query:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS
        MNNGNGSK RWMMGLH KGRKERDNEDLHLFREL+KR+KERTAC LLPV D+LEHN+ GNSPFYRI SI+KES F    EGNKNDYDWLKTPPATPLFPS
Subjt:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK
        LEMEATA  H N QKETPLVQPLSQPQSQASSNSE TK+SSGIEKSP TK KIPSRSITPS+RPRINSS + KNTKRTT P+PNPN R  Q S  D T+K
Subjt:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK

Query:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI
        R    +NN K  NLKES TD+LTSNL KGS N  KPN N   NPRSR TSPIVRSTIAS I EFSNETPPNLRTDRSSSVTRGRQ         +K+EA 
Subjt:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI

Query:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLS-NDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGE
        NPRRQSCSPSVTRGRKVE  KQE NRGGNLS NDQRRTE TNILGSRMVERVMNARK    GNEERD KP  R GIGE
Subjt:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLS-NDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGE

A0A1S3AUR3 mucin-2-like isoform X22.6e-13575.86Show/hide
Query:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS
        MNNGNGSK RWMMGLH KGRKERDNEDLHLFREL+KR+KERTA LLL   D+LEHN+ GNSPFYRI SI+KES      E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK
        LEMEATA    N  +ETPL+QPLSQPQSQASSNSE TK+SSGIEKSP  K K+PSRS TPSHRPRINSS + KNTKRTT P+PNP+QR  Q S  D TIK
Subjt:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK

Query:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI
        R    +NN K  N+KES TD+LTSNLSKGS N  KPNPN   NPRSRTTSPIVRSTIAS I EFSNETPPNLRTDRSSSVTRGRQ       G  +    
Subjt:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI

Query:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGE
        NPRRQSCSPSVTRGRKVE  KQE NRGGNLSNDQRRTE+TNILGSRMVERVMNARK  G GNE+RD KP  RSGIGE
Subjt:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGE

A0A1S3AUS5 mucin-2-like isoform X12.0e-13572.7Show/hide
Query:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS
        MNNGNGSK RWMMGLH KGRKERDNEDLHLFREL+KR+KERTA LLL   D+LEHN+ GNSPFYRI SI+KES      E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK
        LEMEATA    N  +ETPL+QPLSQPQSQASSNSE TK+SSGIEKSP  K K+PSRS TPSHRPRINSS + KNTKRTT P+PNP+QR  Q S  D TIK
Subjt:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK

Query:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI
        R    +NN K  N+KES TD+LTSNLSKGS N  KPNPN   NPRSRTTSPIVRSTIAS I EFSNETPPNLRTDRSSSVTRGRQ       G  +    
Subjt:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI

Query:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGEIGFMSKSSADNTLRQFVSFLSNN
        NPRRQSCSPSVTRGRKVE  KQE NRGGNLSNDQRRTE+TNILGSRMVERVMNARK  G GNE+RD KP  RSGIGE             RQ V  LS +
Subjt:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGEIGFMSKSSADNTLRQFVSFLSNN

Query:  HLP
         LP
Subjt:  HLP

A0A5D3C2G1 Mucin-2-like isoform X12.0e-13572.7Show/hide
Query:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS
        MNNGNGSK RWMMGLH KGRKERDNEDLHLFREL+KR+KERTA LLL   D+LEHN+ GNSPFYRI SI+KES      E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK
        LEMEATA    N  +ETPL+QPLSQPQSQASSNSE TK+SSGIEKSP  K K+PSRS TPSHRPRINSS + KNTKRTT P+PNP+QR  Q S  D TIK
Subjt:  LEMEATA-LHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIK

Query:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI
        R    +NN K  N+KES TD+LTSNLSKGS N  KPNPN   NPRSRTTSPIVRSTIAS I EFSNETPPNLRTDRSSSVTRGRQ       G  +    
Subjt:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNS--NPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAI

Query:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGEIGFMSKSSADNTLRQFVSFLSNN
        NPRRQSCSPSVTRGRKVE  KQE NRGGNLSNDQRRTE+TNILGSRMVERVMNARK  G GNE+RD KP  RSGIGE             RQ V  LS +
Subjt:  NPRRQSCSPSVTRGRKVE-GKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGEIGFMSKSSADNTLRQFVSFLSNN

Query:  HLP
         LP
Subjt:  HLP

A0A6J1ESX0 Uncharacterized protein8.4e-13478.99Show/hide
Query:  NGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPSLEME
        NGSK RWMMGLHLKGRKE DNEDLHLFRELHKR KERTAC LLPV D LEH+N GNS FYRIQ IRKES FEL SEGNKNDYDWLKTPPATPLFPSLEME
Subjt:  NGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNN-GNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPSLEME

Query:  ATALHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIKRSNII
        A A HM  QKET  +Q LSQPQSQAS+NSE TKRS+GIEKSPTT P+IPSRSITPS++PRINSSTE KNT+R T    NPNQR +QASSTDPTIKR+N  
Subjt:  ATALHMNVQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIKRSNII

Query:  SNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNSNPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEA--INPRRQ
         N  KS NLKES TD+LTSNLSK     AK NPN NPRSRTTSPIVRSTIAS I +FSNETPPNLRTDRSSSVTRGRQV        QK E   IN RRQ
Subjt:  SNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNSNPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEA--INPRRQ

Query:  SCSPSVTRGRKVEGKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNE
        SCSPSVTRGRKVE KQEINRGGNLSNDQRRTE+TNI+GSRMVERVMNARKGN N ++
Subjt:  SCSPSVTRGRKVEGKQEINRGGNLSNDQRRTEATNILGSRMVERVMNARKGNGNGNE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27850.1 unknown protein5.2e-1131.55Show/hide
Query:  NEDLHLFRELHKREKERTACLLLPVSDELEHNNGNSPFYRIQ---SIRKESRFELFSEGNKNDYDWLKTPPATPLFPSLEMEATALHMNVQKETPLVQPL
        ++DL LF E+  +E++     LL  SD+LE        +  +    ++ ES   L +EG+KNDYDWL TPP TPLFPSL+ +  A  + V++     +P 
Subjt:  NEDLHLFRELHKREKERTACLLLPVSDELEHNNGNSPFYRIQ---SIRKESRFELFSEGNKNDYDWLKTPPATPLFPSLEMEATALHMNVQKETPLVQPL

Query:  SQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNT---KRTTTP----TPNPNQRTTQAS-STDPTIKRSNIISNNNKSINLK
        SQ     SS  E ++RSS    SP      P        R R  SS  H +    +R+ TP    +P P + +   S S  PT +R              
Subjt:  SQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNT---KRTTTP----TPNPNQRTTQAS-STDPTIKRSNIISNNNKSINLK

Query:  ESCTDFLTSNLSKGSVNMAKP-----NPNSNPRSRTTSPIVRSTIASHIAEFSNETPPNLRT---DRSSSVTRGRQVHQTAGIGGQKAEAINPR-RQSCS
                  +S GS  MA P     +P S+ R  + SP ++    S+I  FS + PPNLRT   DR +S  RG       G      +A++ R R+S S
Subjt:  ESCTDFLTSNLSKGSVNMAKP-----NPNSNPRSRTTSPIVRSTIASHIAEFSNETPPNLRT---DRSSSVTRGRQVHQTAGIGGQKAEAINPR-RQSCS

Query:  PSVTRGRKVEGKQEINR
        PS +R        E +R
Subjt:  PSVTRGRKVEGKQEINR

AT2G40070.1 BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT3G09000.1)6.3e-1732.65Show/hide
Query:  NEDLHLFRELHKREKERTACLLLPVSDELE----HNNGNSPFYRIQSIRKESRFE-----LFSEGNKNDYDWLKTPPATPLFPSLEMEATALHMNVQKET
        +E+L LF E+ +REKE+   LL    DE E      +G SP + I S    SR       L SEG+KNDY+WL TPP TPLFPSLEME+    M+ Q   
Subjt:  NEDLHLFRELHKREKERTACLLLPVSDELE----HNNGNSPFYRIQSIRKESRFE-----LFSEGNKNDYDWLKTPPATPLFPSLEMEATALHMNVQKET

Query:  PLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKI---------PSRSITPSHRP-----RINSSTEHKNTKRTTTPTPN-----------PNQRTTQA
           +P +     A+S++E   R+    +  T+ P +         PS S  P  RP     R ++ T +  + R +TPT              N R+T +
Subjt:  PLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKI---------PSRSITPSHRP-----RINSSTEHKNTKRTTTPTPN-----------PNQRTTQA

Query:  SSTDPT-IKRSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNSNPR--SRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTR
        ++T PT + RS  +S++  +    +  T   ++  S GSV  + P+  +     SR+T+P+ RST  S         PP+    RSS+ TR
Subjt:  SSTDPT-IKRSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNSNPR--SRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTR

AT2G40070.2 FUNCTIONS IN: molecular_function unknown2.9e-1431.91Show/hide
Query:  LHKREKERTACLLLPVSDELE----HNNGNSPFYRIQSIRKESRFE-----LFSEGNKNDYDWLKTPPATPLFPSLEMEATALHMNVQKETPLVQPLSQP
        + +REKE+   LL    DE E      +G SP + I S    SR       L SEG+KNDY+WL TPP TPLFPSLEME+    M+ Q      +P +  
Subjt:  LHKREKERTACLLLPVSDELE----HNNGNSPFYRIQSIRKESRFE-----LFSEGNKNDYDWLKTPPATPLFPSLEMEATALHMNVQKETPLVQPLSQP

Query:  QSQASSNSEPTKRSSGIEKSPTTKPKI---------PSRSITPSHRP-----RINSSTEHKNTKRTTTPTPN-----------PNQRTTQASSTDPT-IK
           A+S++E   R+    +  T+ P +         PS S  P  RP     R ++ T +  + R +TPT              N R+T +++T PT + 
Subjt:  QSQASSNSEPTKRSSGIEKSPTTKPKI---------PSRSITPSHRP-----RINSSTEHKNTKRTTTPTPN-----------PNQRTTQASSTDPT-IK

Query:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNSNPR--SRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTR
        RS  +S++  +    +  T   ++  S GSV  + P+  +     SR+T+P+ RST  S         PP+    RSS+ TR
Subjt:  RSNIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNSNPR--SRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTR

AT3G09000.1 proline-rich family protein1.5e-1329.75Show/hide
Query:  NEDLHLFRELHKREKERTACLLLPVSDELEHN----------------NGNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPSLEMEATALH
        +E+L LF E+ +REKE  A  LL  SD +  N                  +S  Y ++    E+   L+SE  K+DYDWL TPP TP F   E E+    
Subjt:  NEDLHLFRELHKREKERTACLLLPVSDELEHN----------------NGNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPSLEMEATALH

Query:  MNVQKETPLVQP-------------------------------LSQPQSQASSNS-----EPTKRSSGIEKSPTTKPKIP-------SRSITPSHRPRI-
        MN Q + P  +P                               L +P S  SS S      PT+RS+    +PTT    P       SRS TP+ R  + 
Subjt:  MNVQKETPLVQP-------------------------------LSQPQSQASSNS-----EPTKRSSGIEKSPTTKPKIP-------SRSITPSHRPRI-

Query:  -----------NSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIKRS------------NIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNSNPR
                    ++T    + R+ TPT   N R + ASS  P  + +            +I+S+   S     S T    ++LSK       P+P  N  
Subjt:  -----------NSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIKRS------------NIISNNNKSINLKESCTDFLTSNLSKGSVNMAKPNPNSNPR

Query:  SRTTSPIVRSTIASHIAEFSNETPPNLRT---DRSSSVTRGRQVHQTA----------GIGGQKAEAINPRRQSCSPSVTRGRKVEGKQEINRGGNLSND
        SR   P         +  FS E PPNLRT   DR  S +RGR    +A          G G     + N RRQSCSPS  RGR   G    N  G+L+  
Subjt:  SRTTSPIVRSTIASHIAEFSNETPPNLRT---DRSSSVTRGRQVHQTA----------GIGGQKAEAINPRRQSCSPSVTRGRKVEGKQEINRGGNLSND

Query:  QRRTEATN-----------ILGSRMVERVMNARKGNGNGNEERDPKPRGR--SGIGEIGF---MSKSSADNTLR
        + R +A+N            +G++MVERV+N RK       E   +  G+  S    +G+   +SKSS D  +R
Subjt:  QRRTEATN-----------ILGSRMVERVMNARKGNGNGNEERDPKPRGR--SGIGEIGF---MSKSSADNTLR

AT5G01280.1 BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT3G09000.1)5.4e-0827.95Show/hide
Query:  LFSEGNKNDYDWLKTPPATP-----------------LFPSL--------EMEATALHMNVQKETPLVQPLSQPQSQASSNS-----EPTKRSSGIEKSP
        L+S+G K+DY+WL TPP +P                 L   L        E + T+LH      +  V  + +P S +SS S      PT++S    K+P
Subjt:  LFSEGNKNDYDWLKTPPATP-----------------LFPSL--------EMEATALHMNVQKETPLVQPLSQPQSQASSNS-----EPTKRSSGIEKSP

Query:  TTKPKIP-SRSITPSHRPRINSSTEHKNTK---------------------RTTTPTPNPNQRTTQASSTDPTIKRSNIISNNNKSINLKESCTDFLTSN
          +P  P SR+ + + R  + SS+   +T+                     R T PT + +Q+TT  S+T        + + N+K  +   + T   ++ 
Subjt:  TTKPKIP-SRSITPSHRPRINSSTEHKNTK---------------------RTTTPTPNPNQRTTQASSTDPTIKRSNIISNNNKSINLKESCTDFLTSN

Query:  LSKGSVNMAKP-NPNSNPR-SRTTSPIVRST--IASHIAEFSNETPPNLRT-----DRSSSVTRGRQVHQTAGIGGQKAEAINPRRQSCSPSVTR--GRK
            +V  +KP  P S P  S   SPIVRS       +  FS E P NLRT      +++S +R R    ++       E    +RQSCSPS +R     
Subjt:  LSKGSVNMAKP-NPNSNPR-SRTTSPIVRST--IASHIAEFSNETPPNLRT-----DRSSSVTRGRQVHQTAGIGGQKAEAINPRRQSCSPSVTR--GRK

Query:  VEGKQEINRG--GNLSNDQRRTEATNILGSRMVERVMNARK--------------GNGNGNEERDPKPRGRSGIGEIGFMSKSSADNTLR
        V G     RG     +ND  R  +    G++ VE+V+N RK              G G G+        G  G G    +SKSS D  LR
Subjt:  VEGKQEINRG--GNLSNDQRRTEATNILGSRMVERVMNARK--------------GNGNGNEERDPKPRGRSGIGEIGFMSKSSADNTLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGGCAATGGAAGCAAAGGCAGATGGATGATGGGTCTGCATTTAAAGGGTAGAAAGGAGAGAGACAATGAAGATCTCCATCTGTTTCGGGAGCTTCATAAGCG
CGAGAAGGAACGCACCGCCTGCCTTCTTCTGCCCGTCTCCGATGAGCTCGAGCACAACAATGGGAATTCTCCCTTCTACAGAATTCAATCAATCAGGAAGGAATCTAGAT
TTGAACTTTTTTCTGAGGGCAACAAAAACGATTATGATTGGCTGAAAACACCACCTGCAACTCCTCTGTTTCCATCTTTGGAAATGGAAGCCACTGCTCTTCATATGAAT
GTTCAGAAAGAGACACCACTTGTCCAACCTCTCTCACAGCCACAGTCACAGGCTTCAAGCAATTCAGAACCAACAAAGAGAAGCAGTGGAATAGAGAAATCTCCAACCAC
AAAACCAAAAATACCATCCAGATCCATCACTCCCAGTCATAGACCGCGCATCAATTCATCAACCGAACACAAAAACACCAAAAGAACCACAACCCCAACTCCAAATCCAA
ACCAAAGAACCACTCAGGCATCATCAACCGACCCCACGATCAAAAGAAGCAACATCATCAGCAACAACAACAAATCCATAAATCTGAAAGAAAGTTGCACGGATTTTCTA
ACCTCAAACCTGTCCAAAGGGTCAGTGAATATGGCCAAACCAAATCCAAATTCGAATCCCAGAAGTAGAACAACATCCCCAATTGTAAGATCTACAATAGCATCTCACAT
TGCAGAGTTCTCCAACGAAACGCCTCCAAATCTGAGGACCGACCGGTCGAGCTCCGTGACGAGAGGGCGGCAGGTCCATCAAACGGCAGGAATTGGGGGGCAGAAAGCAG
AGGCGATCAATCCCAGAAGGCAGTCTTGCTCGCCGAGCGTGACGAGGGGACGGAAGGTGGAAGGGAAGCAGGAGATCAACAGAGGCGGAAACTTGAGCAATGATCAGAGA
AGAACCGAAGCGACGAACATTCTTGGGAGTCGAATGGTAGAGAGAGTGATGAATGCGAGAAAAGGAAATGGAAATGGAAATGAGGAGAGAGATCCGAAGCCACGAGGACG
AAGTGGGATTGGAGAAATAGGGTTCATGTCTAAGAGTTCAGCAGATAACACGCTGAGGCAATTCGTAAGCTTTCTTTCAAACAATCATCTTCCATTGCATGTATTTGAAT
TCAGGGCAGTTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAATGGCAATGGAAGCAAAGGCAGATGGATGATGGGTCTGCATTTAAAGGGTAGAAAGGAGAGAGACAATGAAGATCTCCATCTGTTTCGGGAGCTTCATAAGCG
CGAGAAGGAACGCACCGCCTGCCTTCTTCTGCCCGTCTCCGATGAGCTCGAGCACAACAATGGGAATTCTCCCTTCTACAGAATTCAATCAATCAGGAAGGAATCTAGAT
TTGAACTTTTTTCTGAGGGCAACAAAAACGATTATGATTGGCTGAAAACACCACCTGCAACTCCTCTGTTTCCATCTTTGGAAATGGAAGCCACTGCTCTTCATATGAAT
GTTCAGAAAGAGACACCACTTGTCCAACCTCTCTCACAGCCACAGTCACAGGCTTCAAGCAATTCAGAACCAACAAAGAGAAGCAGTGGAATAGAGAAATCTCCAACCAC
AAAACCAAAAATACCATCCAGATCCATCACTCCCAGTCATAGACCGCGCATCAATTCATCAACCGAACACAAAAACACCAAAAGAACCACAACCCCAACTCCAAATCCAA
ACCAAAGAACCACTCAGGCATCATCAACCGACCCCACGATCAAAAGAAGCAACATCATCAGCAACAACAACAAATCCATAAATCTGAAAGAAAGTTGCACGGATTTTCTA
ACCTCAAACCTGTCCAAAGGGTCAGTGAATATGGCCAAACCAAATCCAAATTCGAATCCCAGAAGTAGAACAACATCCCCAATTGTAAGATCTACAATAGCATCTCACAT
TGCAGAGTTCTCCAACGAAACGCCTCCAAATCTGAGGACCGACCGGTCGAGCTCCGTGACGAGAGGGCGGCAGGTCCATCAAACGGCAGGAATTGGGGGGCAGAAAGCAG
AGGCGATCAATCCCAGAAGGCAGTCTTGCTCGCCGAGCGTGACGAGGGGACGGAAGGTGGAAGGGAAGCAGGAGATCAACAGAGGCGGAAACTTGAGCAATGATCAGAGA
AGAACCGAAGCGACGAACATTCTTGGGAGTCGAATGGTAGAGAGAGTGATGAATGCGAGAAAAGGAAATGGAAATGGAAATGAGGAGAGAGATCCGAAGCCACGAGGACG
AAGTGGGATTGGAGAAATAGGGTTCATGTCTAAGAGTTCAGCAGATAACACGCTGAGGCAATTCGTAAGCTTTCTTTCAAACAATCATCTTCCATTGCATGTATTTGAAT
TCAGGGCAGTTAATTAA
Protein sequenceShow/hide protein sequence
MNNGNGSKGRWMMGLHLKGRKERDNEDLHLFRELHKREKERTACLLLPVSDELEHNNGNSPFYRIQSIRKESRFELFSEGNKNDYDWLKTPPATPLFPSLEMEATALHMN
VQKETPLVQPLSQPQSQASSNSEPTKRSSGIEKSPTTKPKIPSRSITPSHRPRINSSTEHKNTKRTTTPTPNPNQRTTQASSTDPTIKRSNIISNNNKSINLKESCTDFL
TSNLSKGSVNMAKPNPNSNPRSRTTSPIVRSTIASHIAEFSNETPPNLRTDRSSSVTRGRQVHQTAGIGGQKAEAINPRRQSCSPSVTRGRKVEGKQEINRGGNLSNDQR
RTEATNILGSRMVERVMNARKGNGNGNEERDPKPRGRSGIGEIGFMSKSSADNTLRQFVSFLSNNHLPLHVFEFRAVN