; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C01G001200 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C01G001200
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Descriptionmucin-2-like isoform X1
Genome locationCla97Chr01:901416..903724
RNA-Seq ExpressionCla97C01G001200
SyntenyCla97C01G001200
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579620.1 hypothetical protein SDJN03_24068, partial [Cucurbita argyrosperma subsp. sororia]1.5e-13678.12Show/hide
Query:  TMNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPS
        T  +G+GSK RWMMGLHLKG KE +NEDLHLFRELHKR KERTAC LLP+ +LEH+   NS FYRIQSI+KES F LLSEGNKNDYDWLKTPPATPLFPS
Subjt:  TMNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIK
        LEMEA AP H  AQKET  +Q  SQPQ+QAS+N ESTKRSNGIEKSP TNPRIPSRSITPS++PRINSSTEPKNT+R T    NPNQRI+QASS DPTIK
Subjt:  LEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIK

Query:  SNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEA---N
              NNN  K  NLKESYTDYLTSNLSK      K K+NPNPNPRSRTTSPIVRSTIASQIP+FSNETPPNLRTDRSSSVTRGRQ G  +K E    N
Subjt:  SNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEA---N

Query:  LRRQSCSPSVTRGRKVEVKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEK
         RRQSCSPSVTRGRKVEVKQE NRGGNLSNDQRR ESTNI+GSRMVERVMNARKGN N  K
Subjt:  LRRQSCSPSVTRGRKVEVKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEK

XP_008437498.1 PREDICTED: mucin-2-like isoform X1 [Cucumis melo]2.3e-15080.05Show/hide
Query:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTA-CLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPS
        MNNG+GSKNRWMMGLH KG KER+NEDLHLFREL+KRDKERTA  LLLP+DDLEHNH  NSPFYRI SIKKESG G L E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTA-CLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIK
        LEMEATAP   NA +ETPL+QP SQPQ+QASSN ESTK+S+GIEKSPI   ++PSRS TPSHRPRINSS +PKNTKRTT PSPNP+QRI+Q S ID TIK
Subjt:  LEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIK

Query:  SNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRR
               NNN KP N+KESYTDYLTSNLSKG+ N+ KP  N NPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQ G  EKSE N RR
Subjt:  SNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRR

Query:  QSCSPSVTRGRKVE-VKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ
        QSCSPSVTRGRKVE  KQEKNRGGNLSNDQRR ESTNILGSRMVERVMNARKG GNE++D KP  R G G+
Subjt:  QSCSPSVTRGRKVE-VKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ

XP_008437499.1 PREDICTED: mucin-2-like isoform X2 [Cucumis melo]2.3e-15080.05Show/hide
Query:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTA-CLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPS
        MNNG+GSKNRWMMGLH KG KER+NEDLHLFREL+KRDKERTA  LLLP+DDLEHNH  NSPFYRI SIKKESG G L E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTA-CLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIK
        LEMEATAP   NA +ETPL+QP SQPQ+QASSN ESTK+S+GIEKSPI   ++PSRS TPSHRPRINSS +PKNTKRTT PSPNP+QRI+Q S ID TIK
Subjt:  LEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIK

Query:  SNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRR
               NNN KP N+KESYTDYLTSNLSKG+ N+ KP  N NPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQ G  EKSE N RR
Subjt:  SNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRR

Query:  QSCSPSVTRGRKVE-VKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ
        QSCSPSVTRGRKVE  KQEKNRGGNLSNDQRR ESTNILGSRMVERVMNARKG GNE++D KP  R G G+
Subjt:  QSCSPSVTRGRKVE-VKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ

XP_031740890.1 serine/arginine repetitive matrix protein 1 isoform X3 [Cucumis sativus]3.4e-15481.94Show/hide
Query:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPSL
        MNNG+GSKNRWMMGLH KG KER+NEDLHLFREL+KRDKERTAC LLP+DDLEHNH  NSPFYRI SIKKESGFG L EGNKNDYDWLKTPPATPLFPSL
Subjt:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPSL

Query:  EMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIKS
        EMEATAPSH+NAQKETPLVQP SQPQ+QASSN ESTK+S+GIEKSPIT  +IPSRSITPS+RPRINSS +PKNTKRTT PSPNPN RI+Q S ID T+K 
Subjt:  EMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIKS

Query:  NNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRRQ
              NNN KP NLKESYTDYLTSNL KG+ N+ KP  N NPNPRSR TSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQ    EKSEAN RRQ
Subjt:  NNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRRQ

Query:  SCSPSVTRGRKVEV-KQEKNRGGNLS-NDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ
        SCSPSVTRGRKVEV KQEKNRGGNLS NDQRR E+TNILGSRMVERVMNARK  GNEE+D+KP  R G G+
Subjt:  SCSPSVTRGRKVEV-KQEKNRGGNLS-NDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ

XP_038907224.1 BUD13 homolog [Benincasa hispida]1.5e-14476.07Show/hide
Query:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLL-----SEGNKN-DYDW-------
        MNNGSGSKNRWMMGLHLKG KER+NEDLHLFR+LHKRDKERT C LLP+D+LEHNH      + +         G+L     ++  KN D D        
Subjt:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLL-----SEGNKN-DYDW-------

Query:  -------------LKTPPATPLFPSLEMEATAPSHRNAQKETPLVQPFSQPQT---QASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEP
                     LKTPPATPLFPSLEMEATAPSHRNAQKETPLVQP SQPQ+   QASSN ESTK+SN IEKSP T P+IPSRSITPSHRPRINSS EP
Subjt:  -------------LKTPPATPLFPSLEMEATAPSHRNAQKETPLVQPFSQPQT---QASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEP

Query:  KNTKRTTTPSPNPNQRINQASSIDPTIKSNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPP
        KNTKRTTTPSPNP QRINQASSIDPTIK       NNN KPINLKE YT+YLTSNL KG+ NTAKPKA  NPNPRSRTTSPIVRSTIASQIPEFSNETPP
Subjt:  KNTKRTTTPSPNPNQRINQASSIDPTIKSNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPP

Query:  NLRTDRSSSVTRGRQAGIGEKSEANLRRQSCSPSVTRGRKVEVKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSG
        NLRTDRSSSVTRGRQ G GEKSEAN RRQSCSPSVTRGRKVE KQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKG GNEE+DLKP GR G G
Subjt:  NLRTDRSSSVTRGRQAGIGEKSEANLRRQSCSPSVTRGRKVEVKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSG

TrEMBL top hitse value%identityAlignment
A0A0A0KJQ3 Uncharacterized protein1.7e-15481.94Show/hide
Query:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPSL
        MNNG+GSKNRWMMGLH KG KER+NEDLHLFREL+KRDKERTAC LLP+DDLEHNH  NSPFYRI SIKKESGFG L EGNKNDYDWLKTPPATPLFPSL
Subjt:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPSL

Query:  EMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIKS
        EMEATAPSH+NAQKETPLVQP SQPQ+QASSN ESTK+S+GIEKSPIT  +IPSRSITPS+RPRINSS +PKNTKRTT PSPNPN RI+Q S ID T+K 
Subjt:  EMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIKS

Query:  NNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRRQ
              NNN KP NLKESYTDYLTSNL KG+ N+ KP  N NPNPRSR TSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQ    EKSEAN RRQ
Subjt:  NNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRRQ

Query:  SCSPSVTRGRKVEV-KQEKNRGGNLS-NDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ
        SCSPSVTRGRKVEV KQEKNRGGNLS NDQRR E+TNILGSRMVERVMNARK  GNEE+D+KP  R G G+
Subjt:  SCSPSVTRGRKVEV-KQEKNRGGNLS-NDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ

A0A1S3AUR3 mucin-2-like isoform X21.1e-15080.05Show/hide
Query:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTA-CLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPS
        MNNG+GSKNRWMMGLH KG KER+NEDLHLFREL+KRDKERTA  LLLP+DDLEHNH  NSPFYRI SIKKESG G L E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTA-CLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIK
        LEMEATAP   NA +ETPL+QP SQPQ+QASSN ESTK+S+GIEKSPI   ++PSRS TPSHRPRINSS +PKNTKRTT PSPNP+QRI+Q S ID TIK
Subjt:  LEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIK

Query:  SNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRR
               NNN KP N+KESYTDYLTSNLSKG+ N+ KP  N NPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQ G  EKSE N RR
Subjt:  SNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRR

Query:  QSCSPSVTRGRKVE-VKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ
        QSCSPSVTRGRKVE  KQEKNRGGNLSNDQRR ESTNILGSRMVERVMNARKG GNE++D KP  R G G+
Subjt:  QSCSPSVTRGRKVE-VKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ

A0A1S3AUS5 mucin-2-like isoform X11.1e-15080.05Show/hide
Query:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTA-CLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPS
        MNNG+GSKNRWMMGLH KG KER+NEDLHLFREL+KRDKERTA  LLLP+DDLEHNH  NSPFYRI SIKKESG G L E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTA-CLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIK
        LEMEATAP   NA +ETPL+QP SQPQ+QASSN ESTK+S+GIEKSPI   ++PSRS TPSHRPRINSS +PKNTKRTT PSPNP+QRI+Q S ID TIK
Subjt:  LEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIK

Query:  SNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRR
               NNN KP N+KESYTDYLTSNLSKG+ N+ KP  N NPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQ G  EKSE N RR
Subjt:  SNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRR

Query:  QSCSPSVTRGRKVE-VKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ
        QSCSPSVTRGRKVE  KQEKNRGGNLSNDQRR ESTNILGSRMVERVMNARKG GNE++D KP  R G G+
Subjt:  QSCSPSVTRGRKVE-VKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ

A0A5D3C2G1 Mucin-2-like isoform X11.1e-15080.05Show/hide
Query:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTA-CLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPS
        MNNG+GSKNRWMMGLH KG KER+NEDLHLFREL+KRDKERTA  LLLP+DDLEHNH  NSPFYRI SIKKESG G L E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGSGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTA-CLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIK
        LEMEATAP   NA +ETPL+QP SQPQ+QASSN ESTK+S+GIEKSPI   ++PSRS TPSHRPRINSS +PKNTKRTT PSPNP+QRI+Q S ID TIK
Subjt:  LEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIK

Query:  SNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRR
               NNN KP N+KESYTDYLTSNLSKG+ N+ KP  N NPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQ G  EKSE N RR
Subjt:  SNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRR

Query:  QSCSPSVTRGRKVE-VKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ
        QSCSPSVTRGRKVE  KQEKNRGGNLSNDQRR ESTNILGSRMVERVMNARKG GNE++D KP  R G G+
Subjt:  QSCSPSVTRGRKVE-VKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQ

A0A6J1ESX0 Uncharacterized protein2.7e-13678.65Show/hide
Query:  SGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPSLEMEA
        +GSK RWMMGLHLKG KE +NEDLHLFRELHKR KERTAC LLP+ DLEH++  NS FYRIQ I+KES F LLSEGNKNDYDWLKTPPATPLFPSLEMEA
Subjt:  SGSKNRWMMGLHLKGGKERNNEDLHLFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPSLEMEA

Query:  TAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIKSNNNN
         AP H  AQKET  +Q  SQPQ+QAS+N ESTKRSNGIEKSP TNPRIPSRSITPS++PRINSSTEPKNT+R T    NPNQRI+QASS DPTIK     
Subjt:  TAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIKSNNNN

Query:  NNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEA---NLRRQS
         NNN  K  NLKESYTDYLTSNLSK      K K+NPNPNPRSRTTSPIVRSTIASQIP+FSNETPPNLRTDRSSSVTRGRQ G  +K E    N RRQS
Subjt:  NNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEA---NLRRQS

Query:  CSPSVTRGRKVEVKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEK
        CSPSVTRGRKVEVKQE NRGGNLSNDQRR ESTNI+GSRMVERVMNARKGN N  K
Subjt:  CSPSVTRGRKVEVKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27850.1 unknown protein7.7e-1131.29Show/hide
Query:  NEDLHLFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSI--KKESGFGLLSEGNKNDYDWLKTPPATPLFPSLEMEATAPSHRNAQKETPLVQPF
        ++DL LF E+  +DKER + LL   DDLE         +   +I  + ES   L +EG+KNDYDWL TPP TPLFPSL+ +  A S     +      P 
Subjt:  NEDLHLFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSI--KKESGFGLLSEGNKNDYDWLKTPPATPLFPSLEMEATAPSHRNAQKETPLVQPF

Query:  SQPQTQASSNPESTKRSNGIEKSP---ITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTP----SPNPNQRINQASSIDPTIKSNNNNNNNNNNKPINL
        SQ     SS  E ++RS+    SP    T+PR  +       RP       P + +R+ TP    SP P +     S   PT  S   +  +        
Subjt:  SQPQTQASSNPESTKRSNGIEKSP---ITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTP----SPNPNQRINQASSIDPTIKSNNNNNNNNNNKPINL

Query:  KESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRT---DRSSSVTRGRQAGI--GEKSEANLRRQSCSPSVTRGR
               + S   +GT   +  + N +P+P+ +           S IP FS + PPNLRT   DR +S  RG       G  + +   R+S SPS +R  
Subjt:  KESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRT---DRSSSVTRGRQAGI--GEKSEANLRRQSCSPSVTRGR

Query:  KVEVKQEKNR
              E++R
Subjt:  KVEVKQEKNR

AT2G40070.1 BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT3G09000.1)2.2e-1332.67Show/hide
Query:  NEDLHLFRELHKRDKERTACLLLPLDDLEHNHDE----------NSPFYRIQ-----SIKKESGFGLLSEGNKNDYDWLKTPPATPLFPSLEMEATAPSH
        +E+L LF E+ +R+KE+   L      L +N DE           SP + I      S K      L SEG+KNDY+WL TPP TPLFPSLEME    SH
Subjt:  NEDLHLFRELHKRDKERTACLLLPLDDLEHNHDE----------NSPFYRIQ-----SIKKESGFGLLSEGNKNDYDWLKTPPATPLFPSLEMEATAPSH

Query:  RNAQKET--PLVQPFSQPQTQASSNPESTKRSNGIEKSPITNP---------RIPSRSITPSHRPRI---NSSTEPKNTKRTTTPSPNPNQRINQASSID
        R    +T     +P +     A+S+ ES  R++   +   ++P         R PS S  P  RP      SST   N+K ++ PS  P  R   +S+  
Subjt:  RNAQKET--PLVQPFSQPQTQASSNPESTKRSNGIEKSPITNP---------RIPSRSITPSHRPRI---NSSTEPKNTKRTTTPSPNPNQRINQASSID

Query:  PTIKSNNNNNNNNNNKPINLKES---YTDYLTSNLSKGTKNTAK---------PKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTR
        P++ +N+ +  +   KP  +  S    +  LT   SK T +TA+         P         SR+T+P+ RST  S  P      PP+    RSS+ TR
Subjt:  PTIKSNNNNNNNNNNKPINLKES---YTDYLTSNLSKGTKNTAK---------PKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTR

AT2G40070.2 FUNCTIONS IN: molecular_function unknown1.0e-1031.96Show/hide
Query:  LHKRDKERTACLLLPLDDLEHNHDE----------NSPFYRIQ-----SIKKESGFGLLSEGNKNDYDWLKTPPATPLFPSLEMEATAPSHRNAQKET--
        + +R+KE+   L      L +N DE           SP + I      S K      L SEG+KNDY+WL TPP TPLFPSLEME    SHR    +T  
Subjt:  LHKRDKERTACLLLPLDDLEHNHDE----------NSPFYRIQ-----SIKKESGFGLLSEGNKNDYDWLKTPPATPLFPSLEMEATAPSHRNAQKET--

Query:  PLVQPFSQPQTQASSNPESTKRSNGIEKSPITNP---------RIPSRSITPSHRPRI---NSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIKSNNNN
           +P +     A+S+ ES  R++   +   ++P         R PS S  P  RP      SST   N+K ++ PS  P  R   +S+  P++ +N+ +
Subjt:  PLVQPFSQPQTQASSNPESTKRSNGIEKSPITNP---------RIPSRSITPSHRPRI---NSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIKSNNNN

Query:  NNNNNNKPINLKES---YTDYLTSNLSKGTKNTAK---------PKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTR
          +   KP  +  S    +  LT   SK T +TA+         P         SR+T+P+ RST  S  P      PP+    RSS+ TR
Subjt:  NNNNNNKPINLKES---YTDYLTSNLSKGTKNTAK---------PKANPNPNPRSRTTSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTR

AT3G09000.1 proline-rich family protein2.1e-0828.51Show/hide
Query:  NEDLHLFRELHKRDKE-RTACLLLPLDDLEHNH-------------DENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFP-----SLEMEAT
        +E+L LF E+ +R+KE R   LL   D++  N               E +   R    +  +   L SE  K+DYDWL TPP TP F      S+  +  
Subjt:  NEDLHLFRELHKRDKE-RTACLLLPLDDLEHNH-------------DENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFP-----SLEMEAT

Query:  APSHR-------------------NAQKETP------LVQPFSQPQTQASSNPES-TKRSNGIEKS---PITNPRIPSRSITPSHRPRINSSTEPKNT--
        AP+ R                   N + +T       L +P S   ++++S P + T+RS     S   P+T     SRS TP+ R  + ++    +T  
Subjt:  APSHR-------------------NAQKETP------LVQPFSQPQTQASSNPES-TKRSNGIEKS---PITNPRIPSRSITPSHRPRINSSTEPKNT--

Query:  KRTTTPSPNPNQRINQASSIDPTIKSNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPR--------SRTTSP-----IVRSTIASQ
         RTTT S         A S  PT +SN   ++ ++ KP++   + T   ++       ++  P    +P+P         SR TSP       R     +
Subjt:  KRTTTPSPNPNQRINQASSIDPTIKSNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPR--------SRTTSP-----IVRSTIASQ

Query:  IPEFSNETPPNLRT---DRSSSVTRGR--------------QAGIGEKS--EANLRRQSCSPSVTRGRKVEVKQEKNRGGNLSNDQRRNESTN-------
        +P FS E PPNLRT   DR  S +RGR              + G G  S    N RRQSCSPS  RGR        N  G+L+  + R +++N       
Subjt:  IPEFSNETPPNLRT---DRSSSVTRGR--------------QAGIGEKS--EANLRRQSCSPSVTRGRKVEVKQEKNRGGNLSNDQRRNESTN-------

Query:  ----ILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQKYNN
             +G++MVERV+N RK       +    G G S   +N+
Subjt:  ----ILGSRMVERVMNARKGNGNEEKDLKPPGRGGSGQKYNN

AT5G01280.1 BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT3G09000.1)3.1e-0426.26Show/hide
Query:  IKKESGFGLL-SEGNKNDYDWLKTPPATPL-----------------------FPSLEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIE
        +++ +G  LL S+G K+DY+WL TPP +P                        +   E E    S  ++   + + +P S   ++++S P +  R +   
Subjt:  IKKESGFGLL-SEGNKNDYDWLKTPPATPL-----------------------FPSLEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRSNGIE

Query:  KSPITNPRIP-SRSITPSHRPRINSSTEPKNTKRTTTPSPN-----------------PNQRINQASSIDPT-IKSNNNNNNNNNNKPINLKESYTDYLT
        K+P   P  P SR+ + + R  + SS+   +T+  + PS +                 P    +Q ++   T  +SNN   +  N+KP +   + T   +
Subjt:  KSPITNPRIP-SRSITPSHRPRINSSTEPKNTKRTTTPSPN-----------------PNQRINQASSIDPT-IKSNNNNNNNNNNKPINLKESYTDYLT

Query:  SNLSKGTKNTAKPKANPNPNPRSRTTSPIVRST--IASQIPEFSNETPPNLRT---DR----SSSVTRGRQAGIGEKSEANLR----RQSCSPSVTRGRK
        +     T   +KP    +    S   SPIVRS      ++P FS E P NLRT   DR    SSS TR   A    +S +  R    RQSCSPS +R   
Subjt:  SNLSKGTKNTAKPKANPNPNPRSRTTSPIVRST--IASQIPEFSNETPPNLRT---DR----SSSVTRGRQAGIGEKSEANLR----RQSCSPSVTRGRK

Query:  VEVK----QEKNRGGNLSNDQRRNESTNILGSRMVERVMNARK----------------GNGNEEKDLKPPGRGGSG
          V       + +    +ND  R  S    G++ VE+V+N RK                G G+        G GG G
Subjt:  VEVK----QEKNRGGNLSNDQRRNESTNILGSRMVERVMNARK----------------GNGNEEKDLKPPGRGGSG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATACTTAGAGTCGCTTTCCCATTTTATACACTTTTCATAAATTCTAAAACAAAGACATTGAAGTTGCCGGCGGTGATTCTTCCATTTGGCTTAGTTTATGTTTCCAA
CAAAGCCAAAGTCTCCTCTTATTTGTTCTTAAGAACAACATCACACATCCTCTGTCACTCCAACCAGTCTCCATTAGCATTCATTCCAGATCCAAGAACAAATTTCAAGA
AAAAGAAGAAATTTAACTCCACAATGAACAATGGCAGTGGCAGCAAAAACAGATGGATGATGGGTCTGCATTTAAAGGGTGGAAAGGAAAGAAACAATGAAGATCTCCAT
CTATTTCGGGAGCTGCATAAGCGCGACAAGGAACGTACCGCCTGCCTTCTTCTGCCCCTTGACGACCTTGAACACAACCATGATGAGAATTCACCATTCTATAGAATTCA
ATCAATCAAGAAAGAATCTGGATTTGGACTCCTTTCTGAAGGCAACAAAAACGACTATGATTGGCTGAAAACACCACCTGCAACTCCTTTGTTTCCATCTTTGGAAATGG
AAGCCACTGCTCCTTCACATAGGAATGCTCAGAAAGAGACACCACTTGTCCAGCCTTTCTCACAGCCACAGACACAGGCTTCAAGCAATCCAGAATCAACAAAGAGAAGC
AATGGAATTGAGAAATCTCCAATTACAAATCCAAGAATACCATCCAGGTCCATCACTCCCAGTCATAGACCGCGCATCAATTCATCAACTGAACCCAAAAACACCAAAAG
AACCACAACCCCATCCCCCAACCCAAACCAGAGAATCAATCAGGCATCATCAATAGACCCCACCATCAAAAGCAACAACAATAACAACAACAACAACAACAACAAACCCA
TAAATCTTAAAGAAAGTTACACGGATTATCTAACATCAAACCTCTCCAAAGGAACAAAAAACACAGCGAAGCCAAAGGCAAATCCAAACCCAAATCCAAGAAGTAGAACG
ACATCCCCAATTGTGAGATCGACAATAGCATCTCAAATACCAGAGTTCTCAAACGAAACGCCTCCAAATCTGAGGACCGATCGGTCGAGCTCGGTGACGAGAGGGCGGCA
AGCAGGAATCGGAGAGAAATCAGAGGCGAACTTGAGAAGGCAATCGTGCTCGCCGAGCGTGACGAGGGGGCGGAAAGTGGAGGTGAAACAGGAGAAGAACAGAGGAGGAA
ACTTGAGCAATGATCAAAGAAGAAATGAATCGACGAACATTCTTGGAAGTAGAATGGTGGAGAGAGTGATGAATGCGAGAAAAGGAAATGGAAATGAGGAGAAAGATTTG
AAGCCACCAGGACGCGGCGGCAGCGGGCAAAAGTATAATAATTATACGGATGAGAATCTAACTTTCGACTTCTAG
mRNA sequenceShow/hide mRNA sequence
GACAAATGTCTGTCTTGTATAGAGGAATATTGTTTTCCAATGATACTTAGAGTCGCTTTCCCATTTTATACACTTTTCATAAATTCTAAAACAAAGACATTGAAGTTGCC
GGCGGTGATTCTTCCATTTGGCTTAGTTTATGTTTCCAACAAAGCCAAAGTCTCCTCTTATTTGTTCTTAAGAACAACATCACACATCCTCTGTCACTCCAACCAGTCTC
CATTAGCATTCATTCCAGATCCAAGAACAAATTTCAAGAAAAAGAAGAAATTTAACTCCACAATGAACAATGGCAGTGGCAGCAAAAACAGATGGATGATGGGTCTGCAT
TTAAAGGGTGGAAAGGAAAGAAACAATGAAGATCTCCATCTATTTCGGGAGCTGCATAAGCGCGACAAGGAACGTACCGCCTGCCTTCTTCTGCCCCTTGACGACCTTGA
ACACAACCATGATGAGAATTCACCATTCTATAGAATTCAATCAATCAAGAAAGAATCTGGATTTGGACTCCTTTCTGAAGGCAACAAAAACGACTATGATTGGCTGAAAA
CACCACCTGCAACTCCTTTGTTTCCATCTTTGGAAATGGAAGCCACTGCTCCTTCACATAGGAATGCTCAGAAAGAGACACCACTTGTCCAGCCTTTCTCACAGCCACAG
ACACAGGCTTCAAGCAATCCAGAATCAACAAAGAGAAGCAATGGAATTGAGAAATCTCCAATTACAAATCCAAGAATACCATCCAGGTCCATCACTCCCAGTCATAGACC
GCGCATCAATTCATCAACTGAACCCAAAAACACCAAAAGAACCACAACCCCATCCCCCAACCCAAACCAGAGAATCAATCAGGCATCATCAATAGACCCCACCATCAAAA
GCAACAACAATAACAACAACAACAACAACAACAAACCCATAAATCTTAAAGAAAGTTACACGGATTATCTAACATCAAACCTCTCCAAAGGAACAAAAAACACAGCGAAG
CCAAAGGCAAATCCAAACCCAAATCCAAGAAGTAGAACGACATCCCCAATTGTGAGATCGACAATAGCATCTCAAATACCAGAGTTCTCAAACGAAACGCCTCCAAATCT
GAGGACCGATCGGTCGAGCTCGGTGACGAGAGGGCGGCAAGCAGGAATCGGAGAGAAATCAGAGGCGAACTTGAGAAGGCAATCGTGCTCGCCGAGCGTGACGAGGGGGC
GGAAAGTGGAGGTGAAACAGGAGAAGAACAGAGGAGGAAACTTGAGCAATGATCAAAGAAGAAATGAATCGACGAACATTCTTGGAAGTAGAATGGTGGAGAGAGTGATG
AATGCGAGAAAAGGAAATGGAAATGAGGAGAAAGATTTGAAGCCACCAGGACGCGGCGGCAGCGGGCAAAAGTATAATAATTATACGGATGAGAATCTAACTTTCGACTT
CTAG
Protein sequenceShow/hide protein sequence
MILRVAFPFYTLFINSKTKTLKLPAVILPFGLVYVSNKAKVSSYLFLRTTSHILCHSNQSPLAFIPDPRTNFKKKKKFNSTMNNGSGSKNRWMMGLHLKGGKERNNEDLH
LFRELHKRDKERTACLLLPLDDLEHNHDENSPFYRIQSIKKESGFGLLSEGNKNDYDWLKTPPATPLFPSLEMEATAPSHRNAQKETPLVQPFSQPQTQASSNPESTKRS
NGIEKSPITNPRIPSRSITPSHRPRINSSTEPKNTKRTTTPSPNPNQRINQASSIDPTIKSNNNNNNNNNNKPINLKESYTDYLTSNLSKGTKNTAKPKANPNPNPRSRT
TSPIVRSTIASQIPEFSNETPPNLRTDRSSSVTRGRQAGIGEKSEANLRRQSCSPSVTRGRKVEVKQEKNRGGNLSNDQRRNESTNILGSRMVERVMNARKGNGNEEKDL
KPPGRGGSGQKYNNYTDENLTFDF