; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015081 (gene) of Snake gourd v1 genome

Gene IDTan0015081
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionmucin-2-like isoform X1
Genome locationLG07:4379545..4382683
RNA-Seq ExpressionTan0015081
SyntenyTan0015081
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579620.1 hypothetical protein SDJN03_24068, partial [Cucurbita argyrosperma subsp. sororia]4.8e-12874.59Show/hide
Query:  NGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSLEM
        +G GSK R+MMGLHLKGRKE DNEDLHLFRELHK  KER AC LLPV +LEH+ G NS FYRIQSIRKE  FELLSEGNKNDYDWLKTPPATPLFPSLEM
Subjt:  NGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSLEM

Query:  EATARHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSPRITQTSSTDPAIKRNSNII
        EA A HMKAQKE   +Q LSQPQS  QAS+NS++TKRSNG EKSP T P+  SRSITPS++PRINSS EPK T+R T P+ RI+Q SSTDP IKRN    
Subjt:  EATARHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSPRITQTSSTDPAIKRNSNII

Query:  NNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQKSEV--ID
           NN  KS NLKESYTDYL+SNLS         KP +  NPNPNPRSRTTS IVRS IASQI +FSNETPPNLRTDRSSSVTRGR++G  QK E   I+
Subjt:  NNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQKSEV--ID

Query:  SRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEER
        SRRQSCSPSVTRGRKVE KQE  NRGGNLSNDQRRTESTNI+GSRMVERVMNARKGN N  +
Subjt:  SRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEER

XP_008437498.1 PREDICTED: mucin-2-like isoform X1 [Cucumis melo]1.0e-13074.21Show/hide
Query:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNA-CLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPS
        MNNG GSKNR+MMGLH KGRKERDNEDLHLFREL+K +KER A  LLLPVDDLEHN+G NSPFYRI SI+KE G   L E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNA-CLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPA
        LEMEATA     A +E PL+Q LSQPQS  QASSNS++TK+S+G EKSPI K K  SRS TPSHRPRINSSI+PK TKRTT PSP    RI QTS  D  
Subjt:  LEMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPA

Query:  IKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQ
        IKR        NN+MK  N+KESYTDYL+SNLSKGSTN  KP    NPN NPNPRSRTTS IVRS IASQI EFSNETPPNLRTDRSSSVTRGR+ G  +
Subjt:  IKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQ

Query:  KSEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE
        KSE  + RRQSCSPSVTRGRKVEA ++  NRGGNLSNDQRRTESTNILGSRMVERVMNARKG GNE+RD K   R+ IGE
Subjt:  KSEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE

XP_008437499.1 PREDICTED: mucin-2-like isoform X2 [Cucumis melo]1.0e-13074.21Show/hide
Query:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNA-CLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPS
        MNNG GSKNR+MMGLH KGRKERDNEDLHLFREL+K +KER A  LLLPVDDLEHN+G NSPFYRI SI+KE G   L E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNA-CLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPA
        LEMEATA     A +E PL+Q LSQPQS  QASSNS++TK+S+G EKSPI K K  SRS TPSHRPRINSSI+PK TKRTT PSP    RI QTS  D  
Subjt:  LEMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPA

Query:  IKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQ
        IKR        NN+MK  N+KESYTDYL+SNLSKGSTN  KP    NPN NPNPRSRTTS IVRS IASQI EFSNETPPNLRTDRSSSVTRGR+ G  +
Subjt:  IKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQ

Query:  KSEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE
        KSE  + RRQSCSPSVTRGRKVEA ++  NRGGNLSNDQRRTESTNILGSRMVERVMNARKG GNE+RD K   R+ IGE
Subjt:  KSEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE

XP_022928960.1 putative uncharacterized protein DDB_G0282133 isoform X1 [Cucurbita moschata]2.8e-12875.21Show/hide
Query:  GSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSLEMEAT
        GSK R+MMGLHLKGRKE DNEDLHLFRELHK  KER AC LLPV DLEH+NG NS FYRIQ IRKE  FELLSEGNKNDYDWLKTPPATPLFPSLEMEA 
Subjt:  GSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSLEMEAT

Query:  ARHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSPRITQTSSTDPAIKRNSNIINND
        A HMKAQKE   +Q LSQPQS  QAS+NS++TKRSNG EKSP T P+  SRSITPS++PRINSS EPK T+R T P+ RI+Q SSTDP IKRN       
Subjt:  ARHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSPRITQTSSTDPAIKRNSNIINND

Query:  NNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQKSEV--IDSRR
        NN  KS NLKESYTDYL+SNLS         KP +  NPNPNPRSRTTS IVRS IASQI +FSNETPPNLRTDRSSSVTRGR++G  QK E   I+SRR
Subjt:  NNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQKSEV--IDSRR

Query:  QSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEER
        QSCSPSVTRGRKVE KQE  NRGGNLSNDQRRTESTNI+GSRMVERVMNARKGN N  +
Subjt:  QSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEER

XP_031740890.1 serine/arginine repetitive matrix protein 1 isoform X3 [Cucumis sativus]1.2e-13174.47Show/hide
Query:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSL
        MNNG GSKNR+MMGLH KGRKERDNEDLHLFREL+K +KER AC LLPVDDLEHN+G NSPFYRI SI+KE GF  L EGNKNDYDWLKTPPATPLFPSL
Subjt:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSL

Query:  EMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPAI
        EMEATA  H  AQKE PLVQ LSQPQS  QASSNS++TK+S+G EKSPITK K  SRSITPS+RPRINSSI+PK TKRTT PSP    RI QTS  D  +
Subjt:  EMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPAI

Query:  KRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQK
        KRN+NI        K  NLKESYTDYL+SNL KGSTN  KP    N N NPNPRSR TS IVRS IASQI EFSNETPPNLRTDRSSSVTRGR+    +K
Subjt:  KRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQK

Query:  SEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLS-NDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE
        SE  + RRQSCSPSVTRGRKVE  ++  NRGGNLS NDQRRTE+TNILGSRMVERVMNARK  GNEERD+K   R  IGE
Subjt:  SEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLS-NDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE

TrEMBL top hitse value%identityAlignment
A0A0A0KJQ3 Uncharacterized protein5.9e-13274.47Show/hide
Query:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSL
        MNNG GSKNR+MMGLH KGRKERDNEDLHLFREL+K +KER AC LLPVDDLEHN+G NSPFYRI SI+KE GF  L EGNKNDYDWLKTPPATPLFPSL
Subjt:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSL

Query:  EMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPAI
        EMEATA  H  AQKE PLVQ LSQPQS  QASSNS++TK+S+G EKSPITK K  SRSITPS+RPRINSSI+PK TKRTT PSP    RI QTS  D  +
Subjt:  EMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPAI

Query:  KRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQK
        KRN+NI        K  NLKESYTDYL+SNL KGSTN  KP    N N NPNPRSR TS IVRS IASQI EFSNETPPNLRTDRSSSVTRGR+    +K
Subjt:  KRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQK

Query:  SEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLS-NDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE
        SE  + RRQSCSPSVTRGRKVE  ++  NRGGNLS NDQRRTE+TNILGSRMVERVMNARK  GNEERD+K   R  IGE
Subjt:  SEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLS-NDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE

A0A1S3AUR3 mucin-2-like isoform X25.0e-13174.21Show/hide
Query:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNA-CLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPS
        MNNG GSKNR+MMGLH KGRKERDNEDLHLFREL+K +KER A  LLLPVDDLEHN+G NSPFYRI SI+KE G   L E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNA-CLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPA
        LEMEATA     A +E PL+Q LSQPQS  QASSNS++TK+S+G EKSPI K K  SRS TPSHRPRINSSI+PK TKRTT PSP    RI QTS  D  
Subjt:  LEMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPA

Query:  IKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQ
        IKR        NN+MK  N+KESYTDYL+SNLSKGSTN  KP    NPN NPNPRSRTTS IVRS IASQI EFSNETPPNLRTDRSSSVTRGR+ G  +
Subjt:  IKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQ

Query:  KSEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE
        KSE  + RRQSCSPSVTRGRKVEA ++  NRGGNLSNDQRRTESTNILGSRMVERVMNARKG GNE+RD K   R+ IGE
Subjt:  KSEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE

A0A1S3AUS5 mucin-2-like isoform X15.0e-13174.21Show/hide
Query:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNA-CLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPS
        MNNG GSKNR+MMGLH KGRKERDNEDLHLFREL+K +KER A  LLLPVDDLEHN+G NSPFYRI SI+KE G   L E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNA-CLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPA
        LEMEATA     A +E PL+Q LSQPQS  QASSNS++TK+S+G EKSPI K K  SRS TPSHRPRINSSI+PK TKRTT PSP    RI QTS  D  
Subjt:  LEMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPA

Query:  IKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQ
        IKR        NN+MK  N+KESYTDYL+SNLSKGSTN  KP    NPN NPNPRSRTTS IVRS IASQI EFSNETPPNLRTDRSSSVTRGR+ G  +
Subjt:  IKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQ

Query:  KSEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE
        KSE  + RRQSCSPSVTRGRKVEA ++  NRGGNLSNDQRRTESTNILGSRMVERVMNARKG GNE+RD K   R+ IGE
Subjt:  KSEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE

A0A5D3C2G1 Mucin-2-like isoform X15.0e-13174.21Show/hide
Query:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNA-CLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPS
        MNNG GSKNR+MMGLH KGRKERDNEDLHLFREL+K +KER A  LLLPVDDLEHN+G NSPFYRI SI+KE G   L E NKNDYDWLKTPPATPLFPS
Subjt:  MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNA-CLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPS

Query:  LEMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPA
        LEMEATA     A +E PL+Q LSQPQS  QASSNS++TK+S+G EKSPI K K  SRS TPSHRPRINSSI+PK TKRTT PSP    RI QTS  D  
Subjt:  LEMEATA-RHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSP----RITQTSSTDPA

Query:  IKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQ
        IKR        NN+MK  N+KESYTDYL+SNLSKGSTN  KP    NPN NPNPRSRTTS IVRS IASQI EFSNETPPNLRTDRSSSVTRGR+ G  +
Subjt:  IKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQ

Query:  KSEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE
        KSE  + RRQSCSPSVTRGRKVEA ++  NRGGNLSNDQRRTESTNILGSRMVERVMNARKG GNE+RD K   R+ IGE
Subjt:  KSEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGE

A0A6J1ESX0 Uncharacterized protein1.4e-12875.21Show/hide
Query:  GSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSLEMEAT
        GSK R+MMGLHLKGRKE DNEDLHLFRELHK  KER AC LLPV DLEH+NG NS FYRIQ IRKE  FELLSEGNKNDYDWLKTPPATPLFPSLEMEA 
Subjt:  GSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSLEMEAT

Query:  ARHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSPRITQTSSTDPAIKRNSNIINND
        A HMKAQKE   +Q LSQPQS  QAS+NS++TKRSNG EKSP T P+  SRSITPS++PRINSS EPK T+R T P+ RI+Q SSTDP IKRN       
Subjt:  ARHMKAQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSPRITQTSSTDPAIKRNSNIINND

Query:  NNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQKSEV--IDSRR
        NN  KS NLKESYTDYL+SNLS         KP +  NPNPNPRSRTTS IVRS IASQI +FSNETPPNLRTDRSSSVTRGR++G  QK E   I+SRR
Subjt:  NNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQKSEV--IDSRR

Query:  QSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEER
        QSCSPSVTRGRKVE KQE  NRGGNLSNDQRRTESTNI+GSRMVERVMNARKGN N  +
Subjt:  QSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTNILGSRMVERVMNARKGNGNEER

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27850.1 unknown protein8.7e-1131.67Show/hide
Query:  NEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSI--RKEFGFELLSEGNKNDYDWLKTPPATPLFPSLEMEATARHMKAQKEAPLVQSLS
        ++DL LF E+   +KER++ LL   DDLE         +   +I  + E    L +EG+KNDYDWL TPP TPLFPSL+          Q  A  V    
Subjt:  NEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSI--RKEFGFELLSEGNKNDYDWLKTPPATPLFPSLEMEATARHMKAQKEAPLVQSLS

Query:  QPQSK--LQASSNSQATKRSNGNEKSPITKPKTSSRSITPSH---RPRINSSIEPKITKRTTTPSPRITQTSSTDPAIKRNSNIINNDNNSMKSINLKES
        +PQS+  L  SS  + ++RS+    SP  +  TS R+        RP       P   +R+ TP  RI+ T       K +  +  +   +         
Subjt:  QPQSK--LQASSNSQATKRSNGNEKSPITKPKTSSRSITPSH---RPRINSSIEPKITKRTTTPSPRITQTSSTDPAIKRNSNIINNDNNSMKSINLKES

Query:  YTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRI-ASQIAEFSNETPPNLRT---DRSSSVTRGREMGIGQKSEVIDSR-RQSCSPSVTR
             S  +S GST +A P        +P   SR  S   + ++  S I  FS + PPNLRT   DR +S  RG         + + +R R+S SPS +R
Subjt:  YTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRI-ASQIAEFSNETPPNLRT---DRSSSVTRGREMGIGQKSEVIDSR-RQSCSPSVTR

AT2G40070.1 BEST Arabidopsis thaliana protein match is: proline-rich family protein (TAIR:AT3G09000.1)1.1e-1630.3Show/hide
Query:  NEDLHLFRELHKHEKERNACLL-LPVDDLEHNNGR---NSPFYRIQS----IRKEFGFELL-SEGNKNDYDWLKTPPATPLFPSLEMEA-----------
        +E+L LF E+ + EKE++  LL    D+ E   G     SP + I S     RK    + L SEG+KNDY+WL TPP TPLFPSLEME+           
Subjt:  NEDLHLFRELHKHEKERNACLL-LPVDDLEHNNGR---NSPFYRIQS----IRKEFGFELL-SEGNKNDYDWLKTPPATPLFPSLEMEA-----------

Query:  -------TARHMKAQKEAPLVQSLS--QPQSKLQASSNSQATKRSNGN------------EKSPITKPKTSSRSITPSHRPRINSSIEPKIT------KR
               T+R   +  E+     L+  Q  S    SS+S A++R + +              S +T    SSR  TP+ R  ++S+  P +T        
Subjt:  -------TARHMKAQKEAPLVQSLS--QPQSKLQASSNSQATKRSNGN------------EKSPITKPKTSSRSITPSHRPRINSSIEPKIT------KR

Query:  TTTPSP----------RITQTSS---TDPAIKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKP----------------------------
        TT P+P          R+T T+S   T  A    S   +  + + KS     S T  LS + ++ ST  ++P                            
Subjt:  TTTPSP----------RITQTSS---TDPAIKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKP----------------------------

Query:  --------KPNS------NPNPNPNPR-SRTTSLIVRSR--IASQIAEFSNETPPNLRT---DRSSSVTRGREMGIGQKSEVID--------SRRQSCSP
                KP+S       P P+ NP  SR  S  VRSR    S +  FS ETPPNLRT   +R  S TRGR      +S  ++         RRQSCSP
Subjt:  --------KPNS------NPNPNPNPR-SRTTSLIVRSR--IASQIAEFSNETPPNLRT---DRSSSVTRGREMGIGQKSEVID--------SRRQSCSP

Query:  SVTRGRKVEAKQENN----NRGGNLSNDQRRTESTNILGSRMVERVMNAR---------KGNGNEERDLKARGRNEIGELGFISKISADKTSRQM
        S  RGR       ++    NRG + ++D     S  ++G++MVERV+N R         KG+ +     K+   +  G    +SK S D   R M
Subjt:  SVTRGRKVEAKQENN----NRGGNLSNDQRRTESTNILGSRMVERVMNAR---------KGNGNEERDLKARGRNEIGELGFISKISADKTSRQM

AT2G40070.2 FUNCTIONS IN: molecular_function unknown3.8e-1429.84Show/hide
Query:  LHKHEKERNACLL-LPVDDLEHNNGR---NSPFYRIQS----IRKEFGFELL-SEGNKNDYDWLKTPPATPLFPSLEMEA------------------TA
        + + EKE++  LL    D+ E   G     SP + I S     RK    + L SEG+KNDY+WL TPP TPLFPSLEME+                  T+
Subjt:  LHKHEKERNACLL-LPVDDLEHNNGR---NSPFYRIQS----IRKEFGFELL-SEGNKNDYDWLKTPPATPLFPSLEMEA------------------TA

Query:  RHMKAQKEAPLVQSLS--QPQSKLQASSNSQATKRSNGN------------EKSPITKPKTSSRSITPSHRPRINSSIEPKIT------KRTTTPSP---
        R   +  E+     L+  Q  S    SS+S A++R + +              S +T    SSR  TP+ R  ++S+  P +T        TT P+P   
Subjt:  RHMKAQKEAPLVQSLS--QPQSKLQASSNSQATKRSNGN------------EKSPITKPKTSSRSITPSHRPRINSSIEPKIT------KRTTTPSP---

Query:  -------RITQTSS---TDPAIKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKP------------------------------------K
               R+T T+S   T  A    S   +  + + KS     S T  LS + ++ ST  ++P                                    K
Subjt:  -------RITQTSS---TDPAIKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKP------------------------------------K

Query:  PNS------NPNPNPNPR-SRTTSLIVRSR--IASQIAEFSNETPPNLRT---DRSSSVTRGREMGIGQKSEVID--------SRRQSCSPSVTRGRKVE
        P+S       P P+ NP  SR  S  VRSR    S +  FS ETPPNLRT   +R  S TRGR      +S  ++         RRQSCSPS  RGR   
Subjt:  PNS------NPNPNPNPR-SRTTSLIVRSR--IASQIAEFSNETPPNLRT---DRSSSVTRGREMGIGQKSEVID--------SRRQSCSPSVTRGRKVE

Query:  AKQENN----NRGGNLSNDQRRTESTNILGSRMVERVMNAR---------KGNGNEERDLKARGRNEIGELGFISKISADKTSRQM
            ++    NRG + ++D     S  ++G++MVERV+N R         KG+ +     K+   +  G    +SK S D   R M
Subjt:  AKQENN----NRGGNLSNDQRRTESTNILGSRMVERVMNAR---------KGNGNEERDLKARGRNEIGELGFISKISADKTSRQM

AT3G08670.1 unknown protein2.2e-0627.84Show/hide
Query:  SEGNKNDYDWLKTPPATPL-------FPSLEMEATARHMKAQKEAPLVQSLSQ--------------------------------PQSKLQASS------
        +EG KNDYDWL TPP TPL         + ++ ++AR   A K + L  S S+                                P S L  SS      
Subjt:  SEGNKNDYDWLKTPPATPL-------FPSLEMEATARHMKAQKEAPLVQSLSQ--------------------------------PQSKLQASS------

Query:  ---NSQATKRSNGNEKSPITKPKTSSRSITPSH-RPRINSS----IEPKITKRTTTPS--PRITQTSSTDPAIKRNSNIINNDNNSMKSINLKESYTDYL
           +S +++ S+    S  T+  ++SRS TPS  RP  +SS      P ++ R +TP+  P+++ +S    A + NS        S  S +L  +    +
Subjt:  ---NSQATKRSNGNEKSPITKPKTSSRSITPSH-RPRINSS----IEPKITKRTTTPS--PRITQTSSTDPAIKRNSNIINNDNNSMKSINLKESYTDYL

Query:  SSN--LSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRT---DRSSSVTRGREMGIGQKSEVIDS-----RRQSCSPSVTR
        S     S G T  +  +P+S   P P  R+     IV       +A+F  +TPPNLRT   DR  S  R R +G    ++          R++ SP VTR
Subjt:  SSN--LSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRT---DRSSSVTRGREMGIGQKSEVIDS-----RRQSCSPSVTR

Query:  GRKVEAKQENNNRGGNLSN-----DQRRTESTNILGSRMVERVMNARKGNGN
        GR  E  Q     GGN  +     + RR  + + + SR   +       N N
Subjt:  GRKVEAKQENNNRGGNLSN-----DQRRTESTNILGSRMVERVMNARKGNGN

AT3G09000.1 proline-rich family protein2.5e-1027.39Show/hide
Query:  NEDLHLFRELHKHEKERNACLLLPVDDLEHNNG----------------RNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSLEMEATARH
        +E+L LF E+ + EKE  A  LL   D    N                  +S  Y ++    E  F L SE  K+DYDWL TPP TP F      +    
Subjt:  NEDLHLFRELHKHEKERNACLLLPVDDLEHNNG----------------RNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSLEMEATARH

Query:  MKAQKEAPLV----------QSLSQPQSKLQASSNSQATKR---SNGNEKS--------------------PITKPKTSSRSITPSHRPRINSSIEPKIT
          A    P V            +S   +K Q SS+S A  R   S+G+ +S                    P+T   ++SRS TP+ R  + ++   + T
Subjt:  MKAQKEAPLV----------QSLSQPQSKLQASSNSQATKR---SNGNEKS--------------------PITKPKTSSRSITPSHRPRINSSIEPKIT

Query:  KRTTTPSPRITQTSSTDPAIKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSL---------IVRSRIASQIA
          T  P    T + S   A    SN   +  +S K ++   + T   S+       +   P   ++P+P  N  S+  S            R     ++ 
Subjt:  KRTTTPSPRITQTSSTDPAIKRNSNIINNDNNSMKSINLKESYTDYLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSL---------IVRSRIASQIA

Query:  EFSNETPPNLRT---DRSSSVTRGR---EMGIGQKSEVID------------SRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTN--------
         FS E PPNLRT   DR  S +RGR       G +S  I+            +RRQSCSPS  RGR         N  G+L+  + R +++N        
Subjt:  EFSNETPPNLRT---DRSSSVTRGR---EMGIGQKSEVID------------SRRQSCSPSVTRGRKVEAKQENNNRGGNLSNDQRRTESTN--------

Query:  ---ILGSRMVERVMNARK-------GNGNEERDLKARGRNEIGELGFISKISADKTSRQM
            +G++MVERV+N RK        NG       +   N +G    +SK S D   R M
Subjt:  ---ILGSRMVERVMNARK-------GNGNEERDLKARGRNEIGELGFISKISADKTSRQM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAATGGCACTGGAAGCAAAAACAGATTCATGATGGGTCTGCATTTAAAGGGTCGAAAGGAAAGAGACAATGAAGATCTCCATCTGTTCCGGGAGCTTCATAAGCA
CGAAAAGGAACGTAACGCCTGCCTTCTTCTGCCCGTCGATGACCTCGAACATAACAATGGTAGGAATTCTCCGTTCTACAGAATTCAATCAATCAGGAAAGAATTTGGAT
TTGAACTCCTTTCTGAAGGCAACAAAAACGACTATGATTGGCTTAAAACACCACCTGCAACTCCTTTGTTTCCATCTTTGGAAATGGAAGCCACTGCTCGACATATGAAA
GCTCAAAAAGAGGCTCCACTTGTTCAATCTCTCTCACAGCCACAGTCAAAGTTACAGGCTTCAAGCAATTCACAAGCAACAAAGAGAAGCAATGGAAATGAGAAATCCCC
AATTACAAAACCAAAAACATCATCCAGGTCCATCACTCCCAGTCATAGACCGCGCATCAATTCATCCATTGAACCCAAAATAACCAAAAGAACCACAACCCCATCTCCAA
GAATCACTCAGACATCGTCAACAGACCCCGCGATCAAAAGAAACAGCAACATCATCAACAACGACAACAACAGCATGAAATCCATAAATCTTAAAGAGAGTTACACGGAT
TACCTGAGCTCAAACCTGTCGAAAGGATCAACAAACGTTGCCAAGCCAAAGCCAAATTCCAATCCAAATCCAAATCCAAACCCAAGAAGCAGAACGACATCCCTAATAGT
GAGATCGAGAATAGCATCTCAAATTGCAGAGTTCTCGAACGAAACGCCTCCAAATCTGAGGACGGACCGGTCGAGCTCGGTGACGAGAGGGCGGGAGATGGGAATTGGAC
AGAAATCAGAGGTGATCGATTCCAGAAGGCAATCGTGCTCGCCGAGCGTGACGAGGGGGCGGAAAGTGGAGGCGAAACAGGAGAACAACAACAGAGGCGGAAACTTGAGC
AATGATCAGAGAAGAACAGAATCGACGAACATTCTTGGGAGTCGAATGGTGGAGAGAGTGATGAATGCAAGAAAAGGAAATGGAAATGAGGAGAGAGATTTGAAGGCAAG
AGGACGAAATGAGATTGGGGAATTAGGATTCATCTCCAAGATTTCTGCAGATAAAACGTCAAGGCAAATGGTAGAAATAGTATTTAAAGGCTTTAATTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACAATGGCACTGGAAGCAAAAACAGATTCATGATGGGTCTGCATTTAAAGGGTCGAAAGGAAAGAGACAATGAAGATCTCCATCTGTTCCGGGAGCTTCATAAGCA
CGAAAAGGAACGTAACGCCTGCCTTCTTCTGCCCGTCGATGACCTCGAACATAACAATGGTAGGAATTCTCCGTTCTACAGAATTCAATCAATCAGGAAAGAATTTGGAT
TTGAACTCCTTTCTGAAGGCAACAAAAACGACTATGATTGGCTTAAAACACCACCTGCAACTCCTTTGTTTCCATCTTTGGAAATGGAAGCCACTGCTCGACATATGAAA
GCTCAAAAAGAGGCTCCACTTGTTCAATCTCTCTCACAGCCACAGTCAAAGTTACAGGCTTCAAGCAATTCACAAGCAACAAAGAGAAGCAATGGAAATGAGAAATCCCC
AATTACAAAACCAAAAACATCATCCAGGTCCATCACTCCCAGTCATAGACCGCGCATCAATTCATCCATTGAACCCAAAATAACCAAAAGAACCACAACCCCATCTCCAA
GAATCACTCAGACATCGTCAACAGACCCCGCGATCAAAAGAAACAGCAACATCATCAACAACGACAACAACAGCATGAAATCCATAAATCTTAAAGAGAGTTACACGGAT
TACCTGAGCTCAAACCTGTCGAAAGGATCAACAAACGTTGCCAAGCCAAAGCCAAATTCCAATCCAAATCCAAATCCAAACCCAAGAAGCAGAACGACATCCCTAATAGT
GAGATCGAGAATAGCATCTCAAATTGCAGAGTTCTCGAACGAAACGCCTCCAAATCTGAGGACGGACCGGTCGAGCTCGGTGACGAGAGGGCGGGAGATGGGAATTGGAC
AGAAATCAGAGGTGATCGATTCCAGAAGGCAATCGTGCTCGCCGAGCGTGACGAGGGGGCGGAAAGTGGAGGCGAAACAGGAGAACAACAACAGAGGCGGAAACTTGAGC
AATGATCAGAGAAGAACAGAATCGACGAACATTCTTGGGAGTCGAATGGTGGAGAGAGTGATGAATGCAAGAAAAGGAAATGGAAATGAGGAGAGAGATTTGAAGGCAAG
AGGACGAAATGAGATTGGGGAATTAGGATTCATCTCCAAGATTTCTGCAGATAAAACGTCAAGGCAAATGGTAGAAATAGTATTTAAAGGCTTTAATTTTTAA
Protein sequenceShow/hide protein sequence
MNNGTGSKNRFMMGLHLKGRKERDNEDLHLFRELHKHEKERNACLLLPVDDLEHNNGRNSPFYRIQSIRKEFGFELLSEGNKNDYDWLKTPPATPLFPSLEMEATARHMK
AQKEAPLVQSLSQPQSKLQASSNSQATKRSNGNEKSPITKPKTSSRSITPSHRPRINSSIEPKITKRTTTPSPRITQTSSTDPAIKRNSNIINNDNNSMKSINLKESYTD
YLSSNLSKGSTNVAKPKPNSNPNPNPNPRSRTTSLIVRSRIASQIAEFSNETPPNLRTDRSSSVTRGREMGIGQKSEVIDSRRQSCSPSVTRGRKVEAKQENNNRGGNLS
NDQRRTESTNILGSRMVERVMNARKGNGNEERDLKARGRNEIGELGFISKISADKTSRQMVEIVFKGFNF