; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi06G016890 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi06G016890
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr06:27086085..27105714
RNA-Seq ExpressionLsi06G016890
SyntenyLsi06G016890
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0009451 - RNA modification (biological process)
GO:0009834 - plant-type secondary cell wall biogenesis (biological process)
GO:0010417 - glucuronoxylan biosynthetic process (biological process)
GO:0000139 - Golgi membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0047517 - 1,4-beta-D-xylan synthase activity (molecular function)
GO:0080116 - glucuronoxylan glucuronosyltransferase activity (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR004263 - Exostosin-like
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain
IPR040911 - Exostosin, GT47 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYJ96962.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]3.1e-25687.95Show/hide
Query:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC
        +YQTL+RVLEAC+ FP++SKTVIETHARIIKFGYGNYPTLI SLVSTYQ  GCLNRVH+LLD+L SK LDLVAMNLL+ NFMKIGECKFAKKVFYKMP+ 
Subjt:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC

Query:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH
        DVVTWNSIIGGCVKNA+YDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPS T WV A MTQKKIELNS+LSCALIDAYSKCGSIQIAKE+FS VPH
Subjt:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH

Query:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM
        +D SVWN MIKGLAIHGLAMDALSLF  ME ENV+PDA+TFLGILTACNHGGLID GRRYF+ M+SRY IQPQLEHYGVMVDL+SRAGFLEEAYSLIV M
Subjt:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM

Query:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY
        P+EPDVVTWRTLLSGC+IY+N +LAEVAIANMSH KSGDYVLLSNIYCSLNRWE AE VRKMMKIN VRK  GKSWIELGGT   FKSGDR HPESDAI 
Subjt:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY

Query:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG
        KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGA+ISISKNLRICDDCHTWIKLVSRVLCR IVVRDRIRFHQFEGG
Subjt:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG

XP_008443769.1 PREDICTED: pentatricopeptide repeat-containing protein At5g50990 [Cucumis melo]7.4e-25888.35Show/hide
Query:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC
        +YQTL+RVLEAC+ FP++SKTVIETHARIIKFGYGNYPTLI SLVSTYQ  GCLNRVH+LLD+L SK LDLVAMNLL+ NFMKIGECKFAKKVFYKMP+ 
Subjt:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC

Query:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH
        DVVTWNSIIGGCVKNA+YDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPS T WV A MTQKKIELNS+LSCALIDAYSKCGSIQIAKE+FS VPH
Subjt:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH

Query:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM
        +D SVWN MIKGLAIHGLAMDALSLF  ME ENV+PDA+TFLGILTACNHGGLID GRRYF+ M+SRY IQPQLEHYGVMVDL+SRAGFLEEAYSLIV M
Subjt:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM

Query:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY
        PIEPDVVTWRTLLSGC+IY+N +LAEVAIANMSHCKSGDYVLLSNIYCSLNRWE AE VRKMMKIN VRK  GKSWIELGGT   FKSGDR HPESDAI 
Subjt:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY

Query:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG
        KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGA+ISISKNLRICDDCHTWIKLVSRVLCR IVVRDRIRFHQFEGG
Subjt:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG

XP_011660258.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Cucumis sativus]1.3e-25487.15Show/hide
Query:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC
        +YQ+LYRVLEAC+ F ++SKTVIETHARIIKFGYGNYPTLI SLVSTYQ  GCLNRVHQLLD+L SK+LDLVAMNLL+ NFMKIGECKFAKKVFYKMP+ 
Subjt:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC

Query:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH
        DVVTWNSIIGGCVKNA+YDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPS T WVHA MTQKKIELNS+L+CALIDAYSKCGSIQIAKE+FS +PH
Subjt:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH

Query:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM
        +D SVWN MIKGLAIHGLAMDALSLF  ME E+V+PDA+TFLG+LTACNHG LID GRRYF+ MKS Y IQPQLEHYGVMVDL+SRAGFLEEAYSLIV M
Subjt:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM

Query:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY
        PIEPDVVTWRTLLSGCRIY+N +LAEVAIANMS  KSGDYVLLSNIYCSLNRW+ AE VRKMMKIN VRK  GKSWIELGGTI +FKSGDR HPESDAI 
Subjt:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY

Query:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG
        KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGA+ISISKNLRICDDCHTWIKLVSRVLCR IVVRDRIRFHQFEGG
Subjt:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG

XP_031736189.1 pentatricopeptide repeat-containing protein At5g50990 isoform X2 [Cucumis sativus]3.4e-25587.35Show/hide
Query:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC
        NYQ+LYRVLEAC+ F ++SKTVIETHARIIKFGYGNYPTLI SLVSTYQ  GCLNRVHQLLD+L SK+LDLVAMNLL+ NFMKIGECKFAKKVFYKMP+ 
Subjt:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC

Query:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH
        DVVTWNSIIGGCVKNA+YDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPS T WVHA MTQKKIELNS+L+CALIDAYSKCGSIQIAKE+FS +PH
Subjt:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH

Query:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM
        +D SVWN MIKGLAIHGLAMDALSLF  ME E+V+PDA+TFLG+LTACNHG LID GRRYF+ MKS Y IQPQLEHYGVMVDL+SRAGFLEEAYSLIV M
Subjt:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM

Query:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY
        PIEPDVVTWRTLLSGCRIY+N +LAEVAIANMS  KSGDYVLLSNIYCSLNRW+ AE VRKMMKIN VRK  GKSWIELGGTI +FKSGDR HPESDAI 
Subjt:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY

Query:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG
        KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGA+ISISKNLRICDDCHTWIKLVSRVLCR IVVRDRIRFHQFEGG
Subjt:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG

XP_038879432.1 pentatricopeptide repeat-containing protein At5g50990 isoform X1 [Benincasa hispida]3.8e-27091.37Show/hide
Query:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC
        +YQTL+RVLEAC+  PL SKTVIETHARIIKFGYG+YP LITSLVSTYQ AGCLNRVHQLLDLL SK LDLV MNLL+ENF K+GECKFA +VFYKMPY 
Subjt:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC

Query:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH
        DVVTWNSIIGGCVKNA YDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLG PS TQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH
Subjt:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH

Query:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM
        +D+SVWN+MIKGLAIHGLA DALSLF MMERENV+PDAVTFLGILTACNHGGLIDQGRRYF+WM+SRY IQPQLEHYGV+VDL+SRAGFLEEAYSLIVAM
Subjt:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM

Query:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY
        PIEPDVVTWRTLLSGCRIYRNQELAEVAI NMSH KSGDYVLLSNIYCSLN+WEHA  VRKMMKINGVRK CGKSWIELGGTI NFKSGDRSHPESDA+Y
Subjt:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY

Query:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG
        +VLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEK+ALAYAILKTSPGA+ISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG
Subjt:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG

TrEMBL top hitse value%identityAlignment
A0A0A0LXZ9 DYW_deaminase domain-containing protein6.3e-25587.15Show/hide
Query:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC
        +YQ+LYRVLEAC+ F ++SKTVIETHARIIKFGYGNYPTLI SLVSTYQ  GCLNRVHQLLD+L SK+LDLVAMNLL+ NFMKIGECKFAKKVFYKMP+ 
Subjt:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC

Query:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH
        DVVTWNSIIGGCVKNA+YDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPS T WVHA MTQKKIELNS+L+CALIDAYSKCGSIQIAKE+FS +PH
Subjt:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH

Query:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM
        +D SVWN MIKGLAIHGLAMDALSLF  ME E+V+PDA+TFLG+LTACNHG LID GRRYF+ MKS Y IQPQLEHYGVMVDL+SRAGFLEEAYSLIV M
Subjt:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM

Query:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY
        PIEPDVVTWRTLLSGCRIY+N +LAEVAIANMS  KSGDYVLLSNIYCSLNRW+ AE VRKMMKIN VRK  GKSWIELGGTI +FKSGDR HPESDAI 
Subjt:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY

Query:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG
        KVLCSLMKRTR+EGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGA+ISISKNLRICDDCHTWIKLVSRVLCR IVVRDRIRFHQFEGG
Subjt:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG

A0A1S4DV22 pentatricopeptide repeat-containing protein At5g509903.6e-25888.35Show/hide
Query:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC
        +YQTL+RVLEAC+ FP++SKTVIETHARIIKFGYGNYPTLI SLVSTYQ  GCLNRVH+LLD+L SK LDLVAMNLL+ NFMKIGECKFAKKVFYKMP+ 
Subjt:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC

Query:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH
        DVVTWNSIIGGCVKNA+YDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPS T WV A MTQKKIELNS+LSCALIDAYSKCGSIQIAKE+FS VPH
Subjt:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH

Query:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM
        +D SVWN MIKGLAIHGLAMDALSLF  ME ENV+PDA+TFLGILTACNHGGLID GRRYF+ M+SRY IQPQLEHYGVMVDL+SRAGFLEEAYSLIV M
Subjt:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM

Query:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY
        PIEPDVVTWRTLLSGC+IY+N +LAEVAIANMSHCKSGDYVLLSNIYCSLNRWE AE VRKMMKIN VRK  GKSWIELGGT   FKSGDR HPESDAI 
Subjt:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY

Query:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG
        KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGA+ISISKNLRICDDCHTWIKLVSRVLCR IVVRDRIRFHQFEGG
Subjt:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG

A0A5A7T674 Pentatricopeptide repeat-containing protein3.6e-25888.35Show/hide
Query:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC
        +YQTL+RVLEAC+ FP++SKTVIETHARIIKFGYGNYPTLI SLVSTYQ  GCLNRVH+LLD+L SK LDLVAMNLL+ NFMKIGECKFAKKVFYKMP+ 
Subjt:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC

Query:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH
        DVVTWNSIIGGCVKNA+YDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPS T WV A MTQKKIELNS+LSCALIDAYSKCGSIQIAKE+FS VPH
Subjt:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH

Query:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM
        +D SVWN MIKGLAIHGLAMDALSLF  ME ENV+PDA+TFLGILTACNHGGLID GRRYF+ M+SRY IQPQLEHYGVMVDL+SRAGFLEEAYSLIV M
Subjt:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM

Query:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY
        PIEPDVVTWRTLLSGC+IY+N +LAEVAIANMSHCKSGDYVLLSNIYCSLNRWE AE VRKMMKIN VRK  GKSWIELGGT   FKSGDR HPESDAI 
Subjt:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY

Query:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG
        KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGA+ISISKNLRICDDCHTWIKLVSRVLCR IVVRDRIRFHQFEGG
Subjt:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG

A0A5D3BAU8 Pentatricopeptide repeat-containing protein1.5e-25687.95Show/hide
Query:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC
        +YQTL+RVLEAC+ FP++SKTVIETHARIIKFGYGNYPTLI SLVSTYQ  GCLNRVH+LLD+L SK LDLVAMNLL+ NFMKIGECKFAKKVFYKMP+ 
Subjt:  NYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYC

Query:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH
        DVVTWNSIIGGCVKNA+YDEAFRFFRQML SNIQPDGFTFAS+LNACAQLGAPS T WV A MTQKKIELNS+LSCALIDAYSKCGSIQIAKE+FS VPH
Subjt:  DVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPH

Query:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM
        +D SVWN MIKGLAIHGLAMDALSLF  ME ENV+PDA+TFLGILTACNHGGLID GRRYF+ M+SRY IQPQLEHYGVMVDL+SRAGFLEEAYSLIV M
Subjt:  NDISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAM

Query:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY
        P+EPDVVTWRTLLSGC+IY+N +LAEVAIANMSH KSGDYVLLSNIYCSLNRWE AE VRKMMKIN VRK  GKSWIELGGT   FKSGDR HPESDAI 
Subjt:  PIEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIY

Query:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG
        KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGA+ISISKNLRICDDCHTWIKLVSRVLCR IVVRDRIRFHQFEGG
Subjt:  KVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG

A0A6J1E1Y0 pentatricopeptide repeat-containing protein At5g50990 isoform X12.6e-25385.71Show/hide
Query:  YQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYCD
        YQTLYRVLEAC+  P +SKT  ETHAR+IKFGYGNYPTL+TSLVS YQ A CLNRVHQLL+LL SK LDLVAMNL ++NFMKIGECK AK+VF KMPY D
Subjt:  YQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYCD

Query:  VVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPHN
        VVTWNSIIGGCVKNA+Y+EAF+FFRQMLNSNIQPDGFTFASVLNACAQLGAPS TQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKE+FS VP +
Subjt:  VVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPHN

Query:  DISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAMP
        +ISVWN+MIKGLAIHGL+MDALS+F+MMERENV+PDAVTFLGILTACNHGGLI+QGRR+FDWMK+RY IQPQLEHYGVMVDL+SRAGFLEEAYS+IVAMP
Subjt:  DISVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAMP

Query:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIYK
        IE DVVTWR LLSGCRIYRNQELAEVAIANMSH  SGDYVLLSNIYCSLNRWEHAE VR+ MK NGVRK+CGKSWIELGG+I +FKSGDRSHPESDA+YK
Subjt:  IEPDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIYK

Query:  VLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG
        V+  LMKR+RSEGYMPVT+LV MDISEEEKEENLS+HSEK+ALAYAILKT PGA+ISISKNLR+CDDCH WIKLVS +LCR +VVRDRIRFHQFEGG
Subjt:  VLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG

SwissProt top hitse value%identityAlignment
Q6H4N0 Probable glucuronosyltransferase Os02g05207507.3e-20887.96Show/hide
Query:  HTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMR
        +TERISGSAGDVLEDNPVGRLKVFVY+LPSKYNK+I+ KDPRCLNHMFAAEIFMHRFLL+S VRTLNPE+ADWFY PVYTTCDLT  GLPLPFKSPRMMR
Subjt:  HTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPLPFKSPRMMR

Query:  SAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPYAPPQKMHAHLIPEKTPRSIFVYFR
        SAIQ +S  WP+WNRT+GADHFFVVPHDFGACFHYQEEKAIERGILPLL+RATLVQTFGQ+NHVCLKEGSITIPPYAPPQKM AHLIP  TPRSIFVYFR
Subjt:  SAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPYAPPQKMHAHLIPEKTPRSIFVYFR

Query:  GLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEEIGVFL
        GLFYD GNDPEGGYYARGARA++WENFK+NPLFDISTEHP TYYEDMQR+VFCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFADAIPW+EIGVF+
Subjt:  GLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEEIGVFL

Query:  DEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKILNWTAGPVS
        DE+DV  LD+ILTSIP++ ILRKQRLLANPSMKQAMLFPQPAQP DAFHQ+LNGLARKLPH  SVYL  GEK LNWTAGPV+
Subjt:  DEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKILNWTAGPVS

Q7XLG3 Probable glucuronosyltransferase Os04g03986007.8e-21085.21Show/hide
Query:  LFAAFLTTSNAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTC
        + AA    S    R   QH+ERISGSAGDVLEDNPVGRLKVF+Y+LP KYNKK++ KDPRCLNHMFAAEIFMHRFLL+S VRTLNP+EADWFYTPVYTTC
Subjt:  LFAAFLTTSNAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTC

Query:  DLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPYAPPQKM
        DLTP GLPLPFKSPR+MRSAIQ IS  WP+WNRT+GADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQ NHVCLKEGSITIPPYAPPQKM
Subjt:  DLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPYAPPQKM

Query:  HAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADD
         AHLIP  TPRSIFVYFRGLFYD GNDPEGGYYARGARA++WENFK+NPLFDIST+HP TYYEDMQRAVFCLCPLGWAPWSPRLVEAV+FGCIPVIIADD
Subjt:  HAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADD

Query:  IVLPFADAIPWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKILNWTAGPV
        IVLPFADAIPWEEIGVF++EKDV  LDTILTS+P++ ILRKQRLLANPSMKQAMLFPQPAQP DAFHQ+LNGLARKLPH   VYL   +K LNWTAGPV
Subjt:  IVLPFADAIPWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKILNWTAGPV

Q8S1X7 Probable glucuronosyltransferase Os01g09267008.6e-20982.84Show/hide
Query:  WAIFVPLFAA---FLTTSNAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADW
        W + + + AA   F   + A    +   TERISGSAGDVLED+PVGRLKV+VY+LPSKYNKK+L+KDPRCLNHMFAAEIFMHRFLL+S VRT NPEEADW
Subjt:  WAIFVPLFAA---FLTTSNAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADW

Query:  FYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITI
        FYTPVYTTCDLTP+GLPLPFKSPRMMRSAI+LI++NWPYWNR+EGADHFFV PHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQ+NHVCLK+GSITI
Subjt:  FYTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITI

Query:  PPYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFG
        PPYAPPQKM AHLIP  TPRSIFVYFRGLFYD  NDPEGGYYARGARA+VWENFK+NPLFDIST+HP TYYEDMQR+VFCLCPLGWAPWSPRLVEAV+FG
Subjt:  PPYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFG

Query:  CIPVIIADDIVLPFADAIPWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKI
        CIPVIIADDIVLPFADAIPWEEIGVF+ E+DV  LD+ILTSIP ++ILRKQRLLANPSMKQAMLFPQPAQ GDAFHQ+LNGLARKLPH  +V+L  GE+ 
Subjt:  CIPVIIADDIVLPFADAIPWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKI

Query:  LNWTAGPV
        LNWTAGPV
Subjt:  LNWTAGPV

Q940Q8 Probable beta-1,4-xylosyltransferase IRX10L6.1e-22389.71Show/hide
Query:  LSWAIFVPLFAAFLTTSNAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWF
        LS  + + L     ++ +A    RSQ TERISGSAGDVLED+PVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEI+M RFLL+SPVRTLNPEEADWF
Subjt:  LSWAIFVPLFAAFLTTSNAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWF

Query:  YTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIP
        Y PVYTTCDLTPNGLPLPFKSPRMMRSAIQLI+SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQRNHVCLKEGSIT+P
Subjt:  YTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIP

Query:  PYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGC
        PYAPPQKM +HLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRA+FCLCPLGWAPWSPRLVEAVIFGC
Subjt:  PYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGC

Query:  IPVIIADDIVLPFADAIPWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKIL
        IPVIIADDIVLPFADAIPWE+IGVF+DEKDV  LDTILTSIP E+ILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPH+RSVYL  GEK+L
Subjt:  IPVIIADDIVLPFADAIPWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKIL

Query:  NWTAGPVS
        NWTAGPV+
Subjt:  NWTAGPVS

Q9FZJ1 Probable beta-1,4-xylosyltransferase IRX105.2e-21487.98Show/hide
Query:  NAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPL
        +A + +++  TERISGSAGDVLED+PVG+LKV+VYELPSKYNKK+LQKDPRCL HMFAAEIFMHRFLL+SPVRT NP+EADWFYTP+Y TCDLTP GLPL
Subjt:  NAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPL

Query:  PFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPYAPPQKMHAHLIPEKT
        PFKSPRMMRS+IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL EGSITIPP+APPQKM AH IP   
Subjt:  PFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPYAPPQKMHAHLIPEKT

Query:  PRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAI
        PRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK+NPLFDIST+HPTTYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFADAI
Subjt:  PRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAI

Query:  PWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKILNWTAGPVS
        PWEEIGVF+ EKDV  LDTILTSIP E+ILRKQRLLANPSMK+AMLFPQPAQPGDAFHQ+LNGLARKLPHD+S+YL  GEK LNWTAGPV+
Subjt:  PWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKILNWTAGPVS

Arabidopsis top hitse value%identityAlignment
AT1G27440.1 Exostosin family protein3.7e-21587.98Show/hide
Query:  NAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPL
        +A + +++  TERISGSAGDVLED+PVG+LKV+VYELPSKYNKK+LQKDPRCL HMFAAEIFMHRFLL+SPVRT NP+EADWFYTP+Y TCDLTP GLPL
Subjt:  NAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCDLTPNGLPL

Query:  PFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPYAPPQKMHAHLIPEKT
        PFKSPRMMRS+IQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCL EGSITIPP+APPQKM AH IP   
Subjt:  PFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPYAPPQKMHAHLIPEKT

Query:  PRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAI
        PRSIFVYFRGLFYDV NDPEGGYYARGARAAVWENFK+NPLFDIST+HPTTYYEDMQRA+FCLCPLGWAPWSPRLVEAV+FGCIPVIIADDIVLPFADAI
Subjt:  PRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAI

Query:  PWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKILNWTAGPVS
        PWEEIGVF+ EKDV  LDTILTSIP E+ILRKQRLLANPSMK+AMLFPQPAQPGDAFHQ+LNGLARKLPHD+S+YL  GEK LNWTAGPV+
Subjt:  PWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKILNWTAGPVS

AT3G62890.1 Pentatricopeptide repeat (PPR) superfamily protein2.5e-10740.93Show/hide
Query:  PLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYCDVVTWNSIIGGCVKN
        PLH      THA+I+ FG    P + TSL++ Y S G L    ++ D   SK  DL A N +V  + K G    A+K+F +MP  +V++W+ +I G V  
Subjt:  PLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYCDVVTWNSIIGGCVKN

Query:  AQYDEAFRFFRQML-----NSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCV-PHNDISVWNSM
         +Y EA   FR+M       + ++P+ FT ++VL+AC +LGA  + +WVHA + +  +E++ +L  ALID Y+KCGS++ AK +F+ +    D+  +++M
Subjt:  AQYDEAFRFFRQML-----NSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCV-PHNDISVWNSM

Query:  IKGLAIHGLAMDALSLFF-MMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAMPIEPDVVT
        I  LA++GL  +   LF  M   +N+ P++VTF+GIL AC H GLI++G+ YF  M   + I P ++HYG MVDL+ R+G ++EA S I +MP+EPDV+ 
Subjt:  IKGLAIHGLAMDALSLFF-MMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAMPIEPDVVT

Query:  WRTLLSGCRIYRNQELAEVA---IANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIYKVLCS
        W +LLSG R+  + +  E A   +  +    SG YVLLSN+Y    RW   + +R  M++ G+ K  G S++E+ G +  F  GD S  ES+ IY +L  
Subjt:  WRTLLSGCRIYRNQELAEVA---IANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIYKVLCS

Query:  LMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGGTKS
        +M+R R  GY+  T+ V +D++E++KE  LS+HSEK+A+A+ ++KT PG  + I KNLRIC DCH  +K++S++  R IVVRD  RFH F  G+ S
Subjt:  LMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGGTKS

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.4e-10544.04Show/hide
Query:  LVAMNLLVENFMKIGECKFAKKVFYKMPYCDVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIEL
        +   N L+  +   G+   A KVF KMP  D+V WNS+I G  +N + +EA   + +M +  I+PDGFT  S+L+ACA++GA +  + VH  M +  +  
Subjt:  LVAMNLLVENFMKIGECKFAKKVFYKMPYCDVVTWNSIIGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIEL

Query:  NSILSCALIDAYSKCGSIQIAKEMF-SCVPHNDISVWNSMIKGLAIHGLAMDALSLF-FMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRY
        N   S  L+D Y++CG ++ AK +F   V  N +S W S+I GLA++G   +A+ LF +M   E ++P  +TF+GIL AC+H G++ +G  YF  M+  Y
Subjt:  NSILSCALIDAYSKCGSIQIAKEMF-SCVPHNDISVWNSMIKGLAIHGLAMDALSLF-FMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRY

Query:  LIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAMPIEPDVVTWRTLLSGCRIYRNQELAEVA---IANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKI
         I+P++EH+G MVDL +RAG +++AY  I +MP++P+VV WRTLL  C ++ + +LAE A   I  +    SGDYVLLSN+Y S  RW   + +RK M  
Subjt:  LIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAMPIEPDVVTWRTLLSGCRIYRNQELAEVA---IANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKI

Query:  NGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIYKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRI
        +GV+K  G S +E+G  +  F  GD+SHP+SDAIY  L  +  R RSEGY+P    V++D+ EEEKE  + +HSEK+A+A+ ++ T   + I++ KNLR+
Subjt:  NGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIYKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRI

Query:  CDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGGTKS
        C DCH  IKLVS+V  R IVVRDR RFH F+ G+ S
Subjt:  CDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGGTKS

AT5G50990.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.0e-15754.34Show/hide
Query:  LYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYCDVVT
        L +VLE+CK  P +SK V++ HA+I K GYG YP+L+ S V+ Y+         +LL    S    +  +NL++E+ MKIGE   AKKV       +V+T
Subjt:  LYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYCDVVT

Query:  WNSIIGGCVKNAQYDEAFRFFRQMLN-SNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPHNDI
        WN +IGG V+N QY+EA +  + ML+ ++I+P+ F+FAS L ACA+LG     +WVH+LM    IELN+ILS AL+D Y+KCG I  ++E+F  V  ND+
Subjt:  WNSIIGGCVKNAQYDEAFRFFRQMLN-SNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPHNDI

Query:  SVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAMPIE
        S+WN+MI G A HGLA +A+ +F  ME E+V PD++TFLG+LT C+H GL+++G+ YF  M  R+ IQP+LEHYG MVDL  RAG ++EAY LI +MPIE
Subjt:  SVWNSMIKGLAIHGLAMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAMPIE

Query:  PDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIYKVL
        PDVV WR+LLS  R Y+N EL E+AI N+S  KSGDYVLLSNIY S  +WE A+ VR++M   G+RK  GKSW+E GG I  FK+GD SH E+ AIYKVL
Subjt:  PDVVTWRTLLSGCRIYRNQELAEVAIANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIYKVL

Query:  CSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG
          L+++T+S+G++  T+LV MD+SEEEKEENL++HSEK+ALAY ILK+SPG  I I KN+R+C DCH WIK VS++L R I++RDRIRFH+FE G
Subjt:  CSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFHSEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGG

AT5G61840.1 Exostosin family protein4.3e-22489.71Show/hide
Query:  LSWAIFVPLFAAFLTTSNAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWF
        LS  + + L     ++ +A    RSQ TERISGSAGDVLED+PVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEI+M RFLL+SPVRTLNPEEADWF
Subjt:  LSWAIFVPLFAAFLTTSNAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWF

Query:  YTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIP
        Y PVYTTCDLTPNGLPLPFKSPRMMRSAIQLI+SNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAI RGILPLLQRATLVQTFGQRNHVCLKEGSIT+P
Subjt:  YTPVYTTCDLTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIP

Query:  PYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGC
        PYAPPQKM +HLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRA+FCLCPLGWAPWSPRLVEAVIFGC
Subjt:  PYAPPQKMHAHLIPEKTPRSIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGC

Query:  IPVIIADDIVLPFADAIPWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKIL
        IPVIIADDIVLPFADAIPWE+IGVF+DEKDV  LDTILTSIP E+ILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPH+RSVYL  GEK+L
Subjt:  IPVIIADDIVLPFADAIPWEEIGVFLDEKDVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKIL

Query:  NWTAGPVS
        NWTAGPV+
Subjt:  NWTAGPVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAATTATCAAACCCTTTATCGTGTTCTTGAAGCCTGCAAATTCTTCCCCTTGCATTCCAAAACTGTTATCGAAACGCATGCACGGATTATTAAATTTGGATATGG
AAACTACCCAACTCTCATCACCTCTCTAGTATCAACTTATCAAAGTGCCGGTTGCCTTAATCGGGTCCATCAACTTCTTGATCTACTCTACTCTAAGCGTCTTGATTTAG
TTGCAATGAACTTACTCGTTGAAAATTTTATGAAAATCGGGGAGTGCAAATTTGCTAAAAAGGTATTTTATAAAATGCCTTACTGTGATGTGGTAACATGGAACTCAATC
ATTGGGGGTTGTGTGAAGAATGCACAGTATGACGAGGCATTTAGATTCTTTAGACAGATGCTGAACTCAAATATTCAGCCGGATGGATTTACATTTGCTTCTGTATTGAA
TGCATGTGCTCAGCTCGGAGCTCCAAGTAAGACTCAGTGGGTTCATGCTTTGATGACTCAGAAAAAAATTGAGCTTAATTCTATATTAAGTTGTGCACTCATAGATGCAT
ACTCAAAGTGTGGTAGCATCCAAATTGCGAAGGAAATGTTTAGTTGCGTCCCTCACAATGACATCTCAGTTTGGAATTCGATGATCAAAGGGCTTGCAATTCATGGCCTT
GCAATGGATGCATTATCGTTATTTTTTATGATGGAGCGTGAGAATGTTGTGCCTGATGCTGTCACCTTTTTGGGCATTTTAACAGCCTGCAACCATGGTGGTTTAATTGA
CCAGGGTCGCAGGTATTTTGATTGGATGAAAAGCCGTTATTTAATTCAGCCACAGCTTGAGCATTATGGAGTCATGGTTGATCTCTTTAGCCGAGCTGGGTTTCTGGAGG
AGGCCTATTCCCTAATTGTGGCAATGCCAATAGAGCCAGATGTTGTCACATGGAGGACGCTTTTGAGTGGTTGTAGAATTTACAGAAATCAAGAACTCGCAGAAGTTGCT
ATTGCAAATATGTCTCATTGTAAGAGTGGAGATTACGTGTTATTATCAAATATCTATTGTTCTCTCAACAGATGGGAGCATGCAGAAGCAGTTAGAAAGATGATGAAAAT
CAATGGGGTTCGTAAGAATTGTGGAAAAAGCTGGATTGAGTTGGGAGGTACCATTTTAAACTTCAAGTCGGGTGATCGATCACATCCGGAAAGCGATGCAATATACAAAG
TGTTGTGCAGTTTGATGAAGAGAACTCGGTCAGAGGGATATATGCCTGTGACAGAGTTGGTTTTCATGGATATCTCTGAGGAGGAGAAGGAAGAGAACTTATCATTTCAC
AGCGAAAAGATGGCATTGGCTTATGCGATCCTGAAAACTAGTCCTGGAGCAAGGATCAGTATATCAAAGAACCTGCGGATTTGTGATGATTGTCATACATGGATAAAACT
AGTTTCAAGGGTGCTCTGCAGAGCTATAGTAGTGAGGGATCGGATCCGGTTCCATCAATTTGAAGGTGGCACCAAGTCCATCTTCACAGTCCACAAAAATCGTTCTTGGC
CAGCTCATCTGCTTGTCCCAGGAATTTTCTCAACTGCTGGCAATCTTCTTCCTAAACGAATACACTCTTCAACAATACATATTATATTGGGTGTCTCTGCAGAAATGAAT
AGTCTAAGCTGGGCTATCTTCGTTCCTCTGTTTGCTGCCTTCTTGACCACATCTAATGCAGTCGCGCGGGAAAGGAGTCAACACACTGAACGAATATCAGGCAGTGCTGG
TGATGTTTTGGAGGATAATCCTGTGGGAAGGTTGAAGGTGTTTGTCTATGAGCTTCCTAGCAAATACAACAAGAAAATTCTCCAGAAAGACCCTAGATGCCTTAATCACA
TGTTCGCAGCTGAGATCTTTATGCATCGCTTTCTTTTAACTAGTCCTGTTCGAACCCTTAATCCTGAAGAAGCTGACTGGTTTTATACTCCAGTTTATACAACTTGCGAC
CTTACTCCAAATGGTCTCCCTTTGCCATTTAAATCCCCTCGAATGATGAGAAGTGCAATACAACTTATTTCTTCAAACTGGCCTTACTGGAACCGGACCGAAGGGGCTGA
TCACTTTTTTGTTGTACCCCACGATTTTGGAGCTTGCTTCCACTATCAAGAAGAGAAGGCAATTGAAAGAGGAATACTTCCTTTGCTACAACGTGCTACCTTGGTTCAGA
CTTTTGGACAACGAAATCATGTTTGCTTGAAGGAGGGATCAATTACAATTCCTCCTTATGCCCCTCCTCAGAAAATGCACGCCCACCTCATTCCTGAAAAAACTCCTAGG
TCCATCTTTGTTTACTTTCGTGGACTATTCTATGATGTCGGAAATGATCCAGAAGGTGGTTATTACGCAAGAGGCGCAAGAGCTGCAGTGTGGGAGAACTTCAAGGATAA
TCCTCTTTTTGATATATCAACGGAGCATCCAACTACATACTATGAGGACATGCAGCGAGCTGTGTTTTGTCTTTGCCCGCTTGGATGGGCTCCTTGGAGCCCTAGATTGG
TTGAGGCTGTGATATTTGGTTGCATCCCTGTTATCATAGCTGATGACATTGTTCTGCCCTTCGCAGATGCAATTCCTTGGGAAGAAATTGGGGTATTTTTAGATGAGAAA
GATGTTGCTAACTTGGACACAATATTAACGTCCATCCCACTTGAAATGATATTGAGAAAGCAGAGACTGTTAGCAAATCCTTCCATGAAACAAGCAATGTTATTTCCACA
GCCTGCTCAACCTGGAGATGCTTTTCACCAAGTCCTGAATGGACTTGCCCGTAAGTTGCCTCATGACAGGAGTGTCTACCTTACGGCCGGTGAGAAGATTCTAAATTGGA
CCGCAGGTCCGGTATCATATGTCCCAAACAAGAGGCCGCAGCGACCTCCATATGCAGCAGTATCAAGTAATCCCATCAAAGGATTTCGAGAAAAGCAGGAAGAACTTCAA
CTTACATTTCGCCTGCCGTTCAGCTTGCCCAACCTTGAGCTGACATTAACAATGCGACCACCAGCAGACGAAGGTTTCATCAAGGGGATCATAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGCAATTATCAAACCCTTTATCGTGTTCTTGAAGCCTGCAAATTCTTCCCCTTGCATTCCAAAACTGTTATCGAAACGCATGCACGGATTATTAAATTTGGATATGG
AAACTACCCAACTCTCATCACCTCTCTAGTATCAACTTATCAAAGTGCCGGTTGCCTTAATCGGGTCCATCAACTTCTTGATCTACTCTACTCTAAGCGTCTTGATTTAG
TTGCAATGAACTTACTCGTTGAAAATTTTATGAAAATCGGGGAGTGCAAATTTGCTAAAAAGGTATTTTATAAAATGCCTTACTGTGATGTGGTAACATGGAACTCAATC
ATTGGGGGTTGTGTGAAGAATGCACAGTATGACGAGGCATTTAGATTCTTTAGACAGATGCTGAACTCAAATATTCAGCCGGATGGATTTACATTTGCTTCTGTATTGAA
TGCATGTGCTCAGCTCGGAGCTCCAAGTAAGACTCAGTGGGTTCATGCTTTGATGACTCAGAAAAAAATTGAGCTTAATTCTATATTAAGTTGTGCACTCATAGATGCAT
ACTCAAAGTGTGGTAGCATCCAAATTGCGAAGGAAATGTTTAGTTGCGTCCCTCACAATGACATCTCAGTTTGGAATTCGATGATCAAAGGGCTTGCAATTCATGGCCTT
GCAATGGATGCATTATCGTTATTTTTTATGATGGAGCGTGAGAATGTTGTGCCTGATGCTGTCACCTTTTTGGGCATTTTAACAGCCTGCAACCATGGTGGTTTAATTGA
CCAGGGTCGCAGGTATTTTGATTGGATGAAAAGCCGTTATTTAATTCAGCCACAGCTTGAGCATTATGGAGTCATGGTTGATCTCTTTAGCCGAGCTGGGTTTCTGGAGG
AGGCCTATTCCCTAATTGTGGCAATGCCAATAGAGCCAGATGTTGTCACATGGAGGACGCTTTTGAGTGGTTGTAGAATTTACAGAAATCAAGAACTCGCAGAAGTTGCT
ATTGCAAATATGTCTCATTGTAAGAGTGGAGATTACGTGTTATTATCAAATATCTATTGTTCTCTCAACAGATGGGAGCATGCAGAAGCAGTTAGAAAGATGATGAAAAT
CAATGGGGTTCGTAAGAATTGTGGAAAAAGCTGGATTGAGTTGGGAGGTACCATTTTAAACTTCAAGTCGGGTGATCGATCACATCCGGAAAGCGATGCAATATACAAAG
TGTTGTGCAGTTTGATGAAGAGAACTCGGTCAGAGGGATATATGCCTGTGACAGAGTTGGTTTTCATGGATATCTCTGAGGAGGAGAAGGAAGAGAACTTATCATTTCAC
AGCGAAAAGATGGCATTGGCTTATGCGATCCTGAAAACTAGTCCTGGAGCAAGGATCAGTATATCAAAGAACCTGCGGATTTGTGATGATTGTCATACATGGATAAAACT
AGTTTCAAGGGTGCTCTGCAGAGCTATAGTAGTGAGGGATCGGATCCGGTTCCATCAATTTGAAGGTGGCACCAAGTCCATCTTCACAGTCCACAAAAATCGTTCTTGGC
CAGCTCATCTGCTTGTCCCAGGAATTTTCTCAACTGCTGGCAATCTTCTTCCTAAACGAATACACTCTTCAACAATACATATTATATTGGGTGTCTCTGCAGAAATGAAT
AGTCTAAGCTGGGCTATCTTCGTTCCTCTGTTTGCTGCCTTCTTGACCACATCTAATGCAGTCGCGCGGGAAAGGAGTCAACACACTGAACGAATATCAGGCAGTGCTGG
TGATGTTTTGGAGGATAATCCTGTGGGAAGGTTGAAGGTGTTTGTCTATGAGCTTCCTAGCAAATACAACAAGAAAATTCTCCAGAAAGACCCTAGATGCCTTAATCACA
TGTTCGCAGCTGAGATCTTTATGCATCGCTTTCTTTTAACTAGTCCTGTTCGAACCCTTAATCCTGAAGAAGCTGACTGGTTTTATACTCCAGTTTATACAACTTGCGAC
CTTACTCCAAATGGTCTCCCTTTGCCATTTAAATCCCCTCGAATGATGAGAAGTGCAATACAACTTATTTCTTCAAACTGGCCTTACTGGAACCGGACCGAAGGGGCTGA
TCACTTTTTTGTTGTACCCCACGATTTTGGAGCTTGCTTCCACTATCAAGAAGAGAAGGCAATTGAAAGAGGAATACTTCCTTTGCTACAACGTGCTACCTTGGTTCAGA
CTTTTGGACAACGAAATCATGTTTGCTTGAAGGAGGGATCAATTACAATTCCTCCTTATGCCCCTCCTCAGAAAATGCACGCCCACCTCATTCCTGAAAAAACTCCTAGG
TCCATCTTTGTTTACTTTCGTGGACTATTCTATGATGTCGGAAATGATCCAGAAGGTGGTTATTACGCAAGAGGCGCAAGAGCTGCAGTGTGGGAGAACTTCAAGGATAA
TCCTCTTTTTGATATATCAACGGAGCATCCAACTACATACTATGAGGACATGCAGCGAGCTGTGTTTTGTCTTTGCCCGCTTGGATGGGCTCCTTGGAGCCCTAGATTGG
TTGAGGCTGTGATATTTGGTTGCATCCCTGTTATCATAGCTGATGACATTGTTCTGCCCTTCGCAGATGCAATTCCTTGGGAAGAAATTGGGGTATTTTTAGATGAGAAA
GATGTTGCTAACTTGGACACAATATTAACGTCCATCCCACTTGAAATGATATTGAGAAAGCAGAGACTGTTAGCAAATCCTTCCATGAAACAAGCAATGTTATTTCCACA
GCCTGCTCAACCTGGAGATGCTTTTCACCAAGTCCTGAATGGACTTGCCCGTAAGTTGCCTCATGACAGGAGTGTCTACCTTACGGCCGGTGAGAAGATTCTAAATTGGA
CCGCAGGTCCGGTATCATATGTCCCAAACAAGAGGCCGCAGCGACCTCCATATGCAGCAGTATCAAGTAATCCCATCAAAGGATTTCGAGAAAAGCAGGAAGAACTTCAA
CTTACATTTCGCCTGCCGTTCAGCTTGCCCAACCTTGAGCTGACATTAACAATGCGACCACCAGCAGACGAAGGTTTCATCAAGGGGATCATAGCTTGA
Protein sequenceShow/hide protein sequence
MGNYQTLYRVLEACKFFPLHSKTVIETHARIIKFGYGNYPTLITSLVSTYQSAGCLNRVHQLLDLLYSKRLDLVAMNLLVENFMKIGECKFAKKVFYKMPYCDVVTWNSI
IGGCVKNAQYDEAFRFFRQMLNSNIQPDGFTFASVLNACAQLGAPSKTQWVHALMTQKKIELNSILSCALIDAYSKCGSIQIAKEMFSCVPHNDISVWNSMIKGLAIHGL
AMDALSLFFMMERENVVPDAVTFLGILTACNHGGLIDQGRRYFDWMKSRYLIQPQLEHYGVMVDLFSRAGFLEEAYSLIVAMPIEPDVVTWRTLLSGCRIYRNQELAEVA
IANMSHCKSGDYVLLSNIYCSLNRWEHAEAVRKMMKINGVRKNCGKSWIELGGTILNFKSGDRSHPESDAIYKVLCSLMKRTRSEGYMPVTELVFMDISEEEKEENLSFH
SEKMALAYAILKTSPGARISISKNLRICDDCHTWIKLVSRVLCRAIVVRDRIRFHQFEGGTKSIFTVHKNRSWPAHLLVPGIFSTAGNLLPKRIHSSTIHIILGVSAEMN
SLSWAIFVPLFAAFLTTSNAVARERSQHTERISGSAGDVLEDNPVGRLKVFVYELPSKYNKKILQKDPRCLNHMFAAEIFMHRFLLTSPVRTLNPEEADWFYTPVYTTCD
LTPNGLPLPFKSPRMMRSAIQLISSNWPYWNRTEGADHFFVVPHDFGACFHYQEEKAIERGILPLLQRATLVQTFGQRNHVCLKEGSITIPPYAPPQKMHAHLIPEKTPR
SIFVYFRGLFYDVGNDPEGGYYARGARAAVWENFKDNPLFDISTEHPTTYYEDMQRAVFCLCPLGWAPWSPRLVEAVIFGCIPVIIADDIVLPFADAIPWEEIGVFLDEK
DVANLDTILTSIPLEMILRKQRLLANPSMKQAMLFPQPAQPGDAFHQVLNGLARKLPHDRSVYLTAGEKILNWTAGPVSYVPNKRPQRPPYAAVSSNPIKGFREKQEELQ
LTFRLPFSLPNLELTLTMRPPADEGFIKGIIA