; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS014011 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS014011
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionAvr9/Cf-9 rapidly elicited protein
Genome locationscaffold607:1612141..1613529
RNA-Seq ExpressionMS014011
SyntenyMS014011
Gene Ontology termsGO:0045927 - positive regulation of growth (biological process)
InterPro domainsIPR007700 - Domain of unknown function DUF668
IPR021864 - Domain of unknown function DUF3475


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032674.1 Avr9/Cf-9 rapidly elicited protein [Cucumis melo var. makuwa]1.8e-19576.94Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++ TLRS+G+ YLNS R E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+   KNVAKL++RMEKLV  TAELHSAMEAL EMEASE+K+QKW+   PKQ PPVNFE FDKK+++QRKDVKH+KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG
        RL+C IYARI T+FGP V D    L H+P +RILRDRVW WNFYGG +RK     E RLVTQSGPI KKGKKELVRFPSGIR +DDIGIG  +  S T  
Subjt:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGE--EEMGSGGDGHSLAAGWREAVE
            +NRVYTSAP  TVGGSGLS+NYANVIL AERCLHAPATIGEEARG+LYEMLPA IK  VRAKLRRNNWVKRGE  EEMGSGGDGHSLAAGWREAVE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGE--EEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        E++ WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

XP_004142198.1 uncharacterized protein LOC101204955 [Cucumis sativus]4.0e-19576.51Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++ TLRS+G+ YLNS R E FLL LACSERLE+++NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+S  KNVAKL+ARMEKLV  T+ELHSAME L EME SE+K+QKW++ +PKQ PPVNFE FDKK+A+QRKDVKH+KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG
        RL+C+IY RI TVFGP V D    L H+P +RILRDRVW WNFY G +RK     E RLVTQSGPI KKGKKELVRFPSGIR +DD+GIGY +  S T  
Subjt:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGE--EEMGSGGDGHSLAAGWREAVE
            +NRVYTSAP TTVGGSGLS+NYANVIL AERCLHAPATIG+EARG+LYEMLPA IK  VRAKLRRNNWVKRGE  EE+GSGGDGHSLAAGWREAVE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGE--EEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        E+M WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

XP_008449997.1 PREDICTED: uncharacterized protein LOC103491711, partial [Cucumis melo]1.8e-19576.94Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++ TLRS+G+ YLNS R E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+   KNVAKL++RMEKLV  TAELHSAMEAL EMEASE+K+QKW+   PKQ PPVNFE FDKK+++QRKDVKH+KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG
        RL+C IYARI T+FGP V D    L H+P +RILRDRVW WNFYGG +RK     E RLVTQSGPI KKGKKELVRFPSGIR +DDIGIG  +  S T  
Subjt:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGE--EEMGSGGDGHSLAAGWREAVE
            +NRVYTSAP  TVGGSGLS+NYANVIL AERCLHAPATIGEEARG+LYEMLPA IK  VRAKLRRNNWVKRGE  EEMGSGGDGHSLAAGWREAVE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGE--EEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        E++ WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

XP_022931864.1 uncharacterized protein LOC111438146 [Cucurbita moschata]3.2e-17670.11Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        +PP TL+IL+F  AKTMA L+SL+RSL+D +I  L++H + S+G++YLNS   E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD++FSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIF+ GKS+S  KNVAKL+ +MEKLV ATAELHSAME L+EMEASE+K+QK +   PKQ  P+ F+ FDKK+A+QRKDVKH+KEISLWNQSFD+AVGLMT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDG-GEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTG
        RL+CVIYARIF VF PFVSD    LD +P ++ L +RVW WNF+ G+HRK G G  E +LVTQSGPI K GKKEL+RFPSGIR +++  I Y +  S   
Subjt:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDG-GEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTG

Query:  GEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKR--GEEEMGSGGDGHSLAAGWREAV
           A +NRVYTSAP TTVGGSGLS+NYANVIL AERCL+   TIG++ARG+LY+MLPARIK  VRAKLRRNNW KR  G+EEM S  DGHSLA GWREA+
Subjt:  GEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKR--GEEEMGSGGDGHSLAAGWREAV

Query:  EEIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        EE+M WLGPLAHDTVRWQSERN+EKQRFD   T LLMQTLHYSDLEK EAAIVEVLVGL CI+RY
Subjt:  EEIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

XP_038883935.1 uncharacterized protein LOC120074765 [Benincasa hispida]1.9e-19778.4Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++H LRS+G+ YLNS R E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+S  KNVAKL+ARMEKLV  TAELHSAME+L+EMEASE+K+QKWR+ +PKQ PPVNFE FDKK+A+Q+KDVKH+KEISLWNQSFD+AVGLMT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG
        RL+C IYARI TVFGP   D    + H+P +RILRDRVW WNFYGG +RK GD  E RLVTQSGPI KKGKKELVRFPSGIR +DDIGIGY +  S TG 
Subjt:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKR-GEEEMGSGGDGHSLAAGWREAVEE
            +NRVYTSAP TTVGGSGLSMNYANVIL AERCL APATIGEEARG+LYEMLPARIK  VRAKLRRNNWVKR GEEEMG G DG+SLAAGWREAVEE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKR-GEEEMGSGGDGHSLAAGWREAVEE

Query:  IMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        +M WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  IMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

TrEMBL top hitse value%identityAlignment
A0A0A0L0Z9 Uncharacterized protein1.9e-19576.51Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++ TLRS+G+ YLNS R E FLL LACSERLE+++NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+S  KNVAKL+ARMEKLV  T+ELHSAME L EME SE+K+QKW++ +PKQ PPVNFE FDKK+A+QRKDVKH+KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG
        RL+C+IY RI TVFGP V D    L H+P +RILRDRVW WNFY G +RK     E RLVTQSGPI KKGKKELVRFPSGIR +DD+GIGY +  S T  
Subjt:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGE--EEMGSGGDGHSLAAGWREAVE
            +NRVYTSAP TTVGGSGLS+NYANVIL AERCLHAPATIG+EARG+LYEMLPA IK  VRAKLRRNNWVKRGE  EE+GSGGDGHSLAAGWREAVE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGE--EEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        E+M WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

A0A1S3BMP0 uncharacterized protein LOC1034917118.7e-19676.94Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++ TLRS+G+ YLNS R E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+   KNVAKL++RMEKLV  TAELHSAMEAL EMEASE+K+QKW+   PKQ PPVNFE FDKK+++QRKDVKH+KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG
        RL+C IYARI T+FGP V D    L H+P +RILRDRVW WNFYGG +RK     E RLVTQSGPI KKGKKELVRFPSGIR +DDIGIG  +  S T  
Subjt:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGE--EEMGSGGDGHSLAAGWREAVE
            +NRVYTSAP  TVGGSGLS+NYANVIL AERCLHAPATIGEEARG+LYEMLPA IK  VRAKLRRNNWVKRGE  EEMGSGGDGHSLAAGWREAVE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGE--EEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        E++ WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

A0A5D3DIU8 Avr9/Cf-9 rapidly elicited protein8.7e-19676.94Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        S P TL+IL F  AKTMA L+SL+RSLSDD+I  L++ TLRS+G+ YLNS R E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD+LFSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIFHS KS+   KNVAKL++RMEKLV  TAELHSAMEAL EMEASE+K+QKW+   PKQ PPVNFE FDKK+++QRKDVKH+KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG
        RL+C IYARI T+FGP V D    L H+P +RILRDRVW WNFYGG +RK     E RLVTQSGPI KKGKKELVRFPSGIR +DDIGIG  +  S T  
Subjt:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGG

Query:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGE--EEMGSGGDGHSLAAGWREAVE
            +NRVYTSAP  TVGGSGLS+NYANVIL AERCLHAPATIGEEARG+LYEMLPA IK  VRAKLRRNNWVKRGE  EEMGSGGDGHSLAAGWREAVE
Subjt:  EAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGE--EEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        E++ WLGPLAHDTVRWQSERN+EKQRFD +PTALLMQTLHYSDLEK EAAIVEVLVGLSCI+RY
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

A0A6J1EVF1 uncharacterized protein LOC1114381461.5e-17670.11Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        +PP TL+IL+F  AKTMA L+SL+RSL+D +I  L++H + S+G++YLNS   E FLL LACSERLE+L+NAASSVSRLSRKCADLGL+RFD++FSDMKL
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIF+ GKS+S  KNVAKL+ +MEKLV ATAELHSAME L+EMEASE+K+QK +   PKQ  P+ F+ FDKK+A+QRKDVKH+KEISLWNQSFD+AVGLMT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDG-GEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTG
        RL+CVIYARIF VF PFVSD    LD +P ++ L +RVW WNF+ G+HRK G G  E +LVTQSGPI K GKKEL+RFPSGIR +++  I Y +  S   
Subjt:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDG-GEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTG

Query:  GEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKR--GEEEMGSGGDGHSLAAGWREAV
           A +NRVYTSAP TTVGGSGLS+NYANVIL AERCL+   TIG++ARG+LY+MLPARIK  VRAKLRRNNW KR  G+EEM S  DGHSLA GWREA+
Subjt:  GEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKR--GEEEMGSGGDGHSLAAGWREAV

Query:  EEIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        EE+M WLGPLAHDTVRWQSERN+EKQRFD   T LLMQTLHYSDLEK EAAIVEVLVGL CI+RY
Subjt:  EEIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

A0A6J1GEN0 uncharacterized protein LOC1114534843.8e-17570.04Show/hide
Query:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL
        SPP TL ILSF  AKTMA L+SL+RSLSD++I  L++  +RSQG+ YLNS + + FLL LACSERLE+L+NAASSVSRLS+KCADLGL+RFD++FS+MK 
Subjt:  SPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKL

Query:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT
        GIF+S KS S  KNVAKL+ RMEKLV +TAELHS+MEAL EMEASE+K+Q WR+ +P Q PPVNFE   +K+A QRKDVKH KEISLWNQSFD+AVG+MT
Subjt:  GIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMT

Query:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDG-GEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTG
        RLICVIYARI TVF P+V + D    H+P   I RDR W WNFYGG HRK G G GE +  TQSGPI K+GKKELVRFPS IRV D  G    +L S   
Subjt:  RLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDG-GEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTG

Query:  GEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWV-KRGEEEMGSGGDGHSLAAGWREAVE
             +NRVYTSAP TTVGG+GLS+NYANVIL AERCLH+P TIGEEARG  YEMLPARIK  +RAKLRRNNW+ K GEE+MGS  D  SLAAGW+EAVE
Subjt:  GEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWV-KRGEEEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
        ++M WLGPLAHDT+RWQSERN+EKQRFD SPT LLMQTLHYSDLEK +AAIVE+LVGLSCI+++
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

SwissProt top hitse value%identityAlignment
P0DO24 Protein PSK SIMULATOR 34.7e-0525.5Show/hide
Query:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---
        +G +GL+++YAN+I+  +  +   ++I   AR  LY+ LP  IK  +R+K++  N             D        ++ +E  + WL P+A +T +   
Subjt:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---

Query:  ---WQSERNLEKQRFDASPTA---LLMQTLHYSDLEKAEAAIVEVLVGL
           W  E       F + P+    L ++TL+++  EK E  I+  ++ L
Subjt:  ---WQSERNLEKQRFDASPTA---LLMQTLHYSDLEKAEAAIVEVLVGL

Q9XID5 Protein PSK SIMULATOR 12.9e-0728Show/hide
Query:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---
        +G +GL+++YAN+I   +  +   +T+    R  LY+ LP  IK  +R++++      + +EE+             +  +E+ + WL P+A +T +   
Subjt:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---

Query:  -------WQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGL
               W S  +   QR  A  T L + TLH++D EK EA I++++V L
Subjt:  -------WQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGL

Arabidopsis top hitse value%identityAlignment
AT1G34320.1 Protein of unknown function (DUF668)2.1e-0828Show/hide
Query:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---
        +G +GL+++YAN+I   +  +   +T+    R  LY+ LP  IK  +R++++      + +EE+             +  +E+ + WL P+A +T +   
Subjt:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---

Query:  -------WQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGL
               W S  +   QR  A  T L + TLH++D EK EA I++++V L
Subjt:  -------WQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGL

AT3G23160.1 Protein of unknown function (DUF668)2.1e-6435.66Show/hide
Query:  SSPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMK
        S P  T+ ILSF  A  M+  + LHRSLSD +I  LK     S+G+  L S+  EN LL L+ SE+L+ L   AS VSRL +KC +  L  F+ ++ D+ 
Subjt:  SSPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMK

Query:  LGIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLM
         G     K     K++  +V +ME+ V+AT  L+  ME + E+E +  K+Q+ + +        + + F++K+  QR+DVK  ++ SLWNQ++D  V ++
Subjt:  LGIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLM

Query:  TRLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRV----------------
         R +C IY RI TVFG         L     V + RDR  +        R    G +D   +++    + G      FP G                   
Subjt:  TRLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRV----------------

Query:  --EDDIGIGYRQLKSPTGGEAAPSN------RVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKR
          +DD   G R     +      SN      R+   A A+T+GGS LS++YANV++V E+ L  P  IGEEAR DLY+MLP  +K  ++A LR  +++K 
Subjt:  --EDDIGIGYRQLKSPTGGEAAPSN------RVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKR

Query:  GEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVRWQSERNLEKQRFDASPT-ALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
                     LA  W+E ++ I++WL PLAH+ +RWQSERN E+Q      T  LL+QTL+++D EK EAAI ++LVGL+ I  Y
Subjt:  GEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVRWQSERNLEKQRFDASPT-ALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

AT5G04550.1 Protein of unknown function (DUF668)2.0e-4326.95Show/hide
Query:  ATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSD-MKLGI
        A L +L+F  A  ++ L+ L +SLSD  +  L+D    S G+  L S   ++F++ L   E +E ++N A +V+RL+RKC D  L  F+  FSD MK G 
Subjt:  ATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSD-MKLGI

Query:  FHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMTRL
           G      K + K   +ME+ +S+ A L+   E L ++E + ++++   S         N  ++ KK+  +R +VK+ +++SLWN+++D+ V L+ R 
Subjt:  FHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMTRL

Query:  ICVIYARIFTVFG---------PFVSDDDF---------------HLDHDPHV--------------------------------RILRDRVWSWNFYGG
        +  I +R   VFG            +D DF               H      +                                 +  +   S      
Subjt:  ICVIYARIFTVFG---------PFVSDDDF---------------HLDHDPHV--------------------------------RILRDRVWSWNFYGG

Query:  QHR--KYGDGGEDRLVTQSGPIAKKGK--KELVRFP-----------------------SGIRVEDDIGIGYRQLKSPTGGE------------------
        +H+  K+  G   ++ ++SGP+   GK  K++ + P                        G  V  D GI     ++  G                    
Subjt:  QHR--KYGDGGEDRLVTQSGPIAKKGK--KELVRFP-----------------------SGIRVEDDIGIGYRQLKSPTGGE------------------

Query:  ---AAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGDGHSLAAGWREAVE
             PS    + A   T+G + L+++YANVI+V ER + +P  IG++AR DLY MLPA ++  +R +L+  +           G     LA  W +A+ 
Subjt:  ---AAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGDGHSLAAGWREAVE

Query:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
         I+ WLGPLAH+ ++WQSER+ E Q   +    +L QTL +++ +K EA I E+LVGL+ ++R+
Subjt:  EIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY

AT5G08660.1 Protein of unknown function (DUF668)3.3e-0625.5Show/hide
Query:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---
        +G +GL+++YAN+I+  +  +   ++I   AR  LY+ LP  IK  +R+K++  N             D        ++ +E  + WL P+A +T +   
Subjt:  VGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVR---

Query:  ---WQSERNLEKQRFDASPTA---LLMQTLHYSDLEKAEAAIVEVLVGL
           W  E       F + P+    L ++TL+++  EK E  I+  ++ L
Subjt:  ---WQSERNLEKQRFDASPTA---LLMQTLHYSDLEKAEAAIVEVLVGL

AT5G51670.1 Protein of unknown function (DUF668)3.0e-4730.98Show/hide
Query:  SSPP-----ATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDIL
        SSPP     +++ +LSF  A+ M  LL L  SL+D  + T +DH+L  +GL  +     E F L+L C+E  + L +AA+SVSRLS +C    L  F  L
Subjt:  SSPP-----ATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDIL

Query:  FSDM-KLGIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERK--------IQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEI
        F +   +G    G   +C    AK   ++E+ VS T  L+  ME +  +E S RK         ++   Y  K+   +       KI  Q++ VK+ K+ 
Subjt:  FSDM-KLGIFHSGKSESCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERK--------IQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEI

Query:  SLWNQSFDHAVGLMTRLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVED
        SLWN+SFD  V ++ R +    AR+ +VF    +    ++       + R    S +     H    D   D+  T S  + +                 
Subjt:  SLWNQSFDHAVGLMTRLICVIYARIFTVFGPFVSDDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVED

Query:  DIGIGYRQLKSPTGGEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGD
              R LK                 P TT+GG+G++++YAN+I+V E+ +  P  +G +AR DLY MLPA ++  +R++L+           +G    
Subjt:  DIGIGYRQLKSPTGGEAAPSNRVYTSAPATTVGGSGLSMNYANVILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGD

Query:  GHSLAAGWREAVEEIMAWLGPLAHDTVRWQSERNLEKQRF----DASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY
           LA  W+ A+  I+ WL PLA + +RWQSER+ E+Q      ++    +L+QTL ++D  K EAAI E+LVGL+ I+R+
Subjt:  GHSLAAGWREAVEEIMAWLGPLAHDTVRWQSERNLEKQRF----DASPTALLMQTLHYSDLEKAEAAIVEVLVGLSCIFRY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCCTCTCCCCCGGCCACCCTCCAAATCCTCTCCTTCCACGCCGCCAAAACCATGGCCACTCTCCTCTCCCTCCACCGCTCCCTCTCCGACGACCAAATCCGAACCCTCAA
AGACCACACCCTCCGCTCCCAGGGCCTCCTCTACTTGAATTCCACCCGCCACGAAAACTTCCTCCTTACCCTCGCCTGCTCCGAGCGCCTCGAGCAGCTCGACAATGCCG
CCTCCTCTGTCTCCCGCCTCAGCCGCAAGTGCGCCGATTTGGGCCTCTCCCGGTTCGACATCCTCTTCTCCGACATGAAACTCGGGATCTTCCATTCTGGGAAATCCGAG
TCCTGCTCCAAGAATGTGGCCAAGCTCGTCGCCAGGATGGAGAAGCTTGTTTCGGCCACCGCCGAGCTCCACTCTGCCATGGAAGCTCTGCTCGAGATGGAGGCCTCCGA
GAGGAAGATTCAGAAGTGGAGATCCTACGCCCCTAAGCAGTGCCCCCCGGTCAATTTCGAGCAATTTGATAAAAAAATCGCAGCCCAGAGGAAAGATGTGAAGCATTACA
AAGAAATTTCGCTATGGAACCAGAGTTTCGATCACGCCGTCGGGCTCATGACCCGCCTGATTTGCGTCATCTACGCGAGGATTTTCACCGTCTTCGGCCCGTTCGTTTCC
GACGATGATTTTCATCTGGATCACGACCCGCACGTTCGGATCCTCCGGGACCGGGTTTGGAGCTGGAATTTCTATGGCGGGCAGCATCGGAAGTACGGCGACGGCGGGGA
AGACAGGCTCGTGACGCAATCGGGCCCAATCGCGAAGAAAGGGAAGAAGGAATTGGTCCGATTTCCGAGTGGGATTAGAGTGGAAGACGATATTGGAATTGGGTACCGGC
AATTGAAATCGCCGACGGGGGGAGAAGCGGCGCCGAGTAACAGAGTATACACATCCGCGCCGGCAACGACGGTGGGCGGGTCCGGACTGTCGATGAACTACGCGAATGTG
ATACTGGTGGCGGAGAGGTGTCTGCACGCGCCGGCGACGATAGGGGAGGAGGCGCGTGGGGATCTGTACGAGATGCTGCCGGCGAGAATAAAAGGGATGGTGAGGGCGAA
GTTGAGGAGGAACAATTGGGTGAAGAGAGGGGAGGAGGAAATGGGGAGCGGCGGCGACGGGCATTCGCTGGCGGCGGGGTGGAGGGAGGCGGTGGAGGAGATTATGGCGT
GGTTGGGGCCGCTGGCGCACGACACTGTGCGGTGGCAATCGGAGAGAAATTTAGAGAAACAGAGGTTCGATGCGAGTCCCACGGCGCTGCTGATGCAGACGCTGCATTAT
TCCGACTTGGAGAAGGCGGAGGCGGCCATTGTCGAGGTTCTGGTGGGCCTCAGTTGTATATTTAGGTAC
mRNA sequenceShow/hide mRNA sequence
TCCTCTCCCCCGGCCACCCTCCAAATCCTCTCCTTCCACGCCGCCAAAACCATGGCCACTCTCCTCTCCCTCCACCGCTCCCTCTCCGACGACCAAATCCGAACCCTCAA
AGACCACACCCTCCGCTCCCAGGGCCTCCTCTACTTGAATTCCACCCGCCACGAAAACTTCCTCCTTACCCTCGCCTGCTCCGAGCGCCTCGAGCAGCTCGACAATGCCG
CCTCCTCTGTCTCCCGCCTCAGCCGCAAGTGCGCCGATTTGGGCCTCTCCCGGTTCGACATCCTCTTCTCCGACATGAAACTCGGGATCTTCCATTCTGGGAAATCCGAG
TCCTGCTCCAAGAATGTGGCCAAGCTCGTCGCCAGGATGGAGAAGCTTGTTTCGGCCACCGCCGAGCTCCACTCTGCCATGGAAGCTCTGCTCGAGATGGAGGCCTCCGA
GAGGAAGATTCAGAAGTGGAGATCCTACGCCCCTAAGCAGTGCCCCCCGGTCAATTTCGAGCAATTTGATAAAAAAATCGCAGCCCAGAGGAAAGATGTGAAGCATTACA
AAGAAATTTCGCTATGGAACCAGAGTTTCGATCACGCCGTCGGGCTCATGACCCGCCTGATTTGCGTCATCTACGCGAGGATTTTCACCGTCTTCGGCCCGTTCGTTTCC
GACGATGATTTTCATCTGGATCACGACCCGCACGTTCGGATCCTCCGGGACCGGGTTTGGAGCTGGAATTTCTATGGCGGGCAGCATCGGAAGTACGGCGACGGCGGGGA
AGACAGGCTCGTGACGCAATCGGGCCCAATCGCGAAGAAAGGGAAGAAGGAATTGGTCCGATTTCCGAGTGGGATTAGAGTGGAAGACGATATTGGAATTGGGTACCGGC
AATTGAAATCGCCGACGGGGGGAGAAGCGGCGCCGAGTAACAGAGTATACACATCCGCGCCGGCAACGACGGTGGGCGGGTCCGGACTGTCGATGAACTACGCGAATGTG
ATACTGGTGGCGGAGAGGTGTCTGCACGCGCCGGCGACGATAGGGGAGGAGGCGCGTGGGGATCTGTACGAGATGCTGCCGGCGAGAATAAAAGGGATGGTGAGGGCGAA
GTTGAGGAGGAACAATTGGGTGAAGAGAGGGGAGGAGGAAATGGGGAGCGGCGGCGACGGGCATTCGCTGGCGGCGGGGTGGAGGGAGGCGGTGGAGGAGATTATGGCGT
GGTTGGGGCCGCTGGCGCACGACACTGTGCGGTGGCAATCGGAGAGAAATTTAGAGAAACAGAGGTTCGATGCGAGTCCCACGGCGCTGCTGATGCAGACGCTGCATTAT
TCCGACTTGGAGAAGGCGGAGGCGGCCATTGTCGAGGTTCTGGTGGGCCTCAGTTGTATATTTAGGTAC
Protein sequenceShow/hide protein sequence
SSPPATLQILSFHAAKTMATLLSLHRSLSDDQIRTLKDHTLRSQGLLYLNSTRHENFLLTLACSERLEQLDNAASSVSRLSRKCADLGLSRFDILFSDMKLGIFHSGKSE
SCSKNVAKLVARMEKLVSATAELHSAMEALLEMEASERKIQKWRSYAPKQCPPVNFEQFDKKIAAQRKDVKHYKEISLWNQSFDHAVGLMTRLICVIYARIFTVFGPFVS
DDDFHLDHDPHVRILRDRVWSWNFYGGQHRKYGDGGEDRLVTQSGPIAKKGKKELVRFPSGIRVEDDIGIGYRQLKSPTGGEAAPSNRVYTSAPATTVGGSGLSMNYANV
ILVAERCLHAPATIGEEARGDLYEMLPARIKGMVRAKLRRNNWVKRGEEEMGSGGDGHSLAAGWREAVEEIMAWLGPLAHDTVRWQSERNLEKQRFDASPTALLMQTLHY
SDLEKAEAAIVEVLVGLSCIFRY